A Multi-Label Multi-View Learning Framework for In-App Service Usage Analysis
Abstract
The service usage analysis, aiming at identifying customers' messaging behaviors based on encrypted App traffic flows, has become a challenging and emergent task for service providers. Prior literature usually starts from segmenting a traffic sequence into single-usage subsequences, and then classify the subsequences into different usage types. However, they could suffer from inaccurate traffic segmentations and mixed-usage subsequences. To address this challenge, we exploit a multi-label multi-view learning strategy and develop an enhanced framework for in-App usage analytics. Specifically, we first devise an enhanced traffic segmentation method to reduce mixed-usage subsequences. Besides, we develop a multi-label multi-view logistic classification method, which comprises two alignments. The first alignment is to make use of the classification consistency between packet-length view and time-delay view of traffic subsequences and improve classification accuracy. The second alignment is to combine the classification of single-usage subsequence and the post-classification of mixed-usage subsequences into a unified multi-label logistic classification problem. Finally, we present extensive experiments with real-world datasets to demonstrate the effectiveness of our approach. We find that the proposed multi-label multi-view framework can help overcome the pain of mixed-usage subsequences and can be generalized to latent activity analysis in sequential data, beyond in-App usage analytics.
Recommended Citation
Y. Fu et al., "A Multi-Label Multi-View Learning Framework for In-App Service Usage Analysis," ACM Transactions on Intelligent Systems and Technology, vol. 9, no. 4, Association for Computing Machinery (ACM), Feb 2018.
The definitive version is available at https://doi.org/10.1145/3151937
Department(s)
Computer Science
Research Center/Lab(s)
Intelligent Systems Center
Keywords and Phrases
Alignment; Classification accuracy; Classification consistency; Classification methods; Internet traffic; Multi-label; Multi-views; Segmentation methods; Service usage; E-learning; In-app analytics
International Standard Serial Number (ISSN)
2157-6904; 2157-6912
Document Type
Article - Journal
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2018 Association for Computing Machinery (ACM), All rights reserved.
Publication Date
01 Feb 2018