Abstract
With the recent advancement in technology and a vast amount of information available, research in pattern mining has started to attract more attention. Specifically, various techniques have been developed for clickstream mining, which is a specific type of sequential pattern mining, to discover the underlying patterns from the Internet user clickstream. Due to the complexity of clickstream patterns, many of the existing works applied sequential pattern algorithms to generate an exponential candidate space of patterns with respect to patterns letters. Further, those patterns were generated in a noiseless environment. To address this problem, we focus on a nonoverlapping clickstream pattern mining task with noisy interleaving clicks between the clickstream patterns letters. Additionally, we are interested in labeling the extracted patterns in the user browsing history. A modified suffix tree is proposed to extract those patterns with the exact occurrence in the user noisy database. Following this, we model the user browsing behavior via a Hidden Markov Model (HMM) to capture the dependencies between the extracted patterns and then predict the future clickstream patterns. Experimental results on both real-life and synthetic datasets show that our proposed algorithms outperform the state-of-the-art benchmarks in efficiency and prediction accuracy.
Original language | English |
---|---|
Title of host publication | 2022 IEEE Global Communications Conference, GLOBECOM 2022 - Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 3083-3088 |
Number of pages | 6 |
ISBN (Electronic) | 9781665435406 |
DOIs | |
State | Published - 2022 |
Event | 2022 IEEE Global Communications Conference, GLOBECOM 2022 - Virtual, Online, Brazil Duration: 4 Dec 2022 → 8 Dec 2022 |
Publication series
Name | 2022 IEEE Global Communications Conference, GLOBECOM 2022 - Proceedings |
---|
Conference
Conference | 2022 IEEE Global Communications Conference, GLOBECOM 2022 |
---|---|
Country/Territory | Brazil |
City | Virtual, Online |
Period | 4/12/22 → 8/12/22 |
Bibliographical note
Publisher Copyright:© 2022 IEEE.
Keywords
- Clickstream
- HMM
- Mining Patterns
- Predicting Users Patterns
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Networks and Communications
- Hardware and Architecture
- Signal Processing
- Renewable Energy, Sustainability and the Environment
- Safety, Risk, Reliability and Quality