Abstract
High utility itemset mining is a well-studied data mining task for analyzing customer transactions. It consists of finding the sets of items purchased together that yield a profit that is greater than a minutil threshold, set by the user. To find more precise patterns with purchase quantities, that task was recently generalized as high utility quantitative itemset mining. But an important drawback of current algorithms is that finding an appropriate minutil value is not intuitive and can greatly influence the output. A too small minutil value may lead to very long runtimes and finding millions of patterns, while a too high value, may result in missing many important patterns. To address this issue, this paper redefines the task as top-k quantitative high utility itemset mining and proposes a novel algorithm named TKQ (Top K Quantitative itemset miner), which let the user directly specify the number k of patterns to be found. The algorithm includes three strategies to improve its performance. Experiments on benchmark datasets show that TKQ has excellent performance.
| Original language | English |
|---|---|
| Title of host publication | Advanced Data Mining and Applications - 17th International Conference, ADMA 2021, Proceedings |
| Editors | Bohan Li, Lin Yue, Jing Jiang, Weitong Chen, Xue Li, Guodong Long, Fei Fang, Han Yu |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 16-28 |
| Number of pages | 13 |
| ISBN (Print) | 9783030954079 |
| DOIs | |
| State | Published - 2022 |
| Externally published | Yes |
| Event | 17th International Conference on Advanced Data Mining and Applications, ADMA 2021 - Sydney, Australia Duration: 2 Feb 2022 → 4 Feb 2022 |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 13088 LNAI |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 17th International Conference on Advanced Data Mining and Applications, ADMA 2021 |
|---|---|
| Country/Territory | Australia |
| City | Sydney |
| Period | 2/02/22 → 4/02/22 |
Bibliographical note
Publisher Copyright:© 2022, Springer Nature Switzerland AG.
Keywords
- High utility itemset mining
- Quantitative itemsets
- Top-k Pattern mining
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science