TKQ: Top-K Quantitative High Utility Itemset Mining

Mourad Nouioua, Philippe Fournier-Viger*, Wensheng Gan, Youxi Wu, Jerry Chun Wei Lin, Farid Nouioua

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

High utility itemset mining is a well-studied data mining task for analyzing customer transactions. It consists of finding the sets of items purchased together that yield a profit that is greater than a minutil threshold, set by the user. To find more precise patterns with purchase quantities, that task was recently generalized as high utility quantitative itemset mining. But an important drawback of current algorithms is that finding an appropriate minutil value is not intuitive and can greatly influence the output. A too small minutil value may lead to very long runtimes and finding millions of patterns, while a too high value, may result in missing many important patterns. To address this issue, this paper redefines the task as top-k quantitative high utility itemset mining and proposes a novel algorithm named TKQ (Top K Quantitative itemset miner), which let the user directly specify the number k of patterns to be found. The algorithm includes three strategies to improve its performance. Experiments on benchmark datasets show that TKQ has excellent performance.

Original languageEnglish
Title of host publicationAdvanced Data Mining and Applications - 17th International Conference, ADMA 2021, Proceedings
EditorsBohan Li, Lin Yue, Jing Jiang, Weitong Chen, Xue Li, Guodong Long, Fei Fang, Han Yu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages16-28
Number of pages13
ISBN (Print)9783030954079
DOIs
StatePublished - 2022
Externally publishedYes
Event17th International Conference on Advanced Data Mining and Applications, ADMA 2021 - Sydney, Australia
Duration: 2 Feb 20224 Feb 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13088 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference17th International Conference on Advanced Data Mining and Applications, ADMA 2021
Country/TerritoryAustralia
CitySydney
Period2/02/224/02/22

Bibliographical note

Publisher Copyright:
© 2022, Springer Nature Switzerland AG.

Keywords

  • High utility itemset mining
  • Quantitative itemsets
  • Top-k Pattern mining

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'TKQ: Top-K Quantitative High Utility Itemset Mining'. Together they form a unique fingerprint.

Cite this