Embedded feature selection approach based on TSK fuzzy system with sparse rule base for high-dimensional classification problems

Xiaoling Gong, Jian Wang*, Qilin Ren, Kai Zhang, El Sayed M. El-Alfy, Jacek Mańdziuk

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

In high-dimensional problems of fuzzy rule-based embedded feature selection, the challenges include loss of interpretability, curse of dimensionality, and arithmetic underflow, among others. The primary reason for these problems is the exponential increase in the number of fuzzy rules with an increase in input dimension. In this study, an embedded feature selection approach, the Takagi–Sugeno–Kang (TSK) fuzzy system with sparse rule base (TSK-SRB), is proposed for high-dimensional data. Based on clustering, a broader initial rule base is designed with a suitable number of rules. In the rule layer, refined softmin (Ref-softmin) is introduced to calculate the firing strength, which can approximate the minimum T-norm while avoiding arithmetic underflow. Two Group Lasso regularization terms are used to realize feature and rule selection. In addition, an automatic threshold segmentation is introduced to determine the appropriate number of selected features/rules. Extensive experiments on 17 classification datasets showed that TSK-SRB is effective and competitive in high-dimensional feature selection.

Original languageEnglish
Article number111809
JournalKnowledge-Based Systems
Volume295
DOIs
StatePublished - 8 Jul 2024

Bibliographical note

Publisher Copyright:
© 2024 Elsevier B.V.

Keywords

  • Feature selection
  • High-dimensional data
  • Rule selection
  • Sparse rule base
  • TSK fuzzy system

ASJC Scopus subject areas

  • Software
  • Management Information Systems
  • Information Systems and Management
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Embedded feature selection approach based on TSK fuzzy system with sparse rule base for high-dimensional classification problems'. Together they form a unique fingerprint.

Cite this