Skip to main navigation Skip to search Skip to main content

A Group Feature Ranking and Selection Method Based on Dimension Reduction Technique in High-Dimensional Data

Research output: Contribution to journalArticlepeer-review

15 Scopus citations

Abstract

Group feature selection methods select the important group features by removing the irrelevant group features for reducing the complexity of the model. To the best of our knowledge, there are few group feature selection methods that provide the relative importance of each feature group. For this purpose, we developed a sparse group feature ranking method based on the dimension reduction technique for high dimensional data. Firstly, we applied relief to each group to remove irrelevant individual features. Secondly, we extract the new feature that represents each feature group. To this end, we reduce the multiple dimension of the group feature into a single dimension by applying Fisher linear discriminant analysis (FDA) for each feature group. At last, we estimate the relative importance of the extracted feature by applying random forest and selecting important features that have larger importance scores compared with other ones. In the end, machine-learning algorithms can be used to train and test the models. For the experiment, we compared the proposed with the supervised group lasso (SGL) method by using real-life high-dimensional datasets. Results show that the proposed method selects a few important group features just like the existing group feature selection method and provides the ranking and relative importance of all group features. SGL slightly performs better on logistic regression whereas the proposed method performs better on support vector machine, random forest, and gradient boosting in terms of classification performance metrics.

Original languageEnglish
Pages (from-to)125136-125147
Number of pages12
JournalIEEE Access
Volume10
DOIs
StatePublished - 2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2013 IEEE.

Keywords

  • Dimension reduction
  • feature extraction
  • group feature ranking
  • group feature selection
  • high dimensional data

ASJC Scopus subject areas

  • General Computer Science
  • General Materials Science
  • General Engineering

Fingerprint

Dive into the research topics of 'A Group Feature Ranking and Selection Method Based on Dimension Reduction Technique in High-Dimensional Data'. Together they form a unique fingerprint.

Cite this