A new approach to enhance the performance of decision tree for classifying gene expression data

Md Rafiul Hassan*, Ramamohanarao Kotagiri

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Background: Gene expression data classification is a challenging task due to the large dimensionality and very small number of samples. Decision tree is one of the popular machine learning approaches to address such classification problems. However, the existing decision tree algorithms use a single gene feature at each node to split the data into its child nodes and hence might suffer from poor performance specially when classifying gene expression dataset. Results: By using a new decision tree algorithm where, each node of the tree consists of more than one gene, we enhance the classification performance of traditional decision tree classifiers. Our method selects suitable genes that are combined using a linear function to form a derived composite feature. To determine the structure of the tree we use the area under the Receiver Operating Characteristics curve (AUC). Experimental analysis demonstrates higher classification accuracy using the new decision tree compared to the other existing decision trees in literature. Conclusion: We experimentally compare the effect of our scheme against other well known decision tree techniques. Experiments show that our algorithm can substantially boost the classification performance of the decision tree.

Original languageEnglish
Article numberS3
JournalBMC Proceedings
Volume7
DOIs
StatePublished - 20 Dec 2013
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2013 Hassan and Kotagiri; licensee BioMed Central Ltd.

ASJC Scopus subject areas

  • General Biochemistry, Genetics and Molecular Biology

Fingerprint

Dive into the research topics of 'A new approach to enhance the performance of decision tree for classifying gene expression data'. Together they form a unique fingerprint.

Cite this