Abstract
Elucidating the patterns hidden in gene expression data offers an opportunity for identifying co-expressed genes and biologically relevant grouping of genes. However, the large number of genes and the complexity of biological networks greatly increase the challenges of comprehending and interpreting the microarray data. A first step toward addressing this challenge is the use of clustering techniques. Validation of results obtained from a clustering algorithm is an important part of the clustering process. In this paper, we propose a new cluster validity index (ARPoints index) for the purpose of cluster validation. A new approach to determine the compactness measure and distinctness measure of clusters is presented. We revisit commonly known indices and conduct a thorough comparison of these indices with the proposed index and provide a summary of performance evaluation of different indices. Experimental results show that the proposed index performs better than the commonly known cluster validity indices.
| Original language | English |
|---|---|
| Pages (from-to) | 66-84 |
| Number of pages | 19 |
| Journal | International Journal of Data Mining and Bioinformatics |
| Volume | 17 |
| Issue number | 1 |
| DOIs | |
| State | Published - 2017 |
| Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2017 Inderscience Enterprises Ltd.
Keywords
- Cluster validation
- Clustering
- Clustering gene data
- Compactness measure of clusters
- Distinctness measure of clusters
- Gene expression analysis
ASJC Scopus subject areas
- Information Systems
- General Biochemistry, Genetics and Molecular Biology
- Library and Information Sciences