A new cluster validity index using maximum cluster spread based compactness measure

  • M. Arif Wani*
  • , Romana Riyaz
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

33 Scopus citations

Abstract

Purpose: – The most commonly used approaches for cluster validation are based on indices but the majority of the existing cluster validity indices do not work well on data sets of different complexities. The purpose of this paper is to propose a new cluster validity index (ARSD index) that works well on all types of data sets. Design/methodology/approach: – The authors introduce a new compactness measure that depicts the typical behaviour of a cluster where more points are located around the centre and lesser points towards the outer edge of the cluster. A novel penalty function is proposed for determining the distinctness measure of clusters. Random linear search-algorithm is employed to evaluate and compare the performance of the five commonly known validity indices and the proposed validity index. The values of the six indices are computed for all nc ranging from (nc min, nc max) to obtain the optimal number of clusters present in a data set. The data sets used in the experiments include shaped, Gaussian-like and real data sets. Findings: – Through extensive experimental study, it is observed that the proposed validity index is found to be more consistent and reliable in indicating the correct number of clusters compared to other validity indices. This is experimentally demonstrated on 11 data sets where the proposed index has achieved better results. Originality/value: – The originality of the research paper includes proposing a novel cluster validity index which is used to determine the optimal number of clusters present in data sets of different complexities.

Original languageEnglish
Pages (from-to)179-204
Number of pages26
JournalInternational Journal of Intelligent Computing and Cybernetics
Volume9
Issue number2
DOIs
StatePublished - 13 Jun 2016
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2016, © Emerald Group Publishing Limited.

Keywords

  • Cluster analysis
  • Cluster validity
  • Clustering
  • Compactness measure
  • Distinctness measure
  • Optimal number

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'A new cluster validity index using maximum cluster spread based compactness measure'. Together they form a unique fingerprint.

Cite this