An interpretable neural network for robustly determining the location and number of cluster centers

Xuetao Xie, Yi Fei Pu*, Huaqing Zhang, Jacek Mańdziuk, El Sayed M. El-Alfy, Jian Wang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

K-means is a clustering method with an interpretable mechanism. However, its clustering results are significantly affected by the location of the initial cluster centers. More importantly, for it and its improved versions, it is extremely hard to adaptively determine the number of cluster centers. In contrast, ordinary neural networks have powerful information representation ability but lack interpretability. Moreover, to the best of our knowledge, the use of interpretable neural networks to determine the number of cluster centers of K-means is absent. This paper proposes K-meaNet that combines the interpretable mechanism of K-means and the powerful information representation ability of neural networks. For the neural network in K-meaNet, its inputs, weights, and mathematical expressions of each layer have clear meanings. During training, if one cluster center is critical, the value of one of the weights in the neural network, the gate, corresponding to this cluster center will increase. At the same time, the position of this cluster center will be close to the ideal cluster center. Besides, the location of the cluster center(s) and the value(s) of the corresponding gate(s) will not change significantly. This endows K-meaNet with the ability to adaptively determine the location and number of cluster centers compared with K-means and its improved versions. Moreover, this adaptive ability is robust to the location of the initial cluster centers, the number of the initial cluster centers, and the number of features. On six synthetic datasets and three real datasets, numerical experiments verify that K-meaNet can adaptively determine the number of cluster centers and is robust to the location of the initial cluster centers, the number of the initial cluster centers, and the number of features.

Original languageEnglish
Pages (from-to)1473-1501
Number of pages29
JournalInternational Journal of Machine Learning and Cybernetics
Volume15
Issue number4
DOIs
StatePublished - Apr 2024

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023.

Keywords

  • Cluster center
  • Interpretable
  • K-means
  • Neural network
  • Robust

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'An interpretable neural network for robustly determining the location and number of cluster centers'. Together they form a unique fingerprint.

Cite this