Abstract
This paper presents an effective heart disease prediction model through detecting the anomalies, also known as outliers, in healthcare data using the unsupervised K-means clustering algorithm. Most existing approaches for detecting anomalies are based on constructing profiles of normal instances. However, such techniques require an adequate number of normal profiles to justify those models. Our proposed model first evaluates an optimal value of K using Silhouette method. Next, it intends to locate anomalies that are far from a certain threshold distance with respect to their clusters. Finally, the five most popular classification techniques such as K-Nearest Neighbor (KNN), Random Forest (RF), Support Vector Machines (SVM), Naive Bayes (NB), and Logistic Regression (LR) are applied to build the resultant prediction model. The effectiveness of the proposed methodology is justified using a benchmark dataset of heart disease.
| Original language | English |
|---|---|
| Title of host publication | Hybrid Intelligent Systems - 20th International Conference on Hybrid Intelligent Systems, HIS 2020 |
| Editors | Ajith Abraham, Thomas Hanne, Oscar Castillo, Niketa Gandhi, Tatiane Nogueira Rios, Tzung-Pei Hong |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 280-288 |
| Number of pages | 9 |
| ISBN (Print) | 9783030730499 |
| DOIs | |
| State | Published - 2021 |
| Externally published | Yes |
Publication series
| Name | Advances in Intelligent Systems and Computing |
|---|---|
| Volume | 1375 AIST |
| ISSN (Print) | 2194-5357 |
| ISSN (Electronic) | 2194-5365 |
Bibliographical note
Publisher Copyright:© 2021, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Keywords
- Anomaly detection
- Data analytics
- Healthcare
- Heart disease prediction
- K-means clustering
- Machine learning
ASJC Scopus subject areas
- Control and Systems Engineering
- General Computer Science