Skip to main navigation Skip to search Skip to main content

An Adaptive Oversampling Method for Imbalanced Datasets Based on Mean-Shift and SMOTE

  • Ahmed S. Ghorab*
  • , Wesam M. Ashour
  • , Shadi I. Abudalfa
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Class imbalance is a challenge in different actual datasets, where the majority class contains a large number of data points, and the minority class contains a small number of data points. Class imbalance affects the learning process negatively, resulting in classification algorithms’ ignorance of the minority class. To address this issue, various researchers developed different algorithms to tackle the problem; however, the majority of these algorithms are complex and generate noise. This paper provides a simple and effective oversampling technique based on the mean-shift clustering algorithm and using the synthetic minority oversampling technique (SMOTE) of selected clusters. We conducted several experiments to compare the performance of our technique with different algorithms mentioned in the literature on three common datasets. Experimental results indicate that our technique performs better in synthesizing new samples and improves support vector machine (SVM) classification performance on imbalanced datasets.

Original languageEnglish
Title of host publicationExplore Business, Technology Opportunities and Challenges ‎After the Covid-19 Pandemic
EditorsBahaaeddin Alareeni, Allam Hamdan
PublisherSpringer Science and Business Media Deutschland GmbH
Pages13-23
Number of pages11
ISBN (Print)9783031089534
DOIs
StatePublished - 2023
Externally publishedYes
EventInternational Conference on Business and Technology , ICBT 2021 - Istanbul, Turkey
Duration: 6 Nov 20217 Nov 2021

Publication series

NameLecture Notes in Networks and Systems
Volume495 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

ConferenceInternational Conference on Business and Technology , ICBT 2021
Country/TerritoryTurkey
CityIstanbul
Period6/11/217/11/21

Bibliographical note

Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Keywords

  • Imbalanced datasets
  • Mean-shift clustering
  • Support Vector Machine (SVM)
  • Synthetic Minority Oversampling Technique (SMOTE)

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Signal Processing
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'An Adaptive Oversampling Method for Imbalanced Datasets Based on Mean-Shift and SMOTE'. Together they form a unique fingerprint.

Cite this