Multi-split optimized bagging ensemble model selection for multi-class educational data mining

Mohammad Noor Injadat*, Abdallah Moubayed, Ali Bou Nassif, Abdallah Shami

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

71 Scopus citations

Abstract

Predicting students’ academic performance has been a research area of interest in recent years, with many institutions focusing on improving the students’ performance and the education quality. The analysis and prediction of students’ performance can be achieved using various data mining techniques. Moreover, such techniques allow instructors to determine possible factors that may affect the students’ final marks. To that end, this work analyzes two different undergraduate datasets at two different universities. Furthermore, this work aims to predict the students’ performance at two stages of course delivery (20% and 50% respectively). This analysis allows for properly choosing the appropriate machine learning algorithms to use as well as optimize the algorithms’ parameters. Furthermore, this work adopts a systematic multi-split approach based on Gini index and p-value. This is done by optimizing a suitable bagging ensemble learner that is built from any combination of six potential base machine learning algorithms. It is shown through experimental results that the posited bagging ensemble models achieve high accuracy for the target group for both datasets.

Original languageEnglish
Pages (from-to)4506-4528
Number of pages23
JournalApplied Intelligence
Volume50
Issue number12
DOIs
StatePublished - Dec 2020
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2020, Springer Science+Business Media, LLC, part of Springer Nature.

Keywords

  • e-Learning
  • Gini Index
  • Optimized Bagging Ensemble Learning Model Selection
  • Student Performance Prediction

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Multi-split optimized bagging ensemble model selection for multi-class educational data mining'. Together they form a unique fingerprint.

Cite this