Abstract
Software defect prediction is a critical aspect of software quality assurance, aiming to identify faulty modules early in the development lifecycle to mitigate potential risks and reduce maintenance costs. This paper presents a comprehensive stacking ensemble framework that amalgamates multiple machine learning techniques to enhance defect prediction accuracy. The methodology encompasses meticulous data preprocessing, a two-stage feature selection process involving Minimum Redundancy Maximum Relevance (mRMR) and polynomial feature expansion, followed by dimensionality reduction using Principal Component Analysis (PCA). Four diverse base learners - Extreme Learning Machine (ELM), Support Vector Machine (SVM), Random Forest, and XGBoost - are trained on the transformed feature set, and their outputs are integrated through a Logistic Regression meta-learner. Empirical evaluations conducted on five benchmark NASA datasets - MC1, CM1, KC2, KC3, and PC1 - demonstrate the robustness of the proposed framework, achieving accuracies up to 93.80%. These results underscore the efficacy of leveraging ensemble learning and sophisticated feature engineering in capturing intricate data patterns for improved software defect detection.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 29th International Conference on Evaluation and Assessment in Software Engineering , EASE, 2025 edition, EASE Companion 2025 |
| Editors | Muhammad Ali Babar, Ayse Tosun, Stefan Wagner, Viktoria Stray |
| Publisher | Association for Computing Machinery, Inc |
| Pages | 28-34 |
| Number of pages | 7 |
| ISBN (Electronic) | 9798400718328 |
| DOIs | |
| State | Published - 23 Dec 2025 |
| Event | 29th International Conference on Evaluation and Assessment of Software Engineering, EASE 2025 - Istanbul, Turkey Duration: 17 Jun 2025 → 20 Jun 2025 |
Publication series
| Name | Proceedings of the 29th International Conference on Evaluation and Assessment in Software Engineering , EASE, 2025 edition, EASE Companion 2025 |
|---|
Conference
| Conference | 29th International Conference on Evaluation and Assessment of Software Engineering, EASE 2025 |
|---|---|
| Country/Territory | Turkey |
| City | Istanbul |
| Period | 17/06/25 → 20/06/25 |
Bibliographical note
Publisher Copyright:© 2025 Copyright held by the owner/author(s).
Keywords
- Extreme Learning Machine
- NASA datasets
- PCA
- Software defect prediction
- mRMR
- stacking ensemble
ASJC Scopus subject areas
- Software