Empirical study of homogeneous and heterogeneous ensemble models for software development effort estimation

Mahmoud O. Elish*, Tarek Helmy, Muhammad Imtiaz Hussain

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

51 Scopus citations


Accurate estimation of software development effort is essential for effective management and control of software development projects. Many software effort estimation methods have been proposed in the literature including computational intelligence models. However, none of the existing models proved to be suitable under all circumstances; that is, their performance varies from one dataset to another. The goal of an ensemble model is to manage each of its individual models' strengths and weaknesses automatically, leading to the best possible decision being taken overall. In this paper, we have developed different homogeneous and heterogeneous ensembles of optimized hybrid computational intelligence models for software development effort estimation. Different linear and nonlinear combiners have been used to combine the base hybrid learners. We have conducted an empirical study to evaluate and compare the performance of these ensembles using five popular datasets. The results confirm that individual models are not reliable as their performance is inconsistent and unstable across different datasets. Although none of the ensemble models was consistently the best, many of them were frequently among the best models for each dataset. The homogeneous ensemble of support vector regression (SVR), with the nonlinear combiner adaptive neurofuzzy inference systems-subtractive clustering (ANFIS-SC), was the best model when considering the average rank of each model across the five datasets.

Original languageEnglish
Article number312067
JournalMathematical Problems in Engineering
StatePublished - 2013

ASJC Scopus subject areas

  • General Mathematics
  • General Engineering


Dive into the research topics of 'Empirical study of homogeneous and heterogeneous ensemble models for software development effort estimation'. Together they form a unique fingerprint.

Cite this