Skip to main navigation Skip to search Skip to main content

DFT-PBE band gap correction using machine learning with a reduced set of features

  • Ibnu Jihad
  • , Miftah Hadi S. Anfa
  • , Saad M. Alqahtani
  • , Fahhad H. Alharbi*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

14 Scopus citations

Abstract

In the density functional theory (DFT), it is well-known that the generalized gradient approximation (GGA) underestimates the energy gap. One of the commonly used GGA functionals is the Perdew–Burke–Ernzerhof (PBE) functional. This paper presents a machine learning approach to correct the energy gap estimated by PBE, using a minimal set of five features, to the accuracy level of the many-body perturbation theory G0W0 band gap, which is known for its precision but high computational cost. These features are identified by seeking those which can be obtained directly from PBE-DFT calculations or from the standard atomic tables and directly connected to Coulombic interaction, and by minimizing feature correlations. The utilized features are the PBE band gap and the average atomic distance (obtained directly from PBE-DFT calculations) and the average oxidation states, electronegativity, and the minimum difference of electronegativity between constituents (obtained from the standard atomic tables). These features do not require any further atomistic calculations. The developed models are trained and validated using 265 inorganic semiconductors and insulators with energy gaps ranging from 0.75 to 14.55 eV. Several regression techniques have been applied, all of which have produced highly accurate and comparable results. This suggests that the five features employed in the analysis effectively capture the underlying physics. The most accurate model is based on Gaussian process regression (GPR), achieving a validation accuracy of 0.252 eV root-mean-square error (RMSE) and an R2 value of 0.9932. The best bootstrapped model achieves an RMSE of 0.232 eV. The developed GPR model is then applied successfully to completely independent data sets. The developed model should allow calculating gaps with G0W0 accuracy using the relatively much faster DFT-PBE. This is especially useful in materials discovery where computational efficiency is crucial. Furthermore, the model should allow empirical investigation of the influences of different compositional and structural characteristics on the gap.

Original languageEnglish
Article number113153
JournalComputational Materials Science
Volume244
DOIs
StatePublished - Sep 2024

Bibliographical note

Publisher Copyright:
© 2024

Keywords

  • DFT-PBE band gap correction
  • Physics-guided machine learning
  • Reduced set of features

ASJC Scopus subject areas

  • General Computer Science
  • General Chemistry
  • General Materials Science
  • Mechanics of Materials
  • General Physics and Astronomy
  • Computational Mathematics

Fingerprint

Dive into the research topics of 'DFT-PBE band gap correction using machine learning with a reduced set of features'. Together they form a unique fingerprint.

Cite this