Comparative study of machine learning models for evaluating groundwater vulnerability to nitrate contamination

  • Hussam Eldin Elzain
  • , Sang Yong Chung*
  • , Venkatramanan Senapathi
  • , Selvam Sekar
  • , Seung Yeop Lee
  • , Priyadarsi D. Roy
  • , Amjed Hassan
  • , Chidambaram Sabarathinam
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

96 Scopus citations

Abstract

The accurate evaluation of groundwater contamination vulnerability is essential for the management and prevention of groundwater contamination in the watershed. In this study, advanced multiple machine learning (ML) models of Radial Basis Neural Networks (RBNN), Support Vector Regression (SVR), and ensemble Random Forest Regression (RFR) were applied to determine the most accurate performance for the evaluation of groundwater contamination vulnerability. Eight vulnerability factors of DRASTIC-L were rated based on the modified DRASTIC model (MDM) and were used as input data. The adjusted vulnerability index (AVI) with nitrate values was used as output data for the modeling process. The performance of three models was verified using the statistical performance criteria of MAE, RMSE, r2, and ROC/AUC values. The ensemble RFR model showed the highest performance in comparison with standalone SVR and RBNN models. Specifically, ensemble RFR kept all promising solutions during the model performance due to its flexibility and robustness, and the vulnerability map obtained by the RFR model was more accurate for predicting the most vulnerable areas to contamination. It was concluded that ensemble RFR was a robust tool to enhance the evaluation of groundwater contamination vulnerability, and that it could contribute to environmental safety against groundwater contamination.

Original languageEnglish
Article number113061
JournalEcotoxicology and Environmental Safety
Volume229
DOIs
StatePublished - 1 Jan 2022

Bibliographical note

Publisher Copyright:
© 2021

Keywords

  • Adjusted vulnerability index
  • Ensemble random forest regression
  • Machine learning
  • Modified DRASTIC-L model

ASJC Scopus subject areas

  • Pollution
  • Public Health, Environmental and Occupational Health
  • Health, Toxicology and Mutagenesis

Fingerprint

Dive into the research topics of 'Comparative study of machine learning models for evaluating groundwater vulnerability to nitrate contamination'. Together they form a unique fingerprint.

Cite this