Evaluating Water-Induced Soil Erosion Using Machine Learning: XGBoost as the Most Effective Model

Ihtisham Khan, Kashif Khan, Kazimierz Bęcek*, Muhammad Fahad Bilal

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Soil erosion is a significant environmental concern that threatens agricultural activities, reduces soil fertility, and eventually impacts productivity. Assessing soil erosion is essential for effective planning and conservation initiatives in a basin or watershed. This study evaluates water-induced soil erosion susceptibility using machine learning models, with a focus on the comparative performance of Random Forest (RF), k-Nearest Neighbors (kNN), and Extreme Gradient Boosting (XGBoost). Unlike conventional approaches, this study emphasizes the effectiveness of ML-based predictive modeling, rather than re-identifying well-established erosion-controlling factors. A comprehensive dataset comprising topographic, climatic, and land use parameters was used to train and validate the models (80% training, 20% testing). The models were assessed based on multiple performance metrics, including sensitivity, specificity, Kappa coefficient, and area under the curve (AUC). Among the tested models, XGBoost demonstrated the highest predictive performance with an AUC of 0.91, sensitivity of 0.91, specificity of 0.89, and a Kappa index of 0.80. RF and kNN also performed well, with AUC values of 0.87 and 0.89, and Kappa values of 0.80 and 0.73, respectively. Field validation showed that XGBoost correctly predicted 78.7% of high-risk erosion sites. The final susceptibility map classified 21.3% of the area as high-risk, mainly concentrated in steep, sparsely vegetated uplands. These findings confirm the effectiveness of machine learning—particularly XGBoost—for accurate erosion risk mapping in data-scarce, topographically diverse regions. The findings contribute to sustainable land management strategies, offering a scalable and adaptable approach for erosion risk assessment in diverse environmental settings.

Original languageEnglish
JournalLand Degradation and Development
DOIs
StateAccepted/In press - 2025
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2025 John Wiley & Sons Ltd.

Keywords

  • RF
  • XGBoost
  • kNN
  • soil erosion
  • soil fertility
  • susceptibility

ASJC Scopus subject areas

  • Environmental Chemistry
  • Development
  • General Environmental Science
  • Soil Science

Fingerprint

Dive into the research topics of 'Evaluating Water-Induced Soil Erosion Using Machine Learning: XGBoost as the Most Effective Model'. Together they form a unique fingerprint.

Cite this