Meticulous estimation of maize actual evapotranspiration: A comprehensive explainable CatBoost algorithm reinforced with Jackknife uncertainty paradigm

Mina Rahimi, Masoud Karbasi*, Mehdi Jamei, Vahid Rezaverdinejad, Anurag Malik, Aitazaz A. Farooque, Zaher Mundher Yaseen

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Accurately estimating daily actual evapotranspiration (AET) is essential for managing water resources in irrigated regions. The current study employed a new machine learning technique (CatBoost) to predict maize AET using meteorological and soil-related data. Four benchmark machine learning techniques (Random Forest, Extra Tree, multi-layer perceptron neural network, and K-nearest neighbor) were used for comparison. The lysimeter data of maize AET from Bushland (Texas) in the US were selected to evaluate the performance of the models. The data contained different soil and meteorological parameters. Four different scenarios (comb1: All of the data, comb2: Based on Lasso regression feature selection, comb3: Based on Boruta feature selection algorithm, and comb4: Common meteorological data) were used to predict AET. Various statistical metrics were employed to assess the models’ performance, including the determination coefficient (R2) and root mean square error (RMSE). Comparison between different scenarios showed that the Boruta technique improves precision and decreases computation time by reducing the dimension of the input data. The CatBoost model had the best accuracy in all scenarios. The current study showed that the CatBoost algorithm (comb3 scenario) can predict AET with higher accuracy (R2 = 9.625 × 10−1 and RMSE = 5.594 × 10−1 mm/d). Combining the comb3 scenario with extra tree (R2 = 9.514 × 10−1 and RMSE = 6.716 × 10−1 mm/d) and random forest (R2 = 9.444 × 10−1 and RMSE = 7.084 × 10−1 mm/d) models ranked second and third best accuracy. Also, the SHAP analysis was performed to interpret the black-box model outputs. The SHAP analysis showed that net radiation and air temperature are the most important input parameters for AET prediction.

Original languageEnglish
Article number110599
JournalComputers and Electronics in Agriculture
Volume237
DOIs
StatePublished - Oct 2025

Bibliographical note

Publisher Copyright:
© 2025 The Author(s)

Keywords

  • Actual Evapotranspiration
  • Feature selection
  • Machine learning
  • SHAP
  • Uncertainty analysis

ASJC Scopus subject areas

  • Forestry
  • Agronomy and Crop Science
  • Computer Science Applications
  • Horticulture

Fingerprint

Dive into the research topics of 'Meticulous estimation of maize actual evapotranspiration: A comprehensive explainable CatBoost algorithm reinforced with Jackknife uncertainty paradigm'. Together they form a unique fingerprint.

Cite this