Long-term multi-step ahead forecasting of root zone soil moisture in different climates: Novel ensemble-based complementary data-intelligent paradigms

Mehdi Jamei*, Masoud Karbasi, Anurag Malik, Mozhdeh Jamei, Ozgur Kisi, Zaher Mundher Yaseen

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

29 Scopus citations

Abstract

The root zone soil moisture (RZSM) is essential for monitoring and forecasting agricultural, hydrological, and meteorological systems. Accordingly, researchers are determined to improve robust machine learning (ML) models to increase the accuracy of the RZSM predictions. This paper designed new complementary forecasting paradigms hybridizing Empirical Wavelet Transform (EWT) and two modern ensemble-based ML models, namely, extreme gradient boosting (XGBoost) and categorical boosting (CatBoost), to forecast long-term multi-step ahead daily RZSM in very cold and very warm semi-arid regions of Iran. For this purpose, the required datasets consisting of soil properties and meteorological information were extracted from the satellite datasets during 2005–2020 for Ardabil and Minab sites. Afterward, the significant lags of RZSM time series and optimal influence candidate inputs were sought based on the partial autocorrelation function (PACF) and mutual information techniques, respectively. Selected lagged components of RZSM time series were decomposed using EWT into different sub-sequences and consequently concatenated with candidate inputs to feed the ensemble ML models to forecast one-, three-, and seven-day-ahead RZSM at each case study. The performance of EWT-CatBoost and EWT-XGBoost and their counterpart standalone approaches was firstly evaluated in forecasting one-, three-, and seven-day-ahead RZSM using satellite data in this study and their accuracy were compared with a standalone kernel ridge regression (KRR) and complementary EWT-KRR models based on several statistical metrics (e.g., correlation coefficient (R), root mean square error (RMSE), Nash–Sutcliffe model efficiency coefficient (NSE)) and diagnostic analysis. The outcomes of testing phase in Ardabil site ascertained that the EWT-CatBoost (for RZSM(t + 1), R= 0.9979, RMSE= 0.0019, and NSE= 0.9985; for RZSM(t + 3), R= 0.9934, RMSE= 0.0035, and NSE= 0.9885; for RZSM(t + 7), R= 0.9489, RMSE= 0.0109, and NSE= 0.8634) outperformed the other models. On the other hand, the EWT-XGBoost model according to its best results (for RZSM(t + 1), R= 0.9911, RMSE= 0.0064, and NSE= 0.9805; for RZSM(t + 3), R= 0.9807, RMSE= 0.0092, and NSE= 0.9589; for RZSM(t + 7), R= 0.9680, RMSE= 0.0120, and NSE= 0.9309) yielded the most promising accuracy in forecasting multi-step ahead daily RZSM followed by the EWT-CatBoost, and EWT-KRR, respectively.

Original languageEnglish
Article number107679
JournalAgricultural Water Management
Volume269
DOIs
StatePublished - 1 Jul 2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022 Elsevier B.V.

Keywords

  • Categorical boosting
  • Empirical wavelet
  • Extreme gradient boosting
  • Forecasting
  • Microwave
  • Root zone soil moisture

ASJC Scopus subject areas

  • Agronomy and Crop Science
  • Water Science and Technology
  • Soil Science
  • Earth-Surface Processes

Fingerprint

Dive into the research topics of 'Long-term multi-step ahead forecasting of root zone soil moisture in different climates: Novel ensemble-based complementary data-intelligent paradigms'. Together they form a unique fingerprint.

Cite this