TY - JOUR
T1 - Boruta extra tree-bidirectional long short-term memory model development for Pan evaporation forecasting
T2 - Investigation of arid climate condition
AU - Karbasi, Masoud
AU - Ali, Mumtaz
AU - Bateni, Sayed M.
AU - Jun, Changhyun
AU - Jamei, Mehdi
AU - Yaseen, Zaher Mundher
N1 - Publisher Copyright:
© 2023 The Authors
PY - 2024/1
Y1 - 2024/1
N2 - In this study, two deep learning approaches, bidirectional long short-term memory (BiLSTM) and long short-term memory (LSTM), were used along with adaptive boosting and general regression neural network to forecast multi-step-ahead pan evaporation in two arid climate stations in Iran (Ahvaz and Yazd). Lagged time series of meteorological data and pan evaporation data were input to the machine learning models. Two feature selection methods, i.e., the Boruta extra tree and XGBoost, were used to select significant inputs to reduce the number of inputs and model complexity. Different statistical metrics were used to investigate the model performance. The results demonstrated that Boruta-extra-tree-based models were more accurate than XGBoost-based models. Compared with the machine learning techniques, the combination of Boruta extra tree and BiLSTM enabled more accurate one-day-ahead forecasting of pan evaporation for both sites (Root Mean Square Error (RMSE) = 1.6857, for the Ahvaz station, and RMSE = 1.3996 for the Yazd station). The proposed model was used to forecast up to 30 days ahead of pan evaporation in both stations. The results showed that the Boruta-BiLSTM model could accurately forecast the pan evaporation for up to 30 days in both stations.
AB - In this study, two deep learning approaches, bidirectional long short-term memory (BiLSTM) and long short-term memory (LSTM), were used along with adaptive boosting and general regression neural network to forecast multi-step-ahead pan evaporation in two arid climate stations in Iran (Ahvaz and Yazd). Lagged time series of meteorological data and pan evaporation data were input to the machine learning models. Two feature selection methods, i.e., the Boruta extra tree and XGBoost, were used to select significant inputs to reduce the number of inputs and model complexity. Different statistical metrics were used to investigate the model performance. The results demonstrated that Boruta-extra-tree-based models were more accurate than XGBoost-based models. Compared with the machine learning techniques, the combination of Boruta extra tree and BiLSTM enabled more accurate one-day-ahead forecasting of pan evaporation for both sites (Root Mean Square Error (RMSE) = 1.6857, for the Ahvaz station, and RMSE = 1.3996 for the Yazd station). The proposed model was used to forecast up to 30 days ahead of pan evaporation in both stations. The results showed that the Boruta-BiLSTM model could accurately forecast the pan evaporation for up to 30 days in both stations.
KW - Boruta Feature Selection
KW - Deep Learning
KW - Forecasting
KW - Machine Learning
KW - Pan Evaporation
UR - https://www.scopus.com/pages/publications/85178996271
U2 - 10.1016/j.aej.2023.11.061
DO - 10.1016/j.aej.2023.11.061
M3 - Article
AN - SCOPUS:85178996271
SN - 1110-0168
VL - 86
SP - 425
EP - 442
JO - Alexandria Engineering Journal
JF - Alexandria Engineering Journal
ER -