Bayesian-optimized machine learning boosts actual evapotranspiration prediction in water-stressed agricultural regions of China

Gespeichert in:
Bibliographische Detailangaben
Titel: Bayesian-optimized machine learning boosts actual evapotranspiration prediction in water-stressed agricultural regions of China
Autoren: Ahmed Elbeltagi, Aman Srivastava, Xinchun Cao, Vinay Kumar Gautam, Bilel Zerouali, Muhammad Rizwan Aslam, Ali Salem, Hojjat Emami, Elsayed Ahmed Elsadek
Quelle: Scientific Reports, Vol 15, Iss 1, Pp 1-18 (2025)
Verlagsinformationen: Nature Portfolio, 2025.
Publikationsjahr: 2025
Bestand: LCC:Medicine
LCC:Science
Schlagwörter: Data-driven evapotranspiration, Bayesian optimization, Agricultural water management, Agricultural hydrology, Climate data modeling, Irrigation scheduling, Medicine, Science
Beschreibung: Abstract The accurate estimation of actual evapotranspiration (AET) is crucial for sustainable water resource management, especially in water-scarce and agriculturally intensive regions like Beijing and Tianjin, China. Traditional methods for AET estimation, whether empirical or physically based, often face limitations due to high data requirements, limited scalability, and sensitivity to input uncertainties. This creates a critical research gap in providing reliable AET predictions under data-limited conditions. To address this, we evaluated the efficacy of integrating four advanced machine learning (ML) models: Support Vector Machine (SVM), Gaussian Process Regression (GPR), Ensemble Tree, and Neural Network, with Bayesian hyperparameter optimization for AET modeling using the high-resolution TerraClimate dataset spanning 1958–2022. Key meteorological variables, including maximum and minimum temperature (Tmax and Tmin), solar radiation (SR), wind speed (WS), vapor pressure deficit (VPD), and precipitation (PPT), were selected through rigorous correlation and multicollinearity analyses. Model performance was assessed using the coefficient of determination (R2), mean squared error (MSE), root mean squared error (RMSE), and mean absolute error (MAE) on a 75:25 train-test split. Results demonstrate that the optimizable GPR model achieved the highest predictive accuracy (RMSE = 5.54, R2 = 0.98 on test data), outperforming other ML approaches and traditional empirical models. PPT, Tmin, and Tmax emerged as the most influential predictors for AET. Our findings reveal that ML models, particularly when optimized via Bayesian techniques, yield a robust, scalable, and data-efficient alternative for AET estimation in regions with limited meteorological records. This study establishes a new benchmark for AET modeling, with significant implications for irrigation scheduling, drought monitoring, and integrated water management in the North China Plain and comparable agro-ecological regions.
Publikationsart: article
Dateibeschreibung: electronic resource
Sprache: English
ISSN: 2045-2322
Relation: https://doaj.org/toc/2045-2322
DOI: 10.1038/s41598-025-22130-y
Zugangs-URL: https://doaj.org/article/b5b13716ea4c462ea76510e8baa4f41f
Dokumentencode: edsdoj.b5b13716ea4c462ea76510e8baa4f41f
Datenbank: Directory of Open Access Journals
Beschreibung
Abstract:Abstract The accurate estimation of actual evapotranspiration (AET) is crucial for sustainable water resource management, especially in water-scarce and agriculturally intensive regions like Beijing and Tianjin, China. Traditional methods for AET estimation, whether empirical or physically based, often face limitations due to high data requirements, limited scalability, and sensitivity to input uncertainties. This creates a critical research gap in providing reliable AET predictions under data-limited conditions. To address this, we evaluated the efficacy of integrating four advanced machine learning (ML) models: Support Vector Machine (SVM), Gaussian Process Regression (GPR), Ensemble Tree, and Neural Network, with Bayesian hyperparameter optimization for AET modeling using the high-resolution TerraClimate dataset spanning 1958–2022. Key meteorological variables, including maximum and minimum temperature (Tmax and Tmin), solar radiation (SR), wind speed (WS), vapor pressure deficit (VPD), and precipitation (PPT), were selected through rigorous correlation and multicollinearity analyses. Model performance was assessed using the coefficient of determination (R2), mean squared error (MSE), root mean squared error (RMSE), and mean absolute error (MAE) on a 75:25 train-test split. Results demonstrate that the optimizable GPR model achieved the highest predictive accuracy (RMSE = 5.54, R2 = 0.98 on test data), outperforming other ML approaches and traditional empirical models. PPT, Tmin, and Tmax emerged as the most influential predictors for AET. Our findings reveal that ML models, particularly when optimized via Bayesian techniques, yield a robust, scalable, and data-efficient alternative for AET estimation in regions with limited meteorological records. This study establishes a new benchmark for AET modeling, with significant implications for irrigation scheduling, drought monitoring, and integrated water management in the North China Plain and comparable agro-ecological regions.
ISSN:20452322
DOI:10.1038/s41598-025-22130-y