A Spatiotemporal Interrogation of Hydrologic Drought Model Performance for Machine Learning Model Interpretability

Gespeichert in:
Bibliographische Detailangaben
Titel: A Spatiotemporal Interrogation of Hydrologic Drought Model Performance for Machine Learning Model Interpretability
Autoren: Ali Dadkhah, Scott D. Hamshaw, Ryan van derHeijden, Donna M. Rizzo
Quelle: Water Resources Research, Vol 61, Iss 11, Pp n/a-n/a (2025)
Verlagsinformationen: Wiley, 2025.
Publikationsjahr: 2025
Bestand: LCC:Environmental sciences
Schlagwörter: interpretable machine learning, model performance, hydrologic model, drought forecasting, ungaged basins, clustering, Environmental sciences, GE1-350
Beschreibung: Abstract The predictive accuracy of regional hydrologic models often varies across both time and space. Interpreting relationships between watershed characteristics, hydrologic regimes, and model performance can reveal potential areas for model improvement. In this study, we use machine learning to assess model performance of a regional hydrologic model to forecast the occurrence of streamflow drought. We demonstrate our methodology using a regional long short‐term memory (LSTM) deep learning model developed by the U.S. Geological Survey (USGS) and data from 384 streamgages across the Colorado River Basin region. Performance was assessed by clustering catchments using: (a) physical and climatological catchment attributes, and (b) streamflow drought signatures time series. We examined the association of USGS LSTM model error measures with clusters generated by both approaches to interpret meaningful spatial and temporal information about LSTM model performance. Clustering static catchment attributes identified elevation, degree of streamflow regulation, baseflow contribution, catchment aridity, and drainage area as the most influential attributes to model performance. Clustering gages by their drought signatures revealed that catchments with significant seasonal peak runoff between January and June generally exhibited better model performance. Additionally, a Random Forest classifier was trained to successfully predict LSTM model performance (F1 score of 0.72) based on physical and climatological catchment attributes. Low degree of flow regulation was identified as a key indicator of better LSTM model performance. These findings point to the opportunities for improving the USGS LSTM model performance in future hydrologic drought prediction efforts across regional and CONUS scales.
Publikationsart: article
Dateibeschreibung: electronic resource
Sprache: English
ISSN: 1944-7973
0043-1397
Relation: https://doaj.org/toc/0043-1397; https://doaj.org/toc/1944-7973
DOI: 10.1029/2024WR039077
Zugangs-URL: https://doaj.org/article/fcfabe0d591e44bd986dd8c89ba837d1
Dokumentencode: edsdoj.fcfabe0d591e44bd986dd8c89ba837d1
Datenbank: Directory of Open Access Journals
Beschreibung
Abstract:Abstract The predictive accuracy of regional hydrologic models often varies across both time and space. Interpreting relationships between watershed characteristics, hydrologic regimes, and model performance can reveal potential areas for model improvement. In this study, we use machine learning to assess model performance of a regional hydrologic model to forecast the occurrence of streamflow drought. We demonstrate our methodology using a regional long short‐term memory (LSTM) deep learning model developed by the U.S. Geological Survey (USGS) and data from 384 streamgages across the Colorado River Basin region. Performance was assessed by clustering catchments using: (a) physical and climatological catchment attributes, and (b) streamflow drought signatures time series. We examined the association of USGS LSTM model error measures with clusters generated by both approaches to interpret meaningful spatial and temporal information about LSTM model performance. Clustering static catchment attributes identified elevation, degree of streamflow regulation, baseflow contribution, catchment aridity, and drainage area as the most influential attributes to model performance. Clustering gages by their drought signatures revealed that catchments with significant seasonal peak runoff between January and June generally exhibited better model performance. Additionally, a Random Forest classifier was trained to successfully predict LSTM model performance (F1 score of 0.72) based on physical and climatological catchment attributes. Low degree of flow regulation was identified as a key indicator of better LSTM model performance. These findings point to the opportunities for improving the USGS LSTM model performance in future hydrologic drought prediction efforts across regional and CONUS scales.
ISSN:19447973
00431397
DOI:10.1029/2024WR039077