A hybrid decomposition and Machine learning model for forecasting Chlorophyll-a and total nitrogen concentration in coastal waters

•A hybrid model named CVXS was proposed to forecast the ChlA and TN concentrations.•Machine learning was integrated with decomposition algorithms in the CVXS model.•The superiority of the CVXS model over the other models was examined.•The mechanism of the CVXS model for water quality forecasting was...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Journal of hydrology (Amsterdam) Ročník 619; s. 129207
Hlavní autoři: Zhu, Xiaotong, Guo, Hongwei, Huang, Jinhui Jeanne, Tian, Shang, Zhang, Zijie
Médium: Journal Article
Jazyk:angličtina
Vydáno: Elsevier B.V 01.04.2023
Témata:
ISSN:0022-1694, 1879-2707
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:•A hybrid model named CVXS was proposed to forecast the ChlA and TN concentrations.•Machine learning was integrated with decomposition algorithms in the CVXS model.•The superiority of the CVXS model over the other models was examined.•The mechanism of the CVXS model for water quality forecasting was explained.•The Long-term forecasting ability of the CVXS model was verified. Information regarding Chlorophyll-a (ChlA) and total nitrogen (TN) is critical for early warning of algal blooms. However, reliable models for accurate forecasting of the ChlA and TN are still lacking due to the optical complexity of coastal waters. To address this issue, we proposed a novel hybrid model named the CEEMDAN-VMD-XGBOOST-SARIMA (CVXS) model to forecast ChlA and TN concentrations. The model performance was validated at three hydrological monitoring stations in Hong Kong, China. Four independent models including extreme gradient boosting (XGBoost), support vector regression (SVR), deep learning (DL), and Seasonal autoregressive integrated moving average (SARIMA), and three hybrid models including complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN)-XGBoost, CEEMDAN-SVR, and CEEMDAN-DL were developed to compare their performance with the CVXS model. In addition, the physical mechanisms of the CVXS model were further explored through correlation analysis between the decomposed time series of water quality parameters. The result indicated that (1) the CVXS model had the best accuracy among all models for forecasting ChlA and TN, and all the NSEs remained above 0.97 at three hydrological monitoring stations. For forecasting ChlA, the performance of the eight models is ranked as CVXS > CEEMDAN-XGBoost > CEEMDAN-DL > CEEMDAN-SVR > XGBoost > DL > SARIMA > SVR. For forecasting TN, the performance of the eight models is ranked as CVXS > CEEMDAN-XGBoost > CEEMDAN-DL > CEEMDAN-SVR > XGBoost > SVR > SARIMA > DL; (2) the optimal forecasting time horizons of the CVXS model were one to two months; and (3) the variability of ChlA and TN concentrations induced by hydrologic factors has been inherently embedded in the decomposed time series data, thus providing the theoretical basis for the CVXS model forecasting water quality parameters. The results of this study are promising with respect to forecasting algal blooms and coastal water resource management.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0022-1694
1879-2707
DOI:10.1016/j.jhydrol.2023.129207