Combining Machine Learning and Numerical Simulation for High-Resolution PM2.5 Concentration Forecast

Forecasting ambient PM2.5 concentrations with spatiotemporal coverage is key to alerting decision makers of pollution episodes and preventing detrimental public exposure, especially in regions with limited ground air monitoring stations. The existing methods rely on either chemical transport models...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Environmental science & technology Jg. 56; H. 3; S. 1544
Hauptverfasser: Bi, Jianzhao, Knowland, K Emma, Keller, Christoph A, Liu, Yang
Format: Journal Article
Sprache:Englisch
Veröffentlicht: 01.02.2022
ISSN:1520-5851, 1520-5851
Online-Zugang:Weitere Angaben
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Forecasting ambient PM2.5 concentrations with spatiotemporal coverage is key to alerting decision makers of pollution episodes and preventing detrimental public exposure, especially in regions with limited ground air monitoring stations. The existing methods rely on either chemical transport models (CTMs) to forecast spatial distribution of PM2.5 with nontrivial uncertainty or statistical algorithms to forecast PM2.5 concentration time series at air monitoring locations without continuous spatial coverage. In this study, we developed a PM2.5 forecast framework by combining the robust Random Forest algorithm with a publicly accessible global CTM forecast product, NASA's Goddard Earth Observing System "Composition Forecasting" (GEOS-CF), providing spatiotemporally continuous PM2.5 concentration forecasts for the next 5 days at a 1 km spatial resolution. Our forecast experiment was conducted for a region in Central China including the populous and polluted Fenwei Plain. The forecast for the next 2 days had an overall validation R2 of 0.76 and 0.64, respectively; the R2 was around 0.5 for the following 3 forecast days. Spatial cross-validation showed similar validation metrics. Our forecast model, with a validation normalized mean bias close to 0, substantially reduced the large biases in GEOS-CF. The proposed framework requires minimal computational resources compared to running CTMs at urban scales, enabling near-real-time PM2.5 forecast in resource-restricted environments.Forecasting ambient PM2.5 concentrations with spatiotemporal coverage is key to alerting decision makers of pollution episodes and preventing detrimental public exposure, especially in regions with limited ground air monitoring stations. The existing methods rely on either chemical transport models (CTMs) to forecast spatial distribution of PM2.5 with nontrivial uncertainty or statistical algorithms to forecast PM2.5 concentration time series at air monitoring locations without continuous spatial coverage. In this study, we developed a PM2.5 forecast framework by combining the robust Random Forest algorithm with a publicly accessible global CTM forecast product, NASA's Goddard Earth Observing System "Composition Forecasting" (GEOS-CF), providing spatiotemporally continuous PM2.5 concentration forecasts for the next 5 days at a 1 km spatial resolution. Our forecast experiment was conducted for a region in Central China including the populous and polluted Fenwei Plain. The forecast for the next 2 days had an overall validation R2 of 0.76 and 0.64, respectively; the R2 was around 0.5 for the following 3 forecast days. Spatial cross-validation showed similar validation metrics. Our forecast model, with a validation normalized mean bias close to 0, substantially reduced the large biases in GEOS-CF. The proposed framework requires minimal computational resources compared to running CTMs at urban scales, enabling near-real-time PM2.5 forecast in resource-restricted environments.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1520-5851
1520-5851
DOI:10.1021/acs.est.1c05578