Combining Machine Learning and Numerical Simulation for High-Resolution PM2.5 Concentration Forecast

Forecasting ambient PM2.5 concentrations with spatiotemporal coverage is key to alerting decision makers of pollution episodes and preventing detrimental public exposure, especially in regions with limited ground air monitoring stations. The existing methods rely on either chemical transport models...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Environmental science & technology Ročník 56; číslo 3; s. 1544
Hlavní autoři: Bi, Jianzhao, Knowland, K Emma, Keller, Christoph A, Liu, Yang
Médium: Journal Article
Jazyk:angličtina
Vydáno: 01.02.2022
ISSN:1520-5851, 1520-5851
On-line přístup:Zjistit podrobnosti o přístupu
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Forecasting ambient PM2.5 concentrations with spatiotemporal coverage is key to alerting decision makers of pollution episodes and preventing detrimental public exposure, especially in regions with limited ground air monitoring stations. The existing methods rely on either chemical transport models (CTMs) to forecast spatial distribution of PM2.5 with nontrivial uncertainty or statistical algorithms to forecast PM2.5 concentration time series at air monitoring locations without continuous spatial coverage. In this study, we developed a PM2.5 forecast framework by combining the robust Random Forest algorithm with a publicly accessible global CTM forecast product, NASA's Goddard Earth Observing System "Composition Forecasting" (GEOS-CF), providing spatiotemporally continuous PM2.5 concentration forecasts for the next 5 days at a 1 km spatial resolution. Our forecast experiment was conducted for a region in Central China including the populous and polluted Fenwei Plain. The forecast for the next 2 days had an overall validation R2 of 0.76 and 0.64, respectively; the R2 was around 0.5 for the following 3 forecast days. Spatial cross-validation showed similar validation metrics. Our forecast model, with a validation normalized mean bias close to 0, substantially reduced the large biases in GEOS-CF. The proposed framework requires minimal computational resources compared to running CTMs at urban scales, enabling near-real-time PM2.5 forecast in resource-restricted environments.Forecasting ambient PM2.5 concentrations with spatiotemporal coverage is key to alerting decision makers of pollution episodes and preventing detrimental public exposure, especially in regions with limited ground air monitoring stations. The existing methods rely on either chemical transport models (CTMs) to forecast spatial distribution of PM2.5 with nontrivial uncertainty or statistical algorithms to forecast PM2.5 concentration time series at air monitoring locations without continuous spatial coverage. In this study, we developed a PM2.5 forecast framework by combining the robust Random Forest algorithm with a publicly accessible global CTM forecast product, NASA's Goddard Earth Observing System "Composition Forecasting" (GEOS-CF), providing spatiotemporally continuous PM2.5 concentration forecasts for the next 5 days at a 1 km spatial resolution. Our forecast experiment was conducted for a region in Central China including the populous and polluted Fenwei Plain. The forecast for the next 2 days had an overall validation R2 of 0.76 and 0.64, respectively; the R2 was around 0.5 for the following 3 forecast days. Spatial cross-validation showed similar validation metrics. Our forecast model, with a validation normalized mean bias close to 0, substantially reduced the large biases in GEOS-CF. The proposed framework requires minimal computational resources compared to running CTMs at urban scales, enabling near-real-time PM2.5 forecast in resource-restricted environments.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1520-5851
1520-5851
DOI:10.1021/acs.est.1c05578