Evaluating machine learning approaches for aboveground biomass prediction in fragmented high-elevated forests using multi-sensor satellite data

Accurate aboveground biomass (AGB) estimations over large areas are essential for assessing carbon stocks and forest resources. This study evaluated machine learning approaches for AGB modeling in Pakistan's mountainous region of Diamir district using freely available Sentinel-1 and Sentinel-2...

Full description

Saved in:
Bibliographic Details
Published in:Remote sensing applications Vol. 36; p. 101291
Main Authors: Qadeer, Asim, Shakir, Muhammad, Wang, Li, Talha, Syed Muhammad
Format: Journal Article
Language:English
Published: Elsevier B.V 01.11.2024
Subjects:
ISSN:2352-9385, 2352-9385
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Accurate aboveground biomass (AGB) estimations over large areas are essential for assessing carbon stocks and forest resources. This study evaluated machine learning approaches for AGB modeling in Pakistan's mountainous region of Diamir district using freely available Sentinel-1 and Sentinel-2 data and 171 field-measured AGB training points. Random Forest, Gradient Tree Boosting, CatBoost, LightGBM, and XGBoost algorithms were implemented and optimized. Models were developed using individual and combined datasets. Sentinel-2 optical data outperformed Sentinel-1 radar data, but the fusion of both sensors achieved the highest accuracy (R2 > 0.7, RMSE = 105.64 Mg/ha, MAE = 85.34 Mg/ha). Tree canopy height was the most informative predictor for this data, besides terrain variables and radar textures. The machine learning models significantly improved AGB estimates compared to traditional regression techniques, and gradient boosters outperformed Random Forest. This research demonstrates the potential of multi-sensor remote sensing data and advanced algorithms for forest biomass mapping in complex terrain, with modeling accuracies reaching root mean squared errors below 90 Mg/ha. The framework provides an effective solution for monitoring biomass using freely available satellite data. Further refinements include integrating higher-resolution optical data and additional field samples for better validation. This study contributes to remote sensing capabilities for assessing vegetation carbon stocks and dynamics. •Evaluated machine learning algorithms for aboveground biomass estimation•Gradient boosting algorithms significantly outperformed Random Forest•Sentinel-2 models achieved the highest accuracy for LightGBM and XGBoost models•Key predictors were tree canopy height, terrain variables, and radar data textures
ISSN:2352-9385
2352-9385
DOI:10.1016/j.rsase.2024.101291