Sentinel-2 Masking CNNs Trained on Physics-Supervised Labels

Gespeichert in:
Bibliographische Detailangaben
Titel: Sentinel-2 Masking CNNs Trained on Physics-Supervised Labels
Autoren: Efrain Padilla-Zepeda, Kevin Alonso, Raquel De Los Reyes, Deni Torres-Roman, Avi Putri Pertiwi, Tobias Storch
Quelle: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, Vol 18, Pp 17247-17264 (2025)
Verlagsinformationen: Institute of Electrical and Electronics Engineers (IEEE), 2025.
Publikationsjahr: 2025
Schlagwörter: Classification algorithms, QC801-809, multispectral, sentinel-2, Geophysics. Cosmic physics, deep learning, Earth, Deep learning, Remote sensing, Classification, pixel-level, Ocean engineering, Atmospheric modeling, Optical sensors, Optical reflection, Training, Feature extraction, Convolutional neural networks, masking algorithm, TC1501-1800
Beschreibung: This article presents a method to improve pixel-level classification of Sentinel-2 imagery by integrating spectral index-based masking with deep learning approaches using 1-D, 2-D, and 3-D convolutional neural networks (CNN1D, CNN2D, and CNN3D). Rather than relying on manually labeled data, the proposed method selects high-quality training samples from Python-based atmospheric correction software (PACO), using pixel selection strategies to remove ambiguous or inconsistent labels. Three selection strategies are explored: full inclusion, uniqueness-based filtering, and physics-based rules. Unlike traditional masking algorithms based only on spectral indices, the CNN models leverage spatial correlations among neighboring pixels across all spectral bands, plus auxiliary features like elevation and illumination, enabling the extraction of more informative representations and improved classification accuracy, particularly in complex scenes. The model is trained using a large global training dataset from PACO, while a separate validation dataset from the same source is used to monitor performance during learning and prevent overfitting. Final evaluation is performed using two independent manually labeled testing datasets (TD1 and TD2) that span diverse land cover types and atmospheric conditions. Compared to PACO’s baseline classification, our CNN approaches achieve consistent improvements for normalized Matthews correlation coefficient, with maximum gains of +3.3 percentage points (pp) on TD1 (from 0.855 to 0.888) and +18.3pp on TD2 (from 0.665 to 0.848). The largest class-wise gains are observed for shadows and clear land-related classes, with up to +22.7pp improvement. These results confirm the effectiveness of the proposed training strategy and its potential for improving label quality in large-scale Earth observation pipelines.
Publikationsart: Article
ISSN: 2151-1535
1939-1404
DOI: 10.1109/jstars.2025.3581058
Zugangs-URL: https://doaj.org/article/7e3830700ac54723a1035ba79381c110
Rights: CC BY
Dokumentencode: edsair.doi.dedup.....954329e329301157073c19362f68588a
Datenbank: OpenAIRE
Beschreibung
Abstract:This article presents a method to improve pixel-level classification of Sentinel-2 imagery by integrating spectral index-based masking with deep learning approaches using 1-D, 2-D, and 3-D convolutional neural networks (CNN1D, CNN2D, and CNN3D). Rather than relying on manually labeled data, the proposed method selects high-quality training samples from Python-based atmospheric correction software (PACO), using pixel selection strategies to remove ambiguous or inconsistent labels. Three selection strategies are explored: full inclusion, uniqueness-based filtering, and physics-based rules. Unlike traditional masking algorithms based only on spectral indices, the CNN models leverage spatial correlations among neighboring pixels across all spectral bands, plus auxiliary features like elevation and illumination, enabling the extraction of more informative representations and improved classification accuracy, particularly in complex scenes. The model is trained using a large global training dataset from PACO, while a separate validation dataset from the same source is used to monitor performance during learning and prevent overfitting. Final evaluation is performed using two independent manually labeled testing datasets (TD1 and TD2) that span diverse land cover types and atmospheric conditions. Compared to PACO’s baseline classification, our CNN approaches achieve consistent improvements for normalized Matthews correlation coefficient, with maximum gains of +3.3 percentage points (pp) on TD1 (from 0.855 to 0.888) and +18.3pp on TD2 (from 0.665 to 0.848). The largest class-wise gains are observed for shadows and clear land-related classes, with up to +22.7pp improvement. These results confirm the effectiveness of the proposed training strategy and its potential for improving label quality in large-scale Earth observation pipelines.
ISSN:21511535
19391404
DOI:10.1109/jstars.2025.3581058