Solar Flare Prediction and Feature Selection Using a Light-Gradient-Boosting Machine Algorithm

Solar flares are among the most severe space-weather phenomena, and they have the capacity to generate radiation storms and radio disruptions on Earth. The accurate prediction of solar-flare events remains a significant challenge, requiring continuous monitoring and identification of specific featur...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Solar physics Jg. 298; H. 11; S. 137
Hauptverfasser: Vysakh, P. A., Mayank, Prateek
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Dordrecht Springer Netherlands 01.11.2023
Springer Nature B.V
Schlagworte:
ISSN:0038-0938, 1573-093X
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Solar flares are among the most severe space-weather phenomena, and they have the capacity to generate radiation storms and radio disruptions on Earth. The accurate prediction of solar-flare events remains a significant challenge, requiring continuous monitoring and identification of specific features that can aid in forecasting this phenomenon, particularly for different classes of solar flares. In this study, we aim to forecast C- and M-Class solar flares utilising a machine-learning algorithm, namely the Light Gradient Boosting Machine. We have utilised a dataset spanning nine years, obtained from the Space-weather Helioseismic and Magnetic Imager Active Region Patches (SHARP), with a temporal resolution of 1 h. A total of 37 flare features were considered in our analysis, comprising of 25 active-region parameters and 12 flare-history features. To address the issue of class imbalance in solar-flare data, we employed the Synthetic Minority Over-sampling Technique (SMOTE). We used two labelling approaches in our study: a fixed 24-h window label and a varying window that considers the changing nature of solar activity. Then, the developed machine-learning algorithm was trained and tested using forecast-verification metrics, with an emphasis on evaluating the true skill statistic (TSS). Furthermore, we implemented a feature-selection algorithm to determine the most significant features from the pool of 37 features that could distinguish between flaring and non-flaring active regions. We found that utilising a limited set of useful features resulted in improved prediction performance. For the 24-h prediction window, we achieved a TSS of 0.63 (0.69) and an accuracy of 0.90 (0.97) for ≥C- (≥M)-Class solar flares.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0038-0938
1573-093X
DOI:10.1007/s11207-023-02223-5