Imbalanced Classification Based on Minority Clustering Synthetic Minority Oversampling Technique With Wind Turbine Fault Detection Application

Synthetic minority oversampling technique (SMOTE) has been widely used in dealing with the imbalance classification problem in the machine learning field. However, classical SMOTE implements the oversampling by linear interpolation between adjacent minority class samples, which may fail to consider...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:IEEE transactions on industrial informatics Ročník 17; číslo 9; s. 5867 - 5875
Hlavní autoři: Yi, Huaikuan, Jiang, Qingchao, Yan, Xuefeng, Wang, Bei
Médium: Journal Article
Jazyk:angličtina
Vydáno: Piscataway IEEE 01.09.2021
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Témata:
ISSN:1551-3203, 1941-0050
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Synthetic minority oversampling technique (SMOTE) has been widely used in dealing with the imbalance classification problem in the machine learning field. However, classical SMOTE implements the oversampling by linear interpolation between adjacent minority class samples, which may fail to consider the uneven distribution of the samples. This article proposes a minority clustering SMOTE (MC-SMOTE) method that involves the clustering of minority class samples to improve the imbalance classification performance. First, samples from the minority class are clustered into several clusters. Second, oversampling is performed by linear interpolation between adjacent clusters to create new samples from different clusters that contain additional information of the entire minority class. Then classical classification techniques can be employed to achieve efficient classification. The superiority of the MC-SMOTE is first verified by experiments on some benchmark datasets from various application domains. The proposed method is then applied to the real industrial SCADA data of wind turbine blade icing. Classification results indicate that the MC-SMOTE exhibits a better performance than that of the classical SMOTE.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1551-3203
1941-0050
DOI:10.1109/TII.2020.3046566