Multi-category Classification Problem Oriented Subsampling-Based Active Learning Method

Traditional active learning methods have achieved gratifying results in the classification tasks of less categories such as binary classification, the application research of active learning in the field of big data problems still faces enormous challenges. Since many active learning query strategie...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of physics. Conference series Jg. 1631; H. 1; S. 12003 - 12011
Hauptverfasser: Shi, Wei, Feng, Yanghe, Cheng, Guangquan, Liu, Shixuan, Liu, Zhong
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Bristol IOP Publishing 01.09.2020
Schlagworte:
ISSN:1742-6588, 1742-6596
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Traditional active learning methods have achieved gratifying results in the classification tasks of less categories such as binary classification, the application research of active learning in the field of big data problems still faces enormous challenges. Since many active learning query strategies need to perform matrix inversion, the amount of calculation increases exponentially with the increase of the scale of the problem, it is difficult to apply these active learning methods in large scale multi-category data classification task. In order to solve this problem, this paper designed a subsampling-based active learning model, and integrate unsupervised clustering algorithm with traditional active learning method, then conducted experiments on Binary Alphadigits and OMNIGLOT data sets. This paper compares the performance of five traditional active learning algorithms using this subsampling method, namely random sampling, uncertainty sampling, query-by-committee, density weighting and learning-based active learning. Through comparative experiments, the feasibility of active learning based on subsampling for solving the multi-category classification problem is verified, and it is found that the subsampling-based method can break the limitations of traditional active learning methods that cannot deal with large-scale data classification.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1742-6588
1742-6596
DOI:10.1088/1742-6596/1631/1/012003