FNNWV: farthest-nearest neighbor-based weighted voting for class-imbalanced crowdsourcing

In crowdsourcing scenarios, we can hire crowd workers to label crowdsourced tasks and then use label integration algorithms to infer the integrated label for each instance in the tasks. As more and more label integration algorithms are proposed, the performance of inference based only on the informa...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Science China. Information sciences Ročník 67; číslo 10; s. 202102
Hlavní autori: Zhang, Wenjun, Jiang, Liangxiao, Chen, Ziqi, Li, Chaoqun
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Beijing Science China Press 01.10.2024
Springer Nature B.V
Predmet:
ISSN:1674-733X, 1869-1919
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:In crowdsourcing scenarios, we can hire crowd workers to label crowdsourced tasks and then use label integration algorithms to infer the integrated label for each instance in the tasks. As more and more label integration algorithms are proposed, the performance of inference based only on the information of the inferred instance gradually converges. Recent algorithms attempt to exploit the information of the inferred instance’s nearest neighbors to infer and achieve good performance. However, when crowdsourced tasks are class-imbalanced, negative instances are more easily to occur in the nearest neighbors because negative instances are the majority, and thus recent algorithms are more easily biased toward the negative class. To this end, in this paper, we propose a novel label integration algorithm called farthest-nearest neighbor-based weighted voting (FNNWV) for class-imbalanced crowdsourcing. Specifically, FNNWV considers the nearest neighbors to be more similar to the inferred instance and thus uses them to vote ayes in weighted voting. Yet at the same time, FNNWV considers the farthest neighbors to be more different from the inferred instance and thus uses them to vote nays in weighted voting. Since negative instances are easier to occur in both the nearest neighbors and the farthest neighbors, FNNWV weakens the effect of negative instances by voting ayes and nays. The experimental results on 22 simulated and one real-world crowdsourced datasets show that FNNWV significantly outperforms all the other state-of-the-art competitors.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1674-733X
1869-1919
DOI:10.1007/s11432-023-3854-7