FNNWV: farthest-nearest neighbor-based weighted voting for class-imbalanced crowdsourcing
In crowdsourcing scenarios, we can hire crowd workers to label crowdsourced tasks and then use label integration algorithms to infer the integrated label for each instance in the tasks. As more and more label integration algorithms are proposed, the performance of inference based only on the informa...
Uloženo v:
| Vydáno v: | Science China. Information sciences Ročník 67; číslo 10; s. 202102 |
|---|---|
| Hlavní autoři: | , , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
Beijing
Science China Press
01.10.2024
Springer Nature B.V |
| Témata: | |
| ISSN: | 1674-733X, 1869-1919 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Shrnutí: | In crowdsourcing scenarios, we can hire crowd workers to label crowdsourced tasks and then use label integration algorithms to infer the integrated label for each instance in the tasks. As more and more label integration algorithms are proposed, the performance of inference based only on the information of the inferred instance gradually converges. Recent algorithms attempt to exploit the information of the inferred instance’s nearest neighbors to infer and achieve good performance. However, when crowdsourced tasks are class-imbalanced, negative instances are more easily to occur in the nearest neighbors because negative instances are the majority, and thus recent algorithms are more easily biased toward the negative class. To this end, in this paper, we propose a novel label integration algorithm called farthest-nearest neighbor-based weighted voting (FNNWV) for class-imbalanced crowdsourcing. Specifically, FNNWV considers the nearest neighbors to be more similar to the inferred instance and thus uses them to vote ayes in weighted voting. Yet at the same time, FNNWV considers the farthest neighbors to be more different from the inferred instance and thus uses them to vote nays in weighted voting. Since negative instances are easier to occur in both the nearest neighbors and the farthest neighbors, FNNWV weakens the effect of negative instances by voting ayes and nays. The experimental results on 22 simulated and one real-world crowdsourced datasets show that FNNWV significantly outperforms all the other state-of-the-art competitors. |
|---|---|
| Bibliografie: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 1674-733X 1869-1919 |
| DOI: | 10.1007/s11432-023-3854-7 |