DravidianCodeMix: sentiment analysis and offensive language identification dataset for Dravidian languages in code-mixed text

This paper describes the development of a multilingual, manually annotated dataset for three under-resourced Dravidian languages generated from social media comments. The dataset was annotated for sentiment analysis and offensive language identification for a total of more than 60,000 YouTube commen...

Full description

Saved in:
Bibliographic Details
Published in:Language resources and evaluation Vol. 56; no. 3; pp. 765 - 806
Main Authors: Chakravarthi, Bharathi Raja, Priyadharshini, Ruba, Muralidaran, Vigneshwaran, Jose, Navya, Suryawanshi, Shardul, Sherly, Elizabeth, McCrae, John P.
Format: Journal Article
Language:English
Published: Dordrecht Springer Netherlands 01.09.2022
Springer Nature B.V
Subjects:
ISSN:1574-020X, 1574-0218
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first