CONTENT BASED TWEET CLASSIFICATION ON TWITTER

Uloženo v:
Podrobná bibliografie
Název: CONTENT BASED TWEET CLASSIFICATION ON TWITTER
Autoři: Lalitha, Prof. L A, P, Sumitha, Krishnan, P. Snavaja, S, Sushmita, R M, Vinaya
Zdroj: International Journal of Advanced Research in Computer Science; Vol. 11 (2020): VOLUME 11 SPECIAL ISSUE 1, MAY 2020; 341-345 ; 0976-5697 ; 10.26483/ijarcs.v11i0
Informace o vydavateli: International Journal of Advanced Research in Computer Science
Rok vydání: 2020
Témata: Content-Based Classification, social network, Feature Extraction, text processing, Random Forest
Popis: Today, Social Media Networks are more powerful and popular than any other forms of media that exist and due to this global nature of social media, the amount of information available and being shared online by the users is tremendous. This large data that is available can be used for different purposes like marketing, data analysis, community detection, fraud detection, sentiment analysis, etc. In this work, we present a model to classify tweets in Twitter and therefore offer a solution to process large amounts of data and derive meaningful conclusions from the same. Here, we first collect tweets from different communities on twitter and process this raw dataset. This processed data is then converted into a vector form so that the textual information is converted to a numeric form for the machine to implement and then a text classification algorithm is applied to this dataset. Finally, after training the machine using this dataset, the working of the model and its accuracy is evaluated by using a dataset of test tweets where the machine predicts the category to which the test tweet belongs. With this model, we have been able to classify tweets into different categories and have achieved satisfactory results.Â
Druh dokumentu: article in journal/newspaper
conference object
Popis souboru: application/pdf
Jazyk: English
Relation: http://www.ijarcs.info/index.php/Ijarcs/article/view/6626/5342; http://www.ijarcs.info/index.php/Ijarcs/article/view/6626
DOI: 10.26483/ijarcs.v11i0.6626
Dostupnost: http://www.ijarcs.info/index.php/Ijarcs/article/view/6626
https://doi.org/10.26483/ijarcs.v11i0.6626
Rights: Copyright (c) 2020 International Journal of Advanced Research in Computer Science
Přístupové číslo: edsbas.D240B74D
Databáze: BASE
Popis
Abstrakt:Today, Social Media Networks are more powerful and popular than any other forms of media that exist and due to this global nature of social media, the amount of information available and being shared online by the users is tremendous. This large data that is available can be used for different purposes like marketing, data analysis, community detection, fraud detection, sentiment analysis, etc. In this work, we present a model to classify tweets in Twitter and therefore offer a solution to process large amounts of data and derive meaningful conclusions from the same. Here, we first collect tweets from different communities on twitter and process this raw dataset. This processed data is then converted into a vector form so that the textual information is converted to a numeric form for the machine to implement and then a text classification algorithm is applied to this dataset. Finally, after training the machine using this dataset, the working of the model and its accuracy is evaluated by using a dataset of test tweets where the machine predicts the category to which the test tweet belongs. With this model, we have been able to classify tweets into different categories and have achieved satisfactory results.Â
DOI:10.26483/ijarcs.v11i0.6626