CONTENT BASED TWEET CLASSIFICATION ON TWITTER
Gespeichert in:
| Titel: | CONTENT BASED TWEET CLASSIFICATION ON TWITTER |
|---|---|
| Autoren: | Lalitha, Prof. L A, P, Sumitha, Krishnan, P. Snavaja, S, Sushmita, R M, Vinaya |
| Quelle: | International Journal of Advanced Research in Computer Science; Vol. 11 (2020): VOLUME 11 SPECIAL ISSUE 1, MAY 2020; 341-345 ; 0976-5697 ; 10.26483/ijarcs.v11i0 |
| Verlagsinformationen: | International Journal of Advanced Research in Computer Science |
| Publikationsjahr: | 2020 |
| Schlagwörter: | Content-Based Classification, social network, Feature Extraction, text processing, Random Forest |
| Beschreibung: | Today, Social Media Networks are more powerful and popular than any other forms of media that exist and due to this global nature of social media, the amount of information available and being shared online by the users is tremendous. This large data that is available can be used for different purposes like marketing, data analysis, community detection, fraud detection, sentiment analysis, etc. In this work, we present a model to classify tweets in Twitter and therefore offer a solution to process large amounts of data and derive meaningful conclusions from the same. Here, we first collect tweets from different communities on twitter and process this raw dataset. This processed data is then converted into a vector form so that the textual information is converted to a numeric form for the machine to implement and then a text classification algorithm is applied to this dataset. Finally, after training the machine using this dataset, the working of the model and its accuracy is evaluated by using a dataset of test tweets where the machine predicts the category to which the test tweet belongs. With this model, we have been able to classify tweets into different categories and have achieved satisfactory results. |
| Publikationsart: | article in journal/newspaper conference object |
| Dateibeschreibung: | application/pdf |
| Sprache: | English |
| Relation: | http://www.ijarcs.info/index.php/Ijarcs/article/view/6626/5342; http://www.ijarcs.info/index.php/Ijarcs/article/view/6626 |
| DOI: | 10.26483/ijarcs.v11i0.6626 |
| Verfügbarkeit: | http://www.ijarcs.info/index.php/Ijarcs/article/view/6626 https://doi.org/10.26483/ijarcs.v11i0.6626 |
| Rights: | Copyright (c) 2020 International Journal of Advanced Research in Computer Science |
| Dokumentencode: | edsbas.D240B74D |
| Datenbank: | BASE |
| Abstract: | Today, Social Media Networks are more powerful and popular than any other forms of media that exist and due to this global nature of social media, the amount of information available and being shared online by the users is tremendous. This large data that is available can be used for different purposes like marketing, data analysis, community detection, fraud detection, sentiment analysis, etc. In this work, we present a model to classify tweets in Twitter and therefore offer a solution to process large amounts of data and derive meaningful conclusions from the same. Here, we first collect tweets from different communities on twitter and process this raw dataset. This processed data is then converted into a vector form so that the textual information is converted to a numeric form for the machine to implement and then a text classification algorithm is applied to this dataset. Finally, after training the machine using this dataset, the working of the model and its accuracy is evaluated by using a dataset of test tweets where the machine predicts the category to which the test tweet belongs. With this model, we have been able to classify tweets into different categories and have achieved satisfactory results. |
|---|---|
| DOI: | 10.26483/ijarcs.v11i0.6626 |
Nájsť tento článok vo Web of Science