The Implementation of TF-IDF and Word2Vec on Booster Vaccine Sentiment Analysis Using Support Vector Machine Algorithm

As a sort of technological advancement, social media is a medium used to transmit ideas on certain subjects. Sentiment analysis can be used to analyze public opinion. Feature extraction stage of sentiment analysis is crucial for transforming unstructured text into categorizable structured data. Usin...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Procedia computer science Ročník 234; s. 156 - 163
Hlavní autoři: Nurhaliza Agustina, C.A, Novita, Rice, Mustakim, Rozanda, Nesdi Evrilyan
Médium: Journal Article
Jazyk:angličtina
Vydáno: Elsevier B.V 2024
Témata:
ISSN:1877-0509, 1877-0509
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:As a sort of technological advancement, social media is a medium used to transmit ideas on certain subjects. Sentiment analysis can be used to analyze public opinion. Feature extraction stage of sentiment analysis is crucial for transforming unstructured text into categorizable structured data. Using 13,297 records from Twitter and SVM algorithm, as well as the TF-IDF and Word2Vec feature extraction approaches, the combination of SVM + TF-IDF with 80:20 data split scenario and the RBF kernel produces the best results, with precision 85%, recall 86%, and f1-score 84%. In the 80:20 data split and RBF kernel, SVM+Word2Vec combination achieves the highest performance, with precision 83%, recall 82%, and f1-score 76%.
ISSN:1877-0509
1877-0509
DOI:10.1016/j.procs.2024.02.162