Unveiling Sentiments: A Deep Dive Into Sentiment Analysis for Low-Resource Languages - A Case Study on Hausa Texts

Uloženo v:
Podrobná bibliografie
Název: Unveiling Sentiments: A Deep Dive Into Sentiment Analysis for Low-Resource Languages - A Case Study on Hausa Texts
Autoři: Shehu, Harisu Abdullahi, Usman Majikumna, Kaloma, Bashir Suleiman, Aminu, Luka, Stephen, Sharif, Md Haidar, Ramadan, Rabie A., Kusetogullari, Hüseyin, 1981
Zdroj: IEEE Access. 12:98900-98916
Témata: Bag-of-words, deep learning, Hausa texts, lexicon dictionary, low-resource languages, sentiment analysis, Recurrent neural networks, Bag of words, Case-studies, Convolutional neural network, Deep dives, Hausa text, Low resource languages, Performance
Popis: Opinion mining has witnessed significant advancements in well-resourced languages. However, for low-resource languages, this landscape remains relatively unexplored. This paper addresses this gap by conducting a comprehensive investigation into sentiment analysis in the context of Hausa, one of the most widely spoken languages within the Afro-Asiatic family. To resolve the problem, three different models based on Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), and Hierarchical Attention Network (HAN), all tailored to the unique linguistic characteristics of Hausa have been proposed. Additionally, we have developed the first dedicated lexicon dictionary for Hausa sentiment analysis and a customized stemming method to enhance the accuracy of the bag of words approach. Our results indicate that CNN and HAN achieved significantly higher performance compared to other models such as RNN. While the experimental results demonstrate the effectiveness of the developed deep learning models in contrast to the bag of words approach, the proposed stemming method was found to significantly improve the performance of the bag of words approach. The findings from this study not only enrich the sentiment analysis domain for Hausa but also provide a foundation for future research endeavors in similarly underrepresented languages. © 2023 IEEE.
Popis souboru: electronic
Přístupová URL adresa: https://urn.kb.se/resolve?urn=urn:nbn:se:bth-26807
https://doi.org/10.1109/ACCESS.2024.3427416
Databáze: SwePub
Popis
Abstrakt:Opinion mining has witnessed significant advancements in well-resourced languages. However, for low-resource languages, this landscape remains relatively unexplored. This paper addresses this gap by conducting a comprehensive investigation into sentiment analysis in the context of Hausa, one of the most widely spoken languages within the Afro-Asiatic family. To resolve the problem, three different models based on Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), and Hierarchical Attention Network (HAN), all tailored to the unique linguistic characteristics of Hausa have been proposed. Additionally, we have developed the first dedicated lexicon dictionary for Hausa sentiment analysis and a customized stemming method to enhance the accuracy of the bag of words approach. Our results indicate that CNN and HAN achieved significantly higher performance compared to other models such as RNN. While the experimental results demonstrate the effectiveness of the developed deep learning models in contrast to the bag of words approach, the proposed stemming method was found to significantly improve the performance of the bag of words approach. The findings from this study not only enrich the sentiment analysis domain for Hausa but also provide a foundation for future research endeavors in similarly underrepresented languages. © 2023 IEEE.
ISSN:21693536
DOI:10.1109/ACCESS.2024.3427416