STUDY ON SENTIMENT ANALYSIS METHODS FOR TEXT DATA

Authors

DOI:

https://doi.org/10.35546/kntu2078-4481.2024.1.31

Keywords:

sentiment analysis, natural language processing, machine learning, Python, NLTK, spaCy, TextBlob, Gensim

Abstract

The relevance of the research topic is determined by the avalanche-like growth of unstructured text data on the Internet and the need for effective methods of tonality analysis. The purpose of the work is to systematically study the current state of the tonality analysis methodology, compare the leading approaches and outline further prospects. The article analyzes in detail popular Python libraries for natural language processing – NLTK, spaCy, TextBlob, Gensim. The comparison was made according to the criteria of computational efficiency, ease of use, flexibility of feature extraction, and customization options. The methodological core of the study is an experimental comparison of NLTK and TextBlob for tonality classification of Ukrainian-language texts. Estimates may vary depending on specific usage scenario and settings. NLTK, where it can be more accurate when configured correctly, but requires more effort to configure. TextBlob, on the other hand, is easier to use, but may be less accurate for specialized tasks. The results proved the superiority of TextBlob in speed and NLTK in accuracy. Tonality analysis has a huge potential for improving analytical capabilities in many areas – from optimizing business processes to countering the spread of fake news. Further research should focus on the development of specialized solutions for specific applied tasks. Prospects for improving the ethical principles of text analysis, taking into account the linguistic and cultural context, as well as the integration of the tonality analysis functionality into decision support systems have been determined.

References

Ivokhin E., Маkhno М., Rets V. Про один спосіб аналізу тональності текстів за допомогою штучних нейронних мереж. Системи управління, навігації та зв’язку. Збірник наукових праць. 2022. Т. 3, № 69. С. 71–74.

Orel A. Social media analyzing for evaluation opinions determination based on sentiment analysis. International scientific journal "Internauka". 2018. No. 10.

Deng Y. Research on sentiment analysis methods for text-oriented data. Frontiers in computing and intelligent systems. 2023. Vol. 3, no. 1. P. 42–47.

Mukasheva А. Tasks and methods of text sentiment analysis. Scientific journal of astana IT university. 2021. No. 7. P. 55–62.

Abonizio H. Q., Paraiso E. C., Barbon Junior S. Toward text data augmentation for sentiment analysis. IEEE transactions on artificial intelligence. 2021. P. 1.

Samigulin T. R., Djurabaev A. E. U. Sentiment analysis of text by machine learning methods. Research result. Information technologies. 2021. Vol. 6, no. 1.

Yao J. Automated sentiment analysis of text data with NLTK. Journal of physics: conference series. 2019. Vol. 1187, no. 5. P. 60–78.

A review of text sentiment analysis methods and applications / Y. Jin et al. Frontiers in business, economics and management. 2023. Vol. 10, no. 1. P. 58–64.

Poria S., Hussain A., Cambria E. Concept extraction from natural text for concept level text analysis. Multimodal sentiment analysis. Cham, 2018. P. 79–84.

Maran S. M., Esh P. S. Text analysis for product reviews for sentiment analysis using NLP methods. International journal of engineering trends and technology. 2017. Vol. 47, no. 8. P. 474–480.

Sarkar D. Sentiment analysis. Text analytics with python. Berkeley, CA, 2019. P. 567–629.

Text as data: text mining and sentiment analysis. Data mining and business analytics with R. Hoboken, NJ, USA, 2013. P. 258–271.

Published

2024-05-01