TEXT MINING: APPLICATIONS AND FREE SOFTWARE TOOLS

Authors

  • M.V. MOHYLNA
  • V.I. DUBROVIN

DOI:

https://doi.org/10.32782/mathematical-modelling/2022-5-2-5

Keywords:

classification, expert knowledge discovery, information extraction, patterns, text mining tools, text mining application, library management system

Abstract

The large amount of data generated every day is both an opportunity and a challenge for businesses. On the one hand, data helps companies gain insights into people's opinions about a product or service. These insights can be gained by analyzing emails, product reviews, social media posts, customer feedback, customer service calls, etc. A substantial amount of data is stored in the form of documents, which can be of different types: structured, partially structured, and unstructured. On the other hand, there is the problem of processing this data. Extracting useful information from a huge volume of documents is a difficult task. Text mining is an important area of research because it helps extract knowledge from unstructured text. This article discusses text mining technologies and their application in various areas of life. The article analyzes the use of text mining in a library management system and compares the features of popular text mining tools. The methods of the study are the analysis of scientific articles in which researchers used text mining tools and the comparison of open source software. The methods of text analysis are considered and the main application of intellectual analysis in the library management system is described. Popular free and open-source text analysis software such as RStudio, Python, Orange, RapidMiner, which is also used in machine learning and data science, was discussed. The article helps to increase the level of understanding of researchers in the field of text mining. Using the software in this article, beginners will be able to predict trends, topics, new research concepts, and find duplicate text documents in articles, news, and blogs. Librarians will be able to improve their services in the library management system: reference services, CAS, SDI.

References

Ramzan Talib, Muhammad Kashif Hanif, Shaeela Ayesha and Fakeeha Fatima. Text Mining: Techniques, Applications and Issues. International Journal of Advanced Computer Science and Applications, no.11(7), 2016. doi: 10.14569/IJACSA.2016.071153.

Liao S. H., Chu P. H., Hsiao P. Y. Data mining techniques and applications - a decade review from 2000 to 2011. Expert Systems with Applications, 2012, no. 39(12), pp. 11303-11311. doi: 10.1016/j.eswa.2012.02.063.

Zhong N., Li Y., Wu S.T. Effective pattern discovery for text mining. IEEE transactions on knowledge and data engineering, 2012, no. 1(24), pp. 30-44. doi: 10.1109/TKDE.2010.211.

Henriksson A., Moen H., Skeppstedt M., Daudaravicius V., Duneld M. Synonym extraction and abbreviation expansion with ensembles of semantic spaces. Journal of biomedical semantics, 2014, no. 1(5), p. 1, doi: 10.1186/2041-1480-5-6.

Laxman A., Sujatha D. Improved method for pattern discovery in text mining. International Journal of Research in Engineering and Technology, 2013, no. 1(2), pp. 2321-2328. doi: 10.15623/IJRET.2013.0210090.

Chen A.P., Zhang C.Y. Data-intensive applications, challenges, techniques and technologies: A survey on big data. Information Sciences, 2014, vol. 275, pp. 314-347, 2014. doi: 10.1016/j.ins.2014.01.015.

Rajendra R., Saransh V. A Novel Modified Apriori Approach for Web Document Clustering. International Journal of Computer Applications, 2013, vol. 33, pp. 159-171. doi: 10.1007/978-81-322-2202-6_14.

Sumathy K. L., Chidambaram M. Text mining: Concepts, applications, tools and issues – an overview. International Journal of Computer Applications, 2013, no. 4 (80), pp. 29-32. doi: 10.5120/13851-1685.

Joby P. J., Korra J. Accessing accurate documents by mining auxiliary document information. Advances in Computing and Communication Engineering (ICACCE), Second International Conference on. IEEE, 2015, pp. 634-638. doi: 10.1109/ICACCE.2015.37.

Wen Z., Yoshida T., Tang X. A study with multi-word feature with text classification. Proceedings of the 51st Annual Meeting of the ISSS-2007, Tokyo, Japan, 2007, vol. 51, p. 45.

Карпов І., Антоненко С. Огляд методів інтелектуального аналізу тексту. Актуальні проблеми автоматизації та інформаційних технологій. 2020. № 24. С. 40–46.

Sheela S. K., Bharathi T. Analyzing Different Approaches of Text Mining Techniques and Applications. International Journal of Computer Science Trends and Technology, 2018, no. 4(6).

Thakur K., Kumar V. An Overview of Text Mining: Application and Free Software Tools. Library Waves, 2020, no. 2(6), pp. 53-59.

Published

2023-06-09