METHODS OF VISUAL, TEXTUAL, AND WEB DATA ANALYSIS IN MODERN DATA MINING SYSTEMS
DOI:
https://doi.org/10.35546/kntu2078-4481.2025.4.2.11Keywords:
Data Mining, intelligent data analysis, text analysis, visual analytics, web mining, Internet data, machine learning, deep learning, integrated analytical systems, explainable artificial intelligence.Abstract
The article examines a comprehensive approach to data analysis in modern Data Mining systems, which integrates methods of textual, visual, and Internet-oriented information processing. It is shown that the heterogeneity of contemporary data, encompassing textual, behavioral, structured, and dynamic information streams, necessitates the use of multi-level analytical technologies capable of operating simultaneously with different types of representations and models. A review of scientific sources was conducted, highlighting the development trends of natural language processing models (including deep learning techniques), visual analytics tools, and web mining methods, as well as outlining their advantages and limitations in practical applications. The study substantiates the feasibility of integrating these three groups of methods into a unified analytical platform that ensures deep semantic processing of data, enhanced interpretability of results, and the ability to handle highly dynamic Internet data flows. To evaluate the effectiveness of the proposed approach, a prototype system was developed and tested on an online-catalog dataset that included user textual reviews, behavioral activity logs, and structured product attributes. The experimental results demonstrate that combining textual features with behavioral web indicators improves classification performance compared to relying solely on textual data. The integration of visual analytics, in turn, significantly reduces the time required for interpreting the results and enhances user experience, which is confirmed both by operational metrics and expert evaluations. The findings indicate that the integrated approach is promising for developing scalable, adaptive, and interpretable Data Mining systems designed for real-world big data processing conditions. Future research should focus on advancing universal models capable of combining textual, behavioral, graph-based, and spatiotemporal data, as well as improving explainable artificial intelligence tools for analyzing complex analytical processes.
References
Rautela R., Kumar S., Kumar A. та ін. Text mining: a comprehensive survey // International Journal of Recent Scientific Research. 2018. Vol. 9, Iss. 5(G). P. 26959–26962. DOI: 10.24327/IJRSR.2018.0905.2158.
Alqahtani A., Ghazali R., Hasan M. та ін. An efficient approach for textual data classification using deep learning // Frontiers in Computational Neuroscience. 2022. Vol. 16. Article 992296. DOI: 10.3389/fncom.2022.992296.
Li H., Ling X., Zhang B. та ін. Interactive machine learning by visualization: a small data solution // Proc. IEEE International Conference on Big Data. 2018. P. 3513–3521. DOI: 10.1109/BigData.2018.8621952.
Hohman F., Kahng M., Pienta R., Chau D.H. Visual analytics in deep learning: an interrogative survey for the next frontiers // IEEE Transactions on Visualization and Computer Graphics. 2019. Vol. 25, No. 8. P. 2674–2693. DOI: 10.1109/TVCG.2018.2843369.
Yanto I. T., Handayani O. P. Visualization of data inventory using visual data mining (VDM) and exploratory data analysis (EDA) methods // JOIV: International Journal on Informatics Visualization. 2025. Vol. 9, No. 5. P. 1808–1815. DOI: 10.62527/joiv.9.5.4075.
Begam A. M., Saravanan B. V. Web mining concepts, applications and tools: a survey // Journal of Emerging Technologies and Innovative Research. 2019. Vol. 6, No. 5. P. 285–288.
Darwiesh A., Alghamdi M. I., El‑Baz A. H., Elhoseny M. Social media big data analysis: towards enhancing competitiveness of firms in a post‑pandemic world // Journal of Healthcare Engineering. 2022. Article ID 6967158. DOI: 10.1155/2022/6967158.
Hamdi A., Chen Y.‑P., Mohamed M. B., Hmeidi I. Spatiotemporal data mining: a survey on challenges and openproblems // Artificial Intelligence Review. 2022. Vol. 55. P. 1441–1488. DOI: 10.1007/s10462-021-09994-y.
Li W., Aram J., Islam M. M. та ін. A comprehensive survey on machine learning‑based big data analytics for IoT‑enabled smart healthcare system // Mobile Networks and Applications. 2021. Vol. 26, No. 1. P. 234–252. DOI: 10.1007/s11036-020-01700-6.
Wang K., Ding Y., Han S. C. Graph neural networks for text classification: a survey // Artificial Intelligence Review. 2024. Vol. 57, Article 190. DOI: 10.1007/s10462-024-10808-0.







