APPROACH TO SOFTWARE ENGINEERING AND TESTING OF THE METHOD OF MULTI-LEVEL DETECTION OF CYBERBULLYING SUBJECTS BASED ON TRANSFORMERS IN A CLOUD ENVIRONMENT

Authors

DOI:

https://doi.org/10.32782/mathematical-modelling/2025-8-2-13

Keywords:

cyberbullying, role interpretation, transformative models, Google Colab

Abstract

The article presents an engineering-based approach to multi-level detection of cyberbullying subjects with a combination of primary transformer detection of aggressive statements and subsequent role-based interpretation of discourse according to the initiator speech act addressee scheme in the Google Colab environment. The implementation is based on a pre-trained classifier cardiffnlp twitter roberta base offensive in the text classification configuration of the transformers library, which provides reproducible inference without additional training and allows you to focus on software engineering and testing. The proposed design includes interface modules, processing control, classification and syntactic semantic analysis with fixing package versions, random seeds and tokenization parameters, configuration logging and artifact storage on Google Drive. The methodological contribution consists of a notebook-oriented organization of the life cycle, which includes data format control, replicated microruns, a thresholding protocol for binary decisions, borderline case labeling for manual review, and built-in checks for consistency of role interpretation. Empirical feasibility is confirmed on a small expert-verified subsample, which covers typical situations of direct insult, ironic devaluation, and neutral citation of potentially toxic vocabulary. The evaluation includes fixing the median inference time per example and the interquartile range as indicators of operational stability in variable conditions of the collab environment, as well as analyzing the distribution of decision confidence taking into account the operating threshold. Qualitative validation demonstrates correct activations on explicit invectives, false positives in cases of metalinguistic citation, and false negatives for sarcastic statements, which justifies the need for role interpretation to remove contextual ambiguity. The results obtained indicate that the combination of a pre-trained classifier with a role level and procedurally designed testing provides reproducibility of experiments, interpretability of output, and controlled determinism of solutions within the computational constraints of Colab, creating a basis for scaling to larger corpora and multilingual domains.

References

Wang S., Shibghatullah A.S., Iqbal T.J. et al.. A review of multimodal-based emotion recognition techniques for cyberbullying detection in online social media platforms. Neural Computing and Applications. 2024. № 36. P. 21923–21956. https://doi.org/10.1007/s00521-024-10371-3

Islam M.M., Uddin M.A., Islam L., Akter A., Sharmin S., Acharjee U.K. Cyberbullying detection on social networks using machine learning approaches. 2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE), December 2020. International Journal of Computer Applications. January 2025. Vol. 186, No. 61. P. 1–6. IEEE. https://doi.org/10.5120/ijca2025924395

Молчанова М.О., Дідур В.О., Мазурець О.В., Тищенко О.О., Залуцька О.О. Інформаційна технологія використання хмарних обчислень для класифікації залишків зруйнованих будівель засобами нейронних мереж за візуальними даними з безпі- лотних літальних апаратів. Наука і техніка сьогодні. 2025. № 4 (45). С. 1259–1272. https://doi.org/10.52058/2786-6025-2025-4(45)-1259-1272

Молчанова М.О., Мазурець О.В., Собко О.В., Кліменко В.І., Андрощук В.І. Метод нейромережевого виявлення кібербулінгу з використанням хмарних сервісів та об'єктно-орієнтованої моделі. Вісник Хмельницького національного університету. Серія «Технічні науки». 2024. № 2(333). С. 200–206. https://doi.org/10.31891/2307-5732-2024-333-2

Chen M., et al. 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem. The Speaker and Language Recognition Workshop (Odyssey 2024), 18–21 June 2024, Quebec City, Canada. ISCA, 2024. P. 1–6. https://doi.org/10.21437/odyssey.2024-37

Yin Y., Feng Y., Weng S., Gao X., Liu J., Zhao Z. Lightweight Probabilistic Coverage Metrics for Efficient Testing of Deep Neural Networks. Proceedings of the 16th International Conference on Internetware (Internetware '25), June 20–22, 2025. Association for Computing Machinery, New York, NY, USA. P. 474–486. https://doi.org/10.1145/3755881.3755915

Sihab-Us-Sakib S., Rahman M.R., Forhad M.S.A., Aziz M.A. Cyberbullying detection of resource-constrained language from social media using transformer-based approach. Natural Language Processing Journal. 2024. Vol. 9. P. 100104. https://doi.org/10.1016/j.nlp.2024.100104

Zampieri M., Rosenthal S., Nakov P., Dmonte A., Ranasinghe T. OffensEval 2023: Offensive language identification in the age of Large Language Models. Natural Language Engineering. 2023. Vol. 29, No. 6. P. 1416–1435. https://doi.org/10.1017/S1351324923000517

Aliyeva Ç.O., Yağanoğlu M. Deep learning approach to detect cyberbullying on Twitter. Multimedia Tools and Applications. 2024. Vol. 84. P. 20497–20520. https://doi.org/10.1007/s11042-024-19869-3

Pavlopoulos J., Sorensen J., Laugier L., Androutsopoulos I. SemEval-2021 Task 5: Toxic Spans Detection. Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021), Online, August 2021. Association for Computational Linguistics. P. 59–69. https://doi.org/10.18653/v1/2021.semeval-1.6

Twitter-roBERTa-base for Offensive Language Identification: Huggingface URL: https://huggingface.co/cardiffnlp/twitter-roberta-base-offensive (дата звернення: 10.11.2025).

Krak I., Molchanova M., Didur V., Sobko O., Mazurets O., Barmak O. Method of semantic features estimation for political propaganda techniques detection using transformer neural networks. CEUR Workshop Proceedings, Vol. 3917. Kryvyi Rih, Ukraine, December 27, 2024. P. 286–297. URL: https://ceur-ws.org/Vol-3917/paper56.pdf (дата звернення: 10.11.2025).

Published

2025-12-30