АДАПТИВНИЙ РОЗПОДІЛ ОБЧИСЛЮВАЛЬНОГО НАВАНТАЖЕННЯ  В СИСТЕМАХ ГОЛОСОВОЇ ІДЕНТИФІКАЦІЇ

M. E. BONDARENKO; H. S. IVASHCHENKO

doi:10.35546/kntu2078-4481.2025.4.3.3

Authors

M. E. BONDARENKO Kharkiv National University of Radio Electronics https://orcid.org/0000-0002-2500-7626
H. S. IVASHCHENKO Kharkiv National University of Radio Electronics https://orcid.org/0000-0003-1027-5262

DOI:

https://doi.org/10.35546/kntu2078-4481.2025.4.3.3

Keywords:

voice identification systems, MFCC, spectral subtraction, wavelet filtering, CPU, GPU, adaptive load balancing, load management, task distribution, dynamic resource balancing, real-time performance, multiprocessor systems

Abstract

This paper examines an approach to improving the efficiency of voice identification systems by distributing the computational load between the central processing unit (CPU) and the graphics processing unit (GPU). Existing implementations of such systems are based on fixed distribution schemes, within which individual processing stages are a priori associated with either the CPU or the GPU. Such a static organisation of computations does not take into account the dynamic variability of speech signal parameters, the current state of computing resources, and significant differences in the computational complexity of individual operations. Under such conditions, the system lacks mechanisms for adaptive load redistribution, resulting in asymmetric use of the CPU and GPU, and consequently, a decrease in overall performance. The proposed approach involves an adaptive algorithm that implements load distribution control based on the analysis of a set of speech recording characteristics, including frame length, overlap degree, energy content, and spectral saturation of the signal. The use of these parameters enables a quantitative assessment of the computational complexity of the current segment of the processed signal and the dynamic determination of the ratio of computational operations performed on the CPU and GPU. This ensures coordinated interaction between processors, minimises downtime, and increases the overall performance of the system. The comparative analysis revealed that the use of an adaptive algorithm significantly reduces the average processing time of speech fragments compared to approaches that rely solely on data volume. The proposed adaptive control and load distribution block increases the overall performance of voice identification systems, especially when processing large and structurally complex sets of signals, and can be integrated into modern multiprocessor architectures.

References

Samonte M. J. C., Callejo J. K., Lumbera D. C. N., Ocaya J. C. B. Mitigating Vishing in Digital Banking Through Caller Authentication and Verification Technologies. 2024 14th International Conference on Software Technology and Engineering ICSTE, Macau, China. 2024. Pp. 102–108. DOI: 10.1109/ICSTE63875.2024.00025

Kambampati P., Rane S., Shoeb A., Dhannawat R. PAYV Payment Voice A Platform using Voice Recognition to Enable Payment Transactions. 2024 Asia Pacific Conference on Innovation in Technology APCIT, Mysore, India. 2024. Pp. 1–6. DOI: 10.1109/APCIT62007.2024.10673442

Li Y., Gao X., Song Q., Wang Y., Lyu P., Zhang H. BoneAuth A Bone-Conduction-Based Voice Liveness Authentication for Voice Assistants. IEEE Internet of Things Journal. 2025. Vol. 12, no. 6. Pp. 6997–7009. DOI: 10.1109/ JIOT.2024.3494024

Bao L., Zuo Y. Speaker Identification based on MFSC Voice Feature Extraction using Transformer. 2023 IEEE International Conference on Data Mining Workshops ICDMW, Shanghai, China. 2023. Pp. 1–7. DOI: 10.1109/ ICDMW60847.2023.00008

Chen Q., Gu Z., Lu L., Xu X., Ba Z., Lin F., Liu Z., Ren K. Conan’s Bow Tie A Streaming Voice Conversion for Real-Time VTuber Livestreaming. Proceedings of the 29th International Conference on Intelligent User Interfaces. 2024. Pp. 35–50. DOI: 10.1145/3640543.3645146

Mykhailichenko I., Ivashchenko H., Barkovska O., Liashenko O. Application of Deep Neural Network for Real-Time Voice Command Recognition. IEEE 3rd KhPI Week on Advanced Technology KhPIWeek, Kharkiv, Ukraine. 2022. Pp. 1–4. DOI: 10.1109/KhPIWeek57572.2022.9916473

Бондаренко М. E., Іващенко Г. С. Використання послідовності методів попередньої обробки в системах голосової ідентифікації. Системи управління, навігації та зв’язку. Полтава: ПНТУ. 2025. № 2 (80). С. 90–96. DOI: 10.26906/SUNZ.2025.2.090

Бондаренко М. E., Іващенко Г. С. Організація паралельного виконання методів обробки голосових сигналів на багатоядерних CPU та GPU. Системи управління, навігації та зв’язку. Полтава: ПНТУ. 2025. № 4 (82). С. 39–44. DOI: 10.26906/SUNZ.2025.4.39-44

Gjermundsen A. CPU and GPU Co-processing for Sound. MA thesis. Norwegian University of Science and Technology. 2010. 173 p.

Momcilovic S., Ilic A., Roma N., Sousa L. Dynamic Load Balancing for Real-Time Video Encoding on Heterogeneous CPU+GPU Systems. IEEE Transactions on Multimedia. 2014. Vol. 16, no. 1. Pp. 108–121. DOI: 10.1109/TMM.2013.2284892

Kim J., Lane I. Accelerating Large Vocabulary Continuous Speech Recognition on Heterogeneous CPU-GPU Platforms. 2014 IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP, Florence, Italy. 2014. Pp. 3291–3295. DOI: 10.1109/ICASSP.2014.6854209

Kossaifi J., Walecki R., Panagakis Y., Shen J., Schuller B., Pantic M. SEWA DB A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2021. Vol. 43, no. 3. Pp. 1022–1040. DOI: 10.1109/TPAMI.2019.2944808

ADAPTIVE LOAD DISTRIBUTION IN VOICE IDENTIFICATION SYSTEMS

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

Language

logo