ENHANCING DANGEROUS OBJECT DETECTION THROUGH INTERMEDIATE MULTIMODAL DATA FUSION
DOI:
https://doi.org/10.32782/mathematical-modelling/2025-8-1-27Keywords:
multimodal sensor data fusion, object detection, autonomous systems, LiDAR, attention mechanisms, feature extraction, risk assessmentAbstract
Reliable detection of dangerous objects is a critical component in numerous domains, including autonomous transportation, industrial safety, surveillance systems, and robotics. In real-world scenarios – such as poor lighting, adverse weather, occluded views, or noisy backgrounds – single-sensor detection systems often struggle to maintain accuracy and stability. To address these limitations, this paper proposes an approach based on multimodal data fusion, leveraging attention mechanisms and object criticality assessment.The proposed system integrates information from an RGB camera, LiDAR, thermal imager, and optionally, tactile sensors. Each sensor stream is processed by a dedicated feature encoder. Attention weights are dynamically calculated based on the environmental context to determine the relevance of each modality. The resulting fused feature vector is passed through a detection module that produces a base confidence score, while a parallel criticality module evaluates spatial distance, relative velocity, and orientation of the detected object.The final decision is formed by combining both the detection score and the criticality assessment using a weighted factor. If the resulting Sfinal value exceeds a predefined threshold, the system triggers a response – issuing a warning, initiating emergency braking, or notifying the operator. A hypothetical case study involving pedestrian detection in limited visibility illustrates the practical utility of the proposed architecture.The system exhibits high accuracy, robustness, and contextual adaptability. Its main advantages include preservation of spatial and semantic features, the ability to process inputs across multiple sensing domains, and risk-based prioritization of threats. This architecture presents a strong foundation for the development of reliable, safety-oriented autonomous solutions in transportation, industrial monitoring, and urban environments.
References
Arya C., Tripathi A., Singh P., Diwakar M., Sharma K., Pandey H. Object detection using deep learning: a review. Journal of Physics: Conference Series. 2021. Vol. 1854, № 1. Article 012012. DOI: 10.1088/1742-6596/1854/1/012012.
Zhao F., Zhang C., Geng B. Deep Multimodal Data Fusion. ACM Computing Surveys. 2024. Vol. 56, № 9. Article 216. DOI: 10.1145/3649447.
Fedorenko H., Fesenko H., Kharchenko V. Analysis of methods and development of a concept for guaranteed detection and recognition of explosive objects. Innovative Technologies and Scientific Solutions for Industries. 2022. № 4 (22). P. 20–31.
Щур Г. Інтелектуальна система керування роботом-сапером у віртуальному середовищі. Вісник Хмельницького національного університету. 2024. № 3 (1). P. 326–329. DOI: 10.31891/2307-5732-2024-335-3-43.
Dumyn I., Basystiuk O., Dumyn A. Graph-based approaches for multimodal medical data processing. 2023.
Thompson D.J. Maritime Object Detection, Tracking, and Classification Using LiDAR and Vision-Based Sensor Fusion. 2017. [Technical report].
Vadidar M., Kariminezhad A., Mayr C., Kloeker L., Eckstein L. Robust Environment Perception for Automated Driving: A Unified Learning Pipeline for Visual-Infrared Object Detection. Proceedings of the IEEE Intelligent Vehicles Symposium (IV). 2022. P. 367–374.
Bhown A. Improving Long-Range 3D Object Detection Methods for Autonomous Box Trucks Using Sensor Fusion. 2022. [Preprint].
Alsubaei F.S., Al-Wesabi F.N., Hilal A.M. Deep Learning-Based Small Object Detection and Classification Model for Garbage Waste Management in Smart Cities and IoT Environment. 2022. [Preprint].
Ceccarelli A., Montecchi L. Evaluating Object (mis)Detection from a Safety and Reliability Perspective: Discussion and Measures. IEEE Access. 2023. Vol. 11. P. 44952–44963.
Tabrik S., Behroozi M., Schlaffke L., Heba S., Lenz M., Lissek S., Güntürkün O., Dinse H.R., Tegenthoff M. Visual and Tactile Sensory Systems Share Common Features in Object Recognition. eNeuro. 2021. Vol. 8, № 5. DOI: 10.1523/ENEURO.0261-21.2021.
Rouhafzay G., Cretu A.-M. An Application of Deep Learning to Tactile Data for Object Recognition under Visual Guidance. Sensors. 2019. Vol. 19, № 7. Article 1534. DOI: 10.3390/ s19071534.
Ahmad J., Del Bue A. mmFUSION: Multimodal Fusion for 3D Objects Detection. arXiv preprint arXiv:2311.04058. 2023. URL: https://arxiv.org/abs/2311.04058.
Önal O., Dandıl E. Unsafe-Net: YOLO v4 and ConvLSTM Based Computer Vision System for Real-Time Detection of Unsafe Behaviours in Workplace. Multimedia Tools and Applications. 2024. P. 1–27.
Danso S.A., Shang L., Hu D., Odoom J., Liu Q., Nyarko B. Hidden Dangerous Object Recognition in Terahertz Images Using Deep Learning Methods. Applied Sciences. 2022. Vol. 12, № 15. Article 7354.






