Comparative analysis of YOLOv5 and MobileNetV3 models for real-time image recognition

Keywords: image recognition, computer vision, hybrid model, CNN, YOLOv5, MobileNetV3

Abstract

Relevance: With the growing need for fast and accurate real-time object recognition, especially for mobile and embedded systems, the question of choosing the optimal AI models arises. Comparisons of lightweight and high-precision architectures such as YOLOv5 and MobileNetV3 are important for developing efficient computer vision systems and exploring the principles of hybrid model construction.

Purpose: Comparison of the YOLOv5 and MobileNetV3 architectures to analyze the efficiency for real-time object recognition applications, and to confirm that hybrid models can improve the efficiency of these tasks.

Research methods: image preprocessing methods, deep neural network training methods, measurement of accuracy, processing speed, and resource usage; comparative analysis of results to assess model effectiveness.

Results: An experimental study showed that YOLOv5 demonstrates better overall accuracy on the COCO test suite, but requires more computing resources. MobileNetV3, on the other hand, provides faster output and efficient functioning on low-power devices, sacrificing accuracy in part. As such, both models have proven their suitability for real-world applications, and the choice between them depends on the specific balance between speed, accuracy, and platform limitations. Combining these models gives better results in object recognition, although this may increase the size of the model itself and resource consumption.

Conclusions: As a result of the study, the YOLOv5, MobileNetV3 and hybrid models for the object recognition problem were compared. The hybrid model demonstrated better accuracy and balance between processing speed and resource utilization than individual models. This indicates the feasibility of using hybrid approaches to improve the efficiency of computer vision systems in real conditions. Therefore, the hybrid model is a promising direction for further research and practical implementation.

Downloads

Download data is not yet available.

Author Biographies

Yaroslav Yasinskyi, V.N. Karazin Kharkiv National University Svobody Sq 6, Kharkiv, Ukraine, 61022

Ph.D student

Nina Bakumenko, V.N. Karazin Kharkiv National University Svobody Sq 6, Kharkiv, Ukraine, 61022

Candidate of Technical Sciences; Associate Professor of Computer Systems and Robotics Department, Education and Research Institute of Computer Sciences and Artificial Intelligence

References

/

References

Published
2025-06-30
How to Cite
Yasinskyi, Y., & Bakumenko, N. (2025). Comparative analysis of YOLOv5 and MobileNetV3 models for real-time image recognition. Bulletin of V.N. Karazin Kharkiv National University, Series «Mathematical Modeling. Information Technology. Automated Control Systems», 66, 90-98. https://doi.org/10.26565/2304-6201-2025-66-09
Section
Статті