This paper is accepted in Volume-10, Issue-4, 2024
Area
Object Detection
Author
R Krishnananda, B. Padmavathy, Pulkit Kumar Yadav, R Priyanka, Shubhalakshmi Dash
Org/Univ
Sri Venkateswara College of Engineering, Bangalore, India
Sub. Date
15 July, 2024
Paper ID
V10I4-1157
Publisher
Keywords
Real-Time Object Detection, Deep Learning, Mask R-CNN, SSD, YOLOv3

Abstract

Real-time object detection is crucial in computer vision, impacting domains like surveillance, autonomous vehicles, and augmented reality. Here, it integrates insights from seminal works—faster R-CNN, YOLOv3, Mask R-CNN, and SSD—to create a unified framework. Balancing speed and accuracy, we leverage Faster R-CNN's region proposal networks (RPNs) for precise localization. Inspired by YOLOv3's efficiency, our single-shot detection strategy ensures adaptability. Mask R-CNNs instance segmentation enhances scene comprehension, while SSDs streamlined architecture optimizes speed. This synthesis yields a framework redefining real-time object detection, pushing boundaries without compromising accuracy. In conclusion, this research underscores the transformative potential of real-time object detection, uniting cutting-edge models to innovate computer vision.