KLI

Efficient Multi-Receptive Pooling for Object Detection on Drone

Metadata Downloads
Abstract
Object detection is the most fundamental and important research in computer vision to discriminate the location and class of the object in the image. This technology has been continuously researched for the past few years. Recently, with the development of hardware such as GPU computing power and cameras, object detection technology is gradually improving. However, there are many difficulties in utilizing GPUs on low-cost devices such as drones. Therefore, efficient deep learning technology that can operate on low-cost devices is needed. In this paper, we propose a deep learning model to enable real-time object detection on a low-cost device. We experiment to reduce the amount of computation and improve speed by modifying the CSP Bottleneck and SPPF parts corresponding to the backbone of YOLOv5. The model has been trained on MS COCO and VisDrone datasets, and the mAP values are measured at 0.364mAP and 0.19mAP, which are about 0.07 and 0.04 higher than Refinedetlite and Refinedet, respectively. The speed is 23.010 frames per second on the CPU configuration, which is enough for real-time object detection.
Author(s)
Jinsu AnMuhamad Dwisnanto PutroAdri PriadanaKang-Hyun Jo
Issued Date
2023
Type
Article
Keyword
Object DetectionDrone VisionConvolutional Neural
DOI
10.1007/978-981-99-4914-4_2
URI
https://oak.ulsan.ac.kr/handle/2021.oak/17244
Publisher
Communications in Computer and Information Science
Language
영어
ISSN
1865-0929
Citation Volume
1857
Citation Number
1
Citation Start Page
14
Citation End Page
25
Appears in Collections:
Engineering > IT Convergence
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.