KLI

Three-Attention Mechanisms for One-Stage 3-D Object Detection Based on LiDAR and Camera

Metadata Downloads
Abstract
This article studies one-stage 3-D object detection based on light detection and ranging (LiDAR) point clouds and red-green-blue (RGB) images that aims to boost 3-D object detection accuracy based on three attention mechanisms. Currently, most of the previous works converted LiDAR point clouds into bird's-eye-view (BEV) images, achieving a significant performance. However, they still have a problem due to partial height information (z-axis value) loss during the conversion. To eliminate this problem, the height information of the LiDAR point clouds is projected onto an RGB image and embedded into the original RGB image to generate a new image, named RGBD. This is the first attention mechanism to improve 3-D detection accuracy. Moreover, two other attention mechanisms extract more discriminative global and local features, respectively. Specifically, the global attention network is appended to a feature encoder, and the local attention network is used for the view-specific region of interest fusion. Massive experiments evaluated on the KITTI benchmark suite show that the proposed approach outperforms state-of-the-art LiDAR-Camera-based methods on the car class (easy, moderate, hard): 2-D (90.35%, 88.47%, 86.98%), 3-D (85.12%, 76.23%, 74.46%), and BEV (89.64%, 86.23%, 85.60%).
Author(s)
문리화조강현
Issued Date
2021
Type
Article
Keyword
3-D object detectionCameraCamerasFeature extractionLaser radarLiDARObject detectionone-stageProposalsthree attention mechanismsThree-dimensional displaysTwo dimensional displays
DOI
10.1109/TII.2020.3048719
URI
https://oak.ulsan.ac.kr/handle/2021.oak/9153
https://ulsan-primo.hosted.exlibrisgroup.com/primo-explore/fulldisplay?docid=TN_cdi_ieee_primary_9312480&context=PC&vid=ULSAN&lang=ko_KR&search_scope=default_scope&adaptor=primo_central_multiple_fe&tab=default_tab&query=any,contains,Three-Attention%20Mechanisms%20for%20One-Stage%203-D%20Object%20Detection%20Based%20on%20LiDAR%20and%20Camera&offset=0&pcAvailability=true
Publisher
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
Location
미국
Language
영어
ISSN
1551-3203
Citation Volume
17
Citation Number
10
Citation Start Page
6655
Citation End Page
6663
Appears in Collections:
Engineering > IT Convergence
Authorize & License
  • Authorize공개
Files in This Item:
  • There are no files associated with this item.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.