Three-Attention Mechanisms for One-Stage 3-D Object Detection Based on LiDAR and Camera
- Abstract
- This article studies one-stage 3-D object detection based on light detection and ranging (LiDAR) point clouds and red-green-blue (RGB) images that aims to boost 3-D object detection accuracy based on three attention mechanisms. Currently, most of the previous works converted LiDAR point clouds into bird's-eye-view (BEV) images, achieving a significant performance. However, they still have a problem due to partial height information (z-axis value) loss during the conversion. To eliminate this problem, the height information of the LiDAR point clouds is projected onto an RGB image and embedded into the original RGB image to generate a new image, named RGBD. This is the first attention mechanism to improve 3-D detection accuracy. Moreover, two other attention mechanisms extract more discriminative global and local features, respectively. Specifically, the global attention network is appended to a feature encoder, and the local attention network is used for the view-specific region of interest fusion. Massive experiments evaluated on the KITTI benchmark suite show that the proposed approach outperforms state-of-the-art LiDAR-Camera-based methods on the car class (easy, moderate, hard): 2-D (90.35%, 88.47%, 86.98%), 3-D (85.12%, 76.23%, 74.46%), and BEV (89.64%, 86.23%, 85.60%).
- Author(s)
- 문리화; 조강현
- Issued Date
- 2021
- Type
- Article
- Keyword
- 3-D object detection; Camera; Cameras; Feature extraction; Laser radar; LiDAR; Object detection; one-stage; Proposals; three attention mechanisms; Three-dimensional displays; Two dimensional displays
- DOI
- 10.1109/TII.2020.3048719
- URI
- https://oak.ulsan.ac.kr/handle/2021.oak/9153
https://ulsan-primo.hosted.exlibrisgroup.com/primo-explore/fulldisplay?docid=TN_cdi_ieee_primary_9312480&context=PC&vid=ULSAN&lang=ko_KR&search_scope=default_scope&adaptor=primo_central_multiple_fe&tab=default_tab&query=any,contains,Three-Attention%20Mechanisms%20for%20One-Stage%203-D%20Object%20Detection%20Based%20on%20LiDAR%20and%20Camera&offset=0&pcAvailability=true
- Publisher
- IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
- Location
- 미국
- Language
- 영어
- ISSN
- 1551-3203
- Citation Volume
- 17
- Citation Number
- 10
- Citation Start Page
- 6655
- Citation End Page
- 6663
-
Appears in Collections:
- Engineering > IT Convergence
- 공개 및 라이선스
-
- 파일 목록
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.