SCIR-Net: Structured Color Image Representation Based 3D Object Detection Network from Point Clouds

Qingdong He; Hao Zeng; Yi Zeng; Yijun Liu

doi:10.1609/aaai.v36i4.20371

Authors

Qingdong He University of Electronic Science and Technology of China Aibaba Group
Hao Zeng University of Electronic Science and Technology of China
Yi Zeng University of Electronic Science and Technology of China
Yijun Liu University of Electronic Science and Technology of China

DOI:

https://doi.org/10.1609/aaai.v36i4.20371

Keywords:

Domain(s) Of Application (APP), Computer Vision (CV)

Abstract

3D object detection from point clouds data has become an indispensable part in autonomous driving. Previous works for processing point clouds lie in either projection or voxelization. However, projection-based methods suffer from information loss while voxelization-based methods bring huge computation. In this paper, we propose to encode point clouds into structured color image representation (SCIR) and utilize 2D CNN to fulfill the 3D detection task. Specifically, we use the structured color image encoding module to convert the irregular 3D point clouds into a squared 2D tensor image, where each point corresponds to a spatial point in the 3D space. Furthermore, in order to fit for the Euclidean structure, we apply feature normalization to parameterize the 2D tensor image onto a regular dense color image. Then, we conduct repeated multi-scale fusion with different levels so as to augment the initial features and learn scale-aware feature representations for box prediction. Extensive experiments on KITTI benchmark, Waymo Open Dataset and more challenging nuScenes dataset show that our proposed method yields decent results and demonstrate the effectiveness of such representations for point clouds.

SCIR-Net: Structured Color Image Representation Based 3D Object Detection Network from Point Clouds

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information