Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Qiangeng Xu; Yiqi Zhong; Ulrich Neumann

doi:10.1609/aaai.v36i3.20194

Authors

Qiangeng Xu University of Southern California
Yiqi Zhong University of Southern California
Ulrich Neumann University of Southern California

DOI:

https://doi.org/10.1609/aaai.v36i3.20194

Keywords:

Computer Vision (CV), Cognitive Modeling & Cognitive Systems (CMS), Machine Learning (ML), Domain(s) Of Application (APP)

Abstract

Advances in LiDAR sensors provide rich 3D data that supports 3D scene understanding. However, due to occlusion and signal miss, LiDAR point clouds are in practice 2.5D as they cover only partial underlying shapes, which poses a fundamental challenge to 3D perception. To tackle the challenge, we present a novel LiDAR-based 3D object detection model, dubbed Behind the Curtain Detector (BtcDet), which learns the object shape priors and estimates the complete object shapes that are partially occluded (curtained) in point clouds. BtcDet first identifies the regions that are affected by occlusion and signal miss. In these regions, our model predicts the probability of occupancy that indicates if a region contains object shapes and integrates this probability map with detection features and generates high-quality 3D proposals. Finally, the occupancy estimation is integrated into the proposal refinement module to generate accurate bounding boxes. Extensive experiments on the KITTI Dataset and the Waymo Open Dataset demonstrate the effectiveness of BtcDet. Particularly for the 3D detection of both cars and cyclists on the KITTI benchmark, BtcDet surpasses all of the published state-of-the-art methods by remarkable margins. Code is released.

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription