Semantic Consistency Networks for 3D Object Detection

Wenwen Wei; Ping Wei; Nanning Zheng

doi:10.1609/aaai.v35i4.16392

Authors

Wenwen Wei Xi'an Jiaotong University
Ping Wei Xi'an Jiaotong University
Nanning Zheng Xi'an Jiaotong University

DOI:

https://doi.org/10.1609/aaai.v35i4.16392

Keywords:

3D Computer Vision, Object Detection & Categorization, Scene Analysis & Understanding, (Deep) Neural Network Algorithms

Abstract

Detecting 3D objects from point clouds is a significant yet challenging issue in many applications. While most existing approaches seek to leverage geometric information of point clouds, few studies accommodate the inherent semantic characteristics of each point and the consistency between the geometric and semantic cues. In this work, we propose a novel semantic consistency network (SCNet) driven by a natural principle: the class of a predicted 3D bounding box should be consistent with the classes of all the points inside this box. Specifically, our SCNet consists of a feature extraction structure, a detection decision structure, and a semantic segmentation structure. In inference, the feature extraction and the detection decision structures are used to detect 3D objects. In training, the semantic segmentation structure is jointly trained with the other two structures to produce more robust and applicative model parameters. A novel semantic consistency loss is proposed to regulate the output 3D object boxes and the segmented points to boost the performance. Our model is evaluated on two challenging datasets and achieves comparable results to the state-of-the-art methods.

Semantic Consistency Networks for 3D Object Detection

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information