CasFusionNet: A Cascaded Network for Point Cloud Semantic Scene Completion by Dense Feature Fusion

Authors

  • Jinfeng Xu Huazhong University of Science and Technology
  • Xianzhi Li Huazhong University of Science and Technology
  • Yuan Tang Huazhong University of Science and Technology
  • Qiao Yu Huazhong University of Science and Technology
  • Yixue Hao Huazhong University of Science and Technology
  • Long Hu Huazhong University of Science and Technology
  • Min Chen Huazhong University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v37i3.25405

Keywords:

CV: 3D Computer Vision, CV: Applications, CV: Scene Analysis & Understanding, CV: Segmentation, CV: Vision for Robotics & Autonomous Driving, ML: Deep Neural Architectures, ML: Deep Neural Network Algorithms

Abstract

Semantic scene completion (SSC) aims to complete a partial 3D scene and predict its semantics simultaneously. Most existing works adopt the voxel representations, thus suffering from the growth of memory and computation cost as the voxel resolution increases. Though a few works attempt to solve SSC from the perspective of 3D point clouds, they have not fully exploited the correlation and complementarity between the two tasks of scene completion and semantic segmentation. In our work, we present CasFusionNet, a novel cascaded network for point cloud semantic scene completion by dense feature fusion. Specifically, we design (i) a global completion module (GCM) to produce an upsampled and completed but coarse point set, (ii) a semantic segmentation module (SSM) to predict the per-point semantic labels of the completed points generated by GCM, and (iii) a local refinement module (LRM) to further refine the coarse completed points and the associated labels from a local perspective. We organize the above three modules via dense feature fusion in each level, and cascade a total of four levels, where we also employ feature fusion between each level for sufficient information usage. Both quantitative and qualitative results on our compiled two point-based datasets validate the effectiveness and superiority of our CasFusionNet compared to state-of-the-art methods in terms of both scene completion and semantic segmentation. The codes and datasets are available at: https://github.com/JinfengX/CasFusionNet.

Downloads

Published

2023-06-26

How to Cite

Xu, J., Li, X., Tang, Y., Yu, Q., Hao, Y., Hu, L., & Chen, M. (2023). CasFusionNet: A Cascaded Network for Point Cloud Semantic Scene Completion by Dense Feature Fusion. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), 3018-3026. https://doi.org/10.1609/aaai.v37i3.25405

Issue

Section

AAAI Technical Track on Computer Vision III