Diffusion-Based Contextual Reconstruction for Point Cloud Segmentation with Limited Annotations

Authors

  • Jiawei Lian Nanjing University of Science and Technology
  • Zhengxue Wang Nanjing University of Science and Technology
  • Wentao Qu Nanjing University of Science and Technology
  • Haobo Jiang Nanyang Technological University
  • Le Hui Northwest Polytechnical University Xi'an
  • Jian Yang Nanjing University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v40i8.37610

Abstract

Point cloud semantic segmentation is fundamental to 3D scene understanding, but dense annotation requirements limit scalability. Although recent label propagation and contrastive learning methods enhance local consistency, the incomplete object coverage caused by sparse annotations hinders global context modeling, ultimately limiting overall performance. To this end, we propose a diffusion-based contextual reconstruction framework for point cloud semantic segmentation with limited annotations. At its core, our framework guides denoising with semantic predictions, using better context reconstruction to enhance the conditional model for better segmentation. Specifically, our contributions include: (1) Diffusion-based segmentation framework: reconstructs contextual semantics from noise under conditional guidance, sharing the decoder with the segmentation module for robust contextual semantic learning. (2) Dynamically aggregates local context from segmentation features and guides denoising with global spatial structure, significantly enhancing denoising quality and contextual awareness. Notably, we pioneer diffusion models for 3D semantic segmentation with limited annotations, enabling efficient single-step inference. Experiments show robustness across varying annotation ratios and state-of-the-art performance on benchmarks.

Downloads

Published

2026-03-14

How to Cite

Lian, J., Wang, Z., Qu, W., Jiang, H., Hui, L., & Yang, J. (2026). Diffusion-Based Contextual Reconstruction for Point Cloud Segmentation with Limited Annotations. Proceedings of the AAAI Conference on Artificial Intelligence, 40(8), 6780–6788. https://doi.org/10.1609/aaai.v40i8.37610

Issue

Section

AAAI Technical Track on Computer Vision V