BridgeShape: Latent Diffusion Schrödinger Bridge for 3D Shape Completion

Authors

  • Dequan Kong Nanjing University of Aeronautics and Astronautics
  • Honghua Chen Lingnan University
  • Zhe Zhu Nanjing University of Aeronautics and Astronautics
  • Mingqiang Wei Nanjing University of Aeronautics and Astronautics

DOI:

https://doi.org/10.1609/aaai.v40i7.37493

Abstract

Existing diffusion-based 3D shape completion methods typically use a conditional paradigm, injecting incomplete shape information into the denoising network via deep feature interactions (e.g., concatenation, cross-attention) to guide sampling toward complete shapes, often represented by voxel-based distance functions. However, these approaches fail to explicitly model the optimal global transport path, leading to suboptimal completions. Moreover, performing diffusion directly in voxel space imposes resolution constraints, limiting the generation of fine-grained geometric details. To address these challenges, we propose BridgeShape, a novel framework for 3D shape completion via latent diffusion Schrödinger bridge. The key innovations lie in two aspects: (i) BridgeShape formulates shape completion as an optimal transport problem, explicitly modeling the transition between incomplete and complete shapes to ensure a globally coherent transformation. (ii) We introduce a Depth-Enhanced Vector Quantized Variational Autoencoder (VQ-VAE) to encode 3D shapes into a compact latent space, leveraging self-projected multi-view depth information enriched with strong DINOv2 features to enhance geometric structural perception. By operating in a compact yet structurally informative latent space, BridgeShape effectively mitigates resolution constraints and enables more efficient and high-fidelity 3D shape completion. BridgeShape achieves state-of-the-art performance on 3D shape completion benchmarks, demonstrating superior fidelity at higher resolutions and for unseen object classes.

Downloads

Published

2026-03-14

How to Cite

Kong, D., Chen, H., Zhu, Z., & Wei, M. (2026). BridgeShape: Latent Diffusion Schrödinger Bridge for 3D Shape Completion. Proceedings of the AAAI Conference on Artificial Intelligence, 40(7), 5726–5734. https://doi.org/10.1609/aaai.v40i7.37493

Issue

Section

AAAI Technical Track on Computer Vision IV