VGGS: VGGT-guided Gaussian Splatting for Efficient and Faithful Sparse-View Surface Reconstruction

Authors

  • Peng Xiang Tsinghua University
  • Liang Han Tsinghua University
  • Hui Zhang Tsinghua University
  • Yu-Shen Liu Tsinghua University
  • Zhizhong Han Wayne State University

DOI:

https://doi.org/10.1609/aaai.v40i13.38074

Abstract

Reconstructing a faithful geometric surface from sparse images remains a fundamental challenge in 3D computer vision. While recent methods have achieved remarkable progress, they still struggle to recover reliable geometry due to the lack of multi-view geometric cues, particularly in non-overlapping regions. To address this issue, we introduce VGGS, a Gaussian Splatting (GS) method that exploits multi-view geometric priors from VGGT for efficient and high-fidelity sparse-view surface reconstruction. Our primary contribution is an anchor-calibrated depth estimation scheme, which yields accurate depth maps. The insight is to align the VGGT depth prior to the underlying surface with a sparse set of multi-view consistent anchors, then infer depth for unreliable regions by relative depth estimation. Furthermore, to mitigate misalignment in complex scenes, we propose a relative depth consistency loss that penalizes the rendered depth if its relative depth relationship in local regions is inconsistent to the multi-view prior. Extensive experiments on widely-used benchmarks show that VGGS surpasses state-of-the-art methods in both accuracy and efficiency, delivering 4–7× faster optimization while reducing memory consumption compared to previous GS-based approaches.

Downloads

Published

2026-03-14

How to Cite

Xiang, P., Han, L., Zhang, H., Liu, Y.-S., & Han, Z. (2026). VGGS: VGGT-guided Gaussian Splatting for Efficient and Faithful Sparse-View Surface Reconstruction. Proceedings of the AAAI Conference on Artificial Intelligence, 40(13), 10969-10977. https://doi.org/10.1609/aaai.v40i13.38074

Issue

Section

AAAI Technical Track on Computer Vision X