Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images

Authors

  • Yihui Li State Key Laboratory of Complex and Critical Software Environment, Beijing, China School of Computer Science and Engineering, Beihang University, China
  • Chengxin Lv State Key Laboratory of Complex and Critical Software Environment, Beijing, China School of Computer Science and Engineering, Beihang University, China
  • Hongyu Yang School of Artificial Intelligence, Beihang University, China Shanghai Artificial Intelligence Laboratory, Shanghai, China
  • Di Huang State Key Laboratory of Complex and Critical Software Environment, Beijing, China School of Computer Science and Engineering, Beihang University, China

DOI:

https://doi.org/10.1609/aaai.v39i5.32536

Abstract

3D reconstruction from unconstrained image collections presents substantial challenges due to varying appearances and transient occlusions. In this paper, we introduce Micro-macro Wavelet-based Gaussian Splatting (MW-GS), a novel approach designed to enhance 3D reconstruction by disentangling scene representations into global, refined, and intrinsic components. The proposed method features two key innovations: Micro-macro Projection, which allows Gaussian points to capture details from feature maps across multiple scales with enhanced diversity; and Wavelet-based Sampling, which leverages frequency domain information to refine feature representations and significantly improve the modeling of scene appearances. Additionally, we incorporate a Hierarchical Residual Fusion Network to seamlessly integrate these features. Extensive experiments demonstrate that MW-GS delivers state-of-the-art rendering performance, surpassing existing methods.

Downloads

Published

2025-04-11

How to Cite

Li, Y., Lv, C., Yang, H., & Huang, D. (2025). Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images. Proceedings of the AAAI Conference on Artificial Intelligence, 39(5), 5057–5065. https://doi.org/10.1609/aaai.v39i5.32536

Issue

Section

AAAI Technical Track on Computer Vision IV