HDGS: Hierarchical Dynamic Gaussian Splatting for Urban Driving Scenes

Authors

  • Fudong Ge State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS), CASIA School of Artificial Intelligence, University of Chinese Academy of Sciences Beijing Key Laboratory of Super Intelligent Security of Multi-Modal Information
  • Jin Gao State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS), CASIA School of Artificial Intelligence, University of Chinese Academy of Sciences Beijing Key Laboratory of Super Intelligent Security of Multi-Modal Information
  • Hanshi Wang State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS), CASIA School of Artificial Intelligence, University of Chinese Academy of Sciences Beijing Key Laboratory of Super Intelligent Security of Multi-Modal Information
  • Yiwei Zhang State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS), CASIA School of Artificial Intelligence, University of Chinese Academy of Sciences Beijing Key Laboratory of Super Intelligent Security of Multi-Modal Information
  • Ke Wang KargoBot
  • Weiming Hu State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS), CASIA School of Artificial Intelligence, University of Chinese Academy of Sciences Beijing Key Laboratory of Super Intelligent Security of Multi-Modal Information School of Information Science and Technology, ShanghaiTech University
  • Zhipeng Zhang AutoLab, School of Artificial Intelligence, Shanghai Jiao Tong University Anyverse Robotics

DOI:

https://doi.org/10.1609/aaai.v40i6.42419

Abstract

This paper tackles the challenging task of achieving storage-efficient yet high-fidelity motion representation in large-scale dynamic 3D Gaussian Splatting. Our motivation stems from the truth that existing urban-scale methods, which rely on massive and unstructured individual Gaussians for scene modeling, face a critical scalability bottleneck. Inspired by recent advances in the 3DGS-based compression beyond autonomous driving, we address this challenge by leveraging the compression capability of anchor-driven methods. However, this is non-trivial as our exploratory experiments reveal that the direct application of this paradigm to dynamic, large-scale urban scenes results in performance degradation. We attribute this phenomenon to the hierarchical anchor design that severely loses dynamic information. To this end, we propose Hierarchical Dynamic Gaussian Splatting (HDGS), a novel framework designed to adapt the anchor-based Gaussian paradigm to 4D urban environments. We first establish a local support network to reinforce inter-anchor consistency, mitigating geometric and appearance fractures caused by supervision attenuation in deep hierarchies. Then, we handle heterogeneous object motion via coarse-to-fine decomposition, where high-level anchors model coarse dynamics and low-level anchors refine them with residual deformations. Third, we introduce a hybrid supervision scheme that fuses global geometric constraints and local pixel-level cues to alleviate geometrically inconsistent reconstruction under sparse LiDAR. Extensive experiments show that HDGS reduces storage by 69.0% while maintaining or even improving rendering fidelity compared to state-of-the-art methods.

Published

2026-03-14

How to Cite

Ge, F., Gao, J., Wang, H., Zhang, Y., Wang, K., Hu, W., & Zhang, Z. (2026). HDGS: Hierarchical Dynamic Gaussian Splatting for Urban Driving Scenes. Proceedings of the AAAI Conference on Artificial Intelligence, 40(6), 4230–4238. https://doi.org/10.1609/aaai.v40i6.42419

Issue

Section

AAAI Technical Track on Computer Vision III