LongSplat: Online Generalizable 3D Gaussian Splatting from Long Sequence Images

Authors

  • Guichen Huang Beijing Institute of Technology Shenzhen MSU-BIT University
  • Ruoyu Wang Transcengram
  • Xiangjun Gao Hong Kong University of Science and Technology
  • Che Sun Shenzhen MSU-BIT University
  • Yuwei Wu Beijing Institute of Technology Shenzhen MSU-BIT University
  • Shenghua Gao University of Hong Kong Transcengram
  • Yunde Jia Shenzhen MSU-BIT University Beijing Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v40i6.42504

Abstract

3D Gaussian Splatting (3DGS) achieves high-fidelity novel view synthesis, but its application in online long-sequence scenarios is still restricted. Existing methods either rely on slow per-scene optimization or lack efficient frame-wise 3DGS updates, making them unsuitable for online long-sequence videos. In this paper, we propose LongSplat, an online real-time 3D Gaussian reconstruction framework designed for long-sequence image input. The core idea of LongSplat is to maintain a global 3DGS set and design a streaming 3DGS update mechanism that selectively compressing redundant historical Gaussians and introducing new Gaussians by comparing the current observations with the historical Gaussian. To achieve this goal, we design a Gaussian-Image Representation (GIR), which encodes 3D Gaussian parameters into a structured, image-like 2D format. GIR simultaneously enables identity-aware redundancy compression as well as the fusion of current view and historical Gaussians, which are used for online reconstruction and adapt the model to long sequences without overwhelming memory or computational costs. Extensive experiments demonstrate that LongSplat achieves state-of-the-art efficiency-quality trade-offs in real-time novel view synthesis, delivering real-time reconstruction while reducing Gaussian counts by 44% compared to per-pixel prediction paradigms.

Published

2026-03-14

How to Cite

Huang, G., Wang, R., Gao, X., Sun, C., Wu, Y., Gao, S., & Jia, Y. (2026). LongSplat: Online Generalizable 3D Gaussian Splatting from Long Sequence Images. Proceedings of the AAAI Conference on Artificial Intelligence, 40(6), 4994–5002. https://doi.org/10.1609/aaai.v40i6.42504

Issue

Section

AAAI Technical Track on Computer Vision III