Large Images Are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting

Authors

  • Lingting Zhu The University of Hong Kong
  • Guying Lin Carnegie Mellon University
  • Jinnan Chen National University of Singapore
  • Xinjie Zhang The Hong Kong University of Science and Technology
  • Zhenchao Jin The University of Hong Kong
  • Zhao Wang The Chinese University of Hong Kong
  • Lequan Yu The University of Hong Kong

DOI:

https://doi.org/10.1609/aaai.v39i10.33193

Abstract

While Implicit Neural Representations (INRs) have demonstrated significant success in image representation, they are often hindered by large training memory and slow decoding speed. Recently, Gaussian Splatting (GS) has emerged as a promising solution in 3D reconstruction due to its highquality novel view synthesis and rapid rendering capabilities, positioning it as a valuable tool for a broad spectrum of applications. In particular, a GS-based representation, 2DGS, has shown potential for image fitting. In our work, we present Large Images are Gaussians (LIG), which delves deeper into the application of 2DGS for image representations, addressing the challenge of fitting large images with 2DGS in the situation of numerous Gaussian points, through two distinct modifications: 1) we adopt a variant of representation and optimization strategy, facilitating the fitting of a large number of Gaussian points; 2) we propose a Level-of-Gaussian approach for reconstructing both coarse low-frequency initialization and fine high-frequency details. Consequently, we successfully represent large images as Gaussian points and achieve high-quality large image representation, demonstrating its efficacy across various types of large images.

Downloads

Published

2025-04-11

How to Cite

Zhu, L., Lin, G., Chen, J., Zhang, X., Jin, Z., Wang, Z., & Yu, L. (2025). Large Images Are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting. Proceedings of the AAAI Conference on Artificial Intelligence, 39(10), 10977–10985. https://doi.org/10.1609/aaai.v39i10.33193

Issue

Section

AAAI Technical Track on Computer Vision IX