Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation

Authors

  • An Yang NERC-SLIP, University of Science and Technology of China
  • Chenyu Liu iFLYTEK Research
  • Jun Du NERC-SLIP, University of Science and Technology of China
  • Jianqing Gao iFLYTEK Research
  • Jia Pan iFLYTEK Research
  • Jinshui Hu iFLYTEK Research
  • Baocai Yin iFLYTEK Research
  • Bing Yin iFLYTEK Research
  • Cong Liu iFLYTEK Research

DOI:

https://doi.org/10.1609/aaai.v40i14.38142

Abstract

3D Gaussian Splatting (3D-GS) has emerged as an efficient 3D representation and a promising foundation for semantic tasks like segmentation. However, existing 3D-GS-based segmentation methods typically rely on high-dimensional category features, which introduce substantial memory overhead. Moreover, fine-grained segmentation remains challenging due to label space congestion and the lack of stable multi-granularity control mechanisms. To address these limitations, we propose a coarse-to-fine binary encoding scheme for per-Gaussian category representation, which compresses each feature into a single integer via the binary-to-decimal mapping, drastically reducing memory usage. We further design a progressive training strategy that decomposes panoptic segmentation into a series of independent sub-tasks, reducing inter-class conflicts and thereby enhancing fine-grained segmentation capability. Additionally, we fine-tune opacity during segmentation training to address the incompatibility between photometric rendering and semantic segmentation, which often leads to foreground-background confusion. Extensive experiments on multiple benchmarks demonstrate that our method achieves state-of-the-art segmentation performance while significantly reducing memory consumption and accelerating inference.

Downloads

Published

2026-03-14

How to Cite

Yang, A., Liu, C., Du, J., Gao, J., Pan, J., Hu, J., Yin, B., Yin, B., & Liu, C. (2026). Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(14), 11586-11594. https://doi.org/10.1609/aaai.v40i14.38142

Issue

Section

AAAI Technical Track on Computer Vision XI