Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation
DOI:
https://doi.org/10.1609/aaai.v40i14.38142Abstract
3D Gaussian Splatting (3D-GS) has emerged as an efficient 3D representation and a promising foundation for semantic tasks like segmentation. However, existing 3D-GS-based segmentation methods typically rely on high-dimensional category features, which introduce substantial memory overhead. Moreover, fine-grained segmentation remains challenging due to label space congestion and the lack of stable multi-granularity control mechanisms. To address these limitations, we propose a coarse-to-fine binary encoding scheme for per-Gaussian category representation, which compresses each feature into a single integer via the binary-to-decimal mapping, drastically reducing memory usage. We further design a progressive training strategy that decomposes panoptic segmentation into a series of independent sub-tasks, reducing inter-class conflicts and thereby enhancing fine-grained segmentation capability. Additionally, we fine-tune opacity during segmentation training to address the incompatibility between photometric rendering and semantic segmentation, which often leads to foreground-background confusion. Extensive experiments on multiple benchmarks demonstrate that our method achieves state-of-the-art segmentation performance while significantly reducing memory consumption and accelerating inference.Published
2026-03-14
How to Cite
Yang, A., Liu, C., Du, J., Gao, J., Pan, J., Hu, J., Yin, B., Yin, B., & Liu, C. (2026). Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(14), 11586-11594. https://doi.org/10.1609/aaai.v40i14.38142
Issue
Section
AAAI Technical Track on Computer Vision XI