Enhancing Generalizability via Utilization of Unlabeled Data for Occupancy Perception

Authors

  • Ruihang Li Zhejiang University
  • Tao Li Zhejiang University
  • Shanding Ye Zhejiang University
  • Kaikai Xiao Zhejiang University
  • Zheng Huangnan Zhejiang University
  • Zhe Yin Zhejiang University
  • Zhijie Pan Zhejiang University

DOI:

https://doi.org/10.1609/aaai.v39i5.32518

Abstract

3D occupancy perception accurately estimates the volumetric status and semantic labels of a scene, attracting significant attention in the field of autonomous driving. However, enhancing the model's ability to generalize across different driving scenarios or sensing systems, often requires redesigning the model or extra-expensive annotations. To this end, following a comprehensive analysis of the occupancy model architecture, we proposed the UGOCC method that utilizes domain adaptation to efficiently harness unlabeled autonomous driving data, thereby enhancing the model's generalizability. Specifically, we design the depth fusion module by employing self-supervised depth estimation, and propose a strategy based on semantic attention and domain adversarial learning to improve the generalizability of the learnable fusion module. Additionally, we propose an OCC-specific pseudo-label selection tailored for semi-supervised learning, which optimizes the overall network's generalizability. Our experiment results on two challenging datasets nuScenes and Waymo, demonstrate that our method not only achieves state-of-the-art generalizability but also enhances the model's perceptual capabilities within the source domain by utilizing unlabeled data.

Published

2025-04-11

How to Cite

Li, R., Li, T., Ye, S., Xiao, K., Huangnan, Z., Yin, Z., & Pan, Z. (2025). Enhancing Generalizability via Utilization of Unlabeled Data for Occupancy Perception. Proceedings of the AAAI Conference on Artificial Intelligence, 39(5), 4896–4904. https://doi.org/10.1609/aaai.v39i5.32518

Issue

Section

AAAI Technical Track on Computer Vision IV