(1)
Zhang, T.; He, S.; Dai, T.; Wang, Z.; Chen, B.; Xia, S.-T. Vision-Language Pre-Training With Object Contrastive Learning for 3D Scene Understanding. AAAI 2024, 38, 7296-7304.