ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects

Qihang Cao; Huangxun Chen

doi:10.1609/aaai.v39i2.32190

ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects

Authors

Qihang Cao Shanghai Jiao Tong University Hong Kong University of Science and Technology (Guangzhou)
Huangxun Chen Hong Kong University of Science and Technology (Guangzhou)

DOI:

https://doi.org/10.1609/aaai.v39i2.32190

Abstract

3D scene understanding is an important task, and there has been a recent surge of research interest in aligning 3D representations of point clouds with text to empower embodied AI. However, due to the lack of comprehensive 3D benchmarks, the capabilities of 3D models in real-world scenes, particularly those that are challenging with subtly distinguished objects, remain insufficiently investigated. To facilitate a more thorough evaluation of 3D models' capabilities, we propose a scheme, ObjVariantEnsemble, to systematically introduce more scenes with specified object classes, colors, shapes, quantities, and spatial relationships to meet model evaluation needs. More importantly, we intentionally construct scenes with similar objects to a certain degree and design an LLM-VLM-cooperated annotator to capture key distinctions as annotations. The resultant benchmark can better challenge 3D models, reveal their shortcomings in understanding, and potentially aid in the further development of 3D models.

AAAI-25 / IAAI-25 / EAAI-25 Proceedings Cover

Downloads

PDF
Poster

Published

2025-04-11

How to Cite

Cao, Q., & Chen, H. (2025). ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects. Proceedings of the AAAI Conference on Artificial Intelligence, 39(2), 1944–1952. https://doi.org/10.1609/aaai.v39i2.32190

Download Citation

Issue

Vol. 39 No. 2: AAAI-25 Technical Tracks 2

Section

AAAI Technical Track on Computer Vision I

ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information