[1]
Chen, Y. et al. 2025. Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection. Proceedings of the AAAI Conference on Artificial Intelligence. 39, 2 (Apr. 2025), 2320–2328. DOI:https://doi.org/10.1609/aaai.v39i2.32232.