Chen, Yitong, Wenhao Yao, Lingchen Meng, Sihong Wu, Zuxuan Wu, and Yu-Gang Jiang. “Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 2 (April 11, 2025): 2320–2328. Accessed May 31, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/32232.