Chen, Yitong, et al. “Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 39, no. 2, Apr. 2025, pp. 2320-8, doi:10.1609/aaai.v39i2.32232.