[1]
Y. Chen, W. Yao, L. Meng, S. Wu, Z. Wu, and Y.-G. Jiang, “Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection”, AAAI, vol. 39, no. 2, pp. 2320–2328, Apr. 2025.