Xue, Wangyu, et al. “ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 39, no. 9, Apr. 2025, pp. 9050-8, doi:10.1609/aaai.v39i9.32979.