(1)
Xue, W.; Qian, C.; Wu, J.; Zhou, Y.; Liu, W.; Ren, J.; Fan, S.; Zhang, Y. ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries. AAAI 2025, 39, 9050-9058.