(1)
Jiang, W.; Cheng, Y.; Liu, L.; Fang, Y.; Peng, Y.; Liu, Y. Comprehensive Visual Grounding for Video Description. AAAI 2024, 38, 2552-2560.