(1)
Zhong, C.; Hou, Q.; Zhou, Z.; Zhang, Y.; Hao, S.; Lu, H.; Tang, H.; Bai, X. OwlCap: Harmonizing Motion-Detail for Video Captioning via HMD-270K and Caption Set Equivalence Reward. AAAI 2026, 40, 13503-13511.