ZHONG, Chunlin; HOU, Qiuxia; ZHOU, Zhangjun; ZHANG, Yanhao; HAO, Shuang; LU, Haonan; TANG, He; BAI, Xiang. OwlCap: Harmonizing Motion-Detail for Video Captioning via HMD-270K and Caption Set Equivalence Reward. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 16, p. 13503–13511, 2026. DOI: 10.1609/aaai.v40i16.38355. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/38355. Acesso em: 25 may. 2026.