[1]
Lee, J.S. et al. 2025. VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning. Proceedings of the AAAI Conference on Artificial Intelligence. 39, 4 (Apr. 2025), 4499–4507. DOI:https://doi.org/10.1609/aaai.v39i4.32474.