Lee, J. S., Kim, J., Na, J., Park, J., & Kim, H. J. (2025). VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning. Proceedings of the AAAI Conference on Artificial Intelligence, 39(4), 4499–4507. https://doi.org/10.1609/aaai.v39i4.32474