[1]
Z. Xie, Y. Yang, Y. Yu, J. Wang, Y. Jiang, and X. Wu, “Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-Learning”, AAAI, vol. 39, no. 8, pp. 8771–8779, Apr. 2025.