Zhong, X., Li, Z., Chen, S., Jiang, K., Chen, C. and Ye, M. (2023) “Refined Semantic Enhancement towards Frequency Diffusion for Video Captioning”, Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), pp. 3724-3732. doi: 10.1609/aaai.v37i3.25484.