Yang, B., Zou, Y., Liu, F., & Zhang, C. (2021). Non-Autoregressive Coarse-to-Fine Video Captioning. Proceedings of the AAAI Conference on Artificial Intelligence, 35(4), 3119–3127. https://doi.org/10.1609/aaai.v35i4.16421