Suin, M., & Rajagopalan, A. N. (2020). An Efficient Framework for Dense Video Captioning. Proceedings of the AAAI Conference on Artificial Intelligence, 34(07), 12039–12046. https://doi.org/10.1609/aaai.v34i07.6881