Li, X., Jiang, S., & Han, J. (2019). Learning Object Context for Dense Captioning. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 8650-8657. https://doi.org/10.1609/aaai.v33i01.33018650