(1)
Song, L.; Liu, J.; Qian, B.; Chen, Y. Connecting Language to Images: A Progressive Attention-Guided Network for Simultaneous Image Captioning and Language Grounding. AAAI 2019, 33, 8885-8892.