(1)
Hu, X.; Yin, X.; Lin, K.; Zhang, L.; Gao, J.; Wang, L.; Liu, Z. VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning. AAAI 2021, 35, 1575-1583.