(1)
Wang, Z.; Bao, R.; Wu, Q.; Liu, S. Confidence-Aware Non-Repetitive Multimodal Transformers for TextCaps. AAAI 2021, 35, 2835-2843.