[1]
Z. Wang, R. Bao, Q. Wu, and S. Liu, “Confidence-aware Non-repetitive Multimodal Transformers for TextCaps”, AAAI, vol. 35, no. 4, pp. 2835-2843, May 2021.