(1)
Robbins, W. Towards Multimodal Vision-Language Models Generating Non-Generic Text. AAAI 2022, 36, 13138-13139.