Robbins, Wes. 2022. “Towards Multimodal Vision-Language Models Generating Non-Generic Text”. Proceedings of the AAAI Conference on Artificial Intelligence 36 (11):13138-39. https://doi.org/10.1609/aaai.v36i11.21705.