Scarlini, B., Pasini, T., & Navigli, R. (2022). Visual Definition Modeling: Challenging Vision & Language Models to Define Words and Objects. Proceedings of the AAAI Conference on Artificial Intelligence, 36(10), 11267–11275. https://doi.org/10.1609/aaai.v36i10.21377