Kim, Donghyun, Kuniaki Saito, Kate Saenko, Stan Sclaroff, and Bryan Plummer. “MULE: Multimodal Universal Language Embedding”. Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 07 (April 3, 2020): 11254–11261. Accessed May 25, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/6785.