Chen, Junyi, Longteng Guo, Jia Sun, Shuai Shao, Zehuan Yuan, Liang Lin, and Dongyu Zhang. 2024. “EVE: Efficient Vision-Language Pre-Training With Masked Prediction and Modality-Aware MoE”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (2):1110-19. https://doi.org/10.1609/aaai.v38i2.27872.