(1)
Chen, J.; Guo, L.; Sun, J.; Shao, S.; Yuan, Z.; Lin, L.; Zhang, D. EVE: Efficient Vision-Language Pre-Training With Masked Prediction and Modality-Aware MoE. AAAI 2024, 38, 1110-1119.