[1]
Chen, J. et al. 2024. EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE. Proceedings of the AAAI Conference on Artificial Intelligence. 38, 2 (Mar. 2024), 1110–1119. DOI:https://doi.org/10.1609/aaai.v38i2.27872.