[1]
W. Fang, T. Zhang, and A. Chan, “To Align or Not to Align: Strategic Multimodal Representation Alignment for Optimal Performance”, AAAI, vol. 40, no. 25, pp. 21056–21064, Mar. 2026.