[1]
Y. Wu, Y. Lu, Y. Peng, X. Wang, R. Song, and S. Watanabe, “Enhancing Audiovisual Speech Recognition Through Bifocal Preference Optimization”, AAAI, vol. 39, no. 24, pp. 25516–25524, Apr. 2025.