Wu, Yihan, Yichen Lu, Yifan Peng, Xihua Wang, Ruihua Song, and Shinji Watanabe. “Enhancing Audiovisual Speech Recognition Through Bifocal Preference Optimization”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 24 (April 11, 2025): 25516–25524. Accessed May 27, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/34741.