Yang, C.-C., W.-C. Fan, C.-F. Yang, and Y.-C. F. Wang. “Cross-Modal Mutual Learning for Audio-Visual Speech Recognition and Manipulation”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, no. 3, June 2022, pp. 3036-44, doi:10.1609/aaai.v36i3.20210.