Kim, M., Yeo, J. H., & Ro, Y. M. (2022). Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading. Proceedings of the AAAI Conference on Artificial Intelligence, 36(1), 1174-1182. https://doi.org/10.1609/aaai.v36i1.20003