Kim, Minsu, Jeong Hun Yeo, and Yong Man Ro. 2022. “Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading”. Proceedings of the AAAI Conference on Artificial Intelligence 36 (1):1174-82. https://doi.org/10.1609/aaai.v36i1.20003.