Liang, Yifan, et al. “SLD-L2S: Hierarchical Subspace Latent Diffusion for High-Fidelity Lip to Speech Synthesis”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 38, Mar. 2026, pp. 31943-51, doi:10.1609/aaai.v40i38.40464.