ZHANG, Tian-Hao; ZHANG, Jiawei; WANG, Jun; QIAN, Xinyuan; YIN, Xu-Cheng. FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 39, n. 24, p. 25922–25930, 2025. DOI: 10.1609/aaai.v39i24.34786. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/34786. Acesso em: 13 may. 2026.