Zhang, T.-H., Zhang, J., Wang, J., Qian, X., & Yin, X.-C. (2025). FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles. Proceedings of the AAAI Conference on Artificial Intelligence, 39(24), 25922–25930. https://doi.org/10.1609/aaai.v39i24.34786