Zhou, S., Y. Zhou, Y. He, X. Zhou, J. Wang, W. Deng, and J. Shu. “IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 41, Mar. 2026, pp. 35139-48, doi:10.1609/aaai.v40i41.40820.