Xia, Kangxiang, Xinfa Zhu, Jixun Yao, Wenjie Tian, Wenhao Li, and Lei Xie. “KALL-E: Autoregressive Speech Synthesis With Next-Distribution Prediction”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 40 (March 14, 2026): 34016–34024. Accessed May 9, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/40695.