[1]

K. Xia, X. Zhu, J. Yao, W. Tian, W. Li, and L. Xie, “KALL-E: Autoregressive Speech Synthesis with Next-Distribution Prediction”, AAAI, vol. 40, no. 40, pp. 34016–34024, Mar. 2026.