Song, Yakun, Zhuo Chen, Xiaofei Wang, Ziyang Ma, and Xie Chen. “ELLA-V: Stable Neural Codec Language Modeling With Alignment-Guided Sequence Reordering”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 24 (April 11, 2025): 25174–25182. Accessed May 13, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/34703.