Du, Chenpeng, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, and Kai Yu. 2024. “UniCATS: A Unified Context-Aware Text-to-Speech Framework With Contextual VQ-Diffusion and Vocoding”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (16):17924-32. https://doi.org/10.1609/aaai.v38i16.29747.