Li, Yinghao Aaron, Xilin Jiang, Fei Tao, Cheng Niu, Kaifeng Xu, Juntong Song, and Nima Mesgarani. 2026. “DMOSpeech 2: Reinforcement Learning for Duration Prediction in Metric-Optimized Speech Synthesis”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (38):31814-22. https://doi.org/10.1609/aaai.v40i38.40450.