Chen, Liqun, Ke Bai, Chenyang Tao, Yizhe Zhang, Guoyin Wang, Wenlin Wang, Ricardo Henao, and Lawrence Carin. “Sequence Generation With Optimal-Transport-Enhanced Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 05 (April 3, 2020): 7512–7520. Accessed May 30, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/6249.