Chen, Wenqing, Jidong Tian, Caoyun Fan, Yitian Li, Hao He, and Yaohui Jin. “Preference-Controlled Multi-Objective Reinforcement Learning for Conditional Text Generation”. Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 11 (June 26, 2023): 12662–12672. Accessed May 15, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/26490.