Chen, W., Tian, J., Fan, C., Li, Y., He, H., & Jin, Y. (2023). Preference-Controlled Multi-Objective Reinforcement Learning for Conditional Text Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), 12662–12672. https://doi.org/10.1609/aaai.v37i11.26490