Wang, K., Zou, Z., Deng, Q., Tao, J., Wu, R., Fan, C., Chen, L., & Cui, P. (2021). Reinforcement Learning with a Disentangled Universal Value Function for Item Recommendation. Proceedings of the AAAI Conference on Artificial Intelligence, 35(5), 4427-4435. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/16569