Wang, K., Zou, Z., Deng, Q., Tao, J., Wu, R., Fan, C., Chen, L., & Cui, P. (2021). Reinforcement Learning with a Disentangled Universal Value Function for Item Recommendation. Proceedings of the AAAI Conference on Artificial Intelligence, 35(5), 4427-4435. https://doi.org/10.1609/aaai.v35i5.16569