Wang, K., Z. Zou, Q. Deng, J. Tao, R. Wu, C. Fan, L. Chen, and P. Cui. “Reinforcement Learning With a Disentangled Universal Value Function for Item Recommendation”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 5, May 2021, pp. 4427-35, https://ojs.aaai.org/index.php/AAAI/article/view/16569.