(1)
Wang, K.; Zou, Z.; Deng, Q.; Tao, J.; Wu, R.; Fan, C.; Chen, L.; Cui, P. Reinforcement Learning With a Disentangled Universal Value Function for Item Recommendation. AAAI 2021, 35, 4427-4435.