Chen, H., X. Dai, H. Cai, W. Zhang, X. Wang, R. Tang, Y. Zhang, and Y. Yu. “Large-Scale Interactive Recommendation With Tree-Structured Policy Gradient”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, July 2019, pp. 3312-20, doi:10.1609/aaai.v33i01.33013312.