Das Gupta, U., Talvitie, E., & Bowling, M. (2015). Policy Tree: Adaptive Representation for Policy Gradient. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1). https://doi.org/10.1609/aaai.v29i1.9613