Wright, R., Qiao, X., Loscalzo, S., & Yu, L. (2015). Improving Approximate Value Iteration with Complex Returns by Bounding. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1). https://doi.org/10.1609/aaai.v29i1.9568