Wright, R., X. Qiao, S. Loscalzo, and L. Yu. “Improving Approximate Value Iteration With Complex Returns by Bounding”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 29, no. 1, Feb. 2015, doi:10.1609/aaai.v29i1.9568.