Pazis, J., & Parr, R. (2016). Efficient PAC-Optimal Exploration in Concurrent, Continuous State MDPs with Delayed Updates. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10307