Cao, Z., H. Guo, J. Zhang, F. Oliehoek, and U. Fastenrath. “Maximizing the Probability of Arriving on Time: A Practical Q-Learning Method”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31, no. 1, Feb. 2017, doi:10.1609/aaai.v31i1.11170.