Cao, Z., Guo, H., Zhang, J., Oliehoek, F., & Fastenrath, U. (2017). Maximizing the Probability of Arriving on Time: A Practical Q-Learning Method. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.11170