Cao, Zhiguang, et al. “Maximizing the Probability of Arriving on Time: A Practical Q-Learning Method”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31, no. 1, Feb. 2017, doi:10.1609/aaai.v31i1.11170.