Cao, Z. (2017) “Maximizing the Probability of Arriving on Time: A Practical Q-Learning Method”, Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). doi: 10.1609/aaai.v31i1.11170.