[1]

Cao, Z. et al. 2017. Maximizing the Probability of Arriving on Time: A Practical Q-Learning Method. Proceedings of the AAAI Conference on Artificial Intelligence. 31, 1 (Feb. 2017). DOI:https://doi.org/10.1609/aaai.v31i1.11170.