Zhang, Hongbo, Guang Wang, Xu Wang, Zhengyang Zhou, Chen Zhang, Zheng Dong, and Yang Wang. 2024. “NondBREM: Nondeterministic Offline Reinforcement Learning for Large-Scale Order Dispatching”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (1):401-9. https://doi.org/10.1609/aaai.v38i1.27794.