Xu Z, Gavran I, Ahmad Y, Majumdar R, Neider D, Topcu U, Wu B. Joint Inference of Reward Machines and Policies for Reinforcement Learning. ICAPS [Internet]. 2020Jun.1 [cited 2024Oct.6];30(1):590-8. Available from: https://ojs.aaai.org/index.php/ICAPS/article/view/6756