1.
Xu Z, Gavran I, Ahmad Y, Majumdar R, Neider D, Topcu U, et al. Joint Inference of Reward Machines and Policies for Reinforcement Learning. ICAPS [Internet]. 2020 Jun. 1 [cited 2026 May 28];30(1):590-8. Available from: https://ojs.aaai.org/index.php/ICAPS/article/view/6756