[1]

Z. Xu, “Joint Inference of Reward Machines and Policies for Reinforcement Learning”, ICAPS, vol. 30, no. 1, pp. 590-598, Jun. 2020.