Lewis, Alan, and Tim Miller. “Deceptive Reinforcement Learning in Model-Free Domains”. Proceedings of the International Conference on Automated Planning and Scheduling 33, no. 1 (July 1, 2023): 587–595. Accessed May 25, 2026. https://ojs.aaai.org/index.php/ICAPS/article/view/27240.