Jiang, Yuqian, Suda Bharadwaj, Bo Wu, Rishi Shah, Ufuk Topcu, and Peter Stone. “Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks”. Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 9 (May 18, 2021): 7995–8003. Accessed May 29, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/16975.