[1]

Jiang, Y. et al. 2021. Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks. Proceedings of the AAAI Conference on Artificial Intelligence. 35, 9 (May 2021), 7995–8003. DOI:https://doi.org/10.1609/aaai.v35i9.16975.