[1]
Jiang, Y., Bharadwaj, S., Wu, B., Shah, R., Topcu, U. and Stone, P. 2021. Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks. Proceedings of the AAAI Conference on Artificial Intelligence. 35, 9 (May 2021), 7995-8003. DOI:https://doi.org/10.1609/aaai.v35i9.16975.