(1)
Jiang, Y.; Bharadwaj, S.; Wu, B.; Shah, R.; Topcu, U.; Stone, P. Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks. AAAI 2021, 35, 7995-8003.