1.
Yang S, Gao Y, An B, Wang H, Chen X. Efficient Average Reward Reinforcement Learning Using Constant Shifting Values. AAAI [Internet]. 2016Mar.2 [cited 2024Apr.28];30(1). Available from: https://ojs.aaai.org/index.php/AAAI/article/view/10285