Yang, Shangdong, Yang Gao, Bo An, Hao Wang, and Xingguo Chen. 2016. “Efficient Average Reward Reinforcement Learning Using Constant Shifting Values”. Proceedings of the AAAI Conference on Artificial Intelligence 30 (1). https://doi.org/10.1609/aaai.v30i1.10285.