Singh, A. J., A. Kumar, and H. C. Lau. “Learning and Exploiting Shaped Reward Models for Large Scale Multiagent RL”. Proceedings of the International Conference on Automated Planning and Scheduling, vol. 31, no. 1, May 2021, pp. 588-96, doi:10.1609/icaps.v31i1.16007.