Mguni, David, Taher Jafferjee, Jianhong Wang, Nicolas Perez-Nieves, Wenbin Song, Feifei Tong, Matthew Taylor, Tianpei Yang, Zipeng Dai, Hui Chen, Jiangcheng Zhu, Kun Shao, Jun Wang, and Yaodong Yang. “Learning to Shape Rewards Using a Game of Two Partners”. Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 10 (June 26, 2023): 11604-11612. Accessed May 22, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/26371.