1.
Mguni D, Jafferjee T, Wang J, Perez-Nieves N, Song W, Tong F, Taylor M, Yang T, Dai Z, Chen H, Zhu J, Shao K, Wang J, Yang Y. Learning to Shape Rewards Using a Game of Two Partners. AAAI [Internet]. 2023Jun.26 [cited 2024May21];37(10):11604-12. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/26371