Derstroff, Cedric, Mattia Cerrato, Jannis Brugger, Jan Peters, and Stefan Kramer. “Peer Learning: Learning Complex Policies in Groups from Scratch via Action Recommendations”. Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 10 (March 24, 2024): 11766–11774. Accessed May 18, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/29061.