Flageat, Manon, Bryan Lim, and Antoine Cully. 2024. “Beyond Expected Return: Accounting for Policy Reproducibility When Evaluating Reinforcement Learning Algorithms”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (11):12024-32. https://doi.org/10.1609/aaai.v38i11.29090.