Metcalf, K., Sarabia, M., Fedzechkina, M., & Theobald, B.-J. (2024). Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It’s Complicated. Proceedings of the AAAI Conference on Artificial Intelligence, 38(9), 10128–10136. https://doi.org/10.1609/aaai.v38i9.28877