Metcalf, K. (2024) “Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It’s Complicated”, Proceedings of the AAAI Conference on Artificial Intelligence, 38(9), pp. 10128–10136. doi: 10.1609/aaai.v38i9.28877.