Srinivasan, Padmanaba, and William Knottenbelt. “Behaviour Preference Regression for Offline Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 39, no. 19, Apr. 2025, pp. 20575-83, doi:10.1609/aaai.v39i19.34267.