(1)
Srinivasan, P.; Knottenbelt, W. Behaviour Preference Regression for Offline Reinforcement Learning. AAAI 2025, 39, 20575-20583.