[1]
W. Li, W. Mustafa, M. Monteiro, P. Wang, M. Kloft, and S. Fellenz, “TORA: Train Once, Realign Anytime for Offline Multi-Objective Reinforcement Learning”, AAAI, vol. 40, no. 44, pp. 37609–37617, Mar. 2026.