Li, W., Mustafa, W., Monteiro, M., Wang, P., Kloft, M., & Fellenz, S. (2026). TORA: Train Once, Realign Anytime for Offline Multi-Objective Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(44), 37609–37617. https://doi.org/10.1609/aaai.v40i44.41095