(1)
Li, W.; Mustafa, W.; Monteiro, M.; Wang, P.; Kloft, M.; Fellenz, S. TORA: Train Once, Realign Anytime for Offline Multi-Objective Reinforcement Learning. AAAI 2026, 40, 37609-37617.