(1)
Cheng, J.; Lai, S.; Yao, S.; Xue, B. Offline Multi-Objective Bandits: From Logged Data to Pareto-Optimal Policies. AAAI 2026, 40, 36636-36644.