Cheng, Ji, et al. “Offline Multi-Objective Bandits: From Logged Data to Pareto-Optimal Policies”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 43, Mar. 2026, pp. 36636-44, doi:10.1609/aaai.v40i43.40987.