CHENG, Ji; LAI, Song; YAO, Shunyu; XUE, Bo. Offline Multi-Objective Bandits: From Logged Data to Pareto-Optimal Policies. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 43, p. 36636–36644, 2026. DOI: 10.1609/aaai.v40i43.40987. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/40987. Acesso em: 15 may. 2026.