[1]
L. Cao, M. Shi, and N. B. Shroff, “Provably Efficient Multi-Objective Bandit Algorithms Under Preference-Centric Customization”, AAAI, vol. 40, no. 24, pp. 19889-19897, Mar. 2026.