(1)
Wang, S.; Sun, H.; Li, K. Preference Is More Than Comparisons: Rethinking Dueling Bandits With Augmented Human Feedback. AAAI 2026, 40, 26453-26461.