[1]
S. Wang, H. Sun, and K. Li, “Preference Is More than Comparisons: Rethinking Dueling Bandits with Augmented Human Feedback”, AAAI, vol. 40, no. 31, pp. 26453–26461, Mar. 2026.