Wang, Shengbo, et al. “Preference Is More Than Comparisons: Rethinking Dueling Bandits With Augmented Human Feedback”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 31, Mar. 2026, pp. 26453-61, doi:10.1609/aaai.v40i31.39852.