[1]
L. Xu, J. Honda, and M. Sugiyama, “Dueling Bandits with Qualitative Feedback”, AAAI, vol. 33, no. 01, pp. 5549-5556, Jul. 2019.