[1]

J. Yang and S. Ren, “Robust Bandit Learning with Imperfect Context”, AAAI, vol. 35, no. 12, pp. 10594-10602, May 2021.