[1]
A. Sharaf and H. Daumé III, “Meta-Learning Effective Exploration Strategies for Contextual Bandits”, AAAI, vol. 35, no. 11, pp. 9541-9548, May 2021.