[1]
A. Roy Chaudhuri and S. Kalyanakrishnan, “Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory”, AAAI, vol. 34, no. 06, pp. 10085-10092, Apr. 2020.