(1)
Roy Chaudhuri, A.; Kalyanakrishnan, S. Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory. AAAI 2020, 34, 10085-10092.