1.
Roy Chaudhuri A, Kalyanakrishnan S. Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory. AAAI [Internet]. 2020Apr.3 [cited 2026Apr.25];34(06):10085-92. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/6566