(1)
Wang, S.; Wang, H.; Huang, L. Adaptive Algorithms for Multi-Armed Bandit With Composite and Anonymous Feedback. AAAI 2021, 35, 10210-10217.