Wang, S., H. Wang, and L. Huang. “Adaptive Algorithms for Multi-Armed Bandit With Composite and Anonymous Feedback”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 11, May 2021, pp. 10210-7, https://ojs.aaai.org/index.php/AAAI/article/view/17224.