Wang, S., H. Wang, and L. Huang. “Adaptive Algorithms for Multi-Armed Bandit With Composite and Anonymous Feedback”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 11, May 2021, pp. 10210-7, doi:10.1609/aaai.v35i11.17224.