[1]
S. Zhao, W. Cui, B. Jiang, L. Kong, and X. Yan, “Responsible Bandit Learning via Privacy-Protected Mean-Volatility Utility”, AAAI, vol. 38, no. 19, pp. 21815-21822, Mar. 2024.