[1]

Zhang, S., Liu, B. and Whiteson, S. 2021. Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence. 35, 12 (May 2021), 10905-10913. DOI:https://doi.org/10.1609/aaai.v35i12.17302.