Zhang, S., Liu, B., & Whiteson, S. (2021). Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 35(12), 10905-10913. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/17302