Zhang, S., B. Liu, and S. Whiteson. “Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 12, May 2021, pp. 10905-13, doi:10.1609/aaai.v35i12.17302.