Yang, Qisong, et al. “WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 12, May 2021, pp. 10639-46, doi:10.1609/aaai.v35i12.17272.