Yang, Qisong, Thiago D. Simão, Simon H Tindemans, and Matthijs T. J. Spaan. 2021. “WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 35 (12):10639-46. https://doi.org/10.1609/aaai.v35i12.17272.