Yang, Q., T. D. Simão, S. H. Tindemans, and M. . T. J. Spaan. “WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 12, May 2021, pp. 10639-46, doi:10.1609/aaai.v35i12.17272.