Yang, Q., Simão, T. D., Tindemans, S. H., & Spaan, M. . T. J. (2021). WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 35(12), 10639-10646. https://doi.org/10.1609/aaai.v35i12.17272