(1)
Yang, Q.; Simão, T. D.; Tindemans, S. H.; Spaan, M. . T. J. WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning. AAAI 2021, 35, 10639-10646.