[1]
Q. Yang, T. D. Simão, S. H. Tindemans, and M. . T. J. Spaan, “WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning”, AAAI, vol. 35, no. 12, pp. 10639-10646, May 2021.