[1]
Yang, Q., Simão, T.D., Tindemans, S.H. and Spaan, M. T.J. 2021. WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence. 35, 12 (May 2021), 10639-10646. DOI:https://doi.org/10.1609/aaai.v35i12.17272.