(1)
Cheng, R.; Orosz, G.; Murray, R. M.; Burdick, J. W. End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks. AAAI 2019, 33, 3387-3395.