Gong, Ze, Akshat Kumar, and Pradeep Varakantham. 2025. “Offline Safe Reinforcement Learning Using Trajectory Classification”. Proceedings of the AAAI Conference on Artificial Intelligence 39 (16):16880-87. https://doi.org/10.1609/aaai.v39i16.33855.