(1)
Gong, Z.; Kumar, A.; Varakantham, P. Offline Safe Reinforcement Learning Using Trajectory Classification. AAAI 2025, 39, 16880-16887.