Robust Adaptive Multi-Step Predictive Shielding (Student Abstract)

Authors

  • Tanmay Ambadkar The Pennsylvania State University, University Park, PA
  • Darshan Chudiwal The Pennsylvania State University, University Park, PA
  • Greg Anderson Reed College, Portland, OR
  • Abhinav Verma The Pennsylvania State University, University Park, PA

DOI:

https://doi.org/10.1609/aaai.v40i48.42184

Abstract

Ensuring safety in deep reinforcement learning is challenging, as formal methods that provide strong guarantees often fail to scale to complex, high-dimensional systems. We introduce RAMPS, a scalable shielding framework that pairs a general-purpose, learned linear dynamics model with a robust, multi-step Control Barrier Function (CBF) for real-time safety interventions. Experiments show RAMPS significantly reduces safety violations in high-dimensional environments compared to state-of-the-art methods, without sacrificing task performance.

Published

2026-03-14

How to Cite

Ambadkar, T., Chudiwal, D., Anderson, G., & Verma, A. (2026). Robust Adaptive Multi-Step Predictive Shielding (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41121–41123. https://doi.org/10.1609/aaai.v40i48.42184