Online Robust Planning Under Model Uncertainty: A Sample-Based Approach

Tamir Shazman; Idan Lev-Yehudi; Ron Benchetrit; Vadim Indelman

doi:10.1609/aaai.v40i43.40959

Authors

Tamir Shazman Faculty of Data and Decisions Sciences, Technion – Israel Institute of Technology
Idan Lev-Yehudi Technion Autonomous Systems Program (TASP), Technion - Israel Institute of Technology
Ron Benchetrit Computer Science Department, Technion - Israel Institute of Technology
Vadim Indelman Stephen B. Klein Faculty of Aerospace Engineering, Technion - Israel Institute of Technology Faculty of Data and Decision Sciences, Technion - Israel Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v40i43.40959

Abstract

Online planning in Markov Decision Processes (MDPs) enables agents to make sequential decisions by simulating future trajectories from the current state, making it well-suited for large-scale or dynamic environments. Sample-based methods such as Sparse Sampling and Monte Carlo Tree Search (MCTS) are widely adopted for their ability to approximate optimal actions using a generative model. However, in practical settings, the generative model is often learned from limited data, introducing approximation errors that can degrade performance or lead to unsafe behaviors. To address these challenges, Robust MDPs (RMDPs) offer a principled framework for planning under model uncertainty, yet existing approaches are typically computationally intensive and not suited for real-time use. In this work, we introduce Robust Sparse Sampling (RSS), the first online planning algorithm for RMDPs with finite-sample theoretical performance guarantees. Unlike Sparse Sampling, which estimates the nominal value function, RSS computes a robust value function by leveraging the efficiency and theoretical properties of Sample Average Approximation (SAA), enabling tractable robust policy computation in online settings. RSS is applicable to infinite or continuous state spaces, and its sample and computational complexities are independent of the state space size. We provide theoretical performance guarantees and empirically show that RSS outperforms standard Sparse Sampling in environments with uncertain dynamics.

Online Robust Planning Under Model Uncertainty: A Sample-Based Approach

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information