Real-Time Symbolic Dynamic Programming

Luis Vianna; Leliane de Barros; Scott Sanner

doi:10.1609/aaai.v29i1.9651

Authors

Luis Vianna University of São Paulo
Leliane de Barros University of São Paulo
Scott Sanner NICTA and Australian National University

DOI:

https://doi.org/10.1609/aaai.v29i1.9651

Keywords:

Hybrid MDPs, Continuous Planning, Symbolic Dynamic Programming

Abstract

Recent advances in Symbolic Dynamic Programming (SDP) combined withthe extended algebraic decision diagram (XADD) have provided exactsolutions for expressive subclasses of finite-horizon Hybrid MarkovDecision Processes (HMDPs) with mixed continuous and discrete stateand action parameters. Unfortunately, SDP suffers from two majordrawbacks: (1) it solves for all states and can be intractable formany problems that inherently have large optimal XADD value functionrepresentations; and (2) it cannot maintain compact (pruned) XADDrepresentations for domains with nonlinear dynamics and reward due tothe need for nonlinear constraint checking. In this work, wesimultaneously address both of these problems by introducing real-timeSDP (RTSDP). RTSDP addresses (1) by focusing the solution and valuerepresentation only on regions reachable from a set of initial statesand RTSDP addresses (2) by using visited states as witnesses ofreachable regions to assist in pruning irrelevant or unreachable(nonlinear) regions of the value function. To this end, RTSDP enjoysprovable convergence over the set of initial states and substantialspace and time savings over SDP as we demonstrate in a variety of hybrid domains ranging from inventory to reservoir to traffic control.

Real-Time Symbolic Dynamic Programming

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information