Efficient Bounds in Heuristic Search Algorithms for Stochastic Shortest Path Problems

Eric Hansen; Ibrahim Abdoulahi

doi:10.1609/aaai.v29i1.9643

Efficient Bounds in Heuristic Search Algorithms for Stochastic Shortest Path Problems

Authors

Eric Hansen Mississippi State Univeristy
Ibrahim Abdoulahi Mississippi State University

DOI:

https://doi.org/10.1609/aaai.v29i1.9643

Abstract

Fully observable decision-theoretic planning problems are commonly modeled as stochastic shortest path (SSP) problems. For this class of planning problems, heuristic search algorithms (including LAO*, RTDP, and related algorithms), as well as the value iteration algorithm on which they are based, lack an efficient test for convergence to an ε-optimal policy (except in the special case of discounting). We introduce a simple and efficient test for convergence that applies to SSP problems with positive action costs. The test can detect whether a policy is proper, that is, whether it achieves the goal state with probability 1. If proper, it gives error bounds that can be used to detect convergence to an ε-optimal solution. The convergence test incurs no extra overhead besides computing the Bellman residual, and the performance guarantee it provides substantially improves the utility of this class of planning algorithms.

Downloads

Published

2015-03-04

How to Cite

Hansen, E., & Abdoulahi, I. (2015). Efficient Bounds in Heuristic Search Algorithms for Stochastic Shortest Path Problems. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1). https://doi.org/10.1609/aaai.v29i1.9643

Download Citation

Issue

Vol. 29 No. 1 (2015): Twenty-Ninth AAAI Conference on Artificial Intelligence

Section

Main Track: Planning and Scheduling

Efficient Bounds in Heuristic Search Algorithms for Stochastic Shortest Path Problems

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription