High-Quality Policies for the Canadian Traveler's Problem

Authors

  • Patrick Eyerich Albert-Ludwigs-Universität Freiburg
  • Thomas Keller Albert-Ludwigs-Universität Freiburg
  • Malte Helmert Albert-Ludwigs-Universität Freiburg

DOI:

https://doi.org/10.1609/aaai.v24i1.7542

Keywords:

Canadian Traveler's Problem, UCT, hindsight optimization

Abstract

We consider the stochastic variant of the Canadian Traveler's Problem, a path planning problem where adverse weather can cause some roads to be untraversable. The agent does not initially know which roads can be used. However, it knows a probability distribution for the weather, and it can observe the status of roads incident to its location. The objective is to find a policy with low expected travel cost.

We introduce and compare several algorithms for the stochastic CTP. Unlike the optimistic approach most commonly considered in the literature, the new approaches we propose take uncertainty into account explicitly. We show that this property enables them to generate policies of much higher quality than the optimistic one, both theoretically and experimentally.

Downloads

Published

2010-07-03

How to Cite

Eyerich, P., Keller, T., & Helmert, M. (2010). High-Quality Policies for the Canadian Traveler’s Problem. Proceedings of the AAAI Conference on Artificial Intelligence, 24(1), 51-58. https://doi.org/10.1609/aaai.v24i1.7542

Issue

Section

Constraints, Satisfiability, and Search