To Max or Not to Max: Online Learning for Speeding Up Optimal Planning

Authors

  • Carmel Domshlak Technion
  • Erez Karpas Technion
  • Shaul Markovitch Technion

DOI:

https://doi.org/10.1609/aaai.v24i1.7741

Keywords:

search, optimal search, learning, planning

Abstract

It is well known that there cannot be a single "best" heuristic for optimal planning in general. One way of overcoming this is by combining admissible heuristics (e.g. by using their maximum), which requires computing numerous heuristic estimates at each state. However, there is a tradeoff between the time spent on computing these heuristic estimates for each state, and the time saved by reducing the number of expanded states. We present a novel method that reduces the cost of combining admissible heuristics for optimal search, while maintaining its benefits. Based on an idealized search space model, we formulate a decision rule for choosing the best heuristic to compute at each state. We then present an active online learning approach for that decision rule, and employ the learned model to decide which heuristic to compute at each state. We evaluate this technique empirically, and show that it substantially outperforms each of the individual heuristics that were used, as well as their regular maximum.

Downloads

Published

2010-07-04

How to Cite

Domshlak, C., Karpas, E., & Markovitch, S. (2010). To Max or Not to Max: Online Learning for Speeding Up Optimal Planning. Proceedings of the AAAI Conference on Artificial Intelligence, 24(1), 1071-1076. https://doi.org/10.1609/aaai.v24i1.7741

Issue

Section

Reasoning about Plans, Processes and Actions