PLEASE: Palm Leaf Search for POMDPs with Large Observation Spaces

Authors

  • Zongzhang Zhang Soochow University
  • David Hsu National University of Singapore
  • Wee Sun Lee National University of Singapore
  • Zhan Wei Lim National University of Singapore
  • Aijun Bai University of Science and Technology of China

DOI:

https://doi.org/10.1609/socs.v6i1.18339

Keywords:

POMDPs, Large Observation Space, Point-based Value Iteration, Heuristics, Efficiency, Palm Leaf Search, Observation Selection

Abstract

This paper provides a novel POMDP planning method, called Palm LEAf SEarch (PLEASE), which allows the selection of more than one outcome when their potential impacts are close to the highest one during its forward exploration. Compared with existing trial-based algorithms, PLEASE can save considerable time to propagate the bound improvements of beliefs in deep levels of the search tree to the root belief because of fewer backup operations. Experiments showed that PLEASE scales up SARSOP, one of the fastest algorithms, by orders of magnitude on some POMDP tasks with large observation spaces.

Downloads

Published

2021-09-01