Online Algorithms for POMDPs with Continuous State, Action, and Observation Spaces

Zachary Sunberg; Mykel Kochenderfer

doi:10.1609/icaps.v28i1.13882

Online Algorithms for POMDPs with Continuous State, Action, and Observation Spaces

Authors

Zachary Sunberg Stanford University
Mykel Kochenderfer Stanford University

DOI:

https://doi.org/10.1609/icaps.v28i1.13882

Keywords:

POMDPs, Monte Carlo Tree Search, Particle Filters

Abstract

Online solvers for partially observable Markov decision processes have been applied to problems with large discrete state spaces, but continuous state, action, and observation spaces remain a challenge. This paper begins by investigating double progressive widening (DPW) as a solution to this challenge. However, we prove that this modification alone is not sufficient because the belief representations in the search tree collapse to a single particle causing the algorithm to converge to a policy that is suboptimal regardless of the computation time. This paper proposes and evaluates two new algorithms, POMCPOW and PFT-DPW, that overcome this deficiency by using weighted particle filtering. Simulation results show that these modifications allow the algorithms to be successful where previous approaches fail.

Downloads

Published

2018-06-15

How to Cite

Sunberg, Z., & Kochenderfer, M. (2018). Online Algorithms for POMDPs with Continuous State, Action, and Observation Spaces. Proceedings of the International Conference on Automated Planning and Scheduling, 28(1), 259-263. https://doi.org/10.1609/icaps.v28i1.13882

Download Citation

Issue

Vol. 28 (2018): Twenty-Eighth International Conference on Automated Planning and Scheduling

Section

Main Track

Online Algorithms for POMDPs with Continuous State, Action, and Observation Spaces

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information