Solving Risk-Sensitive POMDPs With and Without Cost Observations

Ping Hou; William Yeoh; Pradeep Varakantham

doi:10.1609/aaai.v30i1.10402

Solving Risk-Sensitive POMDPs With and Without Cost Observations

Authors

Ping Hou New Mexico State University
William Yeoh New Mexico State University
Pradeep Varakantham Singapore Management University

DOI:

https://doi.org/10.1609/aaai.v30i1.10402

Keywords:

POMDP, Risk-Sensitive, Utility Theory

Abstract

Partially Observable Markov Decision Processes (POMDPs) are often used to model planning problems under uncertainty. The goal in Risk-Sensitive POMDPs (RS-POMDPs) is to find a policy that maximizes the probability that the cumulative cost is within some user-defined cost threshold. In this paper, unlike existing POMDP literature, we distinguish between the two cases of whether costs can or cannot be observed and show the empirical impact of cost observations. We also introduce a new search-based algorithm to solve RS-POMDPs and show that it is faster and more scalable than existing approaches in two synthetic domains and a taxi domain generated with real-world data.

Downloads

Published

2016-03-05

How to Cite

Hou, P., Yeoh, W., & Varakantham, P. (2016). Solving Risk-Sensitive POMDPs With and Without Cost Observations. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10402

Download Citation

Issue

Vol. 30 No. 1 (2016): Thirtieth AAAI Conference on Artificial Intelligence

Section

Technical Papers: Planning and Scheduling

Solving Risk-Sensitive POMDPs With and Without Cost Observations

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription