Revisiting Risk-Sensitive MDPs: New Algorithms and Results

Ping Hou; William Yeoh; Pradeep Varakantham

doi:10.1609/icaps.v24i1.13615

Revisiting Risk-Sensitive MDPs: New Algorithms and Results

Authors

Ping Hou New Mexico State University
William Yeoh New Mexico State University
Pradeep Varakantham Singapore Management University

DOI:

https://doi.org/10.1609/icaps.v24i1.13615

Abstract

While Markov Decision Processes (MDPs) have been shown to be effective models for planning under uncertainty, the objective to minimize the expected cumulative cost is inappropriate for high-stake planning problems. As such, Yu, Lin, and Yan (1998) introduced the Risk-Sensitive MDP (RS-MDP) model, where the objective is to find a policy that maximizes the probability that the cumulative cost is within some user-defined cost threshold. In this paper, we revisit this problem and introduce new algorithms that are based on classical techniques, such as depth-first search and dynamic programming, and a recently introduced technique called Topological Value Iteration (TVI). We demonstrate the applicability of our approach on randomly generated MDPs as well as domains from the ICAPS 2011 International Probabilistic Planning Competition (IPPC).

Downloads

Published

2014-05-10

How to Cite

Hou, P., Yeoh, W., & Varakantham, P. (2014). Revisiting Risk-Sensitive MDPs: New Algorithms and Results. Proceedings of the International Conference on Automated Planning and Scheduling, 24(1), 136–144. https://doi.org/10.1609/icaps.v24i1.13615

Download Citation

Issue

Vol. 24 (2014): Twenty-Fourth International Conference on Automated Planning and Scheduling

Section

Full Technical Papers

Revisiting Risk-Sensitive MDPs: New Algorithms and Results

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information