Constrained Risk-Averse Markov Decision Processes

Mohamadreza Ahmadi; Ugo Rosolia; Michel D. Ingham; Richard M. Murray; Aaron D. Ames

doi:10.1609/aaai.v35i13.17393

Authors

Mohamadreza Ahmadi California Institute of Technology, 1200 E. California Blvd., Pasadena, CA 91125
Ugo Rosolia California Institute of Technology, 1200 E. California Blvd., Pasadena, CA 91125
Michel D. Ingham NASA Jet Propulsion Laboratory, 4800 Oak Grove Dr, Pasadena, CA 91109
Richard M. Murray California Institute of Technology, 1200 E. California Blvd., Pasadena, CA 91125
Aaron D. Ames California Institute of Technology, 1200 E. California Blvd., Pasadena, CA 91125

DOI:

https://doi.org/10.1609/aaai.v35i13.17393

Keywords:

Planning with Markov Models (MDPs, POMDPs)

Abstract

We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic coherent risk objectives and constraints. We begin by formulating the problem in a Lagrangian framework. Under the assumption that the risk objectives and constraints can be represented by a Markov risk transition mapping, we propose an optimization-based method to synthesize Markovian policies that lower-bound the constrained risk-averse problem. We demonstrate that the formulated optimization problems are in the form of difference convex programs (DCPs) and can be solved by the disciplined convex-concave programming (DCCP) framework. We show that these results generalize linear programs for constrained MDPs with total discounted expected costs and constraints. Finally, we illustrate the effectiveness of the proposed method with numerical experiments on a rover navigation problem involving conditional-value-at-risk (CVaR) and entropic-value-at-risk (EVaR) coherent risk measures.

Constrained Risk-Averse Markov Decision Processes

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription