Augmenting Markov Decision Processes with Advising

Loïs Vanhée; Laurent Jeanpierre; Abdel-Illah Mouaddib

doi:10.1609/aaai.v33i01.33012531

Authors

Loïs Vanhée Université de Caen
Laurent Jeanpierre Université de Caen
Abdel-Illah Mouaddib University of Caen Normandy

DOI:

https://doi.org/10.1609/aaai.v33i01.33012531

Abstract

This paper introduces Advice-MDPs, an expansion of Markov Decision Processes for generating policies that take into consideration advising on the desirability, undesirability, and prohibition of certain states and actions. AdviceMDPs enable the design of designing semi-autonomous systems (systems that require operator support for at least handling certain situations) that can efficiently handle unexpected complex environments. Operators, through advising, can augment the planning model for covering unexpected real-world irregularities. This advising can swiftly augment the degree of autonomy of the system, so it can work without subsequent human intervention.

This paper details the Advice-MDP formalism, a fast AdviceMDP resolution algorithm, and its applicability for real-world tasks, via the design of a professional-class semi-autonomous robot system ready to be deployed in a wide range of unexpected environments and capable of efficiently integrating operator advising.

Augmenting Markov Decision Processes with Advising

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information