Representation Discovery for MDPs Using Bisimulation Metrics

Sherry Ruan; Gheorghe Comanici; Prakash Panangaden; Doina Precup

doi:10.1609/aaai.v29i1.9701

Representation Discovery for MDPs Using Bisimulation Metrics

Authors

Sherry Ruan McGill University
Gheorghe Comanici McGill University
Prakash Panangaden McGill University
Doina Precup McGill University

DOI:

https://doi.org/10.1609/aaai.v29i1.9701

Abstract

We provide a novel, flexible, iterative refinement algorithm to automatically construct an approximate statespace representation for Markov Decision Processes (MDPs). Our approach leverages bisimulation metrics, which have been used in prior work to generate features to represent the state space of MDPs. We address a drawback of this approach, which is the expensive computation of the bisimulation metrics. We propose an algorithm to generate an iteratively improving sequence of state space partitions. Partial metric computations guide the representation search and provide much lower space and computational complexity, while maintaining strong convergence properties. We provide theoretical results guaranteeing convergence as well as experimental illustrations of the accuracy and savings (in time and memory usage) of the new algorithm, compared to traditional bisimulation metric computation.

Downloads

Published

2015-03-04

How to Cite

Ruan, S., Comanici, G., Panangaden, P., & Precup, D. (2015). Representation Discovery for MDPs Using Bisimulation Metrics. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1). https://doi.org/10.1609/aaai.v29i1.9701

Download Citation

Issue

Vol. 29 No. 1 (2015): Twenty-Ninth AAAI Conference on Artificial Intelligence

Section

AAAI Technical Track: Reasoning under Uncertainty

Representation Discovery for MDPs Using Bisimulation Metrics

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information