Basis Function Discovery Using Spectral Clustering and Bisimulation Metrics

Gheorghe Comanici; Doina Precup

doi:10.1609/aaai.v25i1.7918

Basis Function Discovery Using Spectral Clustering and Bisimulation Metrics

Authors

Gheorghe Comanici McGill University
Doina Precup McGill University

DOI:

https://doi.org/10.1609/aaai.v25i1.7918

Abstract

We study the problem of automatically generating features for function approximation in reinforcement learning. We build on the work of Mahadevan and his colleagues, who pioneered the use of spectral clustering methods for basis function construction. Their methods work on top of a graph that captures state adjacency. Instead, we use bisimulation metrics in order to provide state distances for spectral clustering. The advantage of these metrics is that they incorporate reward information in a natural way, in addition to the state transition information. We provide theoretical bounds on the quality of the obtained approximation, which justify the importance of incorporating reward information. We also demonstrate empirically that the approximation quality improves when bisimulation metrics are used instead of the state adjacency graph in the basis function construction process.

Downloads

Published

2011-08-04

How to Cite

Comanici, G., & Precup, D. (2011). Basis Function Discovery Using Spectral Clustering and Bisimulation Metrics. Proceedings of the AAAI Conference on Artificial Intelligence, 25(1), 325-330. https://doi.org/10.1609/aaai.v25i1.7918

Download Citation

Issue

Vol. 25 No. 1 (2011): Twenty-Fifth AAAI Conference on Artificial Intelligence

Section

AAAI Technical Track: Machine Learning

Basis Function Discovery Using Spectral Clustering and Bisimulation Metrics

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription