Non-Parametric Approximate Linear Programming for MDPs

Jason Pazis; Ronald Parr

doi:10.1609/aaai.v25i1.7930

Non-Parametric Approximate Linear Programming for MDPs

Authors

Jason Pazis Duke University
Ronald Parr Duke University

DOI:

https://doi.org/10.1609/aaai.v25i1.7930

Abstract

The Approximate Linear Programming (ALP) approach to value function approximation for MDPs is a parametric value function approximation method, in that it represents the value function as a linear combination of features which are chosen a priori. Choosing these features can be a difficult challenge in itself. One recent effort, Regularized Approximate Linear Programming (RALP), uses L1 regularization to address this issue by combining a large initial set of features with a regularization penalty that favors a smooth value function with few non-zero weights. Rather than using smoothness as a backhanded way of addressing the feature selection problem, this paper starts with smoothness and develops a non-parametric approach to ALP that is consistent with the smoothness assumption. We show that this new approach has some favorable practical and analytical properties in comparison to (R)ALP.

Downloads

Published

2011-08-04

How to Cite

Pazis, J., & Parr, R. (2011). Non-Parametric Approximate Linear Programming for MDPs. Proceedings of the AAAI Conference on Artificial Intelligence, 25(1), 459-464. https://doi.org/10.1609/aaai.v25i1.7930

Download Citation

Issue

Vol. 25 No. 1 (2011): Twenty-Fifth AAAI Conference on Artificial Intelligence

Section

AAAI Technical Track: Machine Learning

Non-Parametric Approximate Linear Programming for MDPs

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription