Linear Fitted-Q Iteration with Multiple Reward Functions

Daniel Lizotte; Michael Bowling; Susan Murphy

doi:10.1609/icaps.v23i1.13579

Linear Fitted-Q Iteration with Multiple Reward Functions

Authors

Daniel Lizotte University of Waterloo
Michael Bowling University of Alberta
Susan Murphy University of Michigan

DOI:

https://doi.org/10.1609/icaps.v23i1.13579

Keywords:

reinforcement learning, dynamic programming, decision making, linear regression, preference elicitation

Abstract

We present a general and detailed development of an algorithm for finite-horizon fitted-Q iteration with an arbitrary number of reward signals and linear value function approximation using an arbitrary number of state features. This includes a detailed treatment of the 3-reward function case using triangulation primitives from computational geometry and a method for identifying globally dominated actions. We also present an example of how our methods can be used to construct a real-world decision aid by considering symptom reduction, weight gain, and quality of life in sequential treatments for schizophrenia. Finally, we discuss future directions in which to take this work that will further enable our methods to make a positive impact on the field of evidence-based clinical decision support.

Downloads

Published

2013-06-02

How to Cite

Lizotte, D., Bowling, M., & Murphy, S. (2013). Linear Fitted-Q Iteration with Multiple Reward Functions. Proceedings of the International Conference on Automated Planning and Scheduling, 23(1), 474-475. https://doi.org/10.1609/icaps.v23i1.13579

Download Citation

Issue

Vol. 23 (2013): Twenty-Third International Conference on Automated Planning and Scheduling

Section

Journal Presentation Track

Linear Fitted-Q Iteration with Multiple Reward Functions

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information