Trial-Based Dynamic Programming for Multi-Agent Planning

Feng Wu; Shlomo Zilberstein; Xiaoping Chen

doi:10.1609/aaai.v24i1.7616

Trial-Based Dynamic Programming for Multi-Agent Planning

Authors

Feng Wu University of Science and Technology of China
Shlomo Zilberstein University of Massachusetts Amherst
Xiaoping Chen University of Science and Technology of China

DOI:

https://doi.org/10.1609/aaai.v24i1.7616

Keywords:

Multi-Agent Planning, Cooperation and Coordination, Decentralized POMDPs

Abstract

Trial-based approaches offer an efficient way to solve single-agent MDPs and POMDPs. These approaches allow agents to focus their computations on regions of the environment they encounter during the trials, leading to significant computational savings. We present a novel trial-based dynamic programming (TBDP) algorithm for DEC-POMDPs that extends these benefits to multi-agent settings. The algorithm uses trial-based methods for both belief generation and policy evaluation. Policy improvement is implemented efficiently using linear programming and a sub-policy reuse technique that helps bound the amount of memory. The results show that TBDP can produce significant value improvements and is much faster than the best existing planning algorithms.

Downloads

Published

2010-07-04

How to Cite

Wu, F., Zilberstein, S., & Chen, X. (2010). Trial-Based Dynamic Programming for Multi-Agent Planning. Proceedings of the AAAI Conference on Artificial Intelligence, 24(1), 908-914. https://doi.org/10.1609/aaai.v24i1.7616

Download Citation

Issue

Vol. 24 No. 1 (2010): Twenty-Fourth AAAI Conference on Artificial Intelligence

Section

AAAI Technical Track: Multiagent Systems

Trial-Based Dynamic Programming for Multi-Agent Planning

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription