ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search

Shangtong Zhang; Hengshuai Yao

doi:10.1609/aaai.v33i01.33015789

ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search

Authors

Shangtong Zhang University of Alberta
Hengshuai Yao Huawei Technologies

DOI:

https://doi.org/10.1609/aaai.v33i01.33015789

Abstract

In this paper, we propose an actor ensemble algorithm, named ACE, for continuous control with a deterministic policy in reinforcement learning. In ACE, we use actor ensemble (i.e., multiple actors) to search the global maxima of the critic. Besides the ensemble perspective, we also formulate ACE in the option framework by extending the option-critic architecture with deterministic intra-option policies, revealing a relationship between ensemble and options. Furthermore, we perform a look-ahead tree search with those actors and a learned value prediction model, resulting in a refined value estimation. We demonstrate a significant performance boost of ACE over DDPG and its variants in challenging physical robot simulators.

Downloads

Published

2019-07-17

How to Cite

Zhang, S., & Yao, H. (2019). ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 5789–5796. https://doi.org/10.1609/aaai.v33i01.33015789

Download Citation

Issue

Vol. 33 No. 01: AAAI-19, IAAI-19, EAAI-20

Section

AAAI Technical Track: Machine Learning

ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information