Learning to Prune Dominated Action Sequences in Online Black-Box Planning

Authors

  • Yuu Jinnai The University of Tokyo
  • Alex Fukunaga The University of Tokyo

DOI:

https://doi.org/10.1609/aaai.v31i1.10663

Keywords:

Black-box Planning, Online Search, Arcade Learning Environment

Abstract

Black-box domains where the successor states generated by applying an action are generated by a completely opaque simulator pose a challenge for domain-independent planning. The main computational bottleneck in search-based planning for such domains is the number of calls to the black-box simulation. We propose a method for significantly reducing the number of calls to the simulator by the search algorithm by detecting and pruning sequences of actions which are dominated by others. We apply our pruning method to Iterated Width and breadth-first search in domain-independent black-box planning for Atari 2600 games in the Arcade Learning Environment (ALE), adding our pruning method significantly improves upon the baseline algorithms.

Downloads

Published

2017-02-12

How to Cite

Jinnai, Y., & Fukunaga, A. (2017). Learning to Prune Dominated Action Sequences in Online Black-Box Planning. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10663

Issue

Section

AAAI Technical Track: Heuristic Search and Optimization