What Can We Learn Even from the Weakest? Learning Sketches for Programmatic Strategies

Authors

  • Leandro C. Medeiros Universidade Federal de Viçosa
  • David S. Aleixo Universidade Federal de Viçosa
  • Levi H. S. Lelis University of Alberta

DOI:

https://doi.org/10.1609/aaai.v36i7.20744

Keywords:

Machine Learning (ML), Search And Optimization (SO)

Abstract

In this paper we show that behavioral cloning can be used to learn effective sketches of programmatic strategies. We show that even the sketches learned by cloning the behavior of weak players can help the synthesis of programmatic strategies. This is because even weak players can provide helpful information, e.g., that a player must choose an action in their turn of the game. If behavioral cloning is not employed, the synthesizer needs to learn even the most basic information by playing the game, which can be computationally expensive. We demonstrate empirically the advantages of our sketch-learning approach with simulated annealing and UCT synthesizers. We evaluate our synthesizers in the games of Can't Stop and MicroRTS. The sketch-based synthesizers are able to learn stronger programmatic strategies than their original counterparts. Our synthesizers generate strategies of Can't Stop that defeat a traditional programmatic strategy for the game. They also synthesize strategies that defeat the best performing method from the latest MicroRTS competition.

Downloads

Published

2022-06-28

How to Cite

Medeiros, L. C., Aleixo, D. S., & Lelis, L. H. S. (2022). What Can We Learn Even from the Weakest? Learning Sketches for Programmatic Strategies. Proceedings of the AAAI Conference on Artificial Intelligence, 36(7), 7761-7769. https://doi.org/10.1609/aaai.v36i7.20744

Issue

Section

AAAI Technical Track on Machine Learning II