POMCP with Human Preferences in Settlers of Catan

Mihai Dobre; Alex Lascarides

doi:10.1609/aiide.v14i1.13014

POMCP with Human Preferences in Settlers of Catan

Authors

Mihai Dobre The University of Edinburgh
Alex Lascarides The University of Edinburgh

DOI:

https://doi.org/10.1609/aiide.v14i1.13014

Keywords:

Settlers of Catan, complex games, human preferences, POMCP, partially observable monte carlo planning

Abstract

We present a suite of techniques for extending the Partially Observable Monte Carlo Planning algorithm to handle complex multi-agent games. We design the planning algorithm to exploit the inherent structure of the game. When game rules naturally cluster the actions into sets called types, these can be leveraged to extract characteristics and high-level strategies from a sparse corpus of human play. Another key insight is to account for action legality both when extracting policies from game play and when these are used to inform the forward sampling method. We evaluate our algorithm against other baselines and versus ablated versions of itself in the well-known board game Settlers of Catan.

Downloads

Published

2018-09-25

How to Cite

Dobre, M., & Lascarides, A. (2018). POMCP with Human Preferences in Settlers of Catan. Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 14(1), 17–23. https://doi.org/10.1609/aiide.v14i1.13014

Download Citation

Issue

Vol. 14 No. 1 (2018): Fourteenth Artificial Intelligence and Interactive Digital Entertainment Conference

Section

Full Oral Papers

POMCP with Human Preferences in Settlers of Catan

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information