Lower Bounding Klondike Solitaire with Monte-Carlo Planning

Ronald Bjarnason; Alan Fern; Prasad Tadepalli

doi:10.1609/icaps.v19i1.13363

Authors

Ronald Bjarnason Oregon State University
Alan Fern Oregon State University
Prasad Tadepalli Oregon State University

DOI:

https://doi.org/10.1609/icaps.v19i1.13363

Keywords:

Probabilistic Planning, Monte-Carlo, UCT, Games, Solitaire

Abstract

Despite its ubiquitous presence, very little is known about the odds of winning the simple card game of Klondike Solitaire. The main goal of this paper is to investigate the use of probabilistic planning to shed light on this issue. Unfortunatley, most probabilistic planning techniques are not well suited for Klondike due to the difficulties of representing the domain in standard planning languages and the complexity of the required search. Klondike thus serves as an interesting addition to the complement of probabilistic planning domains. In this paper, we study Klondike using several sampling-based planning approaches including UCT, hindsight optimization, and sparse sampling, and establish lower bounds on their performance. We also introduce novel combinations of these approaches and evaluate them in Klondike. We provide a theoretical bound on the sample complexity of a method that naturally combines sparse sampling and UCT. Our results demonstrate that there is a policy that within tight confidence intervals wins over 35% of Klondike games. This result is the first reported lower bound of an optimal Klondike policy.

Lower Bounding Klondike Solitaire with Monte-Carlo Planning

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information