Lower Bounding Klondike Solitaire with Monte-Carlo Planning

Authors

  • Ronald Bjarnason Oregon State University
  • Alan Fern Oregon State University
  • Prasad Tadepalli Oregon State University

DOI:

https://doi.org/10.1609/icaps.v19i1.13363

Keywords:

Probabilistic Planning, Monte-Carlo, UCT, Games, Solitaire

Abstract

Despite its ubiquitous presence, very little is known about the odds of winning the simple card game of Klondike Solitaire. The main goal of this paper is to investigate the use of probabilistic planning to shed light on this issue. Unfortunatley, most probabilistic planning techniques are not well suited for Klondike due to the difficulties of representing the domain in standard planning languages and the complexity of the required search. Klondike thus serves as an interesting addition to the complement of probabilistic planning domains. In this paper, we study Klondike using several sampling-based planning approaches including UCT, hindsight optimization, and sparse sampling, and establish lower bounds on their performance. We also introduce novel combinations of these approaches and evaluate them in Klondike. We provide a theoretical bound on the sample complexity of a method that naturally combines sparse sampling and UCT. Our results demonstrate that there is a policy that within tight confidence intervals wins over 35% of Klondike games. This result is the first reported lower bound of an optimal Klondike policy.

Downloads

Published

2009-10-16

How to Cite

Bjarnason, R., Fern, A., & Tadepalli, P. (2009). Lower Bounding Klondike Solitaire with Monte-Carlo Planning. Proceedings of the International Conference on Automated Planning and Scheduling, 19(1), 26-33. https://doi.org/10.1609/icaps.v19i1.13363