Solving Large Extensive-Form Games with Strategy Constraints

Trevor Davis; Kevin Waugh; Michael Bowling

doi:10.1609/aaai.v33i01.33011861

Authors

Trevor Davis University of Alberta
Kevin Waugh DeepMind
Michael Bowling University of Alberta

DOI:

https://doi.org/10.1609/aaai.v33i01.33011861

Abstract

Extensive-form games are a common model for multiagent interactions with imperfect information. In two-player zerosum games, the typical solution concept is a Nash equilibrium over the unconstrained strategy set for each player. In many situations, however, we would like to constrain the set of possible strategies. For example, constraints are a natural way to model limited resources, risk mitigation, safety, consistency with past observations of behavior, or other secondary objectives for an agent. In small games, optimal strategies under linear constraints can be found by solving a linear program; however, state-of-the-art algorithms for solving large games cannot handle general constraints. In this work we introduce a generalized form of Counterfactual Regret Minimization that provably finds optimal strategies under any feasible set of convex constraints. We demonstrate the effectiveness of our algorithm for finding strategies that mitigate risk in security games, and for opponent modeling in poker games when given only partial observations of private information.

Solving Large Extensive-Form Games with Strategy Constraints

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription