Solving Partially Observable Stochastic Games with Public Observations

Karel Horák; Branislav Bošanský

doi:10.1609/aaai.v33i01.33012029

Authors

Karel Horák Czech Technical University in Prague
Branislav Bošanský Czech Technical University in Prague

DOI:

https://doi.org/10.1609/aaai.v33i01.33012029

Abstract

In many real-world problems, there is a dynamic interaction between competitive agents. Partially observable stochastic games (POSGs) are among the most general formal models that capture such dynamic scenarios. The model captures stochastic events, partial information of players about the environment, and the scenario does not have a fixed horizon. Solving POSGs in the most general setting is intractable.Therefore, the research has been focused on subclasses of POSGs that have a value of the game and admit designing (approximate) optimal algorithms. We propose such a subclass for two-player zero-sum games with discounted-sum objective function—POSGs with public observations (POPOSGs)—where each player is able to reconstruct beliefs of the other player over the unobserved states. Our results include: (1) theoretical analysis of PO-POSGs and their value functions showing convexity (concavity) in beliefs of maximizing (minimizing) player, (2) a novel algorithm for approximating the value of the game, and (3) a practical demonstration of scalability of our algorithm. Experimental results show that our algorithm can closely approximate the value of non-trivial games with hundreds of states.

Solving Partially Observable Stochastic Games with Public Observations

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription