Heuristic Search Value Iteration for One-Sided Partially Observable Stochastic Games

Karel Horák; Branislav Bošanský; Michal Pěchouček

doi:10.1609/aaai.v31i1.10597

Heuristic Search Value Iteration for One-Sided Partially Observable Stochastic Games

Authors

Karel Horák Czech Technical University in Prague
Branislav Bošanský Czech Technical University in Prague
Michal Pěchouček Czech Technical University in Prague

DOI:

https://doi.org/10.1609/aaai.v31i1.10597

Abstract

Security problems can be modeled as two-player partially observable stochastic games with one-sided partial observability and infinite horizon (one-sided POSGs). We seek for optimal strategies of player 1 that correspond to robust strategies against the worst-case opponent (player 2) that is assumed to have a perfect information about the game. We present a novel algorithm for approximately solving one-sided POSGs based on the heuristic search value iteration (HSVI) for POMDPs. Our results include (1) theoretical properties of one-sided POSGs and their value functions, (2) guarantees showing the convergence of our algorithm to optimal strategies, and (3) practical demonstration of applicability and scalability of our algorithm on three different domains: pursuit-evasion, patrolling, and search games.

Downloads

Published

2017-02-10

How to Cite

Horák, K., Bošanský, B., & Pěchouček, M. (2017). Heuristic Search Value Iteration for One-Sided Partially Observable Stochastic Games. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10597

Download Citation

Issue

Vol. 31 No. 1 (2017): Thirty-First AAAI Conference on Artificial Intelligence

Section

AAAI Technical Track: Game Theory and Economic Paradigms

Heuristic Search Value Iteration for One-Sided Partially Observable Stochastic Games

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription