TY - JOUR AU - Peng, Binghui AU - Shen, Weiran AU - Tang, Pingzhong AU - Zuo, Song PY - 2019/07/17 Y2 - 2024/03/29 TI - Learning Optimal Strategies to Commit To JF - Proceedings of the AAAI Conference on Artificial Intelligence JA - AAAI VL - 33 IS - 01 SE - AAAI Technical Track: Game Theory and Economic Paradigms DO - 10.1609/aaai.v33i01.33012149 UR - https://ojs.aaai.org/index.php/AAAI/article/view/4047 SP - 2149-2156 AB - <p>Over the past decades, various theories and algorithms have been developed under the framework of Stackelberg games and part of these innovations have been fielded under the scenarios of national security defenses and wildlife protections. However, one of the remaining difficulties in the literature is that most of theoretical works assume full information of the payoff matrices, while in applications, the leader often has no prior knowledge about the follower’s payoff matrix, but may gain information about the follower’s utility function through repeated interactions. In this paper, we study the problem of learning the optimal leader strategy in Stackelberg (security) games and develop novel algorithms as well as new hardness results.</p> ER -