Flexible Budgets in Restless Bandits: A Primal-Dual Algorithm for Efficient Budget Allocation

Paula Rodriguez Diaz; Jackson A. Killian; Lily Xu; Arun Sai Suggala; Aparna Taneja; Milind Tambe

doi:10.1609/aaai.v37i10.26427

Authors

Paula Rodriguez Diaz Harvard University
Jackson A. Killian Harvard University
Lily Xu Harvard University
Arun Sai Suggala Google Research
Aparna Taneja Google Research
Milind Tambe Harvard University Google Research

DOI:

https://doi.org/10.1609/aaai.v37i10.26427

Keywords:

PRS: Planning With Markov Models (MDPs, POMDPs), MAS: Multiagent Planning

Abstract

Restless multi-armed bandits (RMABs) are an important model to optimize allocation of limited resources in sequential decision-making settings. Typical RMABs assume the budget --- the number of arms pulled --- to be fixed for each step in the planning horizon. However, for realistic real-world planning, resources are not necessarily limited at each planning step; we may be able to distribute surplus resources in one round to an earlier or later round. In real-world planning settings, this flexibility in budget is often constrained to within a subset of consecutive planning steps, e.g., weekly planning of a monthly budget. In this paper we define a general class of RMABs with flexible budget, which we term F-RMABs, and provide an algorithm to optimally solve for them. We derive a min-max formulation to find optimal policies for F-RMABs and leverage gradient primal-dual algorithms to solve for reward-maximizing policies with flexible budgets. We introduce a scheme to sample expected gradients to apply primal-dual algorithms to the F-RMAB setting and make an otherwise computationally expensive approach tractable. Additionally, we provide heuristics that trade off solution quality for efficiency and present experimental comparisons of different F-RMAB solution approaches.

Flexible Budgets in Restless Bandits: A Primal-Dual Algorithm for Efficient Budget Allocation

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription