Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits

Authors

  • Biyonka Liang Harvard University
  • Lily Xu Harvard University
  • Aparna Taneja Google Research
  • Milind Tambe Harvard University Google Research
  • Lucas Janson Harvard University

DOI:

https://doi.org/10.1609/aaai.v39i27.35039

Abstract

Public health programs often provide interventions to encourage program adherence, and effectively allocating interventions is vital for producing the greatest overall health outcomes, especially in underserved communities where resources are limited. Such resource allocation problems are often modeled as restless multi-armed bandits (RMABs) with unknown underlying transition dynamics, hence requiring online reinforcement learning (RL). We present Bayesian Learning for Contextual RMABs (BCoR), an online RL approach for RMABs that novelly combines techniques in Bayesian modeling with Thompson sampling to flexibly model the complex RMAB settings present in public health program adherence problems, namely context and non-stationarity. BCoR's key strength is the ability to leverage shared information within and between arms to learn the unknown RMAB transition dynamics quickly in intervention-scarce settings with relatively short time horizons, which is common in public health applications. Empirically, BCoR achieves substantially higher finite-sample performance over a range of experimental settings, including a setting using real-world adherence data that was developed in collaboration with ARMMAN, an NGO in India which runs a large-scale maternal mHealth program, showcasing BCoR practical utility and potential for real-world deployment.

Downloads

Published

2025-04-11

How to Cite

Liang, B., Xu, L., Taneja, A., Tambe, M., & Janson, L. (2025). Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits. Proceedings of the AAAI Conference on Artificial Intelligence, 39(27), 28195–28203. https://doi.org/10.1609/aaai.v39i27.35039