Chen, Gongpu, Soung Chang Liew, and Deniz Gündüz. 2026. “GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-Armed Bandits”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (24):20032-40. https://doi.org/10.1609/aaai.v40i24.39088.