Chen, G., Liew, S. C. and Gündüz, D. (2026) “GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(24), pp. 20032-20040. doi: 10.1609/aaai.v40i24.39088.