Chandak, Yash, Georgios Theocharous, Blossom Metevier, and Philip Thomas. “Reinforcement Learning When All Actions Are Not Always Available”. Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 04 (April 3, 2020): 3381-3388. Accessed April 18, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/5740.