Chandak, Yash, Georgios Theocharous, Blossom Metevier, and Philip Thomas. 2020. “Reinforcement Learning When All Actions Are Not Always Available”. Proceedings of the AAAI Conference on Artificial Intelligence 34 (04):3381-88. https://doi.org/10.1609/aaai.v34i04.5740.