[1]

Y. Chandak, G. Theocharous, B. Metevier, and P. Thomas, “Reinforcement Learning When All Actions Are Not Always Available”, AAAI, vol. 34, no. 04, pp. 3381-3388, Apr. 2020.