[1]
D. Steckelmacher, D. Roijers, A. Harutyunyan, P. Vrancx, H. Plisnier, and A. Nowé, “Reinforcement Learning in POMDPs With Memoryless Options and Option-Observation Initiation Sets”, AAAI, vol. 32, no. 1, Apr. 2018.