[1]
G. Tennenholtz, U. Shalit, and S. Mannor, “Off-Policy Evaluation in Partially Observable Environments”, AAAI, vol. 34, no. 06, pp. 10276-10283, Apr. 2020.