1.
Udagawa T, Kiyohara H, Narita Y, Saito Y, Tateno K. Policy-Adaptive Estimator Selection for Off-Policy Evaluation. AAAI [Internet]. 2023 Jun. 26 [cited 2026 May 19];37(8):10025-33. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/26195