Hadar, Gal, Forest Agostinelli, and Shahaf S. Shperberg. “Beyond Single-Step Updates: Reinforcement Learning of Heuristics With Limited-Horizon Search”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 43 (March 14, 2026): 36955–36963. Accessed May 26, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/41023.