Hadar, Gal, Forest Agostinelli, and Shahaf S. Shperberg. 2026. “Beyond Single-Step Updates: Reinforcement Learning of Heuristics With Limited-Horizon Search”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (43):36955-63. https://doi.org/10.1609/aaai.v40i43.41023.