Hadar, G., Agostinelli, F., & Shperberg, S. S. (2026). Beyond Single-Step Updates: Reinforcement Learning of Heuristics with Limited-Horizon Search. Proceedings of the AAAI Conference on Artificial Intelligence, 40(43), 36955–36963. https://doi.org/10.1609/aaai.v40i43.41023