Hadar, G., Agostinelli, F. and Shperberg, S. S. (2026) “Beyond Single-Step Updates: Reinforcement Learning of Heuristics with Limited-Horizon Search”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(43), pp. 36955–36963. doi: 10.1609/aaai.v40i43.41023.