Hadar, Gal, et al. “Beyond Single-Step Updates: Reinforcement Learning of Heuristics With Limited-Horizon Search”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 43, Mar. 2026, pp. 36955-63, doi:10.1609/aaai.v40i43.41023.