Lin, Y., Ouyang, X., Zhang, T., & Sui, K. (2026). RPM-MCTS: Knowledge-Retrieval as Process Reward Model with Monte Carlo Tree Search for Code Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(38), 32042–32050. https://doi.org/10.1609/aaai.v40i38.40475