(1)
Wen, W.; Xue, C.; Pan, S.; Sun, Y.; Peng, M. Reinforcement Learning Enhanced Muti-Hop Reasoning for Temporal Knowledge Question Answering. AAAI 2026, 40, 33881-33889.