Liu, Yu, Yanbing Liu, Fangfang Yuan, Cong Cao, Youbang Sun, Kun Peng, WeiZhuo Chen, Jianjun Li, and Zhiyuan Ma. 2026. “OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (38):32258-66. https://doi.org/10.1609/aaai.v40i38.40499.