Liu, Y., Liu, Y., Yuan, F., Cao, C., Sun, Y., Peng, K., … Ma, Z. (2026). OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval. Proceedings of the AAAI Conference on Artificial Intelligence, 40(38), 32258–32266. https://doi.org/10.1609/aaai.v40i38.40499