RASO: Role-Aware Shared Reflection for Multi-Agent Orchestration in E-Commerce Long-Horizon Planning

Authors

  • Kangjia Niu School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
  • Yanning Zhang Alibaba Group
  • Xiuchong Wang Alibaba Group
  • Chennan Ma Alibaba Group
  • Siqi Hong Alibaba Group
  • Hankz Hankui Zhuo State Key Laboratory of Novel Software Technology, Nanjing University, Nanjing 210023, China School of Artificial Intelligence, Nanjing University, Nanjing 210023, China
  • Junxiong Zhu Alibaba Group
  • Bo Zheng Alibaba Group

DOI:

https://doi.org/10.1609/icaps.v36i1.42873

Abstract

E-commerce platforms increasingly deploy automated product management systems to help sellers maximize long-term profitability through multi-day planning and scheduling of operational actions. Despite promising results in long-horizon e-commerce planning, existing LLM-based multi-agent systems with self-reflection suffer from ambiguous credit assignment across agents and shallow reflection that fails to diagnose root causes of poor operational outcomes in practice. We propose RASO, a novel multi-agent framework that enhances long-term planning via role-aware and rule-based shared reflection. RASO addresses these challenges through a hybrid reward mechanism that combines global and role-specific counterfactual rewards to enable precise credit attribution across functionally distinct agents, as well as a rule-decision paradigm that requires agents to formalize their reasoning into auditable, structured rules prior to action execution to support logic-level error diagnosis during reflection. Evaluated on a real-world e-commerce platform over extended planning horizons, RASO significantly outperforms baselines in cumulative profit with transparent and interpretable decision processes. Our results demonstrate that integrating role-aware collaboration with structured reflection empowers LLM agents to effectively manage complex, long-term business objectives.

Downloads

Published

2026-06-08

How to Cite

Niu, K., Zhang, Y., Wang, X., Ma, C., Hong, S., Zhuo, H. H., … Zheng, B. (2026). RASO: Role-Aware Shared Reflection for Multi-Agent Orchestration in E-Commerce Long-Horizon Planning. Proceedings of the International Conference on Automated Planning and Scheduling, 36(1), 557–561. https://doi.org/10.1609/icaps.v36i1.42873