RASO: Role-Aware Shared Reflection for Multi-Agent Orchestration in E-Commerce Long-Horizon Planning
DOI:
https://doi.org/10.1609/icaps.v36i1.42873Abstract
E-commerce platforms increasingly deploy automated product management systems to help sellers maximize long-term profitability through multi-day planning and scheduling of operational actions. Despite promising results in long-horizon e-commerce planning, existing LLM-based multi-agent systems with self-reflection suffer from ambiguous credit assignment across agents and shallow reflection that fails to diagnose root causes of poor operational outcomes in practice. We propose RASO, a novel multi-agent framework that enhances long-term planning via role-aware and rule-based shared reflection. RASO addresses these challenges through a hybrid reward mechanism that combines global and role-specific counterfactual rewards to enable precise credit attribution across functionally distinct agents, as well as a rule-decision paradigm that requires agents to formalize their reasoning into auditable, structured rules prior to action execution to support logic-level error diagnosis during reflection. Evaluated on a real-world e-commerce platform over extended planning horizons, RASO significantly outperforms baselines in cumulative profit with transparent and interpretable decision processes. Our results demonstrate that integrating role-aware collaboration with structured reflection empowers LLM agents to effectively manage complex, long-term business objectives.Downloads
Published
2026-06-08
How to Cite
Niu, K., Zhang, Y., Wang, X., Ma, C., Hong, S., Zhuo, H. H., … Zheng, B. (2026). RASO: Role-Aware Shared Reflection for Multi-Agent Orchestration in E-Commerce Long-Horizon Planning. Proceedings of the International Conference on Automated Planning and Scheduling, 36(1), 557–561. https://doi.org/10.1609/icaps.v36i1.42873