(1)
Zhang, J.; Bedi, A. S.; Wang, M.; Koppel, A. Multi-Agent Reinforcement Learning With General Utilities via Decentralized Shadow Reward Actor-Critic. AAAI 2022, 36, 9031-9039.