[1]
H. Li, H. Zhou, Y. Zou, D. Yu, and T. Lan, “ConcaveQ: Non-monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning”, AAAI, vol. 38, no. 16, pp. 17461–17468, Mar. 2024.