Li, Huiqun, Hanhan Zhou, Yifei Zou, Dongxiao Yu, and Tian Lan. 2024. “ConcaveQ: Non-Monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (16):17461-68. https://doi.org/10.1609/aaai.v38i16.29695.