Advancing Protein Design via Multi-Agent Reinforcement Learning with Pareto-Based Collaborative Optimization
DOI:
https://doi.org/10.1609/aaai.v40i2.37142Abstract
Protein design is revolutionizing biotechnology, yet existing approaches struggle to balance structural foldability with functional performance. Structure-based models excel at generating stable protein backbones but often overlook critical functional properties, while protein language models capture evolutionary and functional signals but frequently predict sequences lacking structural stability. Integrating these complementary approaches remains challenging due to their inherently conflicting objectives. We present MAProt, a multi-agent framework that synergistically combines structure-based and protein language model-based methods for protein design. Each agent specializes in a distinct aspect of the design objective: the structure-based agent (e.g., ProteinMPNN) ensures compatibility with the target backbone, while protein language model-based agents (e.g., ESM, SaProt) capture evolutionary plausibility and functional potential. To reconcile conflicts and achieve optimal trade-offs, we introduce a Pareto-based negotiation module that enables effective multi-objective coordination and consensus among agents. Extensive experiments on benchmark datasets demonstrate that MAProt achieves a remarkable improvement over state-of-the-art baselines, and generalizes robustly across a range of tasks, including thermodynamic folding stability design, functional protein design, and high-affinity antibody design. These results highlight the power of collaborative optimization for advancing rational protein engineering.Downloads
Published
2026-03-14
How to Cite
Zhu, M., Rao, J., Chen, X., Yuan, Q., & Yang, Y. (2026). Advancing Protein Design via Multi-Agent Reinforcement Learning with Pareto-Based Collaborative Optimization. Proceedings of the AAAI Conference on Artificial Intelligence, 40(2), 1650-1658. https://doi.org/10.1609/aaai.v40i2.37142
Issue
Section
AAAI Technical Track on Application Domains II