Cross-Gate MLP with Protein Complex Invariant Embedding Is a One-Shot Antibody Designer

Authors

  • Cheng Tan Zhejiang University AI Lab, Research Center for Industries of the Future, Westlake University
  • Zhangyang Gao Zhejiang University AI Lab, Research Center for Industries of the Future, Westlake University
  • Lirong Wu Zhejiang University AI Lab, Research Center for Industries of the Future, Westlake University
  • Jun Xia Zhejiang University AI Lab, Research Center for Industries of the Future, Westlake University
  • Jiangbin Zheng Zhejiang University AI Lab, Research Center for Industries of the Future, Westlake University
  • Xihong Yang College of Computer, National University of Defense Technology
  • Yue Liu College of Computer, National University of Defense Technology
  • Bozhen Hu Zhejiang University AI Lab, Research Center for Industries of the Future, Westlake University
  • Stan Z. Li AI Lab, Research Center for Industries of the Future, Westlake University

DOI:

https://doi.org/10.1609/aaai.v38i14.29445

Keywords:

ML: Applications, ML: Bio-inspired Learning

Abstract

Antibodies are crucial proteins produced by the immune system in response to foreign substances or antigens. The specificity of an antibody is determined by its complementarity-determining regions (CDRs), which are located in the variable domains of the antibody chains and form the antigen-binding site. Previous studies have utilized complex techniques to generate CDRs, but they suffer from inadequate geometric modeling. Moreover, the common iterative refinement strategies lead to an inefficient inference. In this paper, we propose a simple yet effective model that can co-design 1D sequences and 3D structures of CDRs in a one-shot manner. To achieve this, we decouple the antibody CDR design problem into two stages: (i) geometric modeling of protein complex structures and (ii) sequence-structure co-learning. We develop a novel macromolecular structure invariant embedding, typically for protein complexes, that captures both intra- and inter-component interactions among the backbone atoms, including Calpha, N, C, and O atoms, to achieve comprehensive geometric modeling. Then, we introduce a simple cross-gate MLP for sequence-structure co-learning, allowing sequence and structure representations to implicitly refine each other. This enables our model to design desired sequences and structures in a one-shot manner. Extensive experiments are conducted to evaluate our results at both the sequence and structure level, which demonstrate that our model achieves superior performance compared to the state-of-the-art antibody CDR design methods.

Published

2024-03-24

How to Cite

Tan, C., Gao, Z., Wu, L., Xia, J., Zheng, J., Yang, X., Liu, Y., Hu, B., & Li, S. Z. (2024). Cross-Gate MLP with Protein Complex Invariant Embedding Is a One-Shot Antibody Designer. Proceedings of the AAAI Conference on Artificial Intelligence, 38(14), 15222-15230. https://doi.org/10.1609/aaai.v38i14.29445

Issue

Section

AAAI Technical Track on Machine Learning V