CrossBind: Collaborative Cross-Modal Identification of Protein Nucleic-Acid-Binding Residues

Linglin Jing; Sheng Xu; Yifan Wang; Yuzhe Zhou; Tao Shen; Zhigang Ji; Hui Fang; Zhen Li; Siqi Sun

doi:10.1609/aaai.v38i3.28044

Authors

Linglin Jing Shanghai Artifcial Intelligence Laboratory Department of Computer Science, Loughborough University
Sheng Xu Research Institute of Intelligent Complex Systems, Fudan University Shanghai Artifcial Intelligence Laboratory
Yifan Wang Department of Computer Science, Loughborough University
Yuzhe Zhou SSE & FNII, The Chinese University of Hong Kong (Shenzhen)
Tao Shen Research Institute of Intelligent Complex Systems, Fudan University
Zhigang Ji Shanghai Jiao Tong University
Hui Fang Department of Computer Science, Loughborough University
Zhen Li SSE & FNII, The Chinese University of Hong Kong (Shenzhen)
Siqi Sun Research Institute of Intelligent Complex Systems, Fudan University Shanghai Artifcial Intelligence Laboratory

DOI:

https://doi.org/10.1609/aaai.v38i3.28044

Keywords:

CV: Medical and Biological Imaging, CV: Biometrics, Face, Gesture & Pose, CV: Multi-modal Vision

Abstract

Accurate identification of protein nucleic acid binding residues poses a significant challenge with important implications for various biological processes and drug design. Many typical computational methods for protein analysis rely on a single model that could ignore either the semantic context of the protein or the global 3D geometric information. Consequently, these approaches may result in incomplete or inaccurate protein analysis. To address the above issue, in this paper, we present CrossBind, a novel collaborative cross modal approach for identifying binding residues by exploiting both protein geometric structure and its sequence prior knowledge extracted from a large scale protein language model. Specifically, our multi modal approach leverages a contrastive learning technique and atom wise attention to capture the positional relationships between atoms and residues, thereby incorporating fine grained local geometric knowledge, for better binding residue prediction. Extensive experimental results demonstrate that our approach outperforms the next best state of the art methods, GraphSite and GraphBind, on DNA and RNA datasets by 10.8/17.3% in terms of the harmonic mean of precision and recall (F1 Score) and 11.9/24.8% in Matthews correlation coefficient (MCC), respectively. We release the code at https://github.com/BEAM-Labs/CrossBind.

CrossBind: Collaborative Cross-Modal Identification of Protein Nucleic-Acid-Binding Residues

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information