TY - JOUR
T1 - GAPS
T2 - a geometric attention-based network for peptide binding site identification by the transfer learning approach
AU - Zhu, Cheng
AU - Zhang, Chengyun
AU - Shang, Tianfeng
AU - Zhang, Chenhao
AU - Zhai, Silong
AU - Cao, Lujing
AU - Xu, Zhenyu
AU - Su, Zhihao
AU - Song, Ying
AU - Su, An
AU - Li, Chengxi
AU - Duan, Hongliang
N1 - Publisher Copyright:
© The Author(s) 2024. Published by Oxford University Press.
PY - 2024/7/1
Y1 - 2024/7/1
N2 - Protein–peptide interactions (PPepIs) are vital to understanding cellular functions, which can facilitate the design of novel drugs. As an essential component in forming a PPepI, protein–peptide binding sites are the basis for understanding the mechanisms involved in PPepIs. Therefore, accurately identifying protein–peptide binding sites becomes a critical task. The traditional experimental methods for researching these binding sites are labor-intensive and time-consuming, and some computational tools have been invented to supplement it. However, these computational tools have limitations in generality or accuracy due to the need for ligand information, complex feature construction, or their reliance on modeling based on amino acid residues. To deal with the drawbacks of these computational algorithms, we describe a geometric attention-based network for peptide binding site identification (GAPS) in this work. The proposed model utilizes geometric feature engineering to construct atom representations and incorporates multiple attention mechanisms to update relevant biological features. In addition, the transfer learning strategy is implemented for leveraging the protein–protein binding sites information to enhance the protein–peptide binding sites recognition capability, taking into account the common structure and biological bias between proteins and peptides. Consequently, GAPS demonstrates the state-of-the-art performance and excellent robustness in this task. Moreover, our model exhibits exceptional performance across several extended experiments including predicting the apo protein–peptide, protein–cyclic peptide and the AlphaFold-predicted protein–peptide binding sites. These results confirm that the GAPS model is a powerful, versatile, stable method suitable for diverse binding site predictions.
AB - Protein–peptide interactions (PPepIs) are vital to understanding cellular functions, which can facilitate the design of novel drugs. As an essential component in forming a PPepI, protein–peptide binding sites are the basis for understanding the mechanisms involved in PPepIs. Therefore, accurately identifying protein–peptide binding sites becomes a critical task. The traditional experimental methods for researching these binding sites are labor-intensive and time-consuming, and some computational tools have been invented to supplement it. However, these computational tools have limitations in generality or accuracy due to the need for ligand information, complex feature construction, or their reliance on modeling based on amino acid residues. To deal with the drawbacks of these computational algorithms, we describe a geometric attention-based network for peptide binding site identification (GAPS) in this work. The proposed model utilizes geometric feature engineering to construct atom representations and incorporates multiple attention mechanisms to update relevant biological features. In addition, the transfer learning strategy is implemented for leveraging the protein–protein binding sites information to enhance the protein–peptide binding sites recognition capability, taking into account the common structure and biological bias between proteins and peptides. Consequently, GAPS demonstrates the state-of-the-art performance and excellent robustness in this task. Moreover, our model exhibits exceptional performance across several extended experiments including predicting the apo protein–peptide, protein–cyclic peptide and the AlphaFold-predicted protein–peptide binding sites. These results confirm that the GAPS model is a powerful, versatile, stable method suitable for diverse binding site predictions.
KW - attention mechanism
KW - binding sites
KW - geometric deep learning
KW - transfer learning
UR - http://www.scopus.com/inward/record.url?scp=85198495986&partnerID=8YFLogxK
U2 - 10.1093/bib/bbae297
DO - 10.1093/bib/bbae297
M3 - Article
C2 - 38990514
AN - SCOPUS:85198495986
SN - 1467-5463
VL - 25
JO - Briefings in Bioinformatics
JF - Briefings in Bioinformatics
IS - 4
M1 - bbae297
ER -