TY - JOUR
T1 - PepExplainer
T2 - An explainable deep learning model for selection-based macrocyclic peptide bioactivity prediction and optimization
AU - Zhai, Silong
AU - Tan, Yahong
AU - Zhu, Cheng
AU - Zhang, Chengyun
AU - Gao, Yan
AU - Mao, Qingyi
AU - Zhang, Youming
AU - Duan, Hongliang
AU - Yin, Yizhen
N1 - Publisher Copyright:
© 2024 Elsevier Masson SAS
PY - 2024/9/5
Y1 - 2024/9/5
N2 - Macrocyclic peptides possess unique features, making them highly promising as a drug modality. However, evaluating their bioactivity through wet lab experiments is generally resource-intensive and time-consuming. Despite advancements in artificial intelligence (AI) for bioactivity prediction, challenges remain due to limited data availability and the interpretability issues in deep learning models, often leading to less-than-ideal predictions. To address these challenges, we developed PepExplainer, an explainable graph neural network based on substructure mask explanation (SME). This model excels at deciphering amino acid substructures, translating macrocyclic peptides into detailed molecular graphs at the atomic level, and efficiently handling non-canonical amino acids and complex macrocyclic peptide structures. PepExplainer's effectiveness is enhanced by utilizing the correlation between peptide enrichment data from selection-based focused library and bioactivity data, and employing transfer learning to improve bioactivity predictions of macrocyclic peptides against IL-17C/IL-17 RE interaction. Additionally, PepExplainer underwent further validation for bioactivity prediction using an additional set of thirteen newly synthesized macrocyclic peptides. Moreover, it enabled the optimization of the IC50 of a macrocyclic peptide, reducing it from 15 nM to 5.6 nM based on the contribution score provided by PepExplainer. This achievement underscores PepExplainer's skill in deciphering complex molecular patterns, highlighting its potential to accelerate the discovery and optimization of macrocyclic peptides.
AB - Macrocyclic peptides possess unique features, making them highly promising as a drug modality. However, evaluating their bioactivity through wet lab experiments is generally resource-intensive and time-consuming. Despite advancements in artificial intelligence (AI) for bioactivity prediction, challenges remain due to limited data availability and the interpretability issues in deep learning models, often leading to less-than-ideal predictions. To address these challenges, we developed PepExplainer, an explainable graph neural network based on substructure mask explanation (SME). This model excels at deciphering amino acid substructures, translating macrocyclic peptides into detailed molecular graphs at the atomic level, and efficiently handling non-canonical amino acids and complex macrocyclic peptide structures. PepExplainer's effectiveness is enhanced by utilizing the correlation between peptide enrichment data from selection-based focused library and bioactivity data, and employing transfer learning to improve bioactivity predictions of macrocyclic peptides against IL-17C/IL-17 RE interaction. Additionally, PepExplainer underwent further validation for bioactivity prediction using an additional set of thirteen newly synthesized macrocyclic peptides. Moreover, it enabled the optimization of the IC50 of a macrocyclic peptide, reducing it from 15 nM to 5.6 nM based on the contribution score provided by PepExplainer. This achievement underscores PepExplainer's skill in deciphering complex molecular patterns, highlighting its potential to accelerate the discovery and optimization of macrocyclic peptides.
KW - Bioactivity prediction
KW - Graph neural network (GNN)
KW - Machine learning (ML)
KW - Macrocyclic peptide
KW - Optimization
KW - Structure-activity relationship (SAR)
UR - http://www.scopus.com/inward/record.url?scp=85197483532&partnerID=8YFLogxK
U2 - 10.1016/j.ejmech.2024.116628
DO - 10.1016/j.ejmech.2024.116628
M3 - Article
AN - SCOPUS:85197483532
SN - 0223-5234
VL - 275
JO - European Journal of Medicinal Chemistry
JF - European Journal of Medicinal Chemistry
M1 - 116628
ER -