TY - JOUR
T1 - APEX-pHLA
T2 - A novel method for accurate prediction of the binding between exogenous short peptides and HLA class I molecules
AU - Su, Zhihao
AU - Wu, Yejian
AU - Cao, Kaiqiang
AU - Du, Jie
AU - Cao, Lujing
AU - Wu, Zhipeng
AU - Wu, Xinyi
AU - Wang, Xinqiao
AU - Song, Ying
AU - Wang, Xudong
AU - Duan, Hongliang
N1 - Publisher Copyright:
© 2024 Elsevier Inc.
PY - 2024/8
Y1 - 2024/8
N2 - Human leukocyte antigen (HLA) molecules play critically significant role within the realm of immunotherapy due to their capacities to recognize and bind exogenous antigens such as peptides, subsequently delivering them to immune cells. Predicting the binding between peptides and HLA molecules (pHLA) can expedite the screening of immunogenic peptides and facilitate vaccine design. However, traditional experimental methods are time-consuming and inefficient. In this study, an efficient method based on deep learning was developed for predicting peptide-HLA binding, which treated peptide sequences as linguistic entities. It combined the architectures of textCNN and BiLSTM to create a deep neural network model called APEX-pHLA. This model operated without limitations related to HLA class I allele variants and peptide segment lengths, enabling efficient encoding of sequence features for both HLA and peptide segments. On the independent test set, the model achieved Accuracy, ROC_AUC, F1, and MCC is 0.9449, 0.9850, 0.9453, and 0.8899, respectively. Similarly, on an external test set, the results were 0.9803, 0.9574, 0.8835, and 0.7863, respectively. These findings outperformed fifteen methods previously reported in the literature. The accurate prediction capability of the APEX-pHLA model in peptide-HLA binding might provide valuable insights for future HLA vaccine design.
AB - Human leukocyte antigen (HLA) molecules play critically significant role within the realm of immunotherapy due to their capacities to recognize and bind exogenous antigens such as peptides, subsequently delivering them to immune cells. Predicting the binding between peptides and HLA molecules (pHLA) can expedite the screening of immunogenic peptides and facilitate vaccine design. However, traditional experimental methods are time-consuming and inefficient. In this study, an efficient method based on deep learning was developed for predicting peptide-HLA binding, which treated peptide sequences as linguistic entities. It combined the architectures of textCNN and BiLSTM to create a deep neural network model called APEX-pHLA. This model operated without limitations related to HLA class I allele variants and peptide segment lengths, enabling efficient encoding of sequence features for both HLA and peptide segments. On the independent test set, the model achieved Accuracy, ROC_AUC, F1, and MCC is 0.9449, 0.9850, 0.9453, and 0.8899, respectively. Similarly, on an external test set, the results were 0.9803, 0.9574, 0.8835, and 0.7863, respectively. These findings outperformed fifteen methods previously reported in the literature. The accurate prediction capability of the APEX-pHLA model in peptide-HLA binding might provide valuable insights for future HLA vaccine design.
KW - BiLSTM
KW - CNN
KW - HLA-I peptide binding prediction
KW - Natural language processing
UR - http://www.scopus.com/inward/record.url?scp=85193989719&partnerID=8YFLogxK
U2 - 10.1016/j.ymeth.2024.05.013
DO - 10.1016/j.ymeth.2024.05.013
M3 - Article
C2 - 38772499
AN - SCOPUS:85193989719
SN - 1046-2023
VL - 228
SP - 38
EP - 47
JO - Methods
JF - Methods
ER -