TY - JOUR
T1 - PepPCBench is a Comprehensive Benchmarking Framework for Protein–Peptide Complex Structure Prediction
AU - Zhai, Silong
AU - Zhao, Huifeng
AU - Wang, Jike
AU - Lin, Shaolong
AU - Liu, Tiantao
AU - Gu, Shukai
AU - Jiang, Dejun
AU - Liu, Huanxiang
AU - Kang, Yu
AU - Yao, Xiaojun
AU - Hou, Tingjun
N1 - Publisher Copyright:
© 2025 American Chemical Society
PY - 2025/8/25
Y1 - 2025/8/25
N2 - Accurate modeling of protein–peptide interactions is essential for understanding fundamental biological processes and designing peptide-based drugs. However, predicting the complex structures of these interactions remains challenging, primarily due to the high conformational flexibility of peptides. To support a fair and systematic evaluation of recent deep learning (DL) approaches, we introduce PepPCBench, a benchmarking framework tailored to assess protein folding neural networks (PFNNs) in protein–peptide complex prediction. As part of this framework, we curated PepPCSet, a data set of 261 experimentally resolved complexes with peptides ranging from 5 to 30 residues. We benchmark five full-atom PFNNs, including AlphaFold3 (AF3), AlphaFold-Multimer (AFM), Chai-1, HelixFold3 (HF3), and RoseTTAFold-All-Atom (RFAA), using comprehensive evaluation metrics. Our benchmarking reveals meaningful performance differences among these methods and highlights the influence of peptide length, conformational flexibility, and training set similarity on prediction accuracy. While AF3 shows strong performance in structure prediction, further analysis indicates that confidence metrics correlate poorly with experimental binding affinities, underscoring the need for improved scoring strategies and generalizability. By providing a reproducible and extensible framework, PepPCBench enables a robust evaluation of PFNN-based methods and supports their continued development for peptide–protein structure prediction.
AB - Accurate modeling of protein–peptide interactions is essential for understanding fundamental biological processes and designing peptide-based drugs. However, predicting the complex structures of these interactions remains challenging, primarily due to the high conformational flexibility of peptides. To support a fair and systematic evaluation of recent deep learning (DL) approaches, we introduce PepPCBench, a benchmarking framework tailored to assess protein folding neural networks (PFNNs) in protein–peptide complex prediction. As part of this framework, we curated PepPCSet, a data set of 261 experimentally resolved complexes with peptides ranging from 5 to 30 residues. We benchmark five full-atom PFNNs, including AlphaFold3 (AF3), AlphaFold-Multimer (AFM), Chai-1, HelixFold3 (HF3), and RoseTTAFold-All-Atom (RFAA), using comprehensive evaluation metrics. Our benchmarking reveals meaningful performance differences among these methods and highlights the influence of peptide length, conformational flexibility, and training set similarity on prediction accuracy. While AF3 shows strong performance in structure prediction, further analysis indicates that confidence metrics correlate poorly with experimental binding affinities, underscoring the need for improved scoring strategies and generalizability. By providing a reproducible and extensible framework, PepPCBench enables a robust evaluation of PFNN-based methods and supports their continued development for peptide–protein structure prediction.
UR - https://www.scopus.com/pages/publications/105014114514
U2 - 10.1021/acs.jcim.5c01084
DO - 10.1021/acs.jcim.5c01084
M3 - Article
C2 - 40792461
AN - SCOPUS:105014114514
SN - 1549-9596
VL - 65
SP - 8497
EP - 8513
JO - Journal of Chemical Information and Modeling
JF - Journal of Chemical Information and Modeling
IS - 16
ER -