Skip to main navigation Skip to search Skip to main content

Could statistical potential models achieve comparable or better performance than deep learning models?

  • Zhihao Wang
  • , Sheng Wang
  • , Jingjing Guo
  • , Yuguang Mu
  • , Xiangdong Liu
  • , Liangzhen Zheng
  • , Weifeng Li

Research output: Contribution to journalArticlepeer-review

Abstract

Accurately predicting protein–ligand interactions is vital for structure-based drug discovery. Although deep learning (DL) models have shown strong performance, the potential of traditional statistical potentials under data-limited conditions remains underexplored. Here, we systematically assess several statistical potential models in docking and virtual screening. We find that docking benefits from distance-dependent pairwise atom–atom potentials with clear physical meanings, while screening relies more on orientation-dependent atom–residue potentials that capture local chemical environments. Based on these findings, we propose HybridSP, a hybrid potential combining distance-dependent atom–atom, atom–residue, and orientation-dependent atom–residue terms. An affinity-weighted scheme is applied to correct biases in statistical distributions. On the CASF-2016 benchmark, HybridSP achieves a 91.6% docking success rate and an enrichment factor of 29.35 at the top 1%, rivaling and even surpassing state-of-the-art DL models. Its strong screening ability is further validated on directory of useful decoys-enhanced and directory of useful decoys-adjusted. These results demonstrate that well-designed statistical potentials can achieve high performance and interpretability without complex DL architectures, offering an efficient alternative for scoring function design. The models are available at: https://github.com/zelixirSH/HybridSP.git.

Original languageEnglish
Article numberbbag088
JournalBriefings in Bioinformatics
Volume27
Issue number2
DOIs
Publication statusPublished - 1 Mar 2026

Keywords

  • protein–ligand interaction
  • scoring function
  • statistical potential

Fingerprint

Dive into the research topics of 'Could statistical potential models achieve comparable or better performance than deep learning models?'. Together they form a unique fingerprint.

Cite this