TY - JOUR
T1 - Comparative multi-task deep learning models for protein–nucleic acid interaction prediction
T2 - Unveiling the superior efficacy of the PNI-MAMBA architecture
AU - Cui, Weirong
AU - Ye, Yilin
AU - Guo, Jingjing
AU - Yao, Xiaojun
AU - Tong, Henry Hoi Yee
N1 - Publisher Copyright:
© 2025 Elsevier B.V.
PY - 2025/10
Y1 - 2025/10
N2 - Protein-nucleic acid interactions (PNI) play crucial roles in various life processes, including gene expression regulation, DNA replication, repair, recombination, and RNA processing and translation. However, accurately predicting these interactions remains challenging due to their complexity. This paper proposes a deep learning-based multi-task learning framework for predicting protein-nucleic acid interactions. The integrated framework comprises four independent deep learning models: PNI-FCN, PNI-Transformer, PNI-MAMBA, and PNI-MAMBA2. PNI-FCN leverages fully connected neural networks, PNI-Transformer utilizes Transformer networks, and both PNI-MAMBA and PNI-MAMBA2 are built upon Mamba network architectures. A novel binding site attention mechanism is introduced to capture key binding site information. The multi-task learning objective function combines the binary classification cross-entropy loss with a binding site loss to guide the model's focus on critical regions. Experiments on merged DNA and RNA datasets demonstrate the effectiveness of the proposed framework in accurately predicting protein-nucleic acid interactions and identifying binding PNI sites. Notably, the architectural framework leveraging PNI-MAMBA(s)—encompassing both PNI-MAMBA and PNI-MAMBA2—demonstrates superior overall performance, thereby enhancing both the accuracy and robustness of predictions. This work offers significant insights into the underlying molecular mechanisms and lays a strong foundation for the development of targeted therapeutic interventions.
AB - Protein-nucleic acid interactions (PNI) play crucial roles in various life processes, including gene expression regulation, DNA replication, repair, recombination, and RNA processing and translation. However, accurately predicting these interactions remains challenging due to their complexity. This paper proposes a deep learning-based multi-task learning framework for predicting protein-nucleic acid interactions. The integrated framework comprises four independent deep learning models: PNI-FCN, PNI-Transformer, PNI-MAMBA, and PNI-MAMBA2. PNI-FCN leverages fully connected neural networks, PNI-Transformer utilizes Transformer networks, and both PNI-MAMBA and PNI-MAMBA2 are built upon Mamba network architectures. A novel binding site attention mechanism is introduced to capture key binding site information. The multi-task learning objective function combines the binary classification cross-entropy loss with a binding site loss to guide the model's focus on critical regions. Experiments on merged DNA and RNA datasets demonstrate the effectiveness of the proposed framework in accurately predicting protein-nucleic acid interactions and identifying binding PNI sites. Notably, the architectural framework leveraging PNI-MAMBA(s)—encompassing both PNI-MAMBA and PNI-MAMBA2—demonstrates superior overall performance, thereby enhancing both the accuracy and robustness of predictions. This work offers significant insights into the underlying molecular mechanisms and lays a strong foundation for the development of targeted therapeutic interventions.
KW - Binding site attention mechanism
KW - Deep learning
KW - MAMBA
KW - Multi-task learning
KW - Protein-nucleic acid interactions
UR - https://www.scopus.com/pages/publications/105015298330
U2 - 10.1016/j.ijbiomac.2025.147419
DO - 10.1016/j.ijbiomac.2025.147419
M3 - Article
C2 - 40914361
AN - SCOPUS:105015298330
SN - 0141-8130
VL - 327
JO - International Journal of Biological Macromolecules
JF - International Journal of Biological Macromolecules
M1 - 147419
ER -