TY - JOUR
T1 - 3DSGIMD
T2 - An accurate and interpretable molecular property prediction method using 3D spatial graph focusing network and structure-based feature fusion
AU - Tian, Yanan
AU - Wang, Chenbin
AU - Lu, Ruiqiang
AU - Tong, Henry H.Y.
AU - Gong, Xiaoqing
AU - Qiu, Jiayue
AU - Peng, Shaoliang
AU - Yao, Xiaojun
AU - Liu, Huanxiang
N1 - Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2024/12
Y1 - 2024/12
N2 - A comprehensive representation of molecular structure is essential for establishing accurate and reliable molecular property prediction models. However, fully extracting and learning intrinsic molecular structure information, especially spatial structure features, remains a challenging task, leading that many molecular property prediction models still have no enough accuracy for the real application. In this study, we developed an innovative and interpretable deep learning method, termed 3DSGIMD, which predicted the molecular properties by integrating and learning the spatial structure and substructure information of molecules at multiple levels, and generated the focusing weights by aggregating spatial and adjacency information of molecules to improve understanding of prediction results. We evaluated the model on 10 public datasets and 14 cell-based phenotypic screening datasets. Extensive experimental results indicated that 3DSGIMD achieved superior or comparable predictive performance compared with some existing models, and the individually designed components contributed significantly to the advanced performance of the model. In addition, we also provided insight into the interpretability of our model via visualizing the focusing weights and perturbation analysis, and the results showed that 3DSGIMD can pinpoint crucial local structures and bits of molecular descriptors associated with the predicted properties. In summary, 3DSGIMD is a competitive molecular property prediction method that holds the potential to aid drug design and optimization.
AB - A comprehensive representation of molecular structure is essential for establishing accurate and reliable molecular property prediction models. However, fully extracting and learning intrinsic molecular structure information, especially spatial structure features, remains a challenging task, leading that many molecular property prediction models still have no enough accuracy for the real application. In this study, we developed an innovative and interpretable deep learning method, termed 3DSGIMD, which predicted the molecular properties by integrating and learning the spatial structure and substructure information of molecules at multiple levels, and generated the focusing weights by aggregating spatial and adjacency information of molecules to improve understanding of prediction results. We evaluated the model on 10 public datasets and 14 cell-based phenotypic screening datasets. Extensive experimental results indicated that 3DSGIMD achieved superior or comparable predictive performance compared with some existing models, and the individually designed components contributed significantly to the advanced performance of the model. In addition, we also provided insight into the interpretability of our model via visualizing the focusing weights and perturbation analysis, and the results showed that 3DSGIMD can pinpoint crucial local structures and bits of molecular descriptors associated with the predicted properties. In summary, 3DSGIMD is a competitive molecular property prediction method that holds the potential to aid drug design and optimization.
KW - 3D molecular graph
KW - 3D spatial information
KW - Deep learning
KW - Interpretability
KW - Molecular property prediction
UR - http://www.scopus.com/inward/record.url?scp=85198713135&partnerID=8YFLogxK
U2 - 10.1016/j.future.2024.07.004
DO - 10.1016/j.future.2024.07.004
M3 - Article
AN - SCOPUS:85198713135
SN - 0167-739X
VL - 161
SP - 189
EP - 200
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
ER -