TY - JOUR
T1 - Study of quantitative structure-mobility relationship of the peptides based on the structural descriptors and support vector machines
AU - Liu, Huanxiang
AU - Yao, Xiaojun
AU - Xue, Chunxia
AU - Zhang, Ruisheng
AU - Liu, Mancang
AU - Hu, Zhide
AU - Fan, Botao
N1 - Funding Information:
The authors thank the Association Franco-Chinoise pour la Recherche Scientifique and Technique (AFCRST) for supporting this study (Programme PRA SI 02-03). The authors also thank the R Development Core Team for affording the free R1.7.1 software.
PY - 2005/6/29
Y1 - 2005/6/29
N2 - Support vector machines (SVM), as a novel learning machine, was used to develop the non-linear quantitative structure-mobility relationship model of the peptides based on the calculated descriptors for the first time. The molecular descriptors representing the structural features of the compounds included the constitutional and topological descriptors calculated by CODESSA program, which can be obtained easily without optimizing the structure of the molecule, and CPSA (charged partial surface area) descriptors obtained by SYBYL software. The MLR method was used to select the descriptors responsible for the electrophoretic mobility of peptides and develop the linear model. The prediction result of the SVM model (ε = 0.04, γ = 0.002 and C = 100) is much better than that obtained by MLR method. The RMS error of the training set, the test set and the whole set is 0.0569, 0.0553, 0.0565 and the prediction correlation coefficient is 0.925, 0.912 and 0.922, respectively. The prediction results are in agreement with the experimental values. This paper provided a new and effective method for predicting the electrophoretic behavior of peptide and some insight into what structural features are related to the electrophoretic mobility of peptides. Moreover, it also offered an idea about dealing with the structural optimization and obtaining their structural descriptors for biomacromolecules.
AB - Support vector machines (SVM), as a novel learning machine, was used to develop the non-linear quantitative structure-mobility relationship model of the peptides based on the calculated descriptors for the first time. The molecular descriptors representing the structural features of the compounds included the constitutional and topological descriptors calculated by CODESSA program, which can be obtained easily without optimizing the structure of the molecule, and CPSA (charged partial surface area) descriptors obtained by SYBYL software. The MLR method was used to select the descriptors responsible for the electrophoretic mobility of peptides and develop the linear model. The prediction result of the SVM model (ε = 0.04, γ = 0.002 and C = 100) is much better than that obtained by MLR method. The RMS error of the training set, the test set and the whole set is 0.0569, 0.0553, 0.0565 and the prediction correlation coefficient is 0.925, 0.912 and 0.922, respectively. The prediction results are in agreement with the experimental values. This paper provided a new and effective method for predicting the electrophoretic behavior of peptide and some insight into what structural features are related to the electrophoretic mobility of peptides. Moreover, it also offered an idea about dealing with the structural optimization and obtaining their structural descriptors for biomacromolecules.
KW - Electrophoretic mobility
KW - Peptides
KW - Prediction
KW - Quantitative structure-mobility relationship
KW - Support vector machine
UR - http://www.scopus.com/inward/record.url?scp=20444410445&partnerID=8YFLogxK
U2 - 10.1016/j.aca.2005.04.006
DO - 10.1016/j.aca.2005.04.006
M3 - Article
AN - SCOPUS:20444410445
SN - 0003-2670
VL - 542
SP - 249
EP - 259
JO - Analytica Chimica Acta
JF - Analytica Chimica Acta
IS - 2
ER -