TY - JOUR
T1 - Study on the structure-activity relationship of new anti-HIV nucleoside derivatives based on the support vector machine method
AU - Wang, Jie
AU - Liu, Huanxiang
AU - Qin, Shen
AU - Yao, Xiaojun
AU - Liu, Mancang
AU - Hu, Zhide
AU - Fan, Botao
PY - 2007/2
Y1 - 2007/2
N2 - Support Vector Machine (SVM) was used for the classification of the activity of the new anti-HIV nucleosides derivatives for a large and diverse data set using the twelve descriptors that were calculated from the molecular structure alone. The molecular descriptors were selected by the stepwise Linear Discriminant Analysis (LDA) method implemented in SPSS. The correlation between all the descriptors was lower than 0.85. At the same time, in order to build a nonlinear model to classify the new anti-HIV drugs according to their activities, the data set was divided into two subgroups: the training set and the testing set. The nonlinear model gives satisfactory results, which can classify correctly 91.5% of the compounds in the training set and 91.4% of the compounds in the testing set. In addition, this paper provides a new and effective method for classifying the new anti-HIV nucleoside derivatives from their structures according to their activities and gives some insight into structural features related to the activity of the drugs.
AB - Support Vector Machine (SVM) was used for the classification of the activity of the new anti-HIV nucleosides derivatives for a large and diverse data set using the twelve descriptors that were calculated from the molecular structure alone. The molecular descriptors were selected by the stepwise Linear Discriminant Analysis (LDA) method implemented in SPSS. The correlation between all the descriptors was lower than 0.85. At the same time, in order to build a nonlinear model to classify the new anti-HIV drugs according to their activities, the data set was divided into two subgroups: the training set and the testing set. The nonlinear model gives satisfactory results, which can classify correctly 91.5% of the compounds in the training set and 91.4% of the compounds in the testing set. In addition, this paper provides a new and effective method for classifying the new anti-HIV nucleoside derivatives from their structures according to their activities and gives some insight into structural features related to the activity of the drugs.
KW - Classification
KW - Nucleoside derivatives
KW - Structure-activity relationship
KW - Support vector machine
UR - http://www.scopus.com/inward/record.url?scp=34250682210&partnerID=8YFLogxK
U2 - 10.1002/qsar.200510166
DO - 10.1002/qsar.200510166
M3 - Article
AN - SCOPUS:34250682210
SN - 1611-020X
VL - 26
SP - 161
EP - 172
JO - QSAR and Combinatorial Science
JF - QSAR and Combinatorial Science
IS - 2
ER -