Recognition of protein folding kinetics pathways based on amino acid properties information derived from primary sequence

Lili Xi, Shuyan Li, Yuhui Wei, Xin'an Wu, Huanxiang Liu, Xiaojun Yao

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

Recognition of protein folding kinetics pathways is an effective approach for the study of protein folding behaviors, and thereby to get a better understanding of mechanism that how a protein folds into a functional structure. In this study, we presented a novel method for the classification of protein folding kinetics pathways based on a new class of features weighted by amino acid properties, which were derived from protein primary sequence. According to the leave-one-out and bootstrap cross-validation results, the model with eight features was the best one, and it achieved a satisfactory prediction accuracy of 91.67% for training set; while n-fold cross-validation had also been performed and the results showed that the built model was stable. Besides, the external test set was employed to evaluate the predictive ability of the built model. The accuracy for external test set achieved 88.24% and MCC was 0.79. Next, the selected important features were analyzed for a better understanding of the protein folding mechanisms. The analysis suggests that long-range interaction and unfolding Gibbs free energy change are important factors in determining the protein folding kinetics pathways. Besides, hydrophobicity, secondary structure and charges are also implied to be the important properties that affect the behavior of protein folding.

Original languageEnglish
Pages (from-to)76-82
Number of pages7
JournalChemometrics and Intelligent Laboratory Systems
Volume126
DOIs
Publication statusPublished - 5 Jul 2013
Externally publishedYes

Keywords

  • Amino acid properties
  • Least Squares-Support Vector Machines (LS-SVMs)
  • Protein folding kinetics pathway
  • Support Vector Machine-Recursive Feature Elimination (SVM-RFE)

Fingerprint

Dive into the research topics of 'Recognition of protein folding kinetics pathways based on amino acid properties information derived from primary sequence'. Together they form a unique fingerprint.

Cite this