TY - JOUR
T1 - Global and local prediction of protein folding rates based on sequence autocorrelation information
AU - Xi, Lili
AU - Li, Shuyan
AU - Liu, Huanxiang
AU - Li, Jiazhong
AU - Lei, Beilei
AU - Yao, Xiaojun
PY - 2010/6
Y1 - 2010/6
N2 - To understand the folding behavior of proteins is an important and challenging problem in modern molecular biology. In the present investigation, a large number of features representing protein sequences were developed based on sequence autocorrelation weighted by properties of amino acid residues. Genetic algorithm (GA) combined with multiple linear regression (MLR) was employed to select significant features related to protein folding rates, and to build global predictive model. Moreover, local lazy regression (LLR) method was also used to predict the protein folding rates. The obtained results indicated that LLR performed much better than the global MLR model. The important properties of amino acid residues affecting protein folding rates were also analyzed. The results of this study will be helpful to understand the mechanism of protein folding. Our results also demonstrate that the features of amino acid sequence autocorrelation is effective in representing the relationship between protein sequence and folding rates, and the local method is a powerful tool to predict the protein folding rates.
AB - To understand the folding behavior of proteins is an important and challenging problem in modern molecular biology. In the present investigation, a large number of features representing protein sequences were developed based on sequence autocorrelation weighted by properties of amino acid residues. Genetic algorithm (GA) combined with multiple linear regression (MLR) was employed to select significant features related to protein folding rates, and to build global predictive model. Moreover, local lazy regression (LLR) method was also used to predict the protein folding rates. The obtained results indicated that LLR performed much better than the global MLR model. The important properties of amino acid residues affecting protein folding rates were also analyzed. The results of this study will be helpful to understand the mechanism of protein folding. Our results also demonstrate that the features of amino acid sequence autocorrelation is effective in representing the relationship between protein sequence and folding rates, and the local method is a powerful tool to predict the protein folding rates.
KW - Amino acid sequence autocorrelation
KW - Genetic algorithm
KW - Local lazy regression
KW - Multiple linear regression
KW - Protein folding rate
UR - https://www.scopus.com/pages/publications/77952953441
U2 - 10.1016/j.jtbi.2010.03.042
DO - 10.1016/j.jtbi.2010.03.042
M3 - Article
C2 - 20362588
AN - SCOPUS:77952953441
SN - 0022-5193
VL - 264
SP - 1159
EP - 1168
JO - Journal of Theoretical Biology
JF - Journal of Theoretical Biology
IS - 4
ER -