We present an analysis of the blind predictions submitted to the fold recognition category for the second meeting on the Critical Assessment of techniques for protein Structure Prediction. Our method achieves fold recognition from predicted secondary structure sequences using hidden Markov models (H
Prediction of protein folding rates from primary sequences using hybrid sequence representation
✍ Scribed by Yingfu Jiang; Paul Iglinski; Lukasz Kurgan
- Publisher
- John Wiley and Sons
- Year
- 2009
- Tongue
- English
- Weight
- 123 KB
- Volume
- 30
- Category
- Article
- ISSN
- 0192-8651
No coin nor oath required. For personal study only.
✦ Synopsis
Abstract
The ability to predict protein folding rates constitutes an important step in understanding the overall folding mechanisms. Although many of the prediction methods are structure based, successful predictions can also be obtained from the sequence. We developed a novel method called prediction of protein folding rates (PPFR), for the prediction of protein folding rates from protein sequences. PPFR implements a linear regression model for each of the mainstream folding dynamics including two‐, multi‐, and mixed‐state proteins. The proposed method provides predictions characterized by strong correlations with the experimental folding rates, which equal 0.87 for the two‐ and multistate proteins and 0.82 for the mixed‐state proteins, when evaluated with out‐of‐sample jackknife test. Based on in‐sample and out‐of‐sample tests, the PPFR's predictions are shown to be better than most of other sequence only and structure‐based predictors and complementary to the predictions of the most recent sequence‐based QRSM method. We show that simultaneous incorporation of several characteristics, including the sequence, physiochemical properties of residues, and predicted secondary structure provides improved quality. This hybridized prediction model was analyzed to reveal the complementary factors that can be used in tandem to predict folding rates. We show that bigger proteins require more time for folding, higher helical and coil content and the presence of Phe, Asn, and Gln may accelerate the folding process, the inclusion of Ile, Val, Thr, and Ser may slow down the folding process, and for the two‐state proteins increased β‐strand content may decelerate the folding process. Finally, PPFR provides strong correlation when predicting sequences with low similarity. © 2008 Wiley Periodicals, Inc. J Comput Chem, 2009
📜 SIMILAR VOLUMES
## Abstract Knowledge of structural classes is useful in understanding of folding patterns in proteins. Although existing structural class prediction methods applied virtually all state‐of‐the‐art classifiers, many of them use a relatively simple protein sequence representation that often includes
Analysis of our fold recognition results in the 3rd Critical Assessment in Structure Prediction (CASP3) experiment, using the programs THREADER 2 and GenTHREADER, shows an encouraging level of overall success. Of the 23 submitted predictions, 20 targets showed no clear sequence similarity to protein