𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Prediction of protein structural class using novel evolutionary collocation-based sequence representation

✍ Scribed by Ke Chen; Lukasz A. Kurgan; Jishou Ruan


Publisher
John Wiley and Sons
Year
2008
Tongue
English
Weight
178 KB
Volume
29
Category
Article
ISSN
0192-8651

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

Knowledge of structural classes is useful in understanding of folding patterns in proteins. Although existing structural class prediction methods applied virtually all state‐of‐the‐art classifiers, many of them use a relatively simple protein sequence representation that often includes amino acid (AA) composition. To this end, we propose a novel sequence representation that incorporates evolutionary information encoded using PSI‐BLAST profile‐based collocation of AA pairs. We used six benchmark datasets and five representative classifiers to quantify and compare the quality of the structural class prediction with the proposed representation. The best, classifier support vector machine achieved 61–96% accuracy on the six datasets. These predictions were comprehensively compared with a wide range of recently proposed methods for prediction of structural classes. Our comprehensive comparison shows superiority of the proposed representation, which results in error rate reductions that range between 14% and 26% when compared with predictions of the best‐performing, previously published classifiers on the considered datasets. The study also shows that, for the benchmark dataset that includes sequences characterized by low identity (i.e., 25%, 30%, and 40%), the prediction accuracies are 20–35% lower than for the other three datasets that include sequences with a higher degree of similarity. In conclusion, the proposed representation is shown to substantially improve the accuracy of the structural class prediction. A web server that implements the presented prediction method is freely available at http://biomine.ece.ualberta.ca/Structural_Class/SCEC.html. © 2008 Wiley Periodicals, Inc. J Comput Chem 2008


📜 SIMILAR VOLUMES


Improving protein structural class predi
✍ Qi Dai; Li Wu; Lihua Li 📂 Article 📅 2011 🏛 John Wiley and Sons 🌐 English ⚖ 116 KB

## Abstract Protein structural class prediction solely from protein sequences is a challenging problem in bioinformatics. Numerous efficient methods have been proposed for protein structural class prediction, but challenges remain. Using novel combined sequence information coupled with predicted se

Using support vector machines for predic
✍ Jian-Ding Qiu; San-Hua Luo; Jian-Hua Huang; Ru-Ping Liang 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 211 KB

## Abstract The prediction of secondary structure is a fundamental and important component in the analytical study of protein structure and functions. How to improve the predictive accuracy of protein structural classification by effectively incorporating the sequence‐order effects is an important