An information-theoretic approach to the prediction of protein structural class
β Scribed by Xiaoqi Zheng; Chun Li; Jun Wang
- Publisher
- John Wiley and Sons
- Year
- 2009
- Tongue
- English
- Weight
- 143 KB
- Volume
- 31
- Category
- Article
- ISSN
- 0192-8651
No coin nor oath required. For personal study only.
β¦ Synopsis
Abstract
An informationβtheoretical approach, which combines a sequence decomposition technique and a fuzzy clustering algorithm, is proposed for prediction of protein structural class. This approach could bypass the process of selecting and comparing sequence features as done previously. First, distances between each pair of protein sequences are estimated using a conditional decomposition technique in information theory. Then, the fuzzy kβnearest neighbor algorithm is used to identify the structural class of a protein given as set of sample sequences. To verify the strength of our method, we choose three widely used datasets constructed by Chou and Zhou. It is shown by the Jackknife test that our approach represents an improvement in the prediction of accuracy over existing methods. Β© 2009 Wiley Periodicals, Inc. J Comput Chem, 2010
π SIMILAR VOLUMES
The multidimensional statistical technique of discriminant analysis is used to allocate amino acid sequences to one of four secondary structural classes: high a content, high / 3 content, mixed a and @, low content of ordered structure. Discrimination is based on four attributes: estimates of percen
Based on the concept that the structural class of a protein is mainly determined by its secondary structure sequence, a new algorithm for prediction of the structural class of a protein is proposed. By use of the number of -helices, -strands, and fragments, the structural class of a protein can be p
## Abstract The structural class is an important feature widely used to characterize the overall folding type of a protein. How to improve the prediction quality for protein structural classification by effectively incorporating the sequenceβorder effects is an important and challenging problem. Ba
## Abstract Supervised classifiers, such as artificial neural network, partition trees, and support vector machines, are often used for the prediction and analysis of biological data. However, choosing an appropriate classifier is not straightforward because each classifier has its own strengths an