## Abstract The structural class is an important feature widely used to characterize the overall folding type of a protein. How to improve the prediction quality for protein structural classification by effectively incorporating the sequenceβorder effects is an important and challenging problem. Ba
An optimization approach to predicting protein structural class from amino acid composition
β Scribed by Chun-Ting Zhang; Kuo-Chen Chou
- Publisher
- Cold Spring Harbor Laboratory Press
- Year
- 2008
- Tongue
- English
- Weight
- 644 KB
- Volume
- 1
- Category
- Article
- ISSN
- 0961-8368
No coin nor oath required. For personal study only.
β¦ Synopsis
Abstract
Proteins are generally classified into four structural classes: allβΞ± proteins, allβΞ² proteins, Ξ±+Ξ² proteins, and Ξ±/Ξ² proteins. In this article, a protein is expressed as a vector of 20βdimensional space, in which its 20 components are defined by the composition of its 20 amino acids. Based on this, a new method, the soβcalled maximum component coefficient method, is proposed for predicting the structural class of a protein according to its amino acid composition. In comparison with the existing methods, the new method yields a higher general accuracy of prediction. Especially for the allβΞ± proteins, the rate of correct prediction obtained by the new method is much higher than that by any of the existing methods. For instance, for the 19 allβΞ± proteins investigated previously by P.Y. Chou, the rate of correct prediction by means of his method was 84.2%, but the correct rate when predicted with the new method would be 100%! Furthermore, the new method is characterized by an explicable physical picture. This is reflected by the process in which the vector representing a protein to be predicted is decomposed into four component vectors, each of which corresponds to one of the norms of the four protein structural classes.
π SIMILAR VOLUMES
## Abstract The proteins structure can be mainly classified into four classes: allβΞ±, all**β**Ξ², Ξ±/Ξ², and Ξ± + Ξ² protein according to their chain fold topologies. For the purpose of predicting the protein structural class, a new predicting algorithm, in which the increment of diversity combines with
The multidimensional statistical technique of discriminant analysis is used to allocate amino acid sequences to one of four secondary structural classes: high a content, high / 3 content, mixed a and @, low content of ordered structure. Discrimination is based on four attributes: estimates of percen
## Abstract Using the pseudo amino acid (PseAA) composition to represent the sample of a protein can incorporate a considerable amount of sequence pattern information so as to improve the prediction quality for its structural or functional classification. However, how to optimally formulate the Pse