𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Predicting enzyme family classes by hybridizing gene product composition and pseudo-amino acid composition

✍ Scribed by Yu-Dong Cai; Guo-Ping Zhou; Kuo-Chen Chou


Book ID
104034575
Publisher
Elsevier Science
Year
2005
Tongue
English
Weight
196 KB
Volume
234
Category
Article
ISSN
0022-5193

No coin nor oath required. For personal study only.

✦ Synopsis


A new method has been developed to predict the enzymatic attribute of proteins by hybridizing the gene product composition and pseudo amino acid composition. As a demonstration, a working dataset was generated with a cutoff of 60% sequence identity to avoid redundancy and bias in statistical prediction. The dataset thus constructed contains 39989 protein sequences, of which 27469 are non-enzymes and 12520 enzymes that were further classified into 6 enzyme family classes according to their 6 main EC (Enzyme Commission) numbers (2314 are oxidoreductases, 3653 transferases, 3246 hydrolases, 1307 lyases, 676 isomerases, and 1324 ligases). The overall success rate by the jackknife test for the identification between enzyme and non-enzyme was 94%, and that for the identification among the 6 enzyme family classes was 98%. It is anticipated that, with the rapid increase of protein sequences entering into databanks, the current method will become a useful automated tool in identifying the enzymatic attribute of a newly found protein sequence.


πŸ“œ SIMILAR VOLUMES


Predicting subcellular localization of p
✍ Kuo-Chen Chou; Yu-Dong Cai πŸ“‚ Article πŸ“… 2004 πŸ› John Wiley and Sons 🌐 English βš– 106 KB

## Abstract Recent advances in large‐scale genome sequencing have led to the rapid accumulation of amino acid sequences of proteins whose functions are unknown. Since the functions of these proteins are closely correlated with their subcellular localizations, many efforts have been made to develop

Using grey dynamic modeling and pseudo a
✍ Xuan Xiao; Wei-Zhong Lin; Kuo-Chen Chou πŸ“‚ Article πŸ“… 2008 πŸ› John Wiley and Sons 🌐 English βš– 214 KB

## Abstract Using the pseudo amino acid (PseAA) composition to represent the sample of a protein can incorporate a considerable amount of sequence pattern information so as to improve the prediction quality for its structural or functional classification. However, how to optimally formulate the Pse