Alignment using genetic programming with causal trees for identification of protein functions
✍ Scribed by Chun-Min Hung; Yueh-Min Huang; Ming-Shi Chang
- Publisher
- Elsevier Science
- Year
- 2006
- Tongue
- English
- Weight
- 837 KB
- Volume
- 65
- Category
- Article
- ISSN
- 0362-546X
No coin nor oath required. For personal study only.
✦ Synopsis
A hybrid evolutionary model is used to propose a hierarchical homology of protein sequences to identify protein functions systematically. The proposed model offers considerable potentials, considering the inconsistency of existing methods for predicting novel proteins. Because some novel proteins might align without meaningful conserved domains, maximizing the score of sequence alignment is not the best criterion for predicting protein functions. This work presents a decision model that can minimize the cost of making a decision for predicting protein functions using the hierarchical homologies. Particularly, the model has three characteristics: (i) it is a hybrid evolutionary model with multiple fitness functions that uses genetic programming to predict protein functions on a distantly related protein family, (ii) it incorporates modified robust point matching to accurately compare all feature points using the moment invariant and thin-plate spline theorems, and (iii) the hierarchical homologies holding up a novel protein sequence in the form of a causal tree can effectively demonstrate the relationship between proteins. This work describes the comparisons of nucleocapsid proteins from the putative polyprotein SARS virus and other coronaviruses in other hosts using the model.