## Abstract Because protein identifications rely on matches with sequence databases, highβthroughput proteomics is currently largely restricted to those species for which comprehensive sequence databases are available. The identification of proteins derived from organisms with unsequenced genomes m
A fast algorithm for genome-wide analysis of proteins with repeated sequences
β Scribed by Matteo Pellegrini; Edward M. Marcotte; Todd O. Yeates
- Publisher
- John Wiley and Sons
- Year
- 1999
- Tongue
- English
- Weight
- 126 KB
- Volume
- 35
- Category
- Article
- ISSN
- 0887-3585
No coin nor oath required. For personal study only.
β¦ Synopsis
We present a fast algorithm to search for repeating fragments within protein sequences. The technique is based on an extension of the Smith-Waterman algorithm that allows the calculation of sub-optimal alignments of a sequence against itself. We are able to estimate the statistical significance of all sub-optimal alignment scores. We also rapidly determine the length of the repeating fragment and the number of times it is found in a sequence. The technique is applied to sequences in the Swissprot database, and to 16 complete genomes. We find that eukaryotic proteins contain more internal repeats than those of prokaryotic and archael organisms. The finding that 18% of yeast sequences and 28% of the known human sequences contain detectable repeats emphasizes the importance of internal duplication in protein evolution.
π SIMILAR VOLUMES
## Abstract Bone mineral density (BMD) is influenced by a complex network of gene interactions; therefore, elucidating the relationships between genes and how those genes, in turn, influence BMD is critical for developing a comprehensive understanding of osteoporosis. To investigate the role of tra
## Abstract We describe CHASE, a novel algorithm for automated __de novo__ sequencing based on the mass spectrometric (MS) fragmentation analysis of tryptic peptides. This algorithm is used for protein identification from sequence similarity criteria and consists of four steps: (1) derivatization o
A useful method for preparing a synthetic peptide-carrying protein for specific antibody production was established. The monitor peptide is a trypsin-sensitive cholecystokinin-releasing peptide purified from rat pancreatic juice on the basis of its stimulatory activity toward pancreatic enzyme secre