Mass Accuracy and Sequence Requirements for Protein Database Searching
β Scribed by M.Kirk Green; Murray V. Johnston; Barbara S. Larsen
- Publisher
- Elsevier Science
- Year
- 1999
- Tongue
- English
- Weight
- 472 KB
- Volume
- 275
- Category
- Article
- ISSN
- 0003-2697
No coin nor oath required. For personal study only.
β¦ Synopsis
To elucidate the role of high mass accuracy in mass spectrometric peptide mapping and database searching, selected proteins were subjected to tryptic digestion and the resulting mixtures were analyzed by electrospray ionization on a 7 Tesla Fourier transform mass spectrometer with a mass accuracy of 1 ppm. Two extreme cases were examined in detail: equine apomyoglobin, which digested easily and gave very few spurious masses, and bovine β£-lactalbumin, which under the conditions used, gave many spurious masses. The effectiveness of accurate mass measurements in minimizing false protein matches was examined by varying the mass error allowed in the search over a wide range (2-500 ppm). For the "clean" data obtained from apomyoglobin, very few masses were needed to return valid protein matches, and the mass error allowed in the search had little effect up to 500 ppm. However, in the case of β£-lactalbumin more mass values were needed, and low mass errors increased the search specificity. Mass errors below 30 ppm were particularly useful in eliminating false protein matches when few mass values were used in the search. Collision-induced dissociation of an unassigned peak in the β£-lactalbumin digest provided sufficient data to unambiguously identify the peak as a fragment from β£-lactalbumin and eliminate a large number of spurious proteins found in the peptide mass search. The results show that even with a relatively high mass error (0.8 Da for mass differences between singly charged product ions), collision-induced dissociation can help identify proteins in cases where unfavorable digest conditions or modifications render digest peaks unidentifiable by a simple mass mapping search.
π SIMILAR VOLUMES
The protein sequence database was analyzed for evidence that some distinct sequence families might be distantly related in evolution by changes in frame of translation. Sequences were compared using special amino acid substitution matrices for the alternate frames of translation. The statistical sig
## Abstract Mass spectrometry has become one of the most important techniques in proteomics because of its use to identify the proteins found in different cell types, organelles, and multiprotein complexes. This information about protein location and binding partners can provide valuable clues to i