Analysis on the Distribution of Bases in 1487 Human Protein Coding Sequences
β Scribed by Chun-Ting Zhang; Yong Zhan
- Publisher
- Elsevier Science
- Year
- 1994
- Tongue
- English
- Weight
- 254 KB
- Volume
- 167
- Category
- Article
- ISSN
- 0022-5193
No coin nor oath required. For personal study only.
β¦ Synopsis
The occurrence frequencies of bases A, C, G and T, denoted by (a, c, g) and (t), respectively, in 1487 human: protein coding sequences have been calculated and analyzed. The analysis has been performed by a diagrammatic method presented recently, in which each coding sequence is represented by a point in 3-D space. The distribution of points gives the observer an overall and intuitive picture of the base frequencies. The distance between a point and the origin of the co-ordinate, which corresponds to the case of (a=c=g=t=1 / 4), is called the radical distance. The radical distribution of 1487 points in 3-D space has been found to be normal, with the center basically coinciding with the origin of the co-ordinate. We have found that among 1487 coding sequences, an empirical rule (a^{2}+c^{2}+g^{2}+t^{2}<1 / 3) holds for 1486 sequences. The only sequence in which the above rule does not hold is the one coding for the human parathymosin protein. The composition of amino acids and the structural class of this protein has been studied in some detail.
π SIMILAR VOLUMES
## Abstract Although Korea is a hepatitis B virus (HBV) endemic area, relatively few fullβlength genome sequences are available. In particular, no comparative analysis has been performed on the fullβgenome sequences of different HBV quasispecies from a single Korean patient. This report describes t
## Abstract The protein kinase gene family is the most frequently mutated in human cancer. Previous work has documented activating mutations in the __KIT__ receptor tyrosine kinase in testicular germβcell tumors (TGCT). To investigate further the potential role of mutated protein kinases in the dev
The Drosophila obscura species group has served as an important model system in many evolutionary and population genetic studies. Despite the amount of study this group has received, some phylogenetic relationships remain unclear. While individual analysis of different nuclear, mitochondrial, allozy
## ~ A new class of disease (including Hunting ton disease, Kennedy disease, and spinocerebellar ataxias types 1 and 3) results from abnormal expansions of CAG trinucleotides in the coding regions of genes. In all of these diseases the CAG repeats are thought to be translated into polyglutamine tr
Toward the goal of recovering the phylogenetic relationships among elapid snakes, we separately found the shortest trees from the amino acid sequences for the venom proteins phospholipase A2 and the short neurotoxin, collectively representing 32 species in 16 genera. We then applied a method we term