Context of deletions and insertions in human coding sequences
โ Scribed by Alexey S. Kondrashov; Igor B. Rogozin
- Publisher
- John Wiley and Sons
- Year
- 2004
- Tongue
- English
- Weight
- 159 KB
- Volume
- 23
- Category
- Article
- ISSN
- 1059-7794
No coin nor oath required. For personal study only.
โฆ Synopsis
Communicated by Mark H. Paalman
We studied the dependence of the rate of short deletions and insertions on their contexts using the data on mutations within coding exons at 19 human loci that cause mendelian diseases. We confirm that periodic sequences consisting of three to five or more nucleotides are mutagenic. Mutability of sequences with strongly biased nucleotide composition is also elevated, even when mutations within homonucleotide runs longer than three nucleotides are ignored. In contrast, no elevated mutation rates have been detected for imperfect direct or inverted repeats. Among known candidate contexts, the indel context GTAAGT and regions with purinepyrimidine imbalance between the two DNA strands are mutagenic in our sample, and many others are not mutagenic. Data on mutation hot spots suggest two novel contexts that increase the deletion rate.
Comprehensive analysis of mutability of all possible contexts of lengths four, six, and eight indicates a substantially elevated deletion rate within YYYTG and similar sequences, which is one of the two contexts revealed by the hot spots. Possible contexts that increase the insertion rate (AT(A/C)(A/C)GCC and TACCRC) and decrease deletion (TATCGC) or insertion (GCGG) rates have also been identified. Two-thirds of deletions remove a repeat, and over 80% of insertions create a repeat, i.e., they are duplications. Hum Mutat 23: 177-185, 2004.
๐ SIMILAR VOLUMES
Most attempts to engineer the properties of proteins have employed single or multiple substitution mutations, which typically produce minor changes in structure. Recent structural and stability studies of insertion and deletion mutants clearly indicate that relatively large structural perturbations
The size distributions of deletions, insertions, and indels (i.e., insertions or deletions) were studied, using 78 human processed pseudogenes and other published data sets. The following results were obtained: (1) Deletions occur more frequently than do insertions in sequence evolution; none of th
We study the length distribution functions for the 16 possible distinct dimeric tandem repeats in DNA sequences of diverse taxonomic partitions of GenBank (known human and mouse genomes, and complete genomes of Caenorhabditis elegans and yeast). For coding DNA, we find that all 16 distribution funct