Similarity analysis of DNA sequences based on the weighted pseudo-entropy
β Scribed by Chun Li; Hong Ma; Yang Zhou; Xiaolei Wang; Xiaoqi Zheng
- Publisher
- John Wiley and Sons
- Year
- 2010
- Tongue
- English
- Weight
- 91 KB
- Volume
- 32
- Category
- Article
- ISSN
- 0192-8651
No coin nor oath required. For personal study only.
β¦ Synopsis
A DNA primary sequence is a string consisting of letters on an alphabet X 5 {a, c, g, t}. Based on all of the 2-combinations of the set X, here the repetition is allowed, we transform a DNA primary sequence into a special sequence over a set with cardinality 10. With the 10-letter sequence, we associate 10 nonnegative numerical sequences and then derive a 10-component vector by means of a weighted pseudo-entropy, which can reflect the information on elements of a sequence and, especially, the order relation among them. The new quantitative characterization of DNA sequences is sensitive to substitution of the string elements. The examination of the relationship among b-globin genes of 15 species illustrates the utility of the proposed approach.
π SIMILAR VOLUMES
## Abstract On the basis of a class of 2D graphical representations of DNA sequences, sensitivity analysis has been performed, showing the highβcapability of the proposed representations to take into account small modifications of the DNA sequences. And sensitivity analysis also indicates that the
We have proposed a structuring method of instructional terms and a clustering method with multiplicity on the basis of a weighted graph. In this paper we propose a consistent sequencing method of instructional terms based on them. Here we consider level and relevancy (degree) as the principle of the
The phylogenetic relationships of callitrichine primates have been determined by DNA sequence analyses of exons 1, 2, and 3 of the Ξ² 2 -microglobulin gene. Parsimony, distance, and maximum likelihood analyses of ca. 900 base pairs of 21 taxa, representing all callitrichine genera, indicated that Sag