## Abstract Association analysis, with the aim of investigating genetic variations, is designed to detect genetic associations with observable traits, which has played an increasing part in understanding the genetic basis of diseases. Among these methods, haplotypeβbased association studies are bel
A novel method for haplotype clustering and visualization
β Scribed by Yik Y. Teo; Kerrin S. Small
- Publisher
- John Wiley and Sons
- Year
- 2009
- Tongue
- English
- Weight
- 360 KB
- Volume
- 34
- Category
- Article
- ISSN
- 0741-0395
No coin nor oath required. For personal study only.
β¦ Synopsis
Abstract
In population genetics, it is common to represent the haplotype diversity at a genomic region between multiple populations using wellβconstructed visual representations. This typically requires the chromosomes from these populations to be grouped according to some definition of haplotypic similarity. Here, we introduce a novel algorithm for clustering haplotypes with the specific aim of addressing haplotype diversity within or between populations. The algorithm allows for missing data in the haplotypes and appropriately downweighs single nucleotide polymorphisms with higher extent of missingness. By identifying the canonical haplotypes in a genomic region, defined as the haplotype forms, which most chromosomes are similar to, the algorithm maps each chromosome to either a unique canonical haplotype or as a mosaic of the identified canonical haplotypes. This mapping can subsequently be utilized for producing graphical visualizations of the haplotype clustering for understanding the extent of haplotype diversity in the region. The clustering application has been implemented in R for distribution as haplosim, and we also provide a visualization script hapvisual for graphical display of the clustering results. The outcome of such analysis can be informative in understanding the extent of haplotype diversity between populations, in addressing the reproducibility of established association signals across multiple populations, and also in the investigation of positive selection in the human genome. Genet. Epidemiol. 34:34β41, 2010. Β© 2009 WileyβLiss, Inc.
π SIMILAR VOLUMES
objective reference for users and applications alike in terms Automatic temporal segmentation and visual summary gen-of absolute frame numbers or time codes. The lack of such eration methods that require minimal user interaction are key well-defined generic segments is one of the reasons higher requ
The authors consider thut cluster analysis does not objectivize but represents the biologist's subjectivity as to (1) characters considcred to be significant and to (2) the way of classification. The latter, however, in authors' opiniin, must be specific to the field of application. To this effect s