Fine mapping of disease genes via haplotype clustering
✍ Scribed by E.R.B. Waldron; J.C. Whittaker; D.J. Balding
- Book ID
- 102222529
- Publisher
- John Wiley and Sons
- Year
- 2006
- Tongue
- English
- Weight
- 184 KB
- Volume
- 30
- Category
- Article
- ISSN
- 0741-0395
No coin nor oath required. For personal study only.
✦ Synopsis
Abstract
We propose an algorithm for analysing SNP‐based population association studies, which is a development of that introduced by Molitor et al. [2003: Am J Hum Genet 73:1368–1384]. It uses clustering of haplotypes to overcome the major limitations of many current haplotype‐based approaches. We define a between‐haplotype score that is simple, yet appears to capture much of the information about evolutionary relatedness of the haplotypes in the vicinity of a (unobserved) putative causal locus. Haplotype clusters can then be defined via a putative ancestral haplotype and a cut‐off distance. The number of an individual's two haplotypes that lie within the cluster predicts the individual's genotype at the causal locus. This predicted genotype can then be investigated for association with the phenotype of interest. We implement our approach within a Markov‐chain Monte Carlo algorithm that, in effect, searches over locations and ancestral haplotypes to identify large, case‐rich clusters. The algorithm successfully fine‐maps a causal mutation in a test analysis using real data, and achieves almost 98% accuracy in predicting the genotype at the causal locus. A simulation study indicates that the new algorithm is substantially superior to alternative approaches, and it also allows us to identify situations in which multi‐point approaches can substantially improve over single‐SNP analyses. Our algorithm runs quickly and there is scope for extension to a wide range of disease models and genomic scales. Genet. Epidemiol. 2006. © 2005 Wiley‐Liss, Inc.
📜 SIMILAR VOLUMES
## Abstract We present a novel statistical method for linkage disequilibrium (LD) mapping of disease susceptibility loci in case‐control studies. Such studies exploit the statistical correlation or LD that exist between variants physically close along the genome to identify those that correlate wit
## Abstract Taking advantage of increasingly available high‐density single nucleotide polymorphism (SNP) markers within genes and across genomes, more and more genetic association studies began to use multiple closely linked markers in candidate genes. A practical analytical challenge arising in su