๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

A dictionary model for haplotyping, genotype calling, and association testing

โœ Scribed by Kristin L. Ayers; Chiara Sabatti; Kenneth Lange


Publisher
John Wiley and Sons
Year
2007
Tongue
English
Weight
182 KB
Volume
31
Category
Article
ISSN
0741-0395

No coin nor oath required. For personal study only.

โœฆ Synopsis


Abstract

We propose a new method for haplotyping, genotype calling, and association testing based on a dictionary model for haplotypes. In this framework, a haplotype arises as a concatenation of conserved haplotype segments, drawn from a predefined dictionary according to segment specific probabilities. The observed data consist of unphased multimarker genotypes gathered on a random sample of unrelated individuals. These genotypes are subject to mutation, genotyping errors, and missing data. The true pair of haplotypes corresponding to a person's multimarker genotype is reconstructed using a Markov chain that visits haplotype pairs according to their posterior probabilities. Our implementation of the chain alternates Gibbs steps, which rearrange the phase of a single marker, and Metropolis steps, which swap maternal and paternal haplotypes from a given maker onward. Output of the chain include the most likely haplotype pairs, the most likely genotypes at each marker, and the expected number of occurrences of each haplotype segment. Reconstruction accuracy is comparable to that achieved by the best existing algorithms. More importantly, the dictionary model yields expected counts of conserved haplotype segments. These imputed counts can serve as genetic predictors in association studies, as we illustrate by examples on cystic fibrosis, Friedreich's ataxia, and angiotensinโ€I converting enzyme levels. Genet. Epidemiol. ยฉ 2007 Wileyโ€Liss, Inc.


๐Ÿ“œ SIMILAR VOLUMES


Detecting rare variant associations: met
โœ Rita M. Cantor; Marsha Wilcox ๐Ÿ“‚ Article ๐Ÿ“… 2011 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 92 KB ๐Ÿ‘ 1 views

## Abstract We summarize the work done by the contributors to Group 13 at Genetic Analysis Workshop 17 (GAW17) and provide a synthesis of their data analyses. The Group 13 contributors used a variety of approaches to test associations of both rare variants and common singleโ€nucleotide polymorphisms

Single-marker and two-marker association
โœ Sulgi Kim; Nathan J. Morris; Sungho Won; Robert C. Elston ๐Ÿ“‚ Article ๐Ÿ“… 2009 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 170 KB ๐Ÿ‘ 2 views

## Abstract In caseโ€control single nucleotide polymorphism (SNP) data, the allele frequency, Hardy Weinberg Disequilibrium, and linkage disequilibrium (LD) contrast tests are three distinct sources of information about genetic association. While all three tests are typically developed in a retrospe

Case/pseudocontrol analysis in genetic a
โœ Heather J. Cordell; Bryan J. Barratt; David G. Clayton ๐Ÿ“‚ Article ๐Ÿ“… 2004 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 183 KB ๐Ÿ‘ 1 views

## Abstract Estimation and testing of genetic effects (genotype relative risks) are often performed conditionally on parental genotypes, using data from caseโ€parent trios. This strategy avoids having to estimate nuisance parameters such as parental mating type frequencies, and also avoids generatin