Algorithms for inferring haplotypes
✍ Scribed by Tianhua Niu
- Publisher
- John Wiley and Sons
- Year
- 2004
- Tongue
- English
- Weight
- 411 KB
- Volume
- 27
- Category
- Article
- ISSN
- 0741-0395
No coin nor oath required. For personal study only.
✦ Synopsis
Abstract
Haplotype phase information in diploid organisms provides valuable information on human evolutionary history and may lead to the development of more efficient strategies to identify genetic variants that increase susceptibility to human diseases. Molecular haplotyping methods are labor‐intensive, low‐throughput, and very costly. Therefore, algorithms based on formal statistical theories were shown to be very effective and cost‐efficient for haplotype reconstruction. This review covers 1) population‐based haplotype inference methods: Clark's algorithm, expectation‐maximization (EM) algorithm, coalescence‐based algorithms (pseudo‐Gibbs sampler and perfect/imperfect phylogeny), and partition‐ligation algorithm implemented by a fully Bayesian model (Haplotyper) or by EM (PLEM); 2) family‐based haplotype inference methods; 3) the handling of genotype scoring uncertainties (i.e., genotyping errors and raw two‐dimensional genotype scatterplots) in inferring haplotypes; and 4) haplotype inference methods for pooled DNA samples. The advantages and limitations of each algorithm are discussed. By using simulations based on empirical data on the G6PD gene and TNFRSF5 gene, I demonstrate that different algorithms have different degrees of sensitivity to various extents of population diversities and genotyping error rates. Future development of statistical algorithms for addressing haplotype reconstruction will resort more and more to ideas based on combinatorial mathematics, graphical models, and machine learning, and they will have profound impacts on population genetics and genetic epidemiology with the advent of the human HapMap. © 2004 Wiley‐Liss, Inc.
📜 SIMILAR VOLUMES
## Abstract Inference of haplotypes is important in genetic epidemiology studies. However, all large genotype data sets have errors due to the use of inexpensive genotyping machines that are fallible and shortcomings in genotyping scoring softwares, which can have an enormous impact on haplotype in
## Abstract Knowledge of haplotypes is useful for understanding block structure in the genome and disease risk associations. Direct measurement of haplotypes in the absence of family data is presently impractical, and hence, several methods have been developed for reconstructing haplotypes from pop
## Abstract We compare bias and power of three methods for haplotype inference on disease risk using unphased genotype data from a case‐control study. We examine the prospective score test of Schaid et al. ([2002] Am. J. Hum. Genet 70:425–434), a novel modification of the prospective estimating equ
## Abstract We develop a method that allows inference on parameters in log‐linear models of the relative risk of disease given an individual's haplotypes, that can be used to analyze case‐parent trio data. Our methods are robust to population stratification and can also be used for inference on the