✦ LIBER ✦

Estimating haplotype-disease associations with pooled genotype data

✍ Scribed by D. Zeng; D.Y. Lin

Publisher: John Wiley and Sons
Year: 2004
Tongue: English
Weight: 282 KB
Volume: 28
Category: Article
ISSN: 0741-0395
DOI: 10.1002/gepi.20040

No coin nor oath required. For personal study only.

✦ Synopsis

The genetic dissection of complex human diseases requires large-scale association studies which explore the population associations between genetic variants and disease phenotypes. DNA pooling can substantially reduce the cost of genotyping assays in these studies, and thus enables one to examine a large number of genetic variants on a large number of subjects. The availability of pooled genotype data instead of individual data poses considerable challenges in the statistical inference, especially in the haplotype-based analysis because of increased phase uncertainty. Here we present a general likelihood-based approach to making inferences about haplotype-disease associations based on possibly pooled DNA data. We consider cohort and case-control studies of unrelated subjects, and allow arbitrary and unequal pool sizes. The phenotype can be discrete or continuous, univariate or multivariate. The effects of haplotypes on disease phenotypes are formulated through flexible regression models, which allow a variety of genetic hypotheses and gene-environment interactions. We construct appropriate likelihood functions for various designs and phenotypes, accommodating Hardy-Weinberg disequilibrium. The corresponding maximum likelihood estimators are approximately unbiased, normally distributed, and statistically efficient. We develop simple and efficient numerical algorithms for calculating the maximum likelihood estimators and their variances, and implement these algorithms in a freely available computer program. We assess the performance of the proposed methods through simulation studies, and provide an application to the Finland-United States Investigation of NIDDM Genetics Study. The results show that DNA pooling is highly efficient in studying haplotype-disease associations. As a by-product, this work provides valid and efficient methods for estimating haplotype-disease associations with unpooled DNA samples.

📜 SIMILAR VOLUMES

MaCH: using sequence and genotype data t

MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes

✍ Yun Li; Cristen J. Willer; Jun Ding; Paul Scheet; Gonçalo R. Abecasis 📂 Article 📅 2010 🏛 John Wiley and Sons 🌐 English ⚖ 508 KB

## Abstract Genome‐wide association studies (GWAS) can identify common alleles that contribute to complex disease susceptibility. Despite the large number of SNPs assessed in each study, the effects of most common SNPs must be evaluated indirectly using either genotyped markers or haplotypes thereo

Haplotype Inference for Population Data

Haplotype Inference for Population Data with Genotyping Errors

✍ Wensheng Zhu; Anthony Y. C. Kuk; Jianhua Guo 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 265 KB

## Abstract Inference of haplotypes is important in genetic epidemiology studies. However, all large genotype data sets have errors due to the use of inexpensive genotyping machines that are fallible and shortcomings in genotyping scoring softwares, which can have an enormous impact on haplotype in

Resequencing of pooled DNA for detecting

Resequencing of pooled DNA for detecting disease associations with rare variants

✍ Tao Wang; Chang-Yun Lin; Thomas E. Rohan; Kenny Ye 📂 Article 📅 2010 🏛 John Wiley and Sons 🌐 English ⚖ 340 KB 👁 1 views

## Abstract A combination of common and rare variants is thought to contribute to genetic susceptibility to complex diseases. Recently, next‐generation sequencers have greatly lowered sequencing costs, providing an opportunity to identify rare disease variants in large genetic epidemiology studies.

Streamlined analysis of pooled genotype

Streamlined analysis of pooled genotype data in SNP-based association studies

✍ Valentina Moskvina; Nadine Norton; Nigel Williams; Peter Holmans; Michael Owen; 📂 Article 📅 2005 🏛 John Wiley and Sons 🌐 English ⚖ 130 KB 👁 1 views

## Abstract Several groups have developed methods for estimating allele frequencies in DNA pools as a fast and cheap way for detecting allelic association between genetic markers and disease. To obtain accurate estimates of allele frequencies, a correction factor __k__ for the degree to which measu

Haplotype association analysis for late

Haplotype association analysis for late onset diseases using nuclear family data

✍ Chun Li; Michael Boehnke 📂 Article 📅 2006 🏛 John Wiley and Sons 🌐 English ⚖ 208 KB 👁 1 views

## Abstract In haplotype‐based association studies for late onset diseases, one attractive design is to use available unaffected spouses as controls (Valle et al. [1998] Diab. Care 21:949–958). Given cases and spouses only, the standard expectation‐maximization (EM) algorithm (Dempster et al. [1977

Inference on haplotype/disease associati

Inference on haplotype/disease association using parent-affected-child data: the projection conditional on parental haplotypes method

✍ Andrew S. Allen; Glen A. Satten 📂 Article 📅 2007 🏛 John Wiley and Sons 🌐 English ⚖ 181 KB

## Abstract We develop a method that allows inference on parameters in log‐linear models of the relative risk of disease given an individual's haplotypes, that can be used to analyze case‐parent trio data. Our methods are robust to population stratification and can also be used for inference on the