✦ LIBER ✦

Haplotype analysis in the presence of informatively missing genotype data

✍ Scribed by Nianjun Liu; Isabel Beerman; Richard Lifton; Hongyu Zhao

Publisher: John Wiley and Sons
Year: 2006
Tongue: English
Weight: 231 KB
Volume: 30
Category: Article
ISSN: 0741-0395
DOI: 10.1002/gepi.20144

No coin nor oath required. For personal study only.

✦ Synopsis

Abstract

It is common to have missing genotypes in practical genetic studies, but the exact underlying missing data mechanism is generally unknown to the investigators. Although some statistical methods can handle missing data, they usually assume that genotypes are missing at random, that is, at a given marker, different genotypes and different alleles are missing with the same probability. These include those methods on haplotype frequency estimation and haplotype association analysis. However, it is likely that this simple assumption does not hold in practice, yet few studies to date have examined the magnitude of the effects when this simplifying assumption is violated. In this study, we demonstrate that the violation of this assumption may lead to serious bias in haplotype frequency estimates, and haplotype association analysis based on this assumption can induce both false‐positive and false‐negative evidence of association. To address this limitation in the current methods, we propose a general missing data model to characterize missing data patterns across a set of two or more markers simultaneously. We prove that haplotype frequencies and missing data probabilities are identifiable if and only if there is linkage disequilibrium between these markers under our general missing data model. Simulation studies on the analysis of haplotypes consisting of two single nucleotide polymorphisms illustrate that our proposed model can reduce the bias both for haplotype frequency estimates and association analysis due to incorrect assumption on the missing data mechanism. Finally, we illustrate the utilities of our method through its application to a real data set. Genet. Epidemiol. 2006. © 2006 Wiley‐Liss, Inc.

📜 SIMILAR VOLUMES

MISSING CAUSE OF DEATH INFORMATION IN TH

MISSING CAUSE OF DEATH INFORMATION IN THE ANALYSIS OF SURVIVAL DATA

✍ JANET ANDERSEN; ELS GOETGHEBEUR; LOUISE RYAN 📂 Article 📅 1996 🏛 John Wiley and Sons 🌐 English ⚖ 699 KB

Goetghebeur and Ryan proposed a method for proportional hazards analyses of competing risks failuretime data when the failure type is missing for some cases. This paper evaluates the properties of the method using data from a clinical trial in Hodgkin's disease. We generated several patterns of miss

Testing Equality of Means in the Presenc

Testing Equality of Means in the Presence of Correlation and Missing Data

✍ Prof. Dinesh S. Bhoj 📂 Article 📅 1991 🏛 John Wiley and Sons 🌐 English ⚖ 492 KB 👁 3 views

Ascertainment considerations in the anal

Ascertainment considerations in the analysis of affected sib shared haplotype data

✍ W. J. Ewens; N. C. E. Shute; Nancy J. Cox; R. Arlen Price; Richard S. Spielman 📂 Article 📅 1986 🏛 John Wiley and Sons 🌐 English ⚖ 226 KB

Direct analysis of unphased SNP genotype

Direct analysis of unphased SNP genotype data in population-based association studies via Bayesian partition modelling of haplotypes

✍ Andrew P. Morris 📂 Article 📅 2005 🏛 John Wiley and Sons 🌐 English ⚖ 212 KB 👁 1 views

We describe a novel method for assessing the strength of disease association with single nucleotide polymorphisms (SNPs) in a candidate gene or small candidate region, and for estimating the corresponding haplotype relative risks of disease, using unphased genotype data directly. We begin by estimat

Assessing environmental stressors via Ba

Assessing environmental stressors via Bayesian Model Averaging in the presence of missing data

✍ E. L. Boone; K. Ye; E. P. Smith 📂 Article 📅 2011 🏛 John Wiley and Sons 🌐 English ⚖ 121 KB 👁 1 views

On the advantage of haplotype analysis i

On the advantage of haplotype analysis in the presence of multiple disease susceptibility alleles

✍ Richard W. Morris; Norman L. Kaplan 📂 Article 📅 2002 🏛 John Wiley and Sons 🌐 English ⚖ 193 KB

## Abstract We investigated the effect of multiple susceptibility alleles at a single disease locus on the statistical power of a likelihood ratio test to detect association between alleles at a marker locus and a disease phenotype in a case‐control design. Using simplifying assumptions to obtain t