## Abstract We propose optimized two‐stage designs for genome‐wide case‐control association studies, using a hypothesis testing paradigm. To save genotyping costs, the complete marker set is genotyped in a sub‐sample only (stage I). On stage II, the most promising markers are then genotyped in the
Optimal designs for two-stage genome-wide association studies
✍ Scribed by Andrew D. Skol; Laura J. Scott; Gonçalo R. Abecasis; Michael Boehnke
- Publisher
- John Wiley and Sons
- Year
- 2007
- Tongue
- English
- Weight
- 316 KB
- Volume
- 31
- Category
- Article
- ISSN
- 0741-0395
No coin nor oath required. For personal study only.
✦ Synopsis
Abstract
Genome‐wide association (GWA) studies require genotyping hundreds of thousands of markers on thousands of subjects, and are expensive at current genotyping costs. To conserve resources, many GWA studies are adopting a staged design in which a proportion of the available samples are genotyped on all markers in stage 1, and a proportion of these markers are genotyped on the remaining samples in stage 2. We describe a strategy for designing cost‐effective two‐stage GWA studies. Our strategy preserves much of the power of the corresponding one‐stage design and minimizes the genotyping cost of the study while allowing for differences in per genotyping cost between stages 1 and 2. We show that the ratio of stage 2 to stage 1 per genotype cost can strongly influence both the optimal design and the genotyping cost of the study. Increasing the stage 2 per genotype cost shifts more of the genotyping and study cost to stage 1, and increases the cost of the study. This higher cost can be partially mitigated by adopting a design with reduced power while preserving the false positive rate or by increasing the false positive rate while preserving power. For example, reducing the power preserved in the two‐stage design from 99 to 95% that of the one‐stage design decreases the two‐stage study cost by ∼15%. Alternatively, the same cost savings can be had by relaxing the false positive rate by 2.5‐fold, for example from 1/300,000 to 2.5/300,000, while retaining the same power. Genet. Epidemiol. 2007. © 2007 Wiley‐Liss, Inc.
📜 SIMILAR VOLUMES
## Abstract The much‐anticipated fixed‐array, genome‐wide SNP genotyping technologies make large‐scale genome‐wide association scans now possible for large numbers of subjects. In this paper we reconsider the problem (Satagopan and Elston [2003] Genet Epidemiol 25:149–157) of optimizing a two‐stage
## Abstract We consider two‐stage case‐control designs for testing associations between single nucleotide polymorphisms (SNPs) and disease, in which a subsample of subjects is used to select a panel of “tagging” SNPs that will be considered in the main study. We propose a pseudolikelihood [Pepe and
Meta-analysis of genome-wide association studies involves testing single nucleotide polymorphisms (SNPs) using summary statistics that are weighted sums of site-specific score or Wald statistics. This approach avoids having to pool individual-level data. We describe the weights that maximize the pow
## Abstract Genome‐wide association studies of discrete traits generally use simple methods of analysis based on χ^2^ tests for contingency tables or logistic regression, at least for an initial scan of the entire genome. Nevertheless, more power might be obtained by using various methods that anal
## Abstract With reductions in genotyping costs and the fast pace of improvements in genotyping technology, it is not uncommon for the individuals in a single study to undergo genotyping using several different platforms, where each platform may contain different numbers of markers selected via dif