𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Optimal selection of markers for validation or replication from genome-wide association studies

✍ Scribed by Celia M.T. Greenwood; Jagadish Rangrej; Lei Sun


Publisher
John Wiley and Sons
Year
2007
Tongue
English
Weight
261 KB
Volume
31
Category
Article
ISSN
0741-0395

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

With reductions in genotyping costs and the fast pace of improvements in genotyping technology, it is not uncommon for the individuals in a single study to undergo genotyping using several different platforms, where each platform may contain different numbers of markers selected via different criteria. For example, a set of cases and controls may be genotyped at markers in a small set of carefully selected candidate genes, and shortly thereafter, the same cases and controls may be used for a genome‐wide single nucleotide polymorphism (SNP) association study. After such initial investigations, often, a subset of “interesting” markers is selected for validation or replication. Specifically, by validation, we refer to the investigation of associations between the selected subset of markers and the disease in independent data. However, it is not obvious how to choose the best set of markers for this validation. There may be a prior expectation that some sets of genotyping data are more likely to contain real associations. For example, it may be more likely for markers in plausible candidate genes to show disease associations than markers in a genome‐wide scan. Hence, it would be desirable to select proportionally more markers from the candidate gene set. When a fixed number of markers are selected for validation, we propose an approach for identifying an optimal marker‐selection configuration by basing the approach on minimizing the stratified false discovery rate. We illustrate this approach using a case‐control study of colorectal cancer from Ontario, Canada, and we show that this approach leads to substantial reductions in the estimated false discovery rates in the Ontario dataset for the selected markers, as well as reductions in the expected false discovery rates for the proposed validation dataset. Genet. Epidemiol. 2007. © 2007 Wiley‐Liss, Inc.


📜 SIMILAR VOLUMES


Optimal methods for meta-analysis of gen
✍ Baiyu Zhou; Jianxin Shi; Alice S. Whittemore 📂 Article 📅 2011 🏛 John Wiley and Sons 🌐 English ⚖ 168 KB 👁 2 views

Meta-analysis of genome-wide association studies involves testing single nucleotide polymorphisms (SNPs) using summary statistics that are weighted sums of site-specific score or Wald statistics. This approach avoids having to pool individual-level data. We describe the weights that maximize the pow

Hierarchical Bayes prioritization of mar
✍ Juan Pablo Lewinger; David V. Conti; James W. Baurley; Timothy J. Triche; Duncan 📂 Article 📅 2007 🏛 John Wiley and Sons 🌐 English ⚖ 429 KB 👁 1 views

## Abstract We describe a hierarchical regression modeling approach to selection of a subset of markers from the first stage of a genomewide association scan to carry forward to subsequent stages for testing on an independent set of subjects. Rather than simply selecting a subset of most significan

Replication of previous genome-wide asso
✍ Shoji Ichikawa; Daniel L Koller; Leah R Padgett; Dongbing Lai; Siu L Hui; Munro 📂 Article 📅 2010 🏛 American Society for Bone and Mineral Research 🌐 English ⚖ 371 KB

Bone mineral density (BMD) achieved during young adulthood (peak BMD) is one of the major determinants of osteoporotic fracture in later life. Genetic variants associated with BMD have been identified by three recent genome-wide association studies. The most significant single-nucleotide polymorphis

Shrinkage estimation for robust and effi
✍ Sheng Luo; Bhramar Mukherjee; Jinbo Chen; Nilanjan Chatterjee 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 160 KB 👁 1 views

## Abstract Population‐based case‐control design has become one of the most popular approaches for conducting genome‐wide association scans for rare diseases like cancer. In this article, we propose a novel method for improving the power of the widely used single‐single‐nucleotide polymorphism (SNP