✦ LIBER ✦

Data Mining and Computationally intensive methods: Summary of Group 7 contributions to Genetic Analysis Workshop 13

✍ Scribed by Tracy J. Costello; Catherine T. Falk; Kenny Q. Ye

Publisher: John Wiley and Sons
Year: 2003
Tongue: English
Weight: 85 KB
Volume: 25
Category: Article
ISSN: 0741-0395
DOI: 10.1002/gepi.10285

No coin nor oath required. For personal study only.

✦ Synopsis

The Framingham Heart Study data, as well as a related simulated data set, were generously provided to the participants of the Genetic Analysis Workshop 13 in order that newly developed and emerging statistical methodologies could be tested on that well-characterized data set. The impetus driving the development of novel methods is to elucidate the contributions of genes, environment, and interactions between and among them, as well as to allow comparison between and validation of methods. The seven papers that comprise this group used data-mining methodologies (tree-based methods, neural networks, discriminant analysis, and Bayesian variable selection) in an attempt to identify the underlying genetics of cardiovascular disease and related traits in the presence of environmental and genetic covariates. Data-mining strategies are gaining popularity because they are extremely flexible and may have greater efficiency and potential in identifying the factors involved in complex disorders. While the methods grouped together here constitute a diverse collection, some papers asked similar questions with very different methods, while others used the same underlying methodology to ask very different questions. This paper briefly describes the data-mining methodologies applied to the Genetic Analysis Workshop 13 data sets and the results of those investigations.

📜 SIMILAR VOLUMES

Data mining of RNA expression and DNA ge

Data mining of RNA expression and DNA genotype data: Presentation Group 5 contributions to Genetic Analysis Workshop 15

✍ Catherine T. Falk; Stephen J. Finch; Wonkuk Kim; Nitai D. Mukhopadhyay 📂 Article 📅 2007 🏛 John Wiley and Sons 🌐 English ⚖ 137 KB 👁 1 views

The complexity of data available in human genetics continues to grow at an explosive rate. With that growth, the challenges to understanding the meaning of the underlying information also grow. A currently popular approach to dissecting such information falls under the broad category of data mining.

Model selection and Bayesian methods in

Model selection and Bayesian methods in statistical genetics: Summary of Group 11 contributions to Genetic Analysis Workshop 15

✍ Michael D. Swartz; Duncan C. Thomas; E. Warwick Daw 📂 Article 📅 2007 🏛 John Wiley and Sons 🌐 English ⚖ 133 KB

The research presented in group 11 of the Genetic Analysis Workshop 15 (GAW15) falls into two major themes: Model selection approaches for gene mapping (both Bayesian and Frequentist); and other Bayesian methods. These methods either allow relaxation of some of the common assumptions, such as mode o

Multistage analysis strategies for genom

Multistage analysis strategies for genome-wide association studies: summary of group 3 contributions to Genetic Analysis Workshop 16

✍ Rosalind J. Neuman; Yun Ju Sung 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 78 KB 👁 2 views

This contribution summarizes the work done by six independent teams of investigators to identify the genetic and nongenetic variants that work together or independently to predispose to disease. The theme addressed in these studies is multistage strategies in the context of genome-wide association s

Linkage mapping methods applied to the C

Linkage mapping methods applied to the COGA data set: Presentation Group 4 of Genetic Analysis Workshop 14

✍ E. Warwick Daw; Betty Q. Doan; Robert C. Elston 📂 Article 📅 2005 🏛 John Wiley and Sons 🌐 English ⚖ 108 KB 👁 1 views

Presentation Group 4 participants analyzed the Collaborative Study on the Genetics of Alcoholism data provided for Genetic Analysis Workshop 14. This group examined various aspects of linkage analysis and related issues. Seven papers included linkage analyses, while the eighth calculated identity-by

Inflated type I error rates when using a

Inflated type I error rates when using aggregation methods to analyze rare variants in the 1000 Genomes Project exon sequencing data in unrelated individuals: summary results from Group 7 at Genetic Analysis Workshop 17

✍ Nathan Tintle; Hugues Aschard; Inchi Hu; Nora Nock; Haitian Wang; Elizabeth Pugh 📂 Article 📅 2011 🏛 John Wiley and Sons 🌐 English ⚖ 85 KB 👁 2 views

## Abstract As part of Genetic Analysis Workshop 17 (GAW17), our group considered the application of novel and standard approaches to the analysis of genotype‐phenotype association in next‐generation sequencing data. Our group identified a major issue in the analysis of the GAW17 next‐generation se