✦ LIBER ✦

Brief review of regression-based and machine learning methods in genetic epidemiology: the Genetic Analysis Workshop 17 experience

✍ Scribed by Abhijit Dasgupta; Yan V. Sun; Inke R. König; Joan E. Bailey-Wilson; James D. Malley

Publisher: John Wiley and Sons
Year: 2011
Tongue: English
Weight: 120 KB
Volume: 35
Category: Article
ISSN: 0741-0395
DOI: 10.1002/gepi.20642

No coin nor oath required. For personal study only.

✦ Synopsis

Abstract

Genetics Analysis Workshop 17 provided common and rare genetic variants from exome sequencing data and simulated binary and quantitative traits in 200 replicates. We provide a brief review of the machine learning and regression‐based methods used in the analyses of these data. Several regression and machine learning methods were used to address different problems inherent in the analyses of these data, which are high‐dimension, low‐sample‐size data typical of many genetic association studies. Unsupervised methods, such as cluster analysis, were used for data segmentation and, subset selection. Supervised learning methods, which include regression‐based methods (e.g., generalized linear models, logic regression, and regularized regression) and tree‐based methods (e.g., decision trees and random forests), were used for variable selection (selecting genetic and clinical features most associated or predictive of outcome) and prediction (developing models using common and rare genetic variants to accurately predict outcome), with the outcome being case‐control status or quantitative trait value. We include a discussion of cross‐validation for model selection and assessment, and a description of available software resources for these methods. Genet. Epidemiol. 35:S5–S11, 2011. © 2011 Wiley Periodicals, Inc.

📜 SIMILAR VOLUMES

Regression and data mining methods for a

Regression and data mining methods for analyses of multiple rare variants in the Genetic Analysis Workshop 17 mini-exome data

✍ Joan E. Bailey-Wilson; Jennifer S. Brennan; Shelley B. Bull; Robert Culverhouse; 📂 Article 📅 2011 🏛 John Wiley and Sons 🌐 English ⚖ 124 KB 👁 1 views

## Abstract Group 14 of Genetic Analysis Workshop 17 examined several issues related to analysis of complex traits using DNA sequence data. These issues included novel methods for analyzing rare genetic variants in an aggregated manner (often termed collapsing rare variants), evaluation of various