Bayesian variable selection for survival regression in genetics
✍ Scribed by Ioanna Tachmazidou; Michael R. Johnson; Maria De Iorio
- Publisher
- John Wiley and Sons
- Year
- 2010
- Tongue
- English
- Weight
- 224 KB
- Volume
- 34
- Category
- Article
- ISSN
- 0741-0395
No coin nor oath required. For personal study only.
✦ Synopsis
Abstract
Variable selection in regression with very big numbers of variables is challenging both in terms of model specification and computation. We focus on genetic studies in the field of survival, and we present a Bayesian‐inspired penalized maximum likelihood approach appropriate for high‐dimensional problems. In particular, we employ a simple, efficient algorithm that seeks maximum a posteriori (MAP) estimates of regression coefficients. The latter are assigned a Laplace prior with a sharp mode at zero, and non‐zero posterior mode estimates correspond to significant single nucleotide polymorphisms (SNPs). Using the Laplace prior reflects a prior belief that only a small proportion of the SNPs significantly influence the response. The method is fast and can handle datasets arising from imputation or resequencing. We demonstrate the localization performance, power and false‐positive rates of our method in large simulation studies of dense‐SNP datasets and sequence data, and we compare the performance of our method to the univariate Cox regression and to a recently proposed stochastic search approach. In general, we find that our approach improves localization and power slightly, while the biggest advantage is in false‐positive counts and computing times. We also apply our method to a real prospective study, and we observe potential association between candidate ABC transporter genes and epilepsy treatment outcomes. Genet. Epidemiol. 34:689–701, 2010. © 2010 Wiley‐Liss, Inc.
📜 SIMILAR VOLUMES
## Abstract Variable selection is growing in importance with the advent of high throughput genotyping methods requiring analysis of hundreds to thousands of single nucleotide polymorphisms (SNPs) and the increased interest in using these genetic studies to better understand common, complex diseases
The Cox proportional hazards model restricts the hazard ratio to be linear in the covariates. A survival model based on data from a clinical trial is developed using spline functions with variable knots to estimate the log hazard function. Moreover, the main point of the method is that a knot, seen
Evolutionary and genetic algorithms are powerful tools for searching global optima of complex functions. An evolutionary approach, the MUSEUM (mutation and selection uncover models) programme, is applied to various QSAR data sets to prove the general applicability of this approach for variable selec