In anticipation of the availability of next-generation sequencing data, there has been increasing interest in association analysis of rare variants (RVs). Owing to the extremely low frequency of a RV, single variant-based analysis and many existing tests developed for common variants may not be suit
Comparison of statistical tests for disease association with rare variants
β Scribed by Saonli Basu; Wei Pan
- Publisher
- John Wiley and Sons
- Year
- 2011
- Tongue
- English
- Weight
- 173 KB
- Volume
- 35
- Category
- Article
- ISSN
- 0741-0395
No coin nor oath required. For personal study only.
β¦ Synopsis
In anticipation of the availability of next-generation sequencing data, there is increasing interest in investigating association between complex traits and rare variants (RVs). In contrast to association studies for common variants (CVs), due to the low frequencies of RVs, common wisdom suggests that existing statistical tests for CVs might not work, motivating the recent development of several new tests for analyzing RVs, most of which are based on the idea of pooling/collapsing RVs. However, there is a lack of evaluations of, and thus guidance on the use of, existing tests. Here we provide a comprehensive comparison of various statistical tests using simulated data. We consider both independent and correlated rare mutations, and representative tests for both CVs and RVs. As expected, if there are no or few non-causal (i.e. neutral or non-associated) RVs in a locus of interest while the effects of causal RVs on the trait are all (or mostly) in the same direction (i.e. either protective or deleterious, but not both), then the simple pooled association tests (without selecting RVs and their association directions) and a new test called kernel-based adaptive clustering (KBAC) perform similarly and are most powerful; KBAC is more robust than simple pooled association tests in the presence of non-causal RVs; however, as the number of non-causal CVs increases and/or in the presence of opposite association directions, the winners are two methods originally proposed for CVs and a new test called C-alpha test proposed for RVs, each of which can be regarded as testing on a variance component in a random-effects model. Interestingly, several methods based on sequential model selection (i.e. selecting causal RVs and their association directions), including two new methods proposed here, perform robustly and often have statistical power between those of the above two classes.
π SIMILAR VOLUMES
## Abstract A combination of common and rare variants is thought to contribute to genetic susceptibility to complex diseases. Recently, nextβgeneration sequencers have greatly lowered sequencing costs, providing an opportunity to identify rare disease variants in large genetic epidemiology studies.
## Abstract Genomeβwide association studies succeeded in finding genetic variants associated with various phenotypes, but a large portion of the predicted genetic contribution to many traits remains unknown. One plausible explanation is that some missing variation is due to rare variants. Latest se
## Abstract Association mapping in linked regions is a current major approach for the identification of genes for complex diseases. Loci contributing to linkage, even with small values of sibling recurrence risk (Ξ»~s~), may be equivalent to substantial underlying genetic effects for association stu
## Abstract Linkage studies have suggested a susceptibility locus for lateβonset Alzheimer's disease (LOAD) on chromosome 21. A functional candidate gene in this region is the Ξ²βamyloid precursor protein (APP) gene. Previously, coding mutations in APP have been associated with early onset Alzheimer
Genetic association studies for binary diseases are designed as case-control studies: the cases are those affected with the primary disease and the controls are free of the disease. At the time of case-control collection, information about secondary phenotypes is also collected. Association studies