## Abstract The identification and characterization of genes that influence the risk of common, complex multifactorial diseases, primarily through interactions with other genes and other environmental factors, remains a statistical and computational challenge in genetic epidemiology. This challenge
Identification of gene-gene interactions in the presence of missing data using the multifactor dimensionality reduction method
โ Scribed by Junghyun Namkung; Robert C. Elston; Jun-Mo Yang; Taesung Park
- Publisher
- John Wiley and Sons
- Year
- 2009
- Tongue
- English
- Weight
- 559 KB
- Volume
- 33
- Category
- Article
- ISSN
- 0741-0395
No coin nor oath required. For personal study only.
โฆ Synopsis
Abstract
Geneโgene interaction is believed to play an important role in understanding complex traits. Multifactor dimensionality reduction (MDR) was proposed by Ritchie et al. [2001. Am J Hum Genet 69:138โ147] to identify multiple loci that simultaneously affect disease susceptibility. Although the MDR method has been widely used to detect geneโgene interactions, few studies have been reported on MDR analysis when there are missing data. Currently, there are four approaches available in MDR analysis to handle missing data. The first approach uses only complete observations that have no missing data, which can cause a severe loss of data. The second approach is to treat missing values as an additional genotype category, but interpretation of the results may then be not clear and the conclusions may be misleading. Furthermore, it performs poorly when the missing rates are unbalanced between the case and control groups. The third approach is a simple imputation method that imputes missing genotypes as the most frequent genotype, which may also produce biased results. The fourth approach, Available, uses all data available for the given loci to increase power. In any real data analysis, it is not clear which MDR approach one should use when there are missing data. In this article, we consider a new EM Impute approach to handle missing data more appropriately. Through simulation studies, we compared the performance of the proposed EM Impute approach with the current approaches. Our results showed that Available and EM Impute approaches perform better than the three other current approaches in terms of power and precision. Genet. Epidemiol. 33:646โ656, 2009. ยฉ 2009 WileyโLiss, Inc.
๐ SIMILAR VOLUMES
## Abstract Complex diseases are likely to be caused by the interplay of genetic and environmental factors. Despite this, geneโdisease associations are frequently investigated using models that focus solely on a marginal gene effect, ignoring environmental factors entirely. Failing to take into acc
## Abstract ## BACKGROUND Epidemiological investigations have begun to consider geneโenvironment (GE) interactions as potential risk factors for many diseases, including several different birth defects. However, traditional methodological approaches for the analysis of caseโcontrol data tend to ha