When a rectangular multivariate data set contains missing values, missing data imputation using the multivariate \(t\) distribution appears potentially useful, especially for robust inferences. An efficient technique, called the monotone data augmentation algorithm, for implementing missing data imp
Missing phenotype data imputation in pedigree data analysis
โ Scribed by Brooke L. Fridley; Mariza de Andrade
- Publisher
- John Wiley and Sons
- Year
- 2008
- Tongue
- English
- Weight
- 164 KB
- Volume
- 32
- Category
- Article
- ISSN
- 0741-0395
No coin nor oath required. For personal study only.
โฆ Synopsis
Abstract
Mapping complex traits or phenotypes with small genetic effects, whose phenotypes may be modulated by temporal trends in families are challenging. Detailed and accurate data must be available on families, whether or not the data were collected over time. Missing data complicate matters in pedigree analysis, especially in the case of a longitudinal pedigree analysis. Because most analytical methods developed for the analysis of longitudinal pedigree data require no missing data, the researcher is left with the option of dropping those cases (individuals) with missing data from the analysis or imputing values for the missing data. We present the use of data augmentation within Bayesian polygenic and longitudinal polygenic models to produce k complete datasets. The data augmentation, or imputation step of the Markov chain Monte Carlo, takes into account the observed familial information and the observed subject information available at other time points. These k complete datasets can then be used to fit single time point or longitudinal pedigree models. By producing a set of k complete datasets and thus k sets of parameter estimates, the total variance associated with an estimate can be partitioned into a withinโimputation and a betweenโimputation component. The method is illustrated using the Genetic Analysis Workshop simulated data. Genet. Epidemiol. 2007. ยฉ 2007 WileyโLiss, Inc.
๐ SIMILAR VOLUMES
Our group studied the effects of genotyping errors, pedigree errors, and missing data on a wide range of techniques, with a focus on the role of single-nucleotide polymorphisms (SNPs). Half of our group used simulated data, and half of our group used data from the Collaborative Study on the Genetics
Longitudinal family studies provide a valuable resource for investigating genetic and environmental factors that influence long-term averages and changes over time in a complex trait. This paper summarizes 13 contributions to Genetic Analysis Workshop 13, which include a wide range of methods for ge