Our group studied the effects of genotyping errors, pedigree errors, and missing data on a wide range of techniques, with a focus on the role of single-nucleotide polymorphisms (SNPs). Half of our group used simulated data, and half of our group used data from the Collaborative Study on the Genetics
Summary report: Missing data and pedigree and genotyping errors
โ Scribed by Michael D. Badzioch; Duncan C. Thomas; Gail P. Jarvik
- Publisher
- John Wiley and Sons
- Year
- 2003
- Tongue
- English
- Weight
- 82 KB
- Volume
- 25
- Category
- Article
- ISSN
- 0741-0395
No coin nor oath required. For personal study only.
โฆ Synopsis
Genetic epidemiology is faced with mapping complex traits to genes with relatively small effects whose phenotypes may be modulated by temporal factors. To do this, detailed and accurate data must be available on families, perhaps collected over time. The Framingham Heart Study data supplied to Genetic Analysis Workshop 13 (GAW13), along with its simulated counterpart, contain longitudinal measurements and genomic scan data on 2,885 individuals in 330 families, and offer an opportunity to examine data quality and completeness issues as they affect analytical conclusions. Six GAW13 contributions applied methods to deal with missing data, both phenotypic and genotypic, at a single time point and longitudinally, and with possible errors in pedigree structure and genotypes. The methods included missing phenotypic data imputation by Markov chain Monte Carlo sampling, propensity scoring, regression, and adjusted mean values, as well as the assessment of transmission-disequilibrium tests when missing marker data may be allele-specific. Pedigree structural errors were found by genome-wide allele-sharing probabilities, while Mendelian consistent genotype errors were evaluated through likelihoods of double-recombination events. Each of the methods reviewed here offered insights into how to better take advantage of large, time-dependent, familial data sets. However, no one of them dealt with the longitudinal and familial aspects simultaneously. Overall, more consideration needs to be given to the effects that missing data and data errors have on our ability to map complex traits efficiently and accurately.
๐ SIMILAR VOLUMES
## Abstract Mapping complex traits or phenotypes with small genetic effects, whose phenotypes may be modulated by temporal trends in families are challenging. Detailed and accurate data must be available on families, whether or not the data were collected over time. Missing data complicate matters
## Abstract Inference of haplotypes is important in genetic epidemiology studies. However, all large genotype data sets have errors due to the use of inexpensive genotyping machines that are fallible and shortcomings in genotyping scoring softwares, which can have an enormous impact on haplotype in
## Abstract Because most multipoint linkage analysis programs currently assume linkage equilibrium between markers when inferring parental haplotypes, ignoring linkage disequilibrium (LD) may inflate the Type I error rate. We investigated the effect of LD on the Type I error rate and power of nonpa
The development of a new method for testing the association of genetic markers with disease is presented. This approach is applicable when sampling nuclear families with one or more affected siblings and when neither, one, or both parents are missing marker genotype data. All siblings, affected and