𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Statistical inference of allelic imbalance from transcriptome data

✍ Scribed by Michael Nothnagel; Andreas Wolf; Alexander Herrmann; Karol Szafranski; Inga Vater; Mario Brosch; Klaus Huse; Reiner Siebert; Matthias Platzer; Jochen Hampe; Michael Krawczak


Publisher
John Wiley and Sons
Year
2010
Tongue
English
Weight
224 KB
Volume
32
Category
Article
ISSN
1059-7794

No coin nor oath required. For personal study only.

✦ Synopsis


Next-generation sequencing and the availability of high-density genotyping arrays have facilitated an analysis of somatic and meiotic mutations at unprecedented level, but drawing sensible conclusions about the functional relevance of the detected variants still remains a formidable challenge. In this context, the study of allelic imbalance in intermediate RNA phenotypes may prove a useful means to elucidate the likely effects of DNA variants of unknown significance. We developed a statistical framework for the assessment of allelic imbalance in next-generation transcriptome sequencing (RNA-seq) data that requires neither an expression reference nor the underlying nuclear genotype(s), and that allows for allele miscalls. Using extensive simulation as well as publicly available wholetranscriptome data from European-descent individuals in HapMap, we explored the power of our approach in terms of both genotype inference and allelic imbalance assessment under a wide range of practically relevant scenarios. In so doing, we verified a superior performance of our methodology, particularly at low sequencing coverage, compared to the more simplistic approach of completely ignoring allele miscalls. Because the proposed framework can be used to assess somatic mutations and allelic imbalance in one and the same set of RNA-seq data, it will be particularly useful for the analysis of somatic genetic variation in cancer studies.


πŸ“œ SIMILAR VOLUMES


Statistical inference from multiply cens
✍ A. H. El-Shaarawi; A. Naderi πŸ“‚ Article πŸ“… 1991 πŸ› Springer 🌐 English βš– 346 KB

Maximum likelihood estimation for multiply censored samples are discussed. Approximate confidence intervals for the lognormal mean are obtained using both Taylor expansion method and direct method. It is shown that the direct method performs noticeably better than the Taylor expansion method. Simula

Inferring Three-Dimensional Crack Statis
✍ John K. Dienes πŸ“‚ Article πŸ“… 2000 πŸ› Elsevier Science 🌐 English βš– 67 KB

In rock mechanics it is often assumed that the number of cracks whose size yc r c exceeds c is given by the exponential N e . It is difficult, however, to examine 0

On the statistical analysis of allelic-l
✍ Michael A. Newton; Michael N. Gould; Catherine A. Reznikoff; Jill D. Haag πŸ“‚ Article πŸ“… 1998 πŸ› John Wiley and Sons 🌐 English βš– 200 KB πŸ‘ 2 views

This paper concerns the statistical analysis of certain binary data arising in molecular studies of cancer. In allelic-loss experiments, tumour cell genomes are analysed at informative molecular marker loci to identify deleted chromosomal regions. The resulting binary data are used to infer properti

Statistical downscaling with Bayesian in
✍ Toshichika Iizumi; Motoki Nishimori; Masayuki Yokozawa; Akihiko Kotera; Nguyen D πŸ“‚ Article πŸ“… 2011 πŸ› John Wiley and Sons 🌐 English βš– 468 KB

## Abstract Daily global solar radiation (SR) is one of essential weather inputs for crop, hydrological, and other simulation models to calculate biomass production and potential evapotranspiration. The availability of long‐term observed SR data is, however, limited, especially in developing countr

Influences on inferences. Effect of erro
✍ Seymour H. Levitt; Dorothee M. Aeppli; Roger A. Potish; Chung K. Lee; Mary E. Ni πŸ“‚ Article πŸ“… 1993 πŸ› John Wiley and Sons 🌐 English βš– 705 KB

## Background: Inadvertent random and systemic errors introduced into data sets and manipulation of data are well-defined sources of discrepancies in statistical evaluation of clinical trials. in this study, the authors show the influence of errors on the widely used statistical result, p values.