We discuss how methods based on hidden Markov models performed in the fold-recognition section of the CASP2 experiment. Hidden Markov models were built for a representative set of just over 1,000 structures from the Protein Data Bank (PDB). Each CASP2 target sequence was scored against this library
Haplotype inference using a Bayesian Hidden Markov model
โ Scribed by Shuying Sun; Celia M.T. Greenwood; Radford M. Neal
- Publisher
- John Wiley and Sons
- Year
- 2007
- Tongue
- English
- Weight
- 191 KB
- Volume
- 31
- Category
- Article
- ISSN
- 0741-0395
No coin nor oath required. For personal study only.
โฆ Synopsis
Abstract
Knowledge of haplotypes is useful for understanding block structure in the genome and disease risk associations. Direct measurement of haplotypes in the absence of family data is presently impractical, and hence, several methods have been developed for reconstructing haplotypes from population data. We have developed a new populationโbased method using a Bayesian Hidden Markov model for the source of the ancestral haplotype segments. In our Bayesian model, a higher order Markov model is used as the prior for ancestral haplotypes, to account for linkage disequilibrium. Our model includes parameters for the genotyping error rate, the mutation rate, and the recombination rate at each position. Computation is done by Markov Chain Monte Carlo using the forwardโbackward algorithm to efficiently sum over all possible state sequences of the Hidden Markov model. We have used the model to reconstruct the haplotypes of 129 children at a region on chromosome 5 in the data set of Daly et al. [2001] (for which true haplotypes are obtained based on parental genotypes) and of 30 children at selected regions in the CEU and YRI data of the HAPMAP project. The results are quite close to the familyโbased reconstructions and comparable with the stateโofโtheโart PHASE program. Our haplotype reconstruction method does not require division of the markers into small blocks of loci. The recombination rates inferred from our model can help to predict haplotype block boundaries, and estimate recombination hotspots. Genet. Epidemiol. 2007. ยฉ 2007 WileyโLiss, Inc.
๐ SIMILAR VOLUMES
## Abstract In this paper we propose a Bayesian modeling approach to the analysis of genomeโwide association studies based on single nucleotide polymorphism (SNP) data. Our latent seed model combines various aspects of __k__โmeans clustering, hidden Markov models (HMMs) and logistic regression into
The analysis of routinely collected surveillance data is an important challenge in public health practice. We present a method based on a hidden Markov model for monitoring such time series. The model characterizes the sequence of measurements by assuming that its probability density function depend