𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Analysis of an optimal hidden Markov model for secondary structure prediction

✍ Scribed by Juliette Martin; Jean-François Gibrat; François Rodolphe


Publisher
BioMed Central
Year
2006
Tongue
English
Weight
794 KB
Volume
6
Category
Article
ISSN
1472-6807

No coin nor oath required. For personal study only.

✦ Synopsis


Background

Secondary structure prediction is a useful first step toward 3D structure prediction. A number of successful secondary structure prediction methods use neural networks, but unfortunately, neural networks are not intuitively interpretable. On the contrary, hidden Markov models are graphical interpretable models. Moreover, they have been successfully used in many bioinformatic applications. Because they offer a strong statistical background and allow model interpretation, we propose a method based on hidden Markov models.

Results

Our HMM is designed without prior knowledge. It is chosen within a collection of models of increasing size, using statistical and accuracy criteria. The resulting model has 36 hidden states: 15 that model α-helices, 12 that model coil and 9 that model β-strands. Connections between hidden states and state emission probabilities reflect the organization of protein structures into secondary structure segments. We start by analyzing the model features and see how it offers a new vision of local structures. We then use it for secondary structure prediction. Our model appears to be very efficient on single sequences, with a Q3 score of 68.8%, more than one point above PSIPRED prediction on single sequences. A straightforward extension of the method allows the use of multiple sequence alignments, rising the Q3 score to 75.5%.

Conclusion

The hidden Markov model presented here achieves valuable prediction results using only a limited number of parameters. It provides an interpretable framework for protein secondary structure architecture. Furthermore, it can be used as a tool for generating protein sequences with a given secondary structure content.


📜 SIMILAR VOLUMES


Hidden Markov models that use predicted
✍ Jeanette Hargbo; Arne Elofsson 📂 Article 📅 1999 🏛 John Wiley and Sons 🌐 English ⚖ 112 KB 👁 2 views

There are many proteins that share the same fold but have no clear sequence similarity. To predict the structure of these proteins, so called ''protein fold recognition methods'' have been developed. During the last few years, improvements of protein fold recognition methods have been achieved throu

Prediction of protein structure classes
✍ Hiroshi Yoshikawa; Mitsunori Ikeguchi; Shugo Nakamura; Kentaro Shimizu; Junta Do 📂 Article 📅 1999 🏛 John Wiley and Sons 🌐 English ⚖ 267 KB 👁 2 views

This study deals with structure class/secondary structure prediction of proteins using hidden Markov models (HMMs). With the proposed method, prediction is performed using HMMs designed so as to represent hierarchicality and periodicity of protein structural features. Secondary structures (partial t

Fold recognition using predicted seconda
✍ Di Francesco, Valentina; Geetha, V.; Garnier, Jean; Munson, Peter J. 📂 Article 📅 1997 🏛 John Wiley and Sons 🌐 English ⚖ 59 KB 👁 2 views

We present an analysis of the blind predictions submitted to the fold recognition category for the second meeting on the Critical Assessment of techniques for protein Structure Prediction. Our method achieves fold recognition from predicted secondary structure sequences using hidden Markov models (H

Structure analysis of soccer video with
✍ Lexing Xie; Peng Xu; Shih-Fu Chang; Ajay Divakaran; Huifang Sun 📂 Article 📅 2004 🏛 Elsevier Science 🌐 English ⚖ 355 KB

In this paper, we present statistical techniques for parsing the structure of produced soccer programs. The problem is important for applications such as personalized video streaming and browsing systems, in which videos are segmented into different states and important states are selected based on

Combined Bayesian and predictive techniq
✍ SM Ahadi; PC Woodland 📂 Article 📅 1997 🏛 Elsevier Science 🌐 English ⚖ 302 KB

One problem faced by some model adaptation techniques is that only the parameters of those models which are observed in the adaptation data are updated. Hence, with small amounts of adaptation data most of the system parameters remain unchanged. In this paper, a technique called regression-based mod