A machine learning approach for the prediction of protein surface loop flexibility
✍ Scribed by Howook Hwang; Thom Vreven; Troy W. Whitfield; Kevin Wiehe; Zhiping Weng
- Publisher
- John Wiley and Sons
- Year
- 2011
- Tongue
- English
- Weight
- 398 KB
- Volume
- 79
- Category
- Article
- ISSN
- 0887-3585
No coin nor oath required. For personal study only.
✦ Synopsis
Abstract
Proteins often undergo conformational changes when binding to each other. A major fraction of backbone conformational changes involves motion on the protein surface, particularly in loops. Accounting for the motion of protein surface loops represents a challenge for protein–protein docking algorithms. A first step in addressing this challenge is to distinguish protein surface loops that are likely to undergo backbone conformational changes upon protein–protein binding (mobile loops) from those that are not (stationary loops). In this study, we developed a machine learning strategy based on support vector machines (SVMs). Our SVM uses three features of loop residues in the unbound protein structures—Ramachandran angles, crystallographic B‐factors, and relative accessible surface area—to distinguish mobile loops from stationary ones. This method yields an average prediction accuracy of 75.3% compared with a random prediction accuracy of 50%, and an average of 0.79 area under the receiver operating characteristic (ROC) curve using cross‐validation. Testing the method on an independent dataset, we obtained a prediction accuracy of 70.5%. Finally, we applied the method to 11 complexes that involve members from the Ras superfamily and achieved prediction accuracy of 92.8% for the Ras superfamily proteins and 74.4% for their binding partners. Proteins 2011; © 2011 Wiley‐Liss, Inc.
📜 SIMILAR VOLUMES
## Abstract Identification and characterization of antigenic determinants on proteins has received considerable attention utilizing both, experimental as well as computational methods. For computational routines mostly structural as well as physicochemical parameters have been utilized for predicti
A novel method for estimating prediction uncertainty using machine learning techniques is presented. Uncertainty is expressed in the form of the two quantiles (constituting the prediction interval) of the underlying distribution of prediction errors. The idea is to partition the input space into dif
## Abstract Accurate protein side‐chain conformation prediction is crucial for protein modeling and existing methods for the task are widely used; however, faster and more accurate methods are still required. Here we present a new machine learning approach to the problem where an energy function fo