This article presents SOMCD, an improved method for the evaluation of protein secondary structure from circular dichroism spectra, based on Kohonen's self-organizing maps (SOM). Protein circular dichroism (CD) spectra are used to train a SOM, which arranges the spectra on a two-dimensional map. Loca
Variable selection method improves the prediction of protein secondary structure from circular dichroism spectra
β Scribed by Parthasarathy Manavalan; W.Curtis Johnson Jr.
- Publisher
- Elsevier Science
- Year
- 1987
- Tongue
- English
- Weight
- 812 KB
- Volume
- 167
- Category
- Article
- ISSN
- 0003-2697
No coin nor oath required. For personal study only.
β¦ Synopsis
A new procedure based on the statistical method of "variable selection" is used to predict the secondary structure of proteins from circular dichroism spectra. Variable selection adds the flexibility found in the Provencher and GlΓΆckner method (S. W. Provencher and J. GlΓΆckner, 1981, Biochemistry 20, 33-37) to the method of Hennessey and Johnson (J. P. Hennessey and W. C. Johnson, 1981, Biochemistry 20, 1085-1094). Two analytical methods are presented for choosing a solution from the series generated by the Provencher and GlΓΆckner method, and this improves the technique. All three methods are compared and it is shown that both the variable selection method and the improved Provencher and GlΓΆckner methods have equivalent reliability superior to the original Hennessey and Johnson method. For the new variable selection method, correlation coefficients calculated between X-ray structure and predicted secondary structures for data measured to 178 nm are: 0.97 for alpha-helix, 0.75 for beta-sheet, 0.50 for beta-turn, and 0.89 for other structures. Although the variable selection method improves the analysis of circular dichroism data truncated at 190 nm, data measured to 178 nm gives superior results. It is shown that improving the fit to the measured CD beyond the accuracy of the data can result in poorer analyses.
π SIMILAR VOLUMES
The estimation of protein secondary structure from circular dichroism spectra is described by a multivariate linear model with noise (Gauss-Markoff model). With this formalism the adequacy of the linear model is investigated, paying special attention to the estimation of the error in the secondary s
In least-squares fitting of protein circular dichroism (CD) spectra using basis CD spectra for the respective secondary structure components, as given by reference proteins of known structural composition, good fits of the CD spectrum do not necessarily correspond to appropriate fits of the underlyi
We have expanded our reference set of proteins used in the estimation of protein secondary structure by CD spectroscopy from 29 to 37 proteins by including 3 additional globular proteins with known X-ray structure and 5 denatured proteins. We have also modified the self-consistent method for analyzi