This article presents SOMCD, an improved method for the evaluation of protein secondary structure from circular dichroism spectra, based on Kohonen's self-organizing maps (SOM). Protein circular dichroism (CD) spectra are used to train a SOM, which arranges the spectra on a two-dimensional map. Loca
Analyzing protein circular dichroism spectra for accurate secondary structures
β Scribed by W. Curtis Johnson
- Publisher
- John Wiley and Sons
- Year
- 1999
- Tongue
- English
- Weight
- 44 KB
- Volume
- 35
- Category
- Article
- ISSN
- 0887-3585
No coin nor oath required. For personal study only.
β¦ Synopsis
We have developed an algorithm to analyze the circular dichroism of proteins for secondary structure. Its hallmark is tremendous flexibility in creating the basis set, and it also combines the ideas of many previous workers. We also present a new basis set containing the CD spectra of 22 proteins with secondary structures from high quality X-ray diffraction data. High flexibility is obtained by doing the analysis with a variable selection basis set of only eight proteins. Many variable selection basis sets fail to give a good analysis, but good analyses can be selected without any a priori knowledge by using the following criteria: (1) the sum of secondary structures should be close to 1.0, (2) no fraction of secondary structure should be less than -0.03, (3) the reconstructed CD spectrum should fit the original CD spectrum with only a small error, and (4) the fraction of β£-helix should be similar to that obtained using all the proteins in the basis set. This algorithm gives a root mean square error for the predicted secondary structure for the proteins in the basis set of 3.3% for β£-helix, 2.6% for 3 10 -helix, 4.2% for β€-strand, 4.2% for β€-turn, 2.7% for poly(Lproline) II type 3 1 -helix, and 5.1% for other structures when compared with the X-ray structure.
π SIMILAR VOLUMES
A primary and a secondary neural network are applied to secondary structure and structural class prediction for a database of 681 non-homologous protein chains. A new method of decoding the outputs of the secondary structure prediction network is used to produce an estimate of the probability of fin
Protein S (PS) and growth arrest specific factor 6 (GAS6) are vitamin K-dependent proteins with similar structures. They are mosaic proteins possessing a carboxylterminal region presenting sequence similarity with plasma sex hormone binding globulin (plasma SHBG), although apparently not involved in