Skip to Main content Skip to Navigation
Journal articles

Further developments of protein secondary structure prediction using information theory

Abstract : We have re-evaluated the information used in the Garnier-Osguthorpe-Robson (GOR) method of secondary structure prediction with the currently available database. The framework of information theory provides a means to formulate the influence of local sequence upon the conformation of a given residue, in a rigorous manner. However, the existing database does not allow the evaluation of parameters required for an exact treatment of the problem. The validity of the approximations drawn from the theory is examined. It is shown that the first-level approximation, involving single-residue parameters, is only marginally improved by an increase in the database. The second-level approximation, involving pairs of residues, provides a better model. However, in this case the database is not big enough and this method might lead to parameters with deficiencies. Attention is therefore given to overcoming this lack of data. We have determined the significant pairs and the number of dummy observations necessary to obtain the best result for the prediction. This new version of the GOR method increases the accuracy of prediction by 7%, bringing the amount of residues correctly predicted to 63% for three states and 68 proteins, each protein to be predicted being removed from the database and the parameters derived from the other proteins. If the protein to be predicted is kept in the database the accuracy goes up to 69.7%.
Document type :
Journal articles
Complete list of metadata

https://hal.inrae.fr/hal-03328011
Contributor : Jean-François Gibrat <>
Submitted on : Friday, August 27, 2021 - 6:37:08 PM
Last modification on : Wednesday, September 1, 2021 - 3:28:43 AM

Licence


Distributed under a Creative Commons Attribution - NonCommercial - NoDerivatives 4.0 International License

Links full text

Identifiers

Collections

Citation

Jean-François Gibrat, J. Garnier, B. Robson. Further developments of protein secondary structure prediction using information theory. Journal of Molecular Biology, Elsevier, 1987, 198 (3), pp.425-443. ⟨10.1016/0022-2836(87)90292-0⟩. ⟨hal-03328011⟩

Share

Metrics

Record views

18