Statistical significance of threading scores - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue Journal of Computational Biology Année : 2012

Statistical significance of threading scores

Résumé

We present a general method for assessing threading score significance. The threading score of a protein sequence, thread onto a given structure, should be compared with the threading score distribution of a random amino-acid sequence, of the same length, thread on the same structure; small p-values point significantly high scores. We claim that, due to general protein contact map properties, this reference distribution is a Weibull extreme value distribution whose parameters depend on the threading method, the structure, the length of the query and the random sequence simulation model used. These parameters can be estimated off-line with simulated sequence samples, for different sequence lengths. They can further be interpolated at the exact length of a query, enabling the quick computation of the p-value.

Dates et versions

hal-02647201 , version 1 (29-05-2020)

Identifiants

Citer

Afshin Fayyaz Movaghar, Guillaume Launay, Sophie S. Schbath, Jean-François Gibrat, Francois F. Rodolphe. Statistical significance of threading scores. Journal of Computational Biology, 2012, 19 (1), pp.13-29. ⟨10.1089/cmb.2011.0236⟩. ⟨hal-02647201⟩
7 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More