Statistical significance of threading scores - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Access content directly
Journal Articles Journal of Computational Biology Year : 2012

Statistical significance of threading scores

Abstract

We present a general method for assessing threading score significance. The threading score of a protein sequence, thread onto a given structure, should be compared with the threading score distribution of a random amino-acid sequence, of the same length, thread on the same structure; small p-values point significantly high scores. We claim that, due to general protein contact map properties, this reference distribution is a Weibull extreme value distribution whose parameters depend on the threading method, the structure, the length of the query and the random sequence simulation model used. These parameters can be estimated off-line with simulated sequence samples, for different sequence lengths. They can further be interpolated at the exact length of a query, enabling the quick computation of the p-value.

Dates and versions

hal-02647201 , version 1 (29-05-2020)

Identifiers

Cite

Afshin Fayyaz Movaghar, Guillaume Launay, Sophie S. Schbath, Jean-François Gibrat, Francois F. Rodolphe. Statistical significance of threading scores. Journal of Computational Biology, 2012, 19 (1), pp.13-29. ⟨10.1089/cmb.2011.0236⟩. ⟨hal-02647201⟩
8 View
0 Download

Altmetric

Share

Gmail Mastodon Facebook X LinkedIn More