Assessing the validity domains of graphical Gaussian models in order to infer relationships among components of complex biological systems - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Rapport (Rapport Technique) Année : 2008

Assessing the validity domains of graphical Gaussian models in order to infer relationships among components of complex biological systems

Résumé

The study of the interactions of cellular components is an essential base step to understand the structure and dynamics of biological networks. So, various methods were recently developed in this purpose. While most of them combine different types of data and ¡em¿apriori¡/em¿ knowledge, methods based on Graphical Gaussian Models are capable of learning the network directly from raw data. They consider the full-order partial correlations which are partial correlations between two variables given the remaining ones, for modelling direct links between variables. Statistical methods were developed for estimating these links when the number of observations is larger than the number of variables. However, the rapid advance of new technologies that allow to simultaneous measure genome expression, led to largescale datasets where the number of variables is far larger than the number of observations. To get round this dimensionality problem, different strategies and new statistical methods were proposed. In this study we focused on statistical methods recently published. All are based on the fact that the number of direct relationship between two variables is very small in regards to the number of possible relationships, ¡em¿p(p-1)/2¡/em¿. In the biological context, this assumption is not always satisfied over the whole graph. So it is essential to precisely know the behaviour of the methods in regards to the characteristics of the studied object before applying them. For this purpose, we evaluated the validity domain of each method from wide-ranging simulated datasets. We then illustrated our results using recently published biological data
Fichier principal
Vignette du fichier
37451_20100415113748429_1.pdf (332.6 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-02813966 , version 1 (06-06-2020)

Identifiants

  • HAL Id : hal-02813966 , version 1
  • PRODINRA : 37451

Citer

Fanny Villers, Brigitte Schaeffer, Caroline Bertin, Sylvie Huet. Assessing the validity domains of graphical Gaussian models in order to infer relationships among components of complex biological systems. [Technical Report] 2008-2, 2008. ⟨hal-02813966⟩
8 Consultations
6 Téléchargements

Partager

Gmail Facebook X LinkedIn More