A hybrid and exploratory approach to knowledge discovery in metabolomic data - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue Discrete Applied Mathematics Année : 2020

A hybrid and exploratory approach to knowledge discovery in metabolomic data

Dhouha Grissa
  • Fonction : Auteur
  • PersonId : 771971
  • IdRef : 179415352
Blandine Comte
Mélanie Pétéra
Estelle Pujos-Guillot
Amedeo Napoli

Résumé

In this paper, we propose a hybrid and exploratory knowledge discovery approach for analyzing metabolomic complex data based on a combination of supervised classifiers, pattern mining and Formal Concept Analysis (FCA). The approach is based on three main operations, preprocessing, classification, and postprocessing. Classifiers are applied to datasets of the form individuals×features and produce sets of ranked features which are further analyzed. Pattern mining and FCA are used to provide a complementary analysis and support for visualization. A practical application of this framework is presented in the context of metabolomic data, where two interrelated problems are considered, discrimination and prediction of class membership. The dataset is characterized by a small set of individuals and a large set of features, in which predictive biomarkers of clinical outcomes should be identified. The problems of combining numerical and symbolic data mining methods, as well as discrimination and prediction, are detailed and discussed. Moreover, it appears that visualization based on FCA can be used both for guiding knowledge discovery and for interpretation by domain analysts.
Fichier principal
Vignette du fichier
grissa-etal-dam20.pdf (717.14 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02195463 , version 1 (10-10-2020)

Licence

Paternité - Pas d'utilisation commerciale - Pas de modification

Identifiants

Citer

Dhouha Grissa, Blandine Comte, Mélanie Pétéra, Estelle Pujos-Guillot, Amedeo Napoli. A hybrid and exploratory approach to knowledge discovery in metabolomic data. Discrete Applied Mathematics, 2020, 273 (SI), pp.103-116. ⟨10.1016/j.dam.2018.11.025⟩. ⟨hal-02195463⟩
133 Consultations
168 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More