Three‐way clustering around latent variables approach with constraints on the configurations to facilitate interpretation - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue Journal of Chemometrics Année : 2021

Three‐way clustering around latent variables approach with constraints on the configurations to facilitate interpretation

Résumé

The set-up of comprehensive studies in life sciences involving a longitudinal dimension-as appears in time-scale metabolomics-calls for the use of dimension reduction techniques for three-way data structures (e.g., samples by variables by time points). For this purpose, a clustering around latent variables for three-way data approach,CLV3W, has been proposed.CLV3Waims at both partitioning the variables into nonoverlapping clusters and estimating within each cluster a rank-one Parafac model consisting of a latent component (resp. a weighting system) associated with the first mode (resp. third mode) and a vector of loadings reflecting the degree of closeness of each variable of the second mode to its cluster. In this paper, two constrainedCLV3Wmodels are discussed. First, a nonnegativity constraint is defined implying that clusters are composed of positively correlated variables. Second, it is proposed to constrain the weighting system to be the same for all clusters. These two constraints aim at providing more parsimonious models with configurations that are easier to interpret. The appropriateness of both constraints is evaluated in a simulation study and illustrated on two case studies pertaining to sensory evaluation and metabolomics data. Regarding the first case study,CLV3Wyields the identification of two consumer segments together with one common emotional pleasantness dimension associated with coffee aromas.CLV3Wanalysis of human preterm breast milk metabolomics data provided three clusters of lipid species that are responsible for specific functions (i.e., milk fat globules membrane-constituents, fatty acid oxidation-products, lipid mediators as eicosanoids and endocannabinoids).

Dates et versions

hal-03191052 , version 1 (06-04-2021)

Identifiants

Citer

Véronique Cariou, Marie Cécile Alexandre-Gouabau, Tom Wilderjans. Three‐way clustering around latent variables approach with constraints on the configurations to facilitate interpretation. Journal of Chemometrics, 2021, 35 (2), ⟨10.1002/cem.3269⟩. ⟨hal-03191052⟩
43 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More