Which dissimilarity is to be used when extracting typologies in sequence analysis? A comparative study

Sébastien Massoni; Madalina Olteanu; Nathalie N. Villa-Vialaneix

Communication Dans Un Congrès Année : 2013

Which dissimilarity is to be used when extracting typologies in sequence analysis? A comparative study

(1) , (2) , (3)

1
2
3

Sébastien Massoni

Fonction : Auteur
PersonId : 184147
IdHAL : sebastien-massoni
ORCID : 0000-0001-6980-7505
IdRef : 148445756

Centre d'économie de la Sorbonne

Madalina Olteanu

Fonction : Auteur
PersonId : 838365
ORCID : 0000-0001-7329-8731
IdRef : 126935807

Université Paris 1 Panthéon-Sorbonne

Nathalie N. Villa-Vialaneix

Fonction : Auteur
PersonId : 4221
IdHAL : nathalie-vialaneix
ORCID : 0000-0003-1156-0639
IdRef : 101680503

Unité de Mathématiques et Informatique Appliquées de Toulouse

Résumé

Originally developed in bioinformatics, sequence analysis is being increasingly used in social sciences for the study of life-course processes.The methodology generally employed consists in computing dissimilarities between the trajectories and, if typologies are sought, in clustering the trajectories according to their similarities or dissemblances. The choice of an appropriate dissimilarity measure is a major issue when dealing with sequence analysis for life sequences. Several dissimilarities are available in the literature, but neither of them succeeds to become indisputable. In this paper, instead of deciding upon one dissimilarity measure, we propose to use an optimal convex combination of different dissimilarities. The optimality is automatically determined by the clustering procedure and is defined with respect to the within-class variance.

Mots clés

sequence analysis comparative study clustering procedure dissimilarity measure

variance

Domaines

Mathématiques [math] Informatique [cs]

Fichier principal

massoni_etal_IWANN2013_1.pdf (483.41 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Migration ProdInra : Connectez-vous pour contacter le contributeur

https://hal.inrae.fr/hal-02806089

Soumis le : samedi 6 juin 2020-01:40:41

Dernière modification le : mercredi 19 juin 2024-17:02:23

Dates et versions

hal-02806089 , version 1 (06-06-2020)

Identifiants

HAL Id : hal-02806089 , version 1
PRODINRA : 253514
WOS : 000324897700005

Citer

Sébastien Massoni, Madalina Olteanu, Nathalie N. Villa-Vialaneix. Which dissimilarity is to be used when extracting typologies in sequence analysis? A comparative study. International Workshop on Artificial Neural Networks, Jun 2013, Puerto de la Cruz, Tenerife, Spain. ⟨hal-02806089⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PARIS1 CNRS INRA CES INRAE INRAEOCCITANIETOULOUSE MATHNUM MIAT

19 Consultations

60 Téléchargements

Which dissimilarity is to be used when extracting typologies in sequence analysis? A comparative study

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager