A hierarchical clustering algorithm and an improvement of the single linkage criterion to deal with noise - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue Expert Systems with Applications Année : 2019

A hierarchical clustering algorithm and an improvement of the single linkage criterion to deal with noise

Résumé

Hierarchical clustering is widely used in data mining. The single linkage criterion is powerful, as it allows for handling various shapes and densities, but it is sensitive to noise 1 . Two improvements are proposed in this work to deal with noise. First, the single linkage criterion takes into account the local density to make sure the distance involves core points of each group. Second, the hierarchical algorithm forbids the merging of representative clusters, higher than a minimum size, once identified. The experiments include a sensitivity analysis to the parameters and a comparison of the available criteria using datasets known in the literature. The latter proved that local criteria yield better results than global ones. Then, the three single linkage criteria were compared in more challenging situations that highlighted the complementariness between the two levels of improvement: the criterion and the clustering algorithm.
Fichier principal
Vignette du fichier
pub00061222.pdf (3.98 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02609244 , version 1 (16-05-2020)

Identifiants

Citer

F. Ros, Serge Guillaume. A hierarchical clustering algorithm and an improvement of the single linkage criterion to deal with noise. Expert Systems with Applications, 2019, 128, pp.96-108. ⟨10.1016/j.eswa.2019.03.031⟩. ⟨hal-02609244⟩
83 Consultations
263 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More