Changepoint detection in the presence of outliers

Paul Fearnhead; Guillem Rigaill

doi:10.1080/01621459.2017.1385466

Article Dans Une Revue Journal of the American Statistical Association Année : 2019

Changepoint detection in the presence of outliers

(1) , (2, 3, 4)

1
2
3
4

Paul Fearnhead

Fonction : Auteur

Department of mathematics and statistics

Guillem Rigaill

Fonction : Auteur
PersonId : 737654
IdHAL : guillem-rigaill
ORCID : 0000-0002-7176-7511
IdRef : 19821684X

Institut des Sciences des Plantes de Paris-Saclay

Université Sorbonne Paris Cité (COMUE)

Laboratoire de Mathématiques et Modélisation d'Evry

Résumé

Many traditional methods for identifying changepoints can struggle in the presence of outliers, or when the noise is heavy-tailed. Often they will infer additional changepoints in order to fit the outliers. To overcome this problem, data often needs to be pre-processed to remove outliers, though this is difficult for applications where the data needs to be analysed online. We present an approach to changepoint detection that is robust to the presence of outliers. The idea is to adapt existing penalised cost approaches for detecting changes so that they use loss functions that are less sensitive to outliers. We argue that loss functions that are bounded, such as the classical biweight loss, are particularly suitable - as we show that only bounded loss functions are robust to arbitrarily extreme outliers. We present an efficient dynamic programming algorithm that can find the optimal segmentation under our penalised cost criteria. Importantly, this algorithm can be used in settings where the data needs to be analysed online. We show that we can consistently estimate the number of changepoints, and accurately estimate their locations, using the biweight loss function. We demonstrate the usefulness of our approach for applications such as analysing well-log data, detecting copy number variation, and detecting tampering of wireless devices.

Mots clés

Binary Segmentation Biweight loss Cusum M-estimation Penalised likelihood Robust Statistics

Domaines

Sciences du Vivant [q-bio] Biologie végétale

Fichier principal

2019_Fearnhead_J Am Stat Assoc_1.pdf (6.15 Mo)

Origine	Fichiers éditeurs autorisés sur une archive ouverte

Migration ProdInra : Connectez-vous pour contacter le contributeur

https://hal.inrae.fr/hal-02622377

Soumis le : mardi 26 mai 2020-05:29:31

Dernière modification le : samedi 27 avril 2024-03:14:06

Dates et versions

hal-02622377 , version 1 (26-05-2020)

Licence

Paternité

Identifiants

HAL Id : hal-02622377 , version 1
DOI : 10.1080/01621459.2017.1385466
PRODINRA : 409584
WOS : 000471325500018

Citer

Paul Fearnhead, Guillem Rigaill. Changepoint detection in the presence of outliers. Journal of the American Statistical Association, 2019, 114 (525), pp.169-183. ⟨10.1080/01621459.2017.1385466⟩. ⟨hal-02622377⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PARIS7 CNRS UNIV-EVRY INRA INSMI USPC LAMME IPS2 UNIV-PARIS-SACLAY INRAE UP-SCIENCES GS-ENGINEERING MATHNUM BIOLOGIE_ET_AMELIORATION_DES_PLANTES

64 Consultations

83 Téléchargements

Changepoint detection in the presence of outliers

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager