Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Estimating minimum effect with outlier selection

Abstract : We introduce one-sided versions of Huber's contamination model, in which corrupted samples tend to take larger values than uncorrupted ones. Two intertwined problems are addressed: estimation of the mean of uncorrupted samples (minimum effect) and selection of corrupted samples (outliers). Regarding the minimum effect estimation, we derive the minimax risks and introduce adaptive estimators to the unknown number of contaminations. Interestingly, the optimal convergence rate highly differs from that in classical Huber's contamination model. Also, our analysis uncovers the effect of particular structural assumptions on the distribution of the contaminated samples. As for the problem of selecting the outliers, we formulate the problem in a multiple testing framework for which the location/scaling of the null hypotheses are unknown. We rigorously prove how estimating the null hypothesis is possible while maintaining a theoretical guarantee on the amount of the falsely selected outliers, both through false discovery rate (FDR) or post hoc bounds. As a by-product, we address a long-standing open issue on FDR control under equi-correlation, which reinforces the interest of removing dependency when making multiple testing.
Document type :
Preprints, Working Papers, ...
Complete list of metadata

https://hal.inrae.fr/hal-02966862
Contributor : Nicolas Verzelen <>
Submitted on : Wednesday, October 14, 2020 - 2:48:49 PM
Last modification on : Monday, December 14, 2020 - 9:48:01 AM

Links full text

Identifiers

  • HAL Id : hal-02966862, version 1
  • ARXIV : 1809.08330

Citation

Alexandra Carpentier, Sylvain Delattre, Etienne Roquain, Nicolas Verzelen. Estimating minimum effect with outlier selection. 2020. ⟨hal-02966862⟩

Share

Metrics