Bayesian high-dimensional covariate selection in non-linear mixed-effects models using the SAEM algorithm

High-dimensional variable selection, with many more covariates than observations, is widely documented in standard regression models, but there are still few tools to address it in non-linear mixed-effects models where data are collected repeatedly on several individuals. In this work, variable selection is approached from a Bayesian perspective and a selection procedure is proposed, combining the use of a spike-and-slab prior and the Stochastic Approximation version of the Expectation Maximisation (SAEM) algorithm. Similarly to Lasso regression, the set of relevant covariates is selected by exploring a grid of values for the penalisation parameter. The SAEM approach is much faster than a classical Markov chain Monte Carlo algorithm and our method shows very good selection performances on simulated data. Its flexibility is demonstrated by implementing it for a variety of nonlinear mixed effects models. The usefulness of the proposed method is illustrated on a problem of genetic markers identification, relevant for genomic-assisted selection in plant breeding.

Mots clés

High-dimension Non-linear mixed effect models SAEM algorithm Spike-and-slab prior Variable selection

Domaines

Statistiques [math.ST]

Marion Naveau : Connectez-vous pour contacter le contributeur

https://hal.inrae.fr/hal-04395282

Soumis le : lundi 15 janvier 2024-15:31:18

Dernière modification le : lundi 16 décembre 2024-12:12:04

Dates et versions

hal-04395282 , version 1 (15-01-2024)

Identifiants

HAL Id : hal-04395282 , version 1
ARXIV : 2206.01012
DOI : 10.1007/s11222-023-10367-4
WOS : 001126177100001

Citer

Marion Naveau, Guillaume Kon Kam King, Renaud Rincent, Laure Sansonnet, Maud Delattre. Bayesian high-dimensional covariate selection in non-linear mixed-effects models using the SAEM algorithm. Statistics and Computing, 2024, 34 (1), pp.53. ⟨10.1007/s11222-023-10367-4⟩. ⟨hal-04395282⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

AGROPARISTECH CNRS MIA-PARIS GQE UNIV-PARIS-SACLAY INRAE ANR GS-MATHEMATIQUES GS-COMPUTER-SCIENCE GS-BIOSPHERA GS-LIFE-SCIENCES-HEALTH MAIAGE MICA-UNITES MATHNUM RESEAU-EAU BIOLOGIE_ET_AMELIORATION_DES_PLANTES

84 Consultations

0 Téléchargements