On the accuracy in high dimensional linear models and its application to genomic selection - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue Scandinavian Journal of Statistics Année : 2019

On the accuracy in high dimensional linear models and its application to genomic selection

Résumé

Genomic selection, a hot topic in genetics, consists in predicting breeding values of selection candidates, using a large number of genetic markers , due to the recent progress in molecular biology. One of the most popular method chosen by geneticists is Ridge regression. In this context, we focus on some predictive aspects of Ridge regression and present theoretical results regarding the accuracy criteria, i.e., the correlation between predicted value and true value. We show the influence of the singular values, the regularization parameter , and the projection of the signal on the space spanned by the rows of the design matrix. Asymptotic results, in a high dimensional framework, are also given, and we prove that the convergence to an optimal accuracy highly depends on a weighted projection of the signal on each subspace. We discuss also on how to improve the prediction. Last, illustrations on simulated and real data are proposed.
Fichier principal
Vignette du fichier
RabierManginGruseaV2ForHal.pdf (760.06 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01456310 , version 1 (04-02-2017)
hal-01456310 , version 2 (11-03-2018)

Identifiants

Citer

Charles-Elie Rabier, Brigitte Mangin, Simona Grusea. On the accuracy in high dimensional linear models and its application to genomic selection. Scandinavian Journal of Statistics, 2019, 46 (1), pp.289-313. ⟨10.1111/sjos.12352⟩. ⟨hal-01456310v2⟩
2768 Consultations
351 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More