BWGS: A R package for genomic selection and its application to a wheat breeding programme

Gilles Charmet; Louis-Gautier Tran; Jerome Auzanneau; Renaud Rincent; Sophie Bouchet

doi:10.1371/journal.pone.0222733

Article Dans Une Revue PLoS ONE Année : 2020

BWGS: A R package for genomic selection and its application to a wheat breeding programme

(1) , (1) , (2) , (1) , (1)

1
2

Gilles Charmet

Fonction : Auteur
PersonId : 735966
IdHAL : gcharmet
ORCID : 0000-0001-8927-5040
IdRef : 136934862

Génétique Diversité et Ecophysiologie des Céréales

Louis-Gautier Tran

Fonction : Auteur

Génétique Diversité et Ecophysiologie des Céréales

Jerome Auzanneau

Fonction : Auteur

Agri Obtentions

Renaud Rincent

Fonction : Auteur
PersonId : 748077
IdHAL : renaud-rincent
ORCID : 0000-0003-0885-0969
IdRef : 155055224

Génétique Diversité et Ecophysiologie des Céréales

Sophie Bouchet

Fonction : Auteur
PersonId : 748396
IdHAL : sophie-bouchet
ORCID : 0000-0001-5868-3359

Génétique Diversité et Ecophysiologie des Céréales

Résumé

We developed an integrated R library called BWGS to enable easy computation of Genomic Estimates of Breeding values (GEBV) for genomic selection. BWGS, for BreedWheat Genomic selection, was developed in the framework of a cooperative private-public partnership project called Breedwheat (https://breedwheat.fr) and relies on existing R-libraries, all freely available from CRAN servers. The two main functions enable to run 1) replicated random cross validations within a training set of genotyped and phenotyped lines and 2) GEBV prediction, for a set of genotyped-only lines. Options are available for 1) missing data imputation, 2) markers and training set selection and 3) genomic prediction with 15 different methods, either parametric or semi-parametric. The usefulness and efficiency of BWGS are illustrated using a population of wheat lines from a real breeding programme. Adjusted yield data from historical trials (highly unbalanced design) were used for testing the options of BWGS. On the whole, 760 candidate lines with adjusted phenotypes and genotypes for 47 839 robust SNP were used. With a simple desktop computer, we obtained results which compared with previously published results on wheat genomic selection. As predicted by the theory, factors that are most influencing predictive ability, for a given trait of moderate heritability, are the size of the training population and a minimum number of markers for capturing every QTL information. Missing data up to 40%, if randomly distributed, do not degrade predictive ability once imputed, and up to 80% randomly distributed missing data are still acceptable once imputed with Expectation-Maximization method of package rrBLUP. It is worth noticing that selecting markers that are most associated to the trait do improve predictive ability, compared with the whole set of markers, but only when marker selection is made on the whole population. When marker selection is made only on the sampled training set, this advantage nearly disappeared, since it was clearly due to overfitting. Few differences are observed between the 15 prediction models with this dataset. Although non-parametric methods that are supposed to capture non-additive effects have slightly better predictive accuracy, differences remain small. Finally, the GEBV from the 15 prediction models are all highly correlated to each other. These results are encouraging for an efficient use of genomic selection in applied breeding programmes and BWGS is a simple and powerful toolbox to apply in breeding programmes or training activities.

Domaines

Génétique des plantes

Fichier principal

2020_Charmet_Plos_One.pdf (2.64 Mo)

2020_Charmet_Plos_One_Erratum.pdf (200.37 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Christine Molé : Connectez-vous pour contacter le contributeur

https://hal.inrae.fr/hal-02571067

Soumis le : jeudi 19 novembre 2020-08:57:47

Dernière modification le : jeudi 23 novembre 2023-13:40:24

Dates et versions

hal-02571067 , version 1 (19-11-2020)

Licence

Paternité

Identifiants

HAL Id : hal-02571067 , version 1
DOI : 10.1371/journal.pone.0222733
PUBMED : 32240182
WOS : 000535945000002

Citer

Gilles Charmet, Louis-Gautier Tran, Jerome Auzanneau, Renaud Rincent, Sophie Bouchet. BWGS: A R package for genomic selection and its application to a wheat breeding programme. PLoS ONE, 2020, 15 (4), pp.1-20. ⟨10.1371/journal.pone.0222733⟩. ⟨hal-02571067⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

PRES_CLERMONT GDEC INRAE ANR BIOLOGIE_ET_AMELIORATION_DES_PLANTES

29 Consultations

56 Téléchargements

BWGS: A R package for genomic selection and its application to a wheat breeding programme

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager