Accounting for overlapping annotations in genomic prediction models of complex traits - Archive ouverte HAL Access content directly
Journal Articles BMC Bioinformatics Year : 2022

Accounting for overlapping annotations in genomic prediction models of complex traits

(1) , (2) , (1) , (1, 3)
1
2
3
Fanny Mollandin

Connectez-vous pour contacter l'auteur
Hélène Gilbert
Pascal Croiseau

Abstract

Abstract Background It is now widespread in livestock and plant breeding to use genotyping data to predict phenotypes with genomic prediction models. In parallel, genomic annotations related to a variety of traits are increasing in number and granularity, providing valuable insight into potentially important positions in the genome. The BayesRC model integrates this prior biological information by factorizing the genome according to disjoint annotation categories, in some cases enabling improved prediction of heritable traits. However, BayesRC is not adapted to cases where markers may have multiple annotations. Results We propose two novel Bayesian approaches to account for multi-annotated markers through a cumulative (BayesRC+) or preferential (BayesRC $\pi$ π ) model of the contribution of multiple annotation categories. We illustrate their performance on simulated data with various genetic architectures and types of annotations. We also explore their use on data from a backcross population of growing pigs in conjunction with annotations constructed using the PigQTLdb. In both simulated and real data, we observed a modest improvement in prediction quality with our models when used with informative annotations. In addition, our results show that BayesRC+ successfully prioritizes multi-annotated markers according to their posterior variance, while BayesRC $\pi$ π provides a useful interpretation of informative annotations for multi-annotated markers. Finally, we explore several strategies for constructing annotations from a public database, highlighting the importance of careful consideration of this step. Conclusion When used with annotations that are relevant to the trait under study, BayesRC $\pi$ π and BayesRC+ allow for improved prediction and prioritization of multi-annotated markers, and can provide useful biological insight into the genetic architecture of traits.

Dates and versions

hal-03770583 , version 1 (27-09-2022)

Licence

Attribution - CC BY 4.0

Identifiers

• HAL Id : hal-03770583 , version 1
• DOI :
• PUBMED :
• WOS :

Cite

Fanny Mollandin, Hélène Gilbert, Pascal Croiseau, Andrea Rau. Accounting for overlapping annotations in genomic prediction models of complex traits. BMC Bioinformatics, 2022, 23 (1), 22 p. ⟨10.1186/s12859-022-04914-5⟩. ⟨hal-03770583⟩

59 View