Taming the massive genome of Scots pine with PiSy50k, a new genotyping array for conifer research - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2021

Taming the massive genome of Scots pine with PiSy50k, a new genotyping array for conifer research

Chedly Kastally
Alina Niskanen
Komlan Avia
Sandra Cervantes
  • Fonction : Auteur
Matti Haapanen
  • Fonction : Auteur
Timo Kumpula
  • Fonction : Auteur
Tiina Mattila
  • Fonction : Auteur
Dario Ojeda
  • Fonction : Auteur
Jaakko Tyrmi
  • Fonction : Auteur
Witold Wachowiak
  • Fonction : Auteur
Stephen Cavers
  • Fonction : Auteur
Katri Kärkkäinen
  • Fonction : Auteur
Outi Savolainen
  • Fonction : Auteur
Tanja Pyhäjärvi

Résumé

Summary Scots pine ( Pinus sylvestris ) is the most widespread coniferous tree in the boreal forests of Eurasia and has major economic and ecological importance. However, its large and repetitive genome presents a challenge for conducting genome-wide analyses such as association studies and genomic selection. We present a new 50K SNP genotyping array for Scots pine research, breeding programs, and other applications. To select the SNP set, we first genotyped 480 Scots pine samples on a 407 540 SNP screening array, and identified 47 712 high-quality SNPs for the final array (called ‘PiSy50k’). Here, we provide details of the design and testing, as well as allele frequency estimates from the discovery panel, functional annotation, tissue-specific expression patterns, and expression level information for the SNPs or corresponding genes, when available. We validated the performance of the PiSy50k array using samples from breeding populations from Finland and Scotland. Overall, 39 678 (83.2%) SNPs showed low error rates (mean = 0.92%). Relatedness estimates based on array genotypes were consistent with the expected pedigrees, and the amount of Mendelian error was negligible. In addition, array genotypes successfully discriminate Scots pine populations from different geographic origins. The PiSy50k array will be a valuable tool for future genetic studies and forestry applications. Significance statement Scots pine is an evolutionary, economically and ecologically impressive coniferous species but its gigantic genome has limited studying e.g. the genetic basis of its functional trait variation. We have developed a genotyping array that facilitates Scots pine genetic research and linking its trait variation to genetic polymorphisms and gene expression levels across the genome.

Dates et versions

hal-03335067 , version 1 (05-09-2021)

Identifiants

Citer

Chedly Kastally, Alina Niskanen, Annika Perry, Sonja Kujala, Komlan Avia, et al.. Taming the massive genome of Scots pine with PiSy50k, a new genotyping array for conifer research. 2021. ⟨hal-03335067⟩
33 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Mastodon Facebook X LinkedIn More