Exploiting next-generation sequencing to obtain a whole-genome SNP collection in black poplar (Populus nigra L.)
Résumé
Poplar trees can be found in both domesticated and wild environments. This situation provides extensive experimental opportunity for the study of the relationship between naturally occurring genotypic and phenotypic variation. Having access to a population with little or no human disturbance means that extant populations are the result of natural evolutionary forces and questions pertaining to adaptation and demography will not be anthropomorphically confounded. The genus Populus is useful as a model tree species, since many genetic and genomic resources are available, including mapping pedigrees, association populations, genetic and QTL maps and the genome reference sequence of the black cottonwood, P. trichocarpa (Torr. & Gray). Taking advantage of these resources and exploiting rapidly-evolving new sequencing technologies, we set out to produce a sequence-marker map made of SNPs and short indels covering the whole genome. Such data will be used as markers for association studies and genomic-based breeding for adaptive traits (e.g. phenology, leaf development, water use efficiency and rust resistance). This work focused on black poplar (P. nigra L.), the native and most wide-spread poplar species in Europe. A combined process was adopted to resequencing various P. nigra genomes. The first step was deep-coverage resequencing (≥ 20X) of three poplar individuals from different latitudinal origins, which provided for efficient SNP detection. Alignment of 75 and 100 bp sequences to the P. trichocarpa genome reference seems to work well even though two different species are being compared. This part of the work provided consensus sequences of P. nigra and a wide-genome SNP collection. This SNP detection was supported and confirmed by P. nigra sequences of candidate genes for the above-mentioned adaptive traits, which were resequenced using the traditional Sanger method. The consensus sequences obtained will be used in the second step of the process to align sequences from 50 additional individuals in order to maximize SNP discovery. These poplar genotypes have already been selected from various European natural populations and their lower coverage (2X) resequencing is in progress. A final discussion on sequence data analysis and marker distribution across the poplar genome is provided.