Genotyping By Sequencing development for Salmo salar: A simulation-based predictive approach using the R package SimRAD. - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Poster De Conférence Année : 2014

Genotyping By Sequencing development for Salmo salar: A simulation-based predictive approach using the R package SimRAD.

Résumé

Application of Next Generation Sequencing platform (NGS) for genotyping purpose in the field of biotechnology, ecology or evolutionary biology is developing quickly. The introduction of efficient methods to reduce genome complexity allows making the most of the huge number of sequences generated by analyzing several individuals in a single run. As a result, numerous approaches for genome complexity reduction have been recently developed using different combinations of restriction enzymes, library construction protocols and fragments size selection. Therefore, the choice of which strategy to use may become cumbersome because it is difficult to anticipate the number of loci resulting from each method and no tool was available to provide guidance. To fill this methodological gap, we developed the R package SimRAD (available on the CRAN at http://cran.r-project.org/ web/packages/SimRAD) for simulation-based prediction of the number of loci expected from alternative Genotyping by Sequencing (GBS) or Restriction Associated DNA (RAD) protocols. This package can be used for non-model species for which no reference genome sequence is available, or for species with a draft or a full reference genome sequence released. We illustrated the practical use of SimRAD by comparing the number of loci expected under different GBS approaches applied in Atlantic salmon. We performed our simulations based on a randomly DNA sequence generated following CG content of 42.6% characteristics of Atlantic salmon and the draft genome sequence (AGKD00000000.1) available as yet for this species. Based on these estimations, we selected a GBS protocol that provided a good compromise between number of loci and potential for individual multiplexing in a single run. We then implemented the GBS method on three individuals using the two restriction enzymes PstI and MseI and a fragment size selection step on the Ion Torrent PGM. This preliminary run resulted in a total of 100000 loci which was within the range of the prediction performed using SimRAD (68000 using the simulated sequence and 135000 using the draft genome sequence). This GBS approach will be scaled up on the Ion Torrent Proton platform enabling analyzing up to 72 individuals per run. The imminent release of the Atlantic salmon reference genome sequence will greatly improve GBS outcome predictions allowing an easier balancing of the tradeoff between the number of loci and the number of individual needed for each GBS application
Fichier non déposé

Dates et versions

hal-02799311 , version 1 (05-06-2020)

Identifiants

  • HAL Id : hal-02799311 , version 1
  • PRODINRA : 264334

Citer

Olivier Lepais, Franck F. Salin, Christophe C. Boury, Erwan Guichoux, Yec'Han Laizet, et al.. Genotyping By Sequencing development for Salmo salar: A simulation-based predictive approach using the R package SimRAD.. 2. International Conference on Integrative Salmonid Biology, Jun 2014, Vancouver, Canada. , 2014. ⟨hal-02799311⟩
10 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More