J. Andres and P. N. Bertin, The microbial genomics of arsenic, FEMS Microbiology Reviews, vol.40, issue.2, pp.299-322, 2016.

D. Blankenberg, G. V. Kuster, N. Coraor, G. Ananda, R. Lazarus et al., Galaxy: A Web?Based Genome Analysis Tool for Experimentalists, Current Protocols in Molecular Biology, vol.89, issue.1, pp.10-11, 2010.

N. A. Bokulich, S. Subramanian, J. J. Faith, D. Gevers, J. I. Gordon et al., Quality-filtering vastly improves diversity estimates from Illumina amplicon sequencing, Nature Methods, vol.10, issue.1, pp.57-59, 2013.

F. Boyer, C. Mercier, A. Bonin, Y. Le-bras, P. Taberlet et al., obitools: aunix-inspired software package for DNA metabarcoding, Molecular Ecology Resources, vol.16, issue.1, pp.176-182, 2015.

Y. Cai and Y. Sun, ESPRIT-Tree: hierarchical clustering analysis of millions of 16S rRNA pyrosequences in quasilinear computational time, Nucleic Acids Research, vol.39, issue.14, pp.e95-e95, 2011.

C. Camacho, G. Coulouris, V. Avagyan, N. Ma, J. Papadopoulos et al., BLAST+: architecture and applications, BMC Bioinformatics, vol.10, issue.1, p.421, 2009.

J. G. Caporaso, J. Kuczynski, J. Stombaugh, K. Bittinger, F. D. Bushman et al., QIIME allows analysis of high-throughput community sequencing data, Nature Methods, vol.7, issue.5, pp.335-336, 2010.

A. M. Comeau, G. M. Douglas, and M. G. Langille, Microbiome Helper: a Custom and Streamlined Workflow for Microbiome Research, mSystems, vol.2, issue.1, pp.127-143, 2017.

C. De-vargas, Ocean plankton. Eukaryotic plankton diversity in the sunlit ocean, Science, vol.348, p.1261605, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01258241

T. Z. Desantis, P. Hugenholtz, N. Larsen, M. Rojas, E. L. Brodie et al., Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB, Applied and Environmental Microbiology, vol.72, issue.7, pp.5069-5072, 2006.

R. C. Edgar, Search and clustering orders of magnitude faster than BLAST, Bioinformatics, vol.26, issue.19, pp.2460-2461, 2010.

R. C. Edgar, B. J. Haas, J. C. Clemente, C. Quince, and R. Knight, UCHIME improves sensitivity and speed of chimera detection, Bioinformatics, vol.27, issue.16, pp.2194-2200, 2011.

R. C. Edgar, UPARSE: highly accurate OTU sequences from microbial amplicon reads, Nature Methods, vol.10, issue.10, pp.996-998, 2013.

A. M. Eren, H. G. Morrison, P. J. Lescault, J. Reveillaud, J. H. Vineis et al., Minimum entropy decomposition: Unsupervised oligotyping for sensitive partitioning of high-throughput marker gene sequences, The ISME Journal, vol.9, issue.4, pp.968-979, 2014.
URL : https://hal.archives-ouvertes.fr/hal-02641245

L. Fu, B. Niu, Z. Zhu, S. Wu, and W. Li, CD-HIT: accelerated for clustering the next-generation sequencing data, Bioinformatics, vol.28, issue.23, pp.3150-3152, 2012.

B. Giardine, Galaxy: A platform for interactive large-scale genome analysis, Genome Research, vol.15, issue.10, pp.1451-1455, 2005.

J. Goecks, A. Nekrutenko, J. Taylor, and T. Galaxy-team, Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences, Genome Biology, vol.11, issue.8, p.R86, 2010.

J. Goris, K. T. Konstantinidis, J. A. Klappenbach, T. Coenye, P. Vandamme et al., DNA?DNA hybridization values and their relationship to whole-genome sequence similarities, International Journal of Systematic and Evolutionary Microbiology, vol.57, issue.1, pp.81-91, 2007.

B. J. Haas, D. Gevers, A. M. Earl, M. Feldgarden, D. V. Ward et al., Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons, Genome Research, vol.21, issue.3, pp.494-504, 2011.

M. Hess, A. Sczyrba, R. Egan, T. Kim, H. Chokhawala et al., Metagenomic Discovery of Biomass-Degrading Genes and Genomes from Cow Rumen, Science, vol.331, issue.6016, pp.463-467, 2011.

F. Hildebrand, R. Tadeo, A. Voigt, P. Bork, and J. Raes, LotuS: an efficient and user-friendly OTU processing pipeline, Microbiome, vol.2, issue.1, p.30, 2014.

L. V. Hooper, D. R. Littman, and A. J. Macpherson, Interactions Between the Microbiota and the Immune System, Science, vol.336, issue.6086, pp.1268-1273, 2012.

P. Hugenholtz, B. M. Goebel, and N. R. Pace, Impact of Culture-Independent Studies on the Emerging Phylogenetic View of Bacterial Diversity, Journal of Bacteriology, vol.180, issue.18, pp.4765-4774, 1998.

S. M. Huse, D. M. Welch, H. G. Morrison, and M. L. Sogin, Ironing out the wrinkles in the rare biosphere through improved OTU clustering, Environmental Microbiology, vol.12, issue.7, pp.1889-1898, 2010.

P. Jeraldo, K. Kalari, X. Chen, J. Bhavsar, A. Mangalam et al., IM-TORNADO: A Tool for Comparison of 16S Reads from Paired-End Libraries, PLoS ONE, vol.9, issue.12, p.e114804, 2014.

J. Jovel, J. Patterson, W. Wang, N. Hotte, S. O'keefe et al., Characterization of the Gut Microbiome Using 16S or Shotgun Metagenomics, Frontiers in Microbiology, vol.7, p.459, 2016.

M. Kim, H. Oh, S. Park, and J. Chun, Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes, International Journal of Systematic and Evolutionary Microbiology, vol.64, issue.Pt_2, pp.346-351, 2014.

K. T. Konstantinidis, A. Ramette, and J. M. Tiedje, The bacterial species definition in the genomic era, Philosophical Transactions of the Royal Society B: Biological Sciences, vol.361, issue.1475, pp.1929-1940, 2006.

E. Kopylova, J. A. Navas-molina, C. Mercier, Z. Z. Xu, F. Mahé et al., Open-Source Sequence Clustering Methods Improve the State Of the Art, mSystems, vol.1, issue.1, pp.3-15, 2016.

J. J. Kozich, S. L. Westcott, N. T. Baxter, S. K. Highlander, and P. D. Schloss, Development of a Dual-Index Sequencing Strategy and Curation Pipeline for Analyzing Amplicon Sequence Data on the MiSeq Illumina Sequencing Platform, Applied and Environmental Microbiology, vol.79, issue.17, pp.5112-5120, 2013.

V. Kunin, A. Engelbrektson, H. Ochman, and P. Hugenholtz, Wrinkles in the rare biosphere: pyrosequencing errors can lead to artificial inflation of diversity estimates, Environmental Microbiology, vol.12, issue.1, pp.118-123, 2010.

T. Magoc and S. L. Salzberg, FLASH: fast length adjustment of short reads to improve genome assemblies, Bioinformatics, vol.27, issue.21, pp.2957-2963, 2011.

F. Mahé, T. Rognes, C. Quince, C. De-vargas, and M. Dunthorn, Swarm: robust and fast clustering method for amplicon-based studies, PeerJ, vol.2, p.e593, 2014.

D. K. Manter, M. Korsa, C. Tebbe, and J. A. Delgado, myPhyloDB: a local web server for the storage and analysis of metagenomic data, Database, vol.2016, p.baw037, 2016.

M. Martin, Cutadapt removes adapter sequences from high-throughput sequencing reads, EMBnet.journal, vol.17, issue.1, p.10, 2011.

S. J. Mcilroy, A. M. Saunders, M. Albertsen, M. Nierychlo, B. J. Mcilroy et al., MiDAS: the field guide to the microbes of activated sludge, Database, vol.2015, p.bav062, 2015.

P. J. Mcmurdie and S. Holmes, phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data, PLoS ONE, vol.8, issue.4, p.e61217, 2013.

O. Mizrahi-man, E. R. Davenport, and Y. Gilad, Taxonomic Classification of Bacterial 16S rRNA Genes Using Short Sequencing Reads: Evaluation of Effective Study Designs, PLoS ONE, vol.8, issue.1, p.e53608, 2013.

M. C. Nelson, H. G. Morrison, J. Benjamino, S. L. Grim, and J. Graf, Analysis, Optimization and Verification of Illumina-Generated 16S rRNA Gene Amplicon Surveys, PLoS ONE, vol.9, issue.4, p.e94249, 2014.

N. Nguyen, T. Warnow, M. Pop, and B. White, A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity, npj Biofilms and Microbiomes, vol.2, issue.1, p.16004, 2016.

J. Oh, C. Choi, M. Park, B. K. Kim, K. Hwang et al., CLUSTOM-CLOUD: In-Memory Data Grid-Based Software for Clustering 16S rRNA Sequence Data in the Cloud Environment, PLOS ONE, vol.11, issue.3, p.e0151064, 2016.

A. J. Pinto and L. Raskin, PCR Biases Distort Bacterial and Archaeal Community Structure in Pyrosequencing Datasets, PLoS ONE, vol.7, issue.8, p.e43093, 2012.

C. Quast, E. Pruesse, P. Yilmaz, J. Gerken, T. Schweer et al., The SILVA ribosomal RNA gene database project: improved data processing and web-based tools, Nucleic Acids Research, vol.41, issue.D1, pp.D590-D596, 2012.

T. Rognes, T. Flouri, B. Nichols, C. Quince, and F. Mahé, VSEARCH: a versatile open source tool for metagenomics, PeerJ, vol.4, p.e2584, 2016.

P. D. Schloss, S. L. Westcott, T. Ryabin, J. R. Hall, M. Hartmann et al., Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities, Applied and Environmental Microbiology, vol.75, issue.23, pp.7537-7541, 2009.

L. Sinclair, O. A. Osman, S. Bertilsson, and A. Eiler, Microbial Community Composition and Diversity via 16S rRNA Gene Amplicons: Evaluating the Illumina Platform, PLOS ONE, vol.10, issue.2, p.e0116955, 2015.

Q. Wang, G. M. Garrity, J. M. Tiedje, and J. R. Cole, Nai?ve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy, Applied and Environmental Microbiology, vol.73, issue.16, pp.5261-5267, 2007.