S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman, Basic local alignment search tool, Journal of Molecular Biology, vol.215, issue.3, pp.403-410, 1990.
DOI : 10.1016/S0022-2836(05)80360-2

I. Borg and P. Groenen, Modern Multidimensional Scaling: Theory and Applications. Springer Series in Statistics, 2013.
DOI : 10.1007/978-1-4757-2711-1

. Dalpke, Comparison of microbiomes from different niches of upper and lower airways in children and adolescents with cystic fibrosis, PLoS ONE, vol.10, issue.1, pp.1-19

K. B?inda, M. Sykulski, and G. Kucherov, Spaced seeds improve k-mer-based metagenomic classification, Bioinformatics, issue.22, pp.313584-3592, 2015.

A. Z. Broder, On the resemblance and containment of documents, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171), pp.21-29, 1997.
DOI : 10.1109/SEQUEN.1997.666900

L. Cai, L. Ye, A. H. Tong, S. Lok, and T. Zhang, Biased Diversity Metrics Revealed by Bacterial 16S Pyrotags Derived from Different Primer Sets, PLoS ONE, vol.318, issue.1, p.53649, 2013.
DOI : 10.1371/journal.pone.0053649.s003

URL : http://doi.org/10.1371/journal.pone.0053649

A. Chao, R. L. Chazdon, R. K. Colwell, and T. Shen, Abundance-Based Similarity Indices and Their Estimation When There Are Unseen Species in Samples, Biometrics, vol.57, issue.2, pp.361-371, 2006.
DOI : 10.1111/j.1541-0420.2005.00489.x

E. K. Costello, C. L. Lauber, M. Hamady, N. Fierer, J. I. Gordon et al., Bacterial Community Variation in Human Body Habitats Across Space and Time, Science, vol.326, issue.5960, pp.3261694-1697, 2009.
DOI : 10.1126/science.1177486

S. Coveley, M. S. Elshahed, and N. H. Youssef, Response of the rare biosphere to environmental stressors in a highly diverse ecosystem (Zodletone spring, OK, USA), PeerJ, vol.78
DOI : 10.7717/peerj.1182/supp-21

S. Deorowicz, M. Kokot, S. Grabowski, and A. Debudaj-grabysz, KMC 2: fast and resource-frugal k-mer counting, Bioinformatics, vol.31, issue.10, pp.311569-1576, 2015.
DOI : 10.1093/bioinformatics/btv022

P. Deutsch and J. Gailly, Zlib compressed data format specification version 3.3, 1950.
DOI : 10.17487/rfc1950

E. Drezen, G. Rizk, R. Chikhi, C. Deltel, C. Lemaitre et al., GATB: Genome Assembly & Analysis Tool Box, Bioinformatics, vol.30, issue.20, pp.302959-2961, 2014.
DOI : 10.1093/bioinformatics/btu406

URL : https://hal.archives-ouvertes.fr/hal-01088571

V. B. Dubinkina, D. S. Ischenko, V. I. Ulyantsev, A. V. Tyakht, and D. G. Alexeev, Assessment of k-mer spectrum applicability for metagenomic dissimilarity analysis, BMC Bioinformatics, vol.464, issue.7285, p.38, 2016.
DOI : 10.1186/s12859-015-0875-7

Y. Fofanov, Y. Luo, C. Katili, J. Wang, Y. Belosludtsev et al., How independent are the appearances of n-mers in different genomes?, Bioinformatics, vol.20, issue.15, pp.2421-2428, 2004.
DOI : 10.1093/bioinformatics/bth266

S. Genitsaris, S. Monchy, E. Viscogliosi, T. Sime-ngando, S. Ferreira et al., Seasonal variations of marine protist community structure based on taxon-specific traits using the eastern English Channel as a model coastal system, FEMS Microbiology Ecology, vol.91, issue.5, pp.34-034
DOI : 10.1093/femsec/fiv034

V. Gomez-alvarez, S. Pfaller, J. G. Pressman, D. G. Wahman, and R. P. Revetta, Resilience of microbial communities in a simulated drinking water distribution system subjected to disturbances: role of conditionally rare taxa and potential implications for antibiotic-resistant bacteria, Environ. Sci.: Water Res. Technol., vol.6, issue.469, pp.645-657, 2016.
DOI : 10.1039/C6EW00053C

E. Karsenti, S. G. Acinas, P. Bork, C. Bowler, C. De-vargas et al., A Holistic Approach to Marine Eco-Systems Biology, PLoS Biology, vol.6, issue.10, 2011.
DOI : 10.1371/journal.pbio.1001177.g002

URL : https://hal.archives-ouvertes.fr/hal-00691580

W. J. Kent, BLAT---The BLAST-Like Alignment Tool, Genome Research, vol.12, issue.4, pp.656-664, 2002.
DOI : 10.1101/gr.229202

O. Koren, D. Knights, A. Gonzalez, L. Waldron, N. Segata et al., A Guide to Enterotypes across the Human Body: Meta-Analysis of Microbial Community Structures in Human Microbiome Datasets, PLoS Computational Biology, vol.94, issue.Suppl 1, p.1002863, 2013.
DOI : 10.1371/journal.pcbi.1002863.s031

P. Legendre and M. D. Cáceres, Beta diversity as the variance of community data: dissimilarity coefficients and partitioning, Ecology Letters, vol.72, issue.8, pp.951-963, 2013.
DOI : 10.1111/ele.12141

M. R. Liles, B. F. Manske, S. B. Bintrim, J. Handelsman, and R. M. Goodman, A Census of rRNA Genes and Linked Genomic Sequences within a Soil Metagenomic Library, Applied and Environmental Microbiology, vol.69, issue.5, pp.2684-2691, 2003.
DOI : 10.1128/AEM.69.5.2684-2691.2003

N. Maillet, C. Lemaitre, R. Chikhi, D. Lavenier, and P. Peterlongo, Compareads: comparing huge metagenomic experiments, BMC Bioinformatics, vol.13, issue.Suppl 19, p.10, 2012.
DOI : 10.1371/journal.pbio.0050077

URL : https://hal.archives-ouvertes.fr/hal-00760332

N. Maillet, G. Collet, T. Vannier, D. Lavenier, and P. Peterlongo, Commet: Comparing and combining multiple metagenomic datasets, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp.94-98, 2014.
DOI : 10.1109/BIBM.2014.6999135

URL : https://hal.archives-ouvertes.fr/hal-01080050

H. B. Nielsen, M. Almeida, A. S. Juncker, S. Rasmussen, J. Li et al., Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes, Nature Biotechnology, vol.32, issue.8, pp.32822-828, 2014.
DOI : 10.1214/ss/1177011136

URL : https://hal.archives-ouvertes.fr/hal-01195477

B. D. Ondov, T. J. Treangen, P. Melsted, A. B. Mallonee, N. H. Bergman et al., Mash: fast genome and metagenome distance estimation using MinHash, Genome Biology, vol.19, issue.Suppl 19, p.132, 2016.
DOI : 10.1101/017087

S. Pavoine, E. Vela, S. Gachet, G. De-bélair, and M. B. Bonsall, Linking patterns in phylogeny, traits, abiotic variables and space: a novel approach to linking environmental filtering and plant community assembly, Journal of Ecology, vol.2, issue.1, pp.165-175, 2011.
DOI : 10.1111/j.1365-2745.2010.01743.x

URL : https://hal.archives-ouvertes.fr/halsde-00611063

G. Piganeau, A. Eyre-walker, N. Grimsley, and H. Moreau, How and why DNA barcodes underestimate the diversity of microbial eukaryotes, PLoS ONE, vol.6, issue.2

G. Rizk, D. Lavenier, and R. Chikhi, DSK: k-mer counting with very low memory usage, Bioinformatics, vol.29, issue.5, p.20, 2013.
DOI : 10.1093/bioinformatics/btt020

URL : https://hal.archives-ouvertes.fr/hal-00778473

N. Segata, L. Waldron, A. Ballarini, V. Narasimhan, O. Jousson et al., Metagenomic microbial community profiling using unique clade-specific marker genes, Nature Methods, vol.2008, issue.8, pp.811-814, 2012.
DOI : 10.1093/nar/gkn879

S. Seth, N. Välimäki, S. Kaski, and A. Honkela, Exploration and retrieval of whole-metagenome sequencing samples, Bioinformatics, vol.30, issue.17, pp.2471-2479, 2014.
DOI : 10.1093/bioinformatics/btu340

A. Shade, S. E. Jones, J. G. Caporaso, J. Handelsman, R. Knight et al., Conditionally Rare Taxa Disproportionately Contribute to Temporal Changes in Microbial Diversity, mBio, vol.5, issue.4, pp.1371-1385
DOI : 10.1128/mBio.01371-14

H. Teeling, J. Waldmann, T. Lombardot, M. Bauer, and F. O. Glöckner, Tetra: a web-service and a stand-alone program for the analysis and comparison of tetranucleotide usage patterns in dna sequences, BMC Bioinformatics, vol.5, issue.1, p.163, 2004.
DOI : 10.1186/1471-2105-5-163

V. I. Ulyantsev, S. V. Kazakov, V. B. Dubinkina, A. V. Tyakht, and D. G. Alexeev, MetaFast: fast reference-free graph-based comparison of shotgun metagenomic data, Bioinformatics, vol.32, issue.18, 2016.
DOI : 10.1093/bioinformatics/btw312

R. H. Whittaker, Vegetation of the Siskiyou Mountains, Oregon and California, Ecological Monographs, vol.30, issue.3, pp.279-338, 1960.
DOI : 10.2307/1943563

D. E. Wood and S. L. Salzberg, Kraken: ultrafast metagenomic sequence classification using exact alignments, Genome Biology, vol.15, issue.3, 2014.
DOI : 10.1186/1471-2105-12-385

Y. Wu and Y. Ye, A Novel Abundance-Based Algorithm for Binning Metagenomic Sequences Using l-Tuples, Journal of Computational Biology, vol.18, issue.3, pp.523-534, 2011.
DOI : 10.1007/978-3-642-12683-3_35

S. Yooseph, G. Sutton, D. B. Rusch, A. L. Halpern, S. J. Williamson et al., The Sorcerer II Global Ocean Sampling Expedition: Expanding the Universe of Protein Families, PLoS Biology, vol.17, issue.3, p.16
DOI : 10.1371/journal.pbio.0050016.sd001