Biocluster: an NGS-dedicated HPC cluster in IMBBC, HCMR Upcoming Upgrade Usage/Partners Training

Jacques Lagnel; Tereza Manousaki; Georgios Kotoulas; Antonios Magoulas

Poster De Conférence Année : 2016

Biocluster: an NGS-dedicated HPC cluster in IMBBC, HCMR Upcoming Upgrade Usage/Partners Training

(1) , (1) , (1) , (1)

Jacques Lagnel

Fonction : Auteur
PersonId : 745107
IdHAL : jacques-lagnel
ORCID : 0000-0002-2967-1639

Hellenic Centre for Marine Research

Tereza Manousaki

Fonction : Auteur
PersonId : 784023
ORCID : 0000-0001-7518-0542

Hellenic Centre for Marine Research

Georgios Kotoulas

Fonction : Auteur

Hellenic Centre for Marine Research

Antonios Magoulas

Fonction : Auteur

Hellenic Centre for Marine Research

Résumé

Modern biology is largely shaped by the development of next generation sequencing (NGS) technology. Researchers sequence massively model and non-model organisms from single cell transcriptomes, to whole genomes with applications that range from medicine up to ecology and conservation. However, the great challenge following the scaling up of sequencing throughput is data analysis. The amounts of data produced by NGS experiments require high computational power and cannot be analyzed in desktop computers. This fact underlines the need for HPC platforms that allow the analysis of high throughput data in reasonable timeframes. In IMBBC (HCMR), an HPC cluster named Biocluster has been built, dedicated to bioinformatics applications and NGS data analysis. Since 2010, when the cluster was first launched, we have been configuring it to accommodate more than 200 state-of-the-art pieces of software. On top of that, we have been developing parallelized pipelines for NGS data analysis with special focus on the challenges rising from sequencing non-model species. The parallelized pipelines include raw reads pre-preprocessing, gene annotation, variant discovery, population genetic analyses, metabarcoding analyses and others. Biocluster can accommodate all possible OMICS data schemes in an efficient and optimized way, allowing analyses for various experimental designs in a speed comparable to that of modern sequencing data production. The unprecedented collection of sequence analysis software and the availability of parallelized pipelines turn this platform to a unique bioinformatics tool. TMM-FPKM

Domaines

Biologie végétale

Fichier principal

HPC_HCMR_v4_TM_j_1.pdf (9.21 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Migration ProdInra : Connectez-vous pour contacter le contributeur

https://hal.inrae.fr/hal-02799037

Soumis le : vendredi 5 juin 2020-17:03:51

Dernière modification le : jeudi 23 novembre 2023-13:40:17

Dates et versions

hal-02799037 , version 1 (05-06-2020)

Identifiants

HAL Id : hal-02799037 , version 1
PRODINRA : 423237

Citer

Jacques Lagnel, Tereza Manousaki, Georgios Kotoulas, Antonios Magoulas. Biocluster: an NGS-dedicated HPC cluster in IMBBC, HCMR Upcoming Upgrade Usage/Partners Training. 9. Hellenic Bioinformatics 2016, Nov 2016, Thessalonica, Greece. 2016. ⟨hal-02799037⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRAE BIOLOGIE_ET_AMELIORATION_DES_PLANTES

27 Consultations

7 Téléchargements

Biocluster: an NGS-dedicated HPC cluster in IMBBC, HCMR Upcoming Upgrade Usage/Partners Training

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager