Accéder directement au contenu Accéder directement à la navigation
Article dans une revue

Proteogenomics‐Guided Evaluation of RNA‐Seq Assembly and Protein Database Construction for Emergent Model Organisms

Abstract : Proteogenomics is gaining momentum as, today, genomics, transcriptomics, and proteomics can be readily performed on any new species. This approach allows key alterations to molecular pathways to be identified when comparing conditions. For animals and plants, RNA‐seq‐informed proteomics is the most popular means of interpreting tandem mass spectrometry spectra acquired for species for which the genome has not yet been sequenced. It relies on high‐performance de novo RNA‐seq assembly and optimized translation strategies. Here, several pre‐treatments for Illumina RNA‐seq reads before assembly are explored to translate the resulting contigs into useful polypeptide sequences. Experimental transcriptomics and proteomics datasets acquired for individual Gammarus fossarum freshwater crustaceans are used, the most relevant procedure is defined by the ratio of MS/MS spectra assigned to peptide sequences. Removing reads with a mean quality score of less than 17–which represents a single probable nucleotide error on 150‐bp reads–prior to assembly, increases the proteomics outcome. The best translation using Transdecoder is achieved with a minimal open reading frame length of 50 amino acids and systematic selection of ORFs longer than 900 nucleotides. Using these parameters, transcriptome assembly and translation informed by proteomics pave the way to further improvements in proteogenomics.
Liste complète des métadonnées

https://hal.inrae.fr/hal-02922078
Déposant : Davide Degli Esposti <>
Soumis le : mardi 25 août 2020 - 17:40:22
Dernière modification le : mardi 1 septembre 2020 - 03:34:41

Identifiants

Citation

Yannick Cogne, Duarte Gouveia, Arnaud Chaumot, Davide Degli-Esposti, Olivier Geffard, et al.. Proteogenomics‐Guided Evaluation of RNA‐Seq Assembly and Protein Database Construction for Emergent Model Organisms. Proteomics, Wiley-VCH Verlag, 2020, 20 (10), pp.1-11. ⟨10.1002/pmic.201900261⟩. ⟨hal-02922078⟩

Partager

Métriques

Consultations de la notice

25