Abstract : Transcriptome sequencing represents a fundamental source of information for genome-wide studies and transcriptome analysis and will become increasingly important for expression analysis as new sequencing technologies takes over array technology. The identification of the protein-coding region in transcript sequences is a prerequisite for systematic amino acid-level analysis and more specifically for domain identification. In this article, we present FrameDP, a self-training integrative pipeline for predicting CDS in transcripts which can adapt itself to different levels of sequence qualities.
https://hal.inrae.fr/hal-02665259
Déposant : Migration Prodinra <>
Soumis le : dimanche 31 mai 2020 - 06:25:00 Dernière modification le : lundi 23 novembre 2020 - 15:00:07