Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue BMC Bioinformatics Année : 2018

Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations

an example for tomato and grapevine

Mondher Bouzayen

Résumé

Background: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework. Results: We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at http://biosrv.cab.unina.it/comparalogs/gene/search Conclusions: The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses between two species.
Fichier principal
Vignette du fichier
Ambrosino-BMC-2018_1.pdf (2.18 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-02623377 , version 1 (26-05-2020)

Licence

Paternité

Identifiants

Citer

Luca Ambrosino, Valentino Ruggieri, Hamed Bostan, Marco Miralto, Nicola Vitulo, et al.. Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations. BMC Bioinformatics, 2018, 19 (15), pp.85-143. ⟨10.1186/s12859-018-2420-y⟩. ⟨hal-02623377⟩
21 Consultations
23 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More