Deciphering Arabidopsis thaliana gene neighborhoods through bibliographic co-citations - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue Computers and Chemistry Année : 2002

Deciphering Arabidopsis thaliana gene neighborhoods through bibliographic co-citations

Résumé

In the framework of genome annotation, scientific literature is obviously the major source of biological knowledge. The aim of the work described in this paper is to exploit this source of data for the model plant Arabidopsis thaliana. The first step has consisted in constituting a relevant bibliographic references dataset for plant genomic research. Genes co-citations have then been systematically annotated in this reference dataset, starting from the simple idea that if genes are cited in the same publication, they must probably share some related functional properties. In order to deal with the synonymous gene name problem; a gene name reference list has been constituted starting from A. thaliana SwissProt entries. This list was used to build clusters of co-cited genes by a single linkage procedure such that any gene in a given cluster possesses at least one co-cited partner in the same cluster. Analysis of the clusters demonstrate the biological consistency of this approach, with only very few fortuitous links. As an example, a cluster including genes related to flowering time is more deeply described in the paper. Finally, a graphical representation of each cluster was performed, which provides a convenient way to retrieve the genes (the nodes of the graphs) and the references in which they were co-cited (the edges of the graphs). All the results can be accessed at the URL.

Dates et versions

hal-02676436 , version 1 (31-05-2020)

Identifiants

Citer

A. Louis, Hélène Chiapello, C. Fabry, E. Ollivier, A. Henaut. Deciphering Arabidopsis thaliana gene neighborhoods through bibliographic co-citations. Computers and Chemistry, 2002, 26 (5), pp.511-519. ⟨10.1016/S0097-8485(02)00011-6⟩. ⟨hal-02676436⟩
15 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Mastodon Facebook X LinkedIn More