Gemedoc: a text similarity annotation platform - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Gemedoc: a text similarity annotation platform

Résumé

We present Gemedoc, a platform for text similarity annotation based on the spatial and the thematic dimension. To this end, a two step annotation protocol was designed to assess the similarity between two documents: (1) identification of salient features according to the two analysis dimensions; (2) similarity assessment according to a 4-degree scale. Ultimately, the labeled data retrieved from different corpora could be used as benchmark for text-mining applications.
Fichier non déposé

Dates et versions

hal-02608035 , version 1 (16-05-2020)

Identifiants

Citer

J. Fize, M. Roche, Maguelonne Teisseire. Gemedoc: a text similarity annotation platform. NLDB 2018: 23rd International Conference on Applications of Natural Language to Information Systems, Jun 2018, Paris, France. pp.333-336, ⟨10.1007/978-3-319-91947-8_35⟩. ⟨hal-02608035⟩
8 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More