Identifying associations between epidemiological entities in news data for animal disease surveillance - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue Artificial Intelligence in Agriculture Année : 2021

Identifying associations between epidemiological entities in news data for animal disease surveillance

Résumé

Event-based surveillance systems are at the crossroads of human and animal (and plant and ecosystem) health, epidemiology, statistics, and informatics. Thus, their deployment faces many challenges specific to each domain and their intersections, such as relations among automation, artificial intelligence, and expertise. In this context, our work pertins to the extraction of epidemiological events in textual data (i.e. news) by unsupervised methods. We define the event extraction task as detecting pairs of epidemiological entities (e.g. a disease name and location). The quality of the ranked lists of pairs was evaluated using specific ranking evaluation metrics. We used a publicly available annotated corpus of 438 documents (i.e. news articles) related to animal disease events. The statistical approach was able to detect event-related pairs of epidemiological features with a good trade-off between precision and recall. Our results showed that using a window of words outperformed document-based and sentence-based approaches, while reducing the probability of detecting false pairs. Our results indicated that Mutual Information was less adapted than the Dice coefficient for ranking pairs of features in the event extraction framework. We believe that Mutual Information would be more relevant for rare pair detection (i.e. weak signals), but requires higher manual curation to avoid false positive extraction pairs. Moreover, generalising the country-level spatial features enabled better discrimination (i.e. ranking) of relevant disease-location pairs for event extraction.
Fichier principal
Vignette du fichier
Valentin_2021_AIA.pdf (1.82 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03324118 , version 1 (23-08-2021)

Licence

Paternité - Pas d'utilisation commerciale - Pas de modification

Identifiants

Citer

Sarah Valentin, Renaud Lancelot, Mathieu Roche. Identifying associations between epidemiological entities in news data for animal disease surveillance. Artificial Intelligence in Agriculture, 2021, 5, pp.163-174. ⟨10.1016/j.aiia.2021.07.003⟩. ⟨hal-03324118⟩
67 Consultations
109 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More