A Data-Driven Score Model to Assess Online News Articles in Event-Based Surveillance System - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

A Data-Driven Score Model to Assess Online News Articles in Event-Based Surveillance System

Résumé

Online news sources are popular resources for learning about current health situations and developing event-based surveillance (EBS) systems. However, having access to diverse information originating from multiple sources can misinform stakeholders, eventually leading to false health risks. The existing literature contains several techniques for performing data quality evaluation to minimize the effects of misleading information. However, these methods only rely on the extraction of spatiotemporal information for representing health events. To address this research gap, a score-based technique is proposed to quantify the data quality of online news articles through three assessment measures: 1) news article metadata, 2) content analysis, and 3) epidemiological entity extraction with NLP to weight the contextual information. The results are calculated using classification metrics with two evaluation approaches: 1) a strict approach and 2) a flexible approach. The obtained results show significant enhancement in the data quality by filtering irrelevant news, which can potentially reduce false alert generation in EBS systems.
Fichier non déposé

Dates et versions

hal-03667926 , version 1 (13-05-2022)

Identifiants

Citer

Syed Mehtab Alam, Elena Arsevska, Mathieu Roche, Maguelonne Teisseire. A Data-Driven Score Model to Assess Online News Articles in Event-Based Surveillance System. 8th Annual International Conference on Information Management and Big Data, SIMBig 2021, Dec 2021, Online, France. pp.264-280, ⟨10.1007/978-3-031-04447-2_18⟩. ⟨hal-03667926⟩
62 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More