H-TFIDF: What makes areas specific over time in the massive flow of tweets related to the covid pandemic? - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue AGILE: GIScience Series Année : 2021

H-TFIDF: What makes areas specific over time in the massive flow of tweets related to the covid pandemic?

Résumé

Data produced by social networks may contain weak signals of possible epidemic outbreaks. In this paper, we focus on Twitter data during the waiting period before the appearance of COVID-19 first cases outside China. Among the huge flow of tweets that reflects a global growing concern in all countries, we propose to analyze such data with an adaptation of the TF-IDF measure. It allows the users to extract the discriminant vocabularies used across time and space. The results are then discussed to show how the specific spatio-temporal anchoring of the extracted terms make it possible to follow the crisis dynamics on different scales of time and space.
Fichier principal
Vignette du fichier
Decoupes-2021-H-tfidf-what-makes-areas-specific-o.pdf (4.36 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03279146 , version 1 (06-07-2021)

Licence

Paternité

Identifiants

Citer

Rémy Decoupes, Rodrique Kafando, Mathieu Roche, Maguelonne Teisseire. H-TFIDF: What makes areas specific over time in the massive flow of tweets related to the covid pandemic?. AGILE: GIScience Series, 2021, 2 (2), pp.1-8. ⟨10.5194/agile-giss-2-2-2021⟩. ⟨hal-03279146⟩
88 Consultations
44 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More