Weakly supervised Named Entity Recognition for Carbon Storage using Deep Neural Networks - Données et Connaissances Massives et Hétérogènes Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Weakly supervised Named Entity Recognition for Carbon Storage using Deep Neural Networks

Résumé

Applying Transfer-Learning based on pre-trained language models has become popular in Natural Language Processing. In this paper, we present a weakly supervised Named Entity Recognition system that uses a pre-trained BERT model and applies two consecutive fine tuning steps. We aim to reduce the amount of human labour required for annotating data by proposing a framework which starts by creating a data set that uses lexicons and pattern recognition on documents. This first noisy data set is used in the first fine tuning step. Then, we apply a second fine tuning step on a small manually refined subset of data. We apply and compare our system with the standard fine tuning BERT approach on large amount of old scanned document. Those documents are North Sea Oil & Gas reports and the knowledge extraction would be used to assess the possibility of future carbon sequestration. Furthermore, we empirically demonstrate the flexibility of our framework showing that it can be applied to entity-identifications in other domains.
Fichier principal
Vignette du fichier
DS_2022.pdf (1.09 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
licence : Copyright (Tous droits réservés)

Dates et versions

hal-04142013 , version 1 (26-06-2023)

Identifiants

Citer

René Gómez Londoño, Sylvain Wlodarczyk, Molood Arman, Francesca Bugiotti, Nacéra Bennacer Seghouani. Weakly supervised Named Entity Recognition for Carbon Storage using Deep Neural Networks. 25th International Conference on Discovery Science (DS 2022), Oct 2022, Montpellier, France. pp.227-242, ⟨10.1007/978-3-031-18840-4_17⟩. ⟨hal-04142013⟩
19 Consultations
39 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More