Semantic Management of Data from Biodiversity and Ecosystem Studies: Toward an Integrated Workflow from Collection to Publication. Application to Plankton Data from Lake Geneva

Christian Pichot; Damien Maurice; Ghislaine Monet; Rachid Yahiaoui; Philippe Clastre; Benjamin Jaillet

Communication Dans Un Congrès Année : 2021

Semantic Management of Data from Biodiversity and Ecosystem Studies: Toward an Integrated Workflow from Collection to Publication. Application to Plankton Data from Lake Geneva

(1) , (2) , (3) , (4) , (1) , (1)

1
2
3
4

Christian Pichot

Fonction : Auteur
PersonId : 737528
IdHAL : christian-pichot
ORCID : 0000-0003-1636-9438
IdRef : 132801302

Ecologie des Forêts Méditerranéennes

Damien Maurice

Fonction : Auteur

SILVA

Ghislaine Monet

Fonction : Auteur
PersonId : 743586
IdHAL : ghislaine-monet
ORCID : 0000-0001-7154-3676

Centre Alpin de Recherche sur les Réseaux Trophiques et Ecosystèmes Limniques

Rachid Yahiaoui

Fonction : Auteur

InfoSol

Philippe Clastre

Fonction : Auteur
PersonId : 747290
IdHAL : philippe-clastre-inrae-paca
ORCID : 0000-0003-0242-0693

Ecologie des Forêts Méditerranéennes

Benjamin Jaillet

Fonction : Auteur
PersonId : 1083287

Ecologie des Forêts Méditerranéennes

Résumé

Biodiversity is a key player in ecosystem characteristics and dynamics. Acting as a driver, it also results from ecosystem functioning. Understanding this complex interplay between biological and physical components is one of the main current challenges in the context of land use changes and climate warming. The acquisition of knowledge on biodiversity requires multidisciplinary approaches and mobilises numerous research teams. Data are collected or computed in large quantity but are most often poorly standardised and therefore heterogeneous. In this context the development of semantic interoperability is a major challenge for the sharing and reuse of these data. This objective is implemented within the framework of the AnaEE (Analysis and Experimentation on Ecosystems) Research Infrastructure dedicated to experimentation on ecosystems and biodiversity. A distributed Information System (IS) is developed, based on the semantic interoperability of its components using common vocabularies (AnaeeThes thesaurus and OBOE-based ontology extended for disciplinary needs) for modelling the studied system. This modelling covers the measured variables including biodiversity, as well as the different components of the experimental or observational context, from sensor to plot and network. Driven by the ontology, the approach relies on the atomic decomposition of each of the components into observed entities, their characteristics and qualifiers, their units or naming standards. The modelling of the system allows the semantic annotation of relational databases or flat files for the production of URIs based graph databases. A first pipeline automates the annotation process and the production of the semantic data. A second pipeline is devoted to the exploitation of these semantic data by generating i) metadata records formatted according to the geospatial extension for the Data Catalog Vocabulary standard and the ISO 19139 standard, and ii) Network Common Data Form data files. The implementation of this integrated semantic management of data is presented here for phytoand zoo-plankton data collected from water columns in Lake Geneva over a 30 years period, as well as for environmental data about water temperature and nutrients. The work carried out contributes to the development and use of semantic vocabularies within the biodiversity and ecology research community, leading to semantically enriched metadata records and interoperable data sets. The genericity of the tools make them usable in different contexts of data production, management and ontologies involved in semantic modelling.

Mots clés

pipeline modelling ontology plankton biodiversity interoperability entity property FAIR data

Domaines

Biodiversité et Ecologie

Fichier principal

paper11-s4biodiv.pdf (2.57 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Sabine ROSSI : Connectez-vous pour contacter le contributeur

https://hal.inrae.fr/hal-03579553

Soumis le : mercredi 22 juin 2022-15:28:49

Dernière modification le : mercredi 15 mai 2024-13:20:10

Archivage à long terme le : vendredi 23 septembre 2022-18:53:43

Dates et versions

hal-03579553 , version 1 (22-06-2022)

Identifiants

HAL Id : hal-03579553 , version 1

Citer

Christian Pichot, Damien Maurice, Ghislaine Monet, Rachid Yahiaoui, Philippe Clastre, et al.. Semantic Management of Data from Biodiversity and Ecosystem Studies: Toward an Integrated Workflow from Collection to Publication. Application to Plankton Data from Lake Geneva. Joint Ontology Workshops 2021 Episode VII: The Bolzano Summer of Knowledge, JOWO, Sep 2021, Bolzano, Italy. http://ceur-ws.org/Vol-2969/paper11-s4biodiv.pdf. ⟨hal-03579553⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-SAVOIE AGROPARISTECH OSUG UNIV-LORRAINE SILVA INRAE INRAE-AQUA LIFE D2KAB ANR A2F-UL INFOSOL CARRTEL OLA ZABR URFM INRAEVALDELOIRE DPT_ECODIV INRAEPACA

188 Consultations

39 Téléchargements

Semantic Management of Data from Biodiversity and Ecosystem Studies: Toward an Integrated Workflow from Collection to Publication. Application to Plankton Data from Lake Geneva

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager