Calmip's Logo

Callisto

Proposed by
Calmip's logo

About

This section covers the essential data management key concepts that you need to understand to use Callisto. You’ll also find some tools and links to documents that will help you get a better understanding.

A use-case

Data Management Platforms (DMP)

Understanding the FAIR principles

  • The dedicated page of the Zenodo Web site presents a comprehensive overview of the FAIR principles.
  • Opidor is a (french-speaking) tool for managing scientific data, available online.
  • It offers a set of templates for DMPs.
  • It also presents an online interface for writing DMPs (require authentication).
  • A list of data-related services is also provided.

Data archives and storage

EOSC is the main web front-end for european open science. It aims at providing a federated access point for many services. https://zenodo.org/ allows the sharing of scientific datasets.

Metadata, interoperability and harvesting

Understanding metadata harvesting

The Web site for open archives initiative (OAI) presents the basic concepts of Web content interoperability. It also gives an overview of the metadat harvesting mechanisms, and in-depth description of the OAI protocols and metadata standards that those protocols use for data dissemination. Using Dublin Core metadata (also available as an ontology) is mandatory for using the PMH protocol of OAI.

Interoperability itself, though, is a much broader question than metadata description and repositories harvesting. Before diving in to the details (a little) more, it may be useful to specify two major initiatives, namely OGC and IVOA that built a solid set of standards, protocols and relevant softwares for their scientific communities.

For the geospatial community

  • OGC (Open Geospatial Consortium) is dedicated to the definition of standards and protocols for describing and accessing geospatial data.
  • A list of the corresponding products is available on the OGC Web site.
  • A complete application implementing many standards from the OGC is freely downloadable and usable: Its name is GeoServer.

For astrophysicists

International Virtual Observatory Alliance (IVOA) provides for astrophysics the same set of functionalities that OGC provides for geospatial data.

Interoperability, EIF and Ontologies

Interoperability is a wide area of research, which is not limited to data interoperability but also applies to software, industrial products and processes for example. The European commission adopted in 2017 a framework for interoperability that covers many aspects.

In 2006, the role of ontologies in interoperability had already been clearly stated by David Chen (CHEN, David. Enterprise Interoperability Framework. In: EMOI-INTEROP. 2006.) in the framework EIF, that also fits for scientific data. Ontologies play a key role for extracting interoperability needs from services or data, and providing "federated" interoperability (the kind of interoperability that does not need a native modification of the data, neither to conform to a predefinite meta-model).

The Open Biological and Biomedical Ontology (OBO) not only provides ontologies for Biological-related domains, but also some high level ontologies suitable for any domain of discourse.

Basic Formal Ontology (BFO), SUMO and DOLCE are top-level ontologies that are worth studying before diving into the work of writing an ontology.

More comprehensive information about ontologies and vocabularies can be found on the W3C Web site.