Data Space Connectors and Federated Catalogue

Obstacle

The core concept of the GDDS is to allow the sharing of protected datasets, in the form of assets, among a set of trusted participants. The GDDS requires a federated system able to plug different participants into the data space. Eclipse Data Connector (EDC) is a potential solution that was tested and evaluated.

Solution

AD4GD implemented a Minimum Viable Dataspace with a Provider Data Connector instance to define and store assets, and a Consumer Data Connector instance to execute transaction and retrieve of assets. An implemented Federated Catalogue allows the registration of several related connectors, allowing data sharing among other registered partners thanks to specific adaptors that help to define and register different HTTP sources as EDC Assets.

Description and usage

The Minimum Viable Dataspace deployed by AD4GD is based on the EDC components. The connectors are the central elements of the data space, providing secure, reliable, and structured data exchange between participants. They are easily manageable thanks to a set of user-friendly Web Data dashboards. The consumer backend extends the connector’s data plane by allowing long term persistence of the assets into user’s storage systems, such as MinIO. Secure and authenticated connections are ensured by an identify provider called DAPS, that functions as an Identity Access Management (IAM) layer, issuing Dynamic Attribute Tokens (DAT) to authenticated and verified participants. A Federated Catalogue works as a data broker by enabling participants to find and access data offerings published by others.

The final demonstrator, includes 5 participant nodes, each acting as a consumer and provider, from 2 different projects (AD4GD and FAIRiCUBE).

Foreseen applications

The Prototype of the GDDS and the Federated Data Catalogue have already been leveraged by THE AD4GD sister projects. Its architecture and components will be exploited as a foundational asset into the SAGE project. The prototype will inform or be reused in other projects like CEADS, SusPot and AgriLabs.