Language Dataspace Connector integration with dSpace

Target audience: CLARIN repository managers.

In collaboration with colleagues from Athena RC and Lindat we are working on the integration of CLARIN dSpace v7 with the Language Data Space (LDS) connector (currently v3.0.0).

The 2nd technical meeting to discuss the integration, both on the technical as well as on the architectural level, was held on Friday 19 June. We are currently focussing on the technical feasibility study. In this context we have been able to successfully convert a CMDI representation of a dataset in dSpace into LanguageDCAT-AP (the metadata model used in LDS) and import it as an “asset” (metadata without licensing information) into the LDS connector. In parallel an export of the default licences used in Lindat dSpace 7 have been shared with the Athena RC colleagues to start the work of converting them into ODRL policies (see attachment to this post). As a next step, the combination of the two representations (assets and licences) will enable the automated publication of descriptions created in dSpace at LDS as “offers”.

To continue:

  • We will work on the improvement of the translation from CMDI into LanguageDCAT-AP and work towards an implementation of the full profile: http://catalog.clarin.eu/ds/ComponentRegistry/rest/registry/profiles/clarin.eu:cr1:p_140352607938.

  • Athena RC colleagues are working on the representation of the selected licences as ODRL policies and their import into the LDS Connector installation containers so that they can be re-used by all LDS users

  • We will implement batch import of assets into the LDS. This is more efficient than the import of individual collections. Additionally we are looking into limiting the size of a dataset as that will be zipped on the fly by dSpace when accessed via the LDS.

  • After finishing the asset implementation we will start the work to import offers, where the assets and licences are combined. This will supersede the import of individual assets.

  • We are starting the design / work on the integration of the solution into the dSpace model so that users can select one or more of the datasets in their CLARIN dSpace repository and publish them to their LDS connector.

The next technical meeting is scheduled for 10 July.

dspace_licenses.txt (5.0 KB)