Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Â
OpenAIRE schirrwagen
1. CRIS and OAR entities as Linked
Data in scholarly communication –
a vision scenario
Jochen Schirrwagen
Bielefeld University Library, Germany
2. About OpenAIRE - Motivation
Implementation of a support infrastructure for
the European Open Access pilot (2009-2012)
Research Assessment
– Identification, Capture, Measurement of EC
funded FP7 project results (Special Clause 39)
Addressing of Interoperability aspects regarding
– European Commission (EC) Tools
– Current Research Information Systems (CRIS)
 JISC Research Excellence Framework
– Open Access Repositories (OAR)
2 London, 14th of July 2011 – Linked Data and Libraries
3. OpenAIRE facts
Itself an EC funded FP7 project
38 OpenAIRE partners across Europe
27 National Open Access Liaison Offices
6887 projects in FP7
– Using CORDA as authoritative source
Some 10.000 publications estimated
Data-Sources (striving for OpenAIRE Guidelines compliancy)
– Institutional repositories
 Using OpenDOAR as authoritative source of ~850 repositories
– Coverage of subject-based repositories planned
– Coverage of OA journals planned
3 London, 14th of July 2011 – Linked Data and Libraries
4. Interoperability Scenario
CRIS CORDA
Project data at Project data
institution at EC
KE-CRIS-OAR;
PMH; ORE
OpenAIRE
Bibliographic Deposit/claiming
data; DC; of publications
OAI-PMH related to project
OA-Repositories
Author
4 London, 14th of July 2011 – Linked Data and Libraries
5. Candidates for Entities &
Vocabularies
5 London, 14th of July 2011 – Linked Data and Libraries
6. Challenges – Data and
Interoperability
Capturing research output from different domains
involves:
Different responsibilities and tasks
Different metadata formats used
Different metadata exchange interfaces and
protocols
Different metadata granularity
– In CRIS -> fine
– In OAR -> coarse
6 London, 14th of July 2011 – Linked Data and Libraries
7. Challenges – Data and
Interoperability
In the CRIS domain In the OAR domain
– Covers the research process – Covers research publications
– Run by the administrative – Run by the library department
department – Focus on bibliographic quality
– Broader view on research – Diverse metadata formats
information
 DC, DIDL/MODS, EPrints
– Diverse data models and formats
 CERIF (-like) models
 DDF-MXD, METIS, PURE
7
London, 14th of July 2011 – Linked Data and Libraries
8. Issues addressed
by KE CRIS-OAR
Working group within the quadrolateral Knowledge
Exchange-Initiative (KE: SURF-NL, JISC-UK, DFG-DE, DEFF-DK)
Aiming to increase interoperability between CRIS
and OAR domains
– Increasing metadata quality and re-use
– Increasing level of interface standards
– By taking existing formats into account:
 Defining a metadata exchange format
 With a corresponding set of common
vocabularies
8 London, 14th of July 2011 – Linked Data and Libraries
9. Publication entity as the center
of interest
Organisation
Person Publication Event
Project
9 London, 14th of July 2011 – Linked Data and Libraries
10. How could “Linked Data” help ?
Common way of linkages of content from distinct domains
– Use of native web-technologies
 Controlled vocabularies may help to tame semantic variability
– URIfying named entities
 Data publishers keep control of their data
 Avoids context loss compared to interchange formats
 May avoid double input and thus redundant data in each domain
 Vocabulary helps to tame semantic variability
May reduce the identifier problem by assigning persistent URIs to the entities
– Person (Author) identifier (DAI, ORCID)
– Publication identifier (DOI, URN, …)
– Project identifier (?)
– Event identifier (?)
– Organisation identifier (?)
10 London, 14th of July 2011 – Linked Data and Libraries
11. Aggregation of Interlinked Data
Task to be addressed:
– “bulk import/export” of publication and
project data -> new wording “exposure”
– Representing different views on the
information packages, e.g.:
 Publication as an Aggregation of Person,
Organisation, Project and Event entities
 Nested aggregation as a collection of all
publications, where each publication is itself
an aggregation of relative CRIS-OAR entities
11 London, 14th of July 2011 – Linked Data and Libraries
13. Extract of a ORE RDF
Serialization
13 London, 14th of July 2011 – Linked Data and Libraries
14. Next Steps
Adopting KE CRIS-OAR model and vocabulary
Addressing “Linked Data” in OpenAIREplus
starting in 12/2011
– Linkage of publications and research data
– Linkage of CRIS and OAR domains
Further scenarios may include linked data as
citations or statistical data
14 London, 14th of July 2011 – Linked Data and Libraries
15. Further Links
• Portal and project home: www.openaire.eu
• EC pilot: ec.europa.eu/research/science-society/open_access
• CERIF: www.eurocris.org
• KE CRIS-OAR: http://knowledge-exchange.info/Default.aspx?ID=340
• CRIS-OAR schema and vocabulary:
https://infoshare.dtv.dk/twiki/bin/view/KeCrisOar/KeCrisOarFormat
Jochen Schirrwagen
jochen.schirrwagen@uni-bielefeld.de
Wolfram Horstmann
wolfram.horstmann@uni-bielefeld.de
15 London, 14th of July 2011 – Linked Data and Libraries