Developing Infrastructure to Support Closer Collaboration of Aggregators with Open Repositories
1. Developing Infrastructure to Support Closer
Collaboration of Aggregators with Open
Repositories
Dr. Nancy Pontika & Dr. Petr Knoth
COnnecting Repositories (CORE)
Open University, UK
LIBER 2015, 24 – 26 June, London
2. Mission of CORE
Aggregate all open access content distributed
across different systems worldwide, enrich this
content and provide access to it through a set of
services …
[Source: http://core.ac.uk/about#mission]
3. Need for a UK aggregator
Bringing the UK’s open access
research outputs together:
• Feasibility study commissioned
by Jisc, published June 2014
• Referred to as “Open Mirror”
[Source :
https://repository.jisc.ac.uk/5570/1/JISC_REPORT_open_mirror_09051
4_FINAL_WEB.pdf]
4. Three levels of support
Programmable
Data Access
- CORE API
- CORE Data Dumps
- Researchers
- Developers
- Companies
Transaction
Information
Access
- CORE Portal
- CORE Mobile
- CORE Plugin
- Researchers
- Students
- Life long learners
Analytical
Information
Access
- CORE Policy
-CORE Compliance
Analytics
- CORE Dashboard
- Funders
- Governments
- Data Providers
[Source: http://www.dlib.org/dlib/november12/knoth/11knoth.html]
5. CORE Statistics
• Content: 20M+ records, 600+ repositories, 1.8M+
full-texts
• The UK national aggregator - Jisc
• Full-text aggregator (not just metadata)
• Placed among Top 10 search engines for research
that go beyond Google [Jisc, 2013]
• Listed among Top 100 Thesis and Dissertation
Resources
• Part of Jisc’s Repositories Shared Services Project
(RSSP)
6. Aggregation process
• Metadata download, extraction and cleaning
• Full-text harvesting
• Text extraction
• Language detection
• Extraction of citation references from text
• Identification of related content
• Detection of duplicate items
• Parsing of author names
• Indexing
7. CORE Applications
• CORE Portal
– Search engine providing open access content
• CORE Mobile
– Android and iOS apps
• CORE Plugin
– For repositories and journals
• CORE API
– Programmable access to million of resources
• CORE Dashboard
– Tool for repository managers
8. CORE Dashboard : purpose
• Harvested
Records
• Metadata
• Harvesting
Process
• Standards
• Repository
Managers
• Funders
• Repositories
• Journals
Data
Providers
Collaboration
QualityTransparency
19. Issues : 3 types
When harvesting your repository/document we encountered an error that we couldn't
resolve. These errors need to be fixed in order to to harvest your repository/document.
We encountered an error but we were still able to harvest the repository/document. We
strongly recommend that these issues are resolved as they may lead to incompatibility
problems in the future.
This may not be a problem but it may be a clue for misconfiguration or future
incompatibilities.
27. Dashboard benefits
- Increased and simplified collaboration between
aggregators and content providers
- Improved control of the content provider over
the harvested content
- Reduction of scepticism and fear of sharing
content with other systems
- Improvement of the harvesting process
- Broadening of the open access content
discoverability and thus reuse of the open access
content where permitted
28. Would you like to take a look?
Dashboard still in BETA but we welcome
volunteer testers
Email me at nancy.pontika[at]open.ac.uk
29. Many thanks to…
CORE developers:
• Matteo Cancellieri
• Samuel Pearce
• Drahomira Herrmannova
• Lucas Anastasiou
Volunteer testers:
• Chris Biggs, Metadata & Repository Specialist, Open University
• Nick Sheppard, Repository Developer, Leeds Beckett University
The mission of CORE (COnnecting REpositories) is to aggregate all open access research outputs from repositories and journals worldwide and make them available to the public. In this way CORE facilitates free unrestricted access to research for all.
CORE:
supports the right of citizens and general public to access the results of research towards which they contributed by paying taxes,
facilitates access to open access content for all by offering services to general public, academic institutions, libraries, software developers, researchers, etc.,
provides support to both content consumers and content providers by working with digital libraries, institutional and subject repositories and journals,
enriches the research content using state-of-the-art technology and provides access to it through a set of services including search, API and analytical tools,
contributes to a cultural change by promoting open access, a fast growing movement.
CORE harvests openly accessible content available with respect to the Budapest Open Access Initiative definition:
"By 'open access' to this literature, we mean its free availability on the public internet, permitting any users to read, download, copy, distribute, print, search, or link to the full texts of these articles, crawl them for indexing, pass them as data to software, or use them for any other lawful purpose, without financial, legal, or technical barriers other than those inseparable from gaining access to the internet itself. The only constraint on reproduction and distribution, and the only role for copyright in this domain, should be to give authors control over the integrity of their work and the right to be properly acknowledged and cited."