2. • 619 publishers
Facts and Figures
• Database: 41.2 million documents from over
121,000 titles
• Total documents checked in 2013: 1,098,974
(compared to 629,359 in 2012)
• Total documents checked in 2014 ytd: 1,890,678
(compared to 859,930 in the same period in 2013)
5. Participation and Feedback
Interpreting the Similarity Reports webinar:
http://www.crossref.org/08downloads/webinars/crossche
ck_interpreting_the_similarity_reports_102413.arf
Guide:
http://www.crossref.org/crosscheck/crosscheck_interpreti
ng_the_similarity_reports.pdf
CrossCheck Admin Webinar:
https://www.dropbox.com/s/c5iabdebcxgew42/CrossChe
ck_%20iThenticate%20Admin%20Webinar%2003-11-
2014%2016.05.mov?dl=0
6. • Launched 28th May 2014
• Involves depositing full-text links and license
information
• Deposits: 1,038,803 (11.6m from Elsevier to be
included)
• Get in touch:
http://www.crossref.org/tdm/contact_form.html
• 17 publishers signed up via the web form
• Cost? Free to researchers and the public
• No cost for publishers through 2014
7. Text and Data Mining Steps
• Define problem
• Identify potential corpus to mine
• Discovery (full text links)
• Identification of subset which can be
accessed (license information)
• Download identified corpus
• Text and data mine corpus
8. Cross-Publisher API for Text and
Data Mining
For publishers: http://tdmsupport.crossref.org/publishers/
For researchers: http://tdmsupport.crossref.org/researchers/
Ephorus – acquisition – expanding European content in the database.
310,000 manuscripts being uploaded each month. We need to support this! Both CrossRef and iThenticate.
One of the things coming through for users is the fact that they need to support this. Specifically for CrossCheck users.
Also 11.6m from Elsevier.
CSV upload option
CrossRef service trying to deal with these three steps.
Discovery of where the full text is located, finding out if you have permission to mine it, and then pulling back that corpus of content in order to work on it.
The CrossRef Common API is the main aspect of this service and is designed to allow researchers to easily harvest full text documents from all participating publishers regardless of their business model (e.g. open access, subscription). Common API (protocol) for requesting machine readable full text from many different publishers. Across lots of publishers – not just individual ones. Streamlining of permssions, plus one single point of access for multiple pubs content.
The CrossRef Common API is the main aspect of this service and is designed to allow researchers to easily harvest full text documents from all participating publishers regardless of their business model (e.g. open access, subscription). It makes use of CrossRef DOI content negotiation to provide researchers with links to the full text of content located on the publisher’s site. The publisher remains responsible for actually delivering the full text of the content requested. Thus, open access publishers can simply deliver the requested content while subscription based publishers continue to support subscriptions using their existing access control systems.
Webinars! Plus, look at Geoffrey’s presentation for then.