SlideShare ist ein Scribd-Unternehmen logo
1 von 32
TEXT MINING 101:
WHAT YOU SHOULD KNOW
Ethan Pullman (Carnegie Mellon University)
Denise Novak (Carnegie Mellon University)
Kristen Garlock (Ithaka.org)
Patricia Cleary (Springer US)
NASIG annual
Saturday, June 11, 2016 10:30 am
Working with Your Constituents
Ethan Pullman
Humanities Liaison & Library Instruction Coordinator
Carnegie Mellon University
Audience Survey
On a scale from 1-5, novice to experienced, how familiar are you with text mining?
1 -- 2 -- 3 -- 4 -- 5
What is your role in providing Text Mining (TM) Services?
A. We have an expert librarian/service center
B. I work directly with my department(s)
C. Other: (Please describe)
My TM population mainly consists of: Faculty PhDs/TAs Undergrads
How many of you have used TM for your own research/project?
Text Mining Briefly
● What it is?
● What is its purpose?
Photo adapted from Text Mine ‘01
Text mining is the automated processing of large amounts
of structured digital texts
Purpose: retrieval, analysis, and interpretation of texts.
_______________________________________________
Note: Mining non-textual information falls under “data mining”.
Although often included with Text Mining as “Text & Data
Mining”, data mining is different and requires tools and
methodologies that are distinct from text mining.
Text Mining Examples
Visualization tools build word clouds from words mined from large texts.
SDFB mines British early modern texts to trace “social connections”
between individuals from that period (read more)
A class project that used text mining to analyze case documents and
briefs submitted by Authors’ Guild in Authors’ Guild vs. Google. The
analysis shed light on the rhetorical strategy used by Authors’ Guild
lawyers and informed outcome prediction.It Ain’t About the Money, Money, Money…
or is it?
Authors’ Guild vs. Google Books: A Rhetorical Analysis
The Role of Library Liaisons
What is new?
Acquiring texts?
Providing access?
Librarians need to understand:
> how texts are used in the digital age
> what tools are available
> issues impacting acquisition and access
How I stay informed ….
Stay Informed:
Faculty Profiles
● Curriculum Vitae
● Publications
● Syllabi
How I stay informed
● Attend departmental lectures ● Visit research showcases
● Read about campus initiatives
How I stay informed ...
● Maintaining our Text-Mining Website:
● Professional participation:
● Organizations & Conferences: for example, Text Analytics World;
● Social Networks/Email lists, blogs
● Seek continuing education opportunities
● Collaborate with our acquisition and data services librarians
Acquisitions Point of View
Denise Novak
Acquisitions Librarian
Carnegie Mellon University
Supporting Text Mining of the
JSTOR Digital Library
Kristen Garlock
Associate Director of Education and Outreach --
JSTOR
Ithaka
What is Data for
Research?
Data for Research is a self-service
website for generating datasets from
the content on JSTOR.
http://dfr.jstor.org
How it works
Service is free, permitted under Terms & Conditions.
● Data for Research: Researcher creates free account on
site, defines parameters of dataset, submits request,
downloads dataset.
● Full-text datasets: Letter agreement (may be established
with individuals or libraries). Datasets not limited by
licenses or institutional affiliation.
Support for Text Mining
Why?
● Supporting new types of scholarship is part of our mission
● Opportunities to build beneficial partnerships
● Increasing value of publications; corpus in and
of itself has value as a scholarly tool
NOTE: For a bibliography of projects and research that incorporated datasets from JSTOR, please
contact Kristen Garlock (kristen.garlock@ithaka.org)<mailto:kristen.garlock@ithaka.org)>.
Challenges
Biggest challenges:
● Staffing and support
● Keeping up with evolving researcher needs
Trends:
● Increasing numbers of requests
● Requests for larger and more complex datasets
● Interest from non-technologists
● Scholars not anticipating/understanding gaps or data issues in datasets
● Desire to combine datasets from multiple sources
Springer TDM policy
Patricia Cleary
Global eProduct Development Manager
Springer US
Springer TDM Policy Update (June 2016)
• This presentation provides an overview of the current Springer TDM policy.
• Springer is currently working on a new combined TDM policy for Springer Nature.
• The new TDM policy will be announced sometime in the near future.
Springer is currently working on a new combined
TDM policy for Springer Nature
Springer’s TDM policy was introduced in 2014
• The volume of scientific publications is increasing and TDM software tools
continue to improve
• Springer acknowledges the need for a more formalized process to enable TDM
• Strive to make it as simple as possible for researchers
Springer grants text- and data-mining rights to
subscribed content, provided the purpose is
non-commercial research
For researchers with subscription access
• Individual researchers can download subscription and open access content for
TDM purposes directly from the SpringerLink platform
• No registration or API key is required
• Full-text content can be accessed easily and programmatically at friendly URLs
based on the content’s Digital Object Identifier (DOI)
For researchers with no subscription access
• Researchers who do not have subscription access to SpringerLink can send
requests for TDM access to a contact within Springer
• These inquiries will be considered on a case by case basis
Implementation by academic and government institutions
• For subscribers at academic and government institutions, these rights will be
included in all new and renewed SpringerLink subscription agreements as an
additional TDM clause
• Existing subscribers may also add the TDM clause before their agreement is up for
renewal
Use of text and data mining results and research output
• Publications or analyses resulting from TDM of subscribed content may include
quotations from the original text of up to 200 characters, or 20 words, or 1
complete sentence
• Should cite the original Springer content in the form of a DOI link
• Permission to reproduce images may be granted on a case-by-case basis
• For Open Access (OA) publications from Springer, BioMed Central and
SpringerOpen, TDM is usually allowed without restrictions since the majority of
Springer's OA content is licensed under CC-BY
Technical guide to downloading content
• For TDM researchers interested in cross-publisher automated downloading, the
CrossRef TDM initiative may be useful
• Springer is actively collaborating with CrossRef on this project and we expect
Springer content to be fully supported soon
• Guidelines for performing TDM of Springer content are located on the Springer’s
text- and data-mining policy page on Springer.com
Springer Metadata API
• Springer provides the free Springer Metadata API for searching within Springer
content
• Provides rich searching for the vast majority of Springer, BioMed Central and
SpringerOpen documents, including all journal content, book chapters and
protocols
• The Springer Book Archives will soon be searchable through this API as well
Q&A
[Q] Do publishers prefer to sign agreements directly with
researchers, or with the libraries that either have an active
subscription or have purchased the corpus to be mined?
[A] So far, Springer has only signed licenses with libraries. We
are currently focused on customers who have an active
subscription with us. TDM access to content is for researchers
have access to through their institutional subscription, and OA
content.
Q&A (cont’d)
[Q] If libraries do sign agreements on behalf of researchers,
does Springer expect libraries to track or monitor researcher
activities, either for compliance to terms of the agreement, or
for reporting purposes?
[A] Springer doesn't expect libraries to directly monitor
researcher TDM activities as separate from regular content
access activities. TDM access is subject to the same restrictions
as any regular content from a library-researcher relationship
Q&A (cont’d)
[Q] What drives publisher decisions to host data vs. send the data to
libraries for hosting? What types of costs are associated with hosting?
How can libraries support an infrastructure for text mining if the data is
sent on drives, and do publishers mind if researchers get copies of this
data (sort of like a dataset that we buy for them?)
[A] This is different per publisher. Since Springer provides content that is
DRM-free, we can host content on our native site SpringerLink, or offline at
the library.
The advantage of SpringerLink is that the library does not have to
constantly receive updated data from us, and doesn’t have to build a GUI or
Useful links
Springer's Text and Data Mining Policy
https://www.springer.com/gp/rights-permissions/springer-s-text-and-data-mining-
policy/29056
Springer / BioMed Central API Portal
https://dev.springer.com/
CrossRef TDM Initiative
Thank You!
patricia.cleary@springer.com

Weitere ähnliche Inhalte

Was ist angesagt?

Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...NASIG
 
The Future of Information Literacy in the Library: An Example of Librarian/Pu...
The Future of Information Literacy in the Library: An Example of Librarian/Pu...The Future of Information Literacy in the Library: An Example of Librarian/Pu...
The Future of Information Literacy in the Library: An Example of Librarian/Pu...NASIG
 
Promoting Open Access and Open Educational Resources to Faculty
Promoting Open Access and Open Educational Resources to FacultyPromoting Open Access and Open Educational Resources to Faculty
Promoting Open Access and Open Educational Resources to FacultyNASIG
 
Research Support Services ECU Library
Research Support Services ECU LibraryResearch Support Services ECU Library
Research Support Services ECU LibraryJulia Gross
 
Managing discovery and linking services
Managing discovery and linking servicesManaging discovery and linking services
Managing discovery and linking servicesNASIG
 
Virtual support_to_research_communities
Virtual  support_to_research_communitiesVirtual  support_to_research_communities
Virtual support_to_research_communitiesСОБДиЮ
 
Getting on with it (research support at an academic library) presented at Uni...
Getting on with it (research support at an academic library) presented at Uni...Getting on with it (research support at an academic library) presented at Uni...
Getting on with it (research support at an academic library) presented at Uni...Reed Elsevier
 
UKSG 2017 Conference Breakout - Take control of your PhD journey: a librarian...
UKSG 2017 Conference Breakout - Take control of your PhD journey: a librarian...UKSG 2017 Conference Breakout - Take control of your PhD journey: a librarian...
UKSG 2017 Conference Breakout - Take control of your PhD journey: a librarian...UKSG: connecting the knowledge community
 

Was ist angesagt? (20)

Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
 
The Future of Information Literacy in the Library: An Example of Librarian/Pu...
The Future of Information Literacy in the Library: An Example of Librarian/Pu...The Future of Information Literacy in the Library: An Example of Librarian/Pu...
The Future of Information Literacy in the Library: An Example of Librarian/Pu...
 
Promoting Open Access and Open Educational Resources to Faculty
Promoting Open Access and Open Educational Resources to FacultyPromoting Open Access and Open Educational Resources to Faculty
Promoting Open Access and Open Educational Resources to Faculty
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Research Support Services ECU Library
Research Support Services ECU LibraryResearch Support Services ECU Library
Research Support Services ECU Library
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
Hansen Metadata for Institutional Repositories
Hansen Metadata for Institutional RepositoriesHansen Metadata for Institutional Repositories
Hansen Metadata for Institutional Repositories
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Nance "Demystifying Resource Sharing"
Nance "Demystifying Resource Sharing"Nance "Demystifying Resource Sharing"
Nance "Demystifying Resource Sharing"
 
Managing discovery and linking services
Managing discovery and linking servicesManaging discovery and linking services
Managing discovery and linking services
 
Rodriguez No Free Lunch Sept 7
Rodriguez No Free Lunch Sept 7Rodriguez No Free Lunch Sept 7
Rodriguez No Free Lunch Sept 7
 
Virtual support_to_research_communities
Virtual  support_to_research_communitiesVirtual  support_to_research_communities
Virtual support_to_research_communities
 
Goldman "Collaboratively Build Data Science Services and Skills"
Goldman "Collaboratively Build Data Science Services and Skills"Goldman "Collaboratively Build Data Science Services and Skills"
Goldman "Collaboratively Build Data Science Services and Skills"
 
Getting on with it (research support at an academic library) presented at Uni...
Getting on with it (research support at an academic library) presented at Uni...Getting on with it (research support at an academic library) presented at Uni...
Getting on with it (research support at an academic library) presented at Uni...
 
NISO/BISG Changing Standards Landscape: EBook Discovery and Requirements for ...
NISO/BISG Changing Standards Landscape: EBook Discovery and Requirements for ...NISO/BISG Changing Standards Landscape: EBook Discovery and Requirements for ...
NISO/BISG Changing Standards Landscape: EBook Discovery and Requirements for ...
 
Liaison panelrdap2016
Liaison panelrdap2016Liaison panelrdap2016
Liaison panelrdap2016
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
UKSG 2017 Conference Breakout - Take control of your PhD journey: a librarian...
UKSG 2017 Conference Breakout - Take control of your PhD journey: a librarian...UKSG 2017 Conference Breakout - Take control of your PhD journey: a librarian...
UKSG 2017 Conference Breakout - Take control of your PhD journey: a librarian...
 

Andere mochten auch

Ambient Intelligence
Ambient IntelligenceAmbient Intelligence
Ambient IntelligenceRam Inamdar
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Dev Sahu
 
How Sentiment Analysis works
How Sentiment Analysis worksHow Sentiment Analysis works
How Sentiment Analysis worksCJ Jenkins
 
Sentiment analysis of tweets
Sentiment analysis of tweetsSentiment analysis of tweets
Sentiment analysis of tweetsVasu Jain
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in TwitterAyushi Dalmia
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSumit Raj
 
Virtual Reality-Seminar presentation
Virtual Reality-Seminar  presentationVirtual Reality-Seminar  presentation
Virtual Reality-Seminar presentationShreyansh Vijay Singh
 

Andere mochten auch (11)

Femto_cells
Femto_cellsFemto_cells
Femto_cells
 
smart glasses
smart glassessmart glasses
smart glasses
 
Ambient Intelligence
Ambient IntelligenceAmbient Intelligence
Ambient Intelligence
 
Oculus Rift
Oculus RiftOculus Rift
Oculus Rift
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier
 
How Sentiment Analysis works
How Sentiment Analysis worksHow Sentiment Analysis works
How Sentiment Analysis works
 
Sentiment analysis of tweets
Sentiment analysis of tweetsSentiment analysis of tweets
Sentiment analysis of tweets
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in Twitter
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
Virtual Reality-Seminar presentation
Virtual Reality-Seminar  presentationVirtual Reality-Seminar  presentation
Virtual Reality-Seminar presentation
 
Ambient intelligence
Ambient intelligenceAmbient intelligence
Ambient intelligence
 

Ähnlich wie Text Mining 101: What You Should Know

A Pragmatic Approach to Facilitating Text and Data Mining
A Pragmatic Approach to Facilitating Text and Data Mining A Pragmatic Approach to Facilitating Text and Data Mining
A Pragmatic Approach to Facilitating Text and Data Mining Chris Shillum
 
The New Dimensions in Scholcomm: How a global scholarly community collaborati...
The New Dimensions in Scholcomm: How a global scholarly community collaborati...The New Dimensions in Scholcomm: How a global scholarly community collaborati...
The New Dimensions in Scholcomm: How a global scholarly community collaborati...NASIG
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findabilityKristian Norling
 
Pushing on the paywalls: Extending licensed resource access to external part...
Pushing on the paywalls:  Extending licensed resource access to external part...Pushing on the paywalls:  Extending licensed resource access to external part...
Pushing on the paywalls: Extending licensed resource access to external part...NASIG
 
THOR Workshop - Introduction
THOR Workshop - IntroductionTHOR Workshop - Introduction
THOR Workshop - IntroductionMaaike Duine
 
Managing eResources at Universities
Managing eResources at UniversitiesManaging eResources at Universities
Managing eResources at UniversitiesPK Mishra
 
Data Management for librarians
Data Management for librariansData Management for librarians
Data Management for librariansC. Tobin Magle
 
Crossref for Ambassadors - Introductory webinar
Crossref for Ambassadors - Introductory webinarCrossref for Ambassadors - Introductory webinar
Crossref for Ambassadors - Introductory webinarVanessa Fairhurst
 
Crossref for Ambassadors - Introductory webinar
Crossref for Ambassadors - Introductory webinarCrossref for Ambassadors - Introductory webinar
Crossref for Ambassadors - Introductory webinarCrossref
 
SpringerNature and its sharing strategy on ReadCube
SpringerNature and its sharing  strategy on  ReadCubeSpringerNature and its sharing  strategy on  ReadCube
SpringerNature and its sharing strategy on ReadCubeMartijn Roelandse
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...Sarah Anna Stewart
 
Relationship Building and Advocacy Across the Campus
Relationship Building and Advocacy Across the CampusRelationship Building and Advocacy Across the Campus
Relationship Building and Advocacy Across the CampusUCD Library
 
Evaluating Electronic Resources
Evaluating Electronic ResourcesEvaluating Electronic Resources
Evaluating Electronic ResourcesRichard Bernier
 
Magic willmers presentation_30.06.16
Magic willmers presentation_30.06.16Magic willmers presentation_30.06.16
Magic willmers presentation_30.06.16Michelle Willmers
 
Online Presentation
Online PresentationOnline Presentation
Online Presentationnw13
 
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...Chris Shillum
 

Ähnlich wie Text Mining 101: What You Should Know (20)

A Pragmatic Approach to Facilitating Text and Data Mining
A Pragmatic Approach to Facilitating Text and Data Mining A Pragmatic Approach to Facilitating Text and Data Mining
A Pragmatic Approach to Facilitating Text and Data Mining
 
The New Dimensions in Scholcomm: How a global scholarly community collaborati...
The New Dimensions in Scholcomm: How a global scholarly community collaborati...The New Dimensions in Scholcomm: How a global scholarly community collaborati...
The New Dimensions in Scholcomm: How a global scholarly community collaborati...
 
Roy "Accelerating ML/AI Based R&D through Text & Data Mining"
Roy "Accelerating ML/AI Based R&D through Text & Data Mining"Roy "Accelerating ML/AI Based R&D through Text & Data Mining"
Roy "Accelerating ML/AI Based R&D through Text & Data Mining"
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findability
 
Pushing on the paywalls: Extending licensed resource access to external part...
Pushing on the paywalls:  Extending licensed resource access to external part...Pushing on the paywalls:  Extending licensed resource access to external part...
Pushing on the paywalls: Extending licensed resource access to external part...
 
THOR Workshop - Introduction
THOR Workshop - IntroductionTHOR Workshop - Introduction
THOR Workshop - Introduction
 
Managing eResources at Universities
Managing eResources at UniversitiesManaging eResources at Universities
Managing eResources at Universities
 
NISO Update ODI June 2014 Morse
NISO Update ODI June 2014 MorseNISO Update ODI June 2014 Morse
NISO Update ODI June 2014 Morse
 
Data Management for librarians
Data Management for librariansData Management for librarians
Data Management for librarians
 
Crossref for Ambassadors - Introductory webinar
Crossref for Ambassadors - Introductory webinarCrossref for Ambassadors - Introductory webinar
Crossref for Ambassadors - Introductory webinar
 
Crossref for Ambassadors - Introductory webinar
Crossref for Ambassadors - Introductory webinarCrossref for Ambassadors - Introductory webinar
Crossref for Ambassadors - Introductory webinar
 
SpringerNature and its sharing strategy on ReadCube
SpringerNature and its sharing  strategy on  ReadCubeSpringerNature and its sharing  strategy on  ReadCube
SpringerNature and its sharing strategy on ReadCube
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
Tabloid
TabloidTabloid
Tabloid
 
Relationship Building and Advocacy Across the Campus
Relationship Building and Advocacy Across the CampusRelationship Building and Advocacy Across the Campus
Relationship Building and Advocacy Across the Campus
 
Evaluating Electronic Resources
Evaluating Electronic ResourcesEvaluating Electronic Resources
Evaluating Electronic Resources
 
Magic willmers presentation_30.06.16
Magic willmers presentation_30.06.16Magic willmers presentation_30.06.16
Magic willmers presentation_30.06.16
 
Hawkins "Monitoring Usage of Open Access Long-Form Content"
Hawkins "Monitoring Usage of Open Access Long-Form Content"Hawkins "Monitoring Usage of Open Access Long-Form Content"
Hawkins "Monitoring Usage of Open Access Long-Form Content"
 
Online Presentation
Online PresentationOnline Presentation
Online Presentation
 
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
 

Mehr von NASIG

Ctrl + Alt + Repeat: Strategies for Regaining Authority Control after a Migra...
Ctrl + Alt + Repeat: Strategies for Regaining Authority Control after a Migra...Ctrl + Alt + Repeat: Strategies for Regaining Authority Control after a Migra...
Ctrl + Alt + Repeat: Strategies for Regaining Authority Control after a Migra...NASIG
 
The Serial Cohort: A Confederacy of Catalogers
The Serial Cohort: A Confederacy of CatalogersThe Serial Cohort: A Confederacy of Catalogers
The Serial Cohort: A Confederacy of CatalogersNASIG
 
Calculating how much your University spends on Open Access and what to do abo...
Calculating how much your University spends on Open Access and what to do abo...Calculating how much your University spends on Open Access and what to do abo...
Calculating how much your University spends on Open Access and what to do abo...NASIG
 
Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...
Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...
Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...NASIG
 
Analyzing workflows and improving communication across departments
Analyzing workflows and improving communication across departments Analyzing workflows and improving communication across departments
Analyzing workflows and improving communication across departments NASIG
 
Supporting Students: OER and Textbook Affordability Initiatives at a Mid-Size...
Supporting Students: OER and Textbook Affordability Initiatives at a Mid-Size...Supporting Students: OER and Textbook Affordability Initiatives at a Mid-Size...
Supporting Students: OER and Textbook Affordability Initiatives at a Mid-Size...NASIG
 
Access to Supplemental Journal Article Materials
Access to Supplemental Journal Article Materials Access to Supplemental Journal Article Materials
Access to Supplemental Journal Article Materials NASIG
 
Communications and context: strategies for onboarding new e-resources librari...
Communications and context: strategies for onboarding new e-resources librari...Communications and context: strategies for onboarding new e-resources librari...
Communications and context: strategies for onboarding new e-resources librari...NASIG
 
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...NASIG
 
Bloomsbury digital resources
Bloomsbury digital resourcesBloomsbury digital resources
Bloomsbury digital resourcesNASIG
 
Web accessibility in the institutional repository crafting user centered sub...
Web accessibility in the institutional repository  crafting user centered sub...Web accessibility in the institutional repository  crafting user centered sub...
Web accessibility in the institutional repository crafting user centered sub...NASIG
 
Linked Data at Smithsonian Libraries
Linked Data at Smithsonian Libraries Linked Data at Smithsonian Libraries
Linked Data at Smithsonian Libraries NASIG
 
Walk this way: Online content platform migration experiences and collaboration
Walk this way: Online content platform migration experiences and collaboration Walk this way: Online content platform migration experiences and collaboration
Walk this way: Online content platform migration experiences and collaboration NASIG
 
Read & Publish – What It Takes to Implement a Seamless Model?
Read & Publish – What It Takes to Implement a Seamless Model?Read & Publish – What It Takes to Implement a Seamless Model?
Read & Publish – What It Takes to Implement a Seamless Model?NASIG
 
Mapping Domain Knowledge for Leading and Managing Change
Mapping Domain Knowledge for Leading and Managing ChangeMapping Domain Knowledge for Leading and Managing Change
Mapping Domain Knowledge for Leading and Managing ChangeNASIG
 
When to hold them when to fold them: reassessing big deals in 2020
When to hold them when to fold them: reassessing big deals in 2020When to hold them when to fold them: reassessing big deals in 2020
When to hold them when to fold them: reassessing big deals in 2020NASIG
 
Getting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides ContentGetting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides ContentNASIG
 
A multi-institutional model for advancing open access journals and reclaiming...
A multi-institutional model for advancing open access journals and reclaiming...A multi-institutional model for advancing open access journals and reclaiming...
A multi-institutional model for advancing open access journals and reclaiming...NASIG
 
Knowledge Bases: The Heart of Resource Management
Knowledge Bases: The Heart of Resource ManagementKnowledge Bases: The Heart of Resource Management
Knowledge Bases: The Heart of Resource ManagementNASIG
 
Practical approaches to linked data
Practical approaches to linked dataPractical approaches to linked data
Practical approaches to linked dataNASIG
 

Mehr von NASIG (20)

Ctrl + Alt + Repeat: Strategies for Regaining Authority Control after a Migra...
Ctrl + Alt + Repeat: Strategies for Regaining Authority Control after a Migra...Ctrl + Alt + Repeat: Strategies for Regaining Authority Control after a Migra...
Ctrl + Alt + Repeat: Strategies for Regaining Authority Control after a Migra...
 
The Serial Cohort: A Confederacy of Catalogers
The Serial Cohort: A Confederacy of CatalogersThe Serial Cohort: A Confederacy of Catalogers
The Serial Cohort: A Confederacy of Catalogers
 
Calculating how much your University spends on Open Access and what to do abo...
Calculating how much your University spends on Open Access and what to do abo...Calculating how much your University spends on Open Access and what to do abo...
Calculating how much your University spends on Open Access and what to do abo...
 
Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...
Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...
Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...
 
Analyzing workflows and improving communication across departments
Analyzing workflows and improving communication across departments Analyzing workflows and improving communication across departments
Analyzing workflows and improving communication across departments
 
Supporting Students: OER and Textbook Affordability Initiatives at a Mid-Size...
Supporting Students: OER and Textbook Affordability Initiatives at a Mid-Size...Supporting Students: OER and Textbook Affordability Initiatives at a Mid-Size...
Supporting Students: OER and Textbook Affordability Initiatives at a Mid-Size...
 
Access to Supplemental Journal Article Materials
Access to Supplemental Journal Article Materials Access to Supplemental Journal Article Materials
Access to Supplemental Journal Article Materials
 
Communications and context: strategies for onboarding new e-resources librari...
Communications and context: strategies for onboarding new e-resources librari...Communications and context: strategies for onboarding new e-resources librari...
Communications and context: strategies for onboarding new e-resources librari...
 
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
 
Bloomsbury digital resources
Bloomsbury digital resourcesBloomsbury digital resources
Bloomsbury digital resources
 
Web accessibility in the institutional repository crafting user centered sub...
Web accessibility in the institutional repository  crafting user centered sub...Web accessibility in the institutional repository  crafting user centered sub...
Web accessibility in the institutional repository crafting user centered sub...
 
Linked Data at Smithsonian Libraries
Linked Data at Smithsonian Libraries Linked Data at Smithsonian Libraries
Linked Data at Smithsonian Libraries
 
Walk this way: Online content platform migration experiences and collaboration
Walk this way: Online content platform migration experiences and collaboration Walk this way: Online content platform migration experiences and collaboration
Walk this way: Online content platform migration experiences and collaboration
 
Read & Publish – What It Takes to Implement a Seamless Model?
Read & Publish – What It Takes to Implement a Seamless Model?Read & Publish – What It Takes to Implement a Seamless Model?
Read & Publish – What It Takes to Implement a Seamless Model?
 
Mapping Domain Knowledge for Leading and Managing Change
Mapping Domain Knowledge for Leading and Managing ChangeMapping Domain Knowledge for Leading and Managing Change
Mapping Domain Knowledge for Leading and Managing Change
 
When to hold them when to fold them: reassessing big deals in 2020
When to hold them when to fold them: reassessing big deals in 2020When to hold them when to fold them: reassessing big deals in 2020
When to hold them when to fold them: reassessing big deals in 2020
 
Getting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides ContentGetting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides Content
 
A multi-institutional model for advancing open access journals and reclaiming...
A multi-institutional model for advancing open access journals and reclaiming...A multi-institutional model for advancing open access journals and reclaiming...
A multi-institutional model for advancing open access journals and reclaiming...
 
Knowledge Bases: The Heart of Resource Management
Knowledge Bases: The Heart of Resource ManagementKnowledge Bases: The Heart of Resource Management
Knowledge Bases: The Heart of Resource Management
 
Practical approaches to linked data
Practical approaches to linked dataPractical approaches to linked data
Practical approaches to linked data
 

Kürzlich hochgeladen

Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptshraddhaparab530
 
Millenials and Fillennials (Ethical Challenge and Responses).pptx
Millenials and Fillennials (Ethical Challenge and Responses).pptxMillenials and Fillennials (Ethical Challenge and Responses).pptx
Millenials and Fillennials (Ethical Challenge and Responses).pptxJanEmmanBrigoli
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 

Kürzlich hochgeladen (20)

YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.ppt
 
Millenials and Fillennials (Ethical Challenge and Responses).pptx
Millenials and Fillennials (Ethical Challenge and Responses).pptxMillenials and Fillennials (Ethical Challenge and Responses).pptx
Millenials and Fillennials (Ethical Challenge and Responses).pptx
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 

Text Mining 101: What You Should Know

  • 1. TEXT MINING 101: WHAT YOU SHOULD KNOW Ethan Pullman (Carnegie Mellon University) Denise Novak (Carnegie Mellon University) Kristen Garlock (Ithaka.org) Patricia Cleary (Springer US) NASIG annual Saturday, June 11, 2016 10:30 am
  • 2. Working with Your Constituents Ethan Pullman Humanities Liaison & Library Instruction Coordinator Carnegie Mellon University
  • 3. Audience Survey On a scale from 1-5, novice to experienced, how familiar are you with text mining? 1 -- 2 -- 3 -- 4 -- 5 What is your role in providing Text Mining (TM) Services? A. We have an expert librarian/service center B. I work directly with my department(s) C. Other: (Please describe) My TM population mainly consists of: Faculty PhDs/TAs Undergrads How many of you have used TM for your own research/project?
  • 4. Text Mining Briefly ● What it is? ● What is its purpose? Photo adapted from Text Mine ‘01 Text mining is the automated processing of large amounts of structured digital texts Purpose: retrieval, analysis, and interpretation of texts. _______________________________________________ Note: Mining non-textual information falls under “data mining”. Although often included with Text Mining as “Text & Data Mining”, data mining is different and requires tools and methodologies that are distinct from text mining.
  • 5. Text Mining Examples Visualization tools build word clouds from words mined from large texts. SDFB mines British early modern texts to trace “social connections” between individuals from that period (read more) A class project that used text mining to analyze case documents and briefs submitted by Authors’ Guild in Authors’ Guild vs. Google. The analysis shed light on the rhetorical strategy used by Authors’ Guild lawyers and informed outcome prediction.It Ain’t About the Money, Money, Money… or is it? Authors’ Guild vs. Google Books: A Rhetorical Analysis
  • 6. The Role of Library Liaisons What is new? Acquiring texts? Providing access? Librarians need to understand: > how texts are used in the digital age > what tools are available > issues impacting acquisition and access
  • 7. How I stay informed …. Stay Informed: Faculty Profiles ● Curriculum Vitae ● Publications ● Syllabi
  • 8. How I stay informed ● Attend departmental lectures ● Visit research showcases ● Read about campus initiatives
  • 9. How I stay informed ... ● Maintaining our Text-Mining Website: ● Professional participation: ● Organizations & Conferences: for example, Text Analytics World; ● Social Networks/Email lists, blogs ● Seek continuing education opportunities ● Collaborate with our acquisition and data services librarians
  • 10. Acquisitions Point of View Denise Novak Acquisitions Librarian Carnegie Mellon University
  • 11.
  • 12.
  • 13.
  • 14. Supporting Text Mining of the JSTOR Digital Library Kristen Garlock Associate Director of Education and Outreach -- JSTOR Ithaka
  • 15. What is Data for Research? Data for Research is a self-service website for generating datasets from the content on JSTOR. http://dfr.jstor.org
  • 16. How it works Service is free, permitted under Terms & Conditions. ● Data for Research: Researcher creates free account on site, defines parameters of dataset, submits request, downloads dataset. ● Full-text datasets: Letter agreement (may be established with individuals or libraries). Datasets not limited by licenses or institutional affiliation.
  • 17. Support for Text Mining Why? ● Supporting new types of scholarship is part of our mission ● Opportunities to build beneficial partnerships ● Increasing value of publications; corpus in and of itself has value as a scholarly tool NOTE: For a bibliography of projects and research that incorporated datasets from JSTOR, please contact Kristen Garlock (kristen.garlock@ithaka.org)<mailto:kristen.garlock@ithaka.org)>.
  • 18. Challenges Biggest challenges: ● Staffing and support ● Keeping up with evolving researcher needs Trends: ● Increasing numbers of requests ● Requests for larger and more complex datasets ● Interest from non-technologists ● Scholars not anticipating/understanding gaps or data issues in datasets ● Desire to combine datasets from multiple sources
  • 19. Springer TDM policy Patricia Cleary Global eProduct Development Manager Springer US
  • 20. Springer TDM Policy Update (June 2016) • This presentation provides an overview of the current Springer TDM policy. • Springer is currently working on a new combined TDM policy for Springer Nature. • The new TDM policy will be announced sometime in the near future. Springer is currently working on a new combined TDM policy for Springer Nature
  • 21. Springer’s TDM policy was introduced in 2014 • The volume of scientific publications is increasing and TDM software tools continue to improve • Springer acknowledges the need for a more formalized process to enable TDM • Strive to make it as simple as possible for researchers Springer grants text- and data-mining rights to subscribed content, provided the purpose is non-commercial research
  • 22. For researchers with subscription access • Individual researchers can download subscription and open access content for TDM purposes directly from the SpringerLink platform • No registration or API key is required • Full-text content can be accessed easily and programmatically at friendly URLs based on the content’s Digital Object Identifier (DOI)
  • 23. For researchers with no subscription access • Researchers who do not have subscription access to SpringerLink can send requests for TDM access to a contact within Springer • These inquiries will be considered on a case by case basis
  • 24. Implementation by academic and government institutions • For subscribers at academic and government institutions, these rights will be included in all new and renewed SpringerLink subscription agreements as an additional TDM clause • Existing subscribers may also add the TDM clause before their agreement is up for renewal
  • 25. Use of text and data mining results and research output • Publications or analyses resulting from TDM of subscribed content may include quotations from the original text of up to 200 characters, or 20 words, or 1 complete sentence • Should cite the original Springer content in the form of a DOI link • Permission to reproduce images may be granted on a case-by-case basis • For Open Access (OA) publications from Springer, BioMed Central and SpringerOpen, TDM is usually allowed without restrictions since the majority of Springer's OA content is licensed under CC-BY
  • 26. Technical guide to downloading content • For TDM researchers interested in cross-publisher automated downloading, the CrossRef TDM initiative may be useful • Springer is actively collaborating with CrossRef on this project and we expect Springer content to be fully supported soon • Guidelines for performing TDM of Springer content are located on the Springer’s text- and data-mining policy page on Springer.com
  • 27. Springer Metadata API • Springer provides the free Springer Metadata API for searching within Springer content • Provides rich searching for the vast majority of Springer, BioMed Central and SpringerOpen documents, including all journal content, book chapters and protocols • The Springer Book Archives will soon be searchable through this API as well
  • 28. Q&A [Q] Do publishers prefer to sign agreements directly with researchers, or with the libraries that either have an active subscription or have purchased the corpus to be mined? [A] So far, Springer has only signed licenses with libraries. We are currently focused on customers who have an active subscription with us. TDM access to content is for researchers have access to through their institutional subscription, and OA content.
  • 29. Q&A (cont’d) [Q] If libraries do sign agreements on behalf of researchers, does Springer expect libraries to track or monitor researcher activities, either for compliance to terms of the agreement, or for reporting purposes? [A] Springer doesn't expect libraries to directly monitor researcher TDM activities as separate from regular content access activities. TDM access is subject to the same restrictions as any regular content from a library-researcher relationship
  • 30. Q&A (cont’d) [Q] What drives publisher decisions to host data vs. send the data to libraries for hosting? What types of costs are associated with hosting? How can libraries support an infrastructure for text mining if the data is sent on drives, and do publishers mind if researchers get copies of this data (sort of like a dataset that we buy for them?) [A] This is different per publisher. Since Springer provides content that is DRM-free, we can host content on our native site SpringerLink, or offline at the library. The advantage of SpringerLink is that the library does not have to constantly receive updated data from us, and doesn’t have to build a GUI or
  • 31. Useful links Springer's Text and Data Mining Policy https://www.springer.com/gp/rights-permissions/springer-s-text-and-data-mining- policy/29056 Springer / BioMed Central API Portal https://dev.springer.com/ CrossRef TDM Initiative