Hosted at The New York Academy of Medicine on November 30, 2016.
Morning Session: Developing Islandora Digital Collections (Panel)
This panel discussion will explore multiple uses and implementations of Islandora, an open source digital repository framework. Panelists will describe their digital projects, how Islandora was utilized and their overall experience.
Afternoon Session: Islandora Demonstration (Hands-on)
Islandora is an OAIS adherent and open source digital repository framework. It combines the Drupal CMS and Fedora Commons repository software, together with additional open source applications, the framework delivers a wide range of functionality out of the box.
This Islandora demonstration will provide users with an overview of how to ingest content, configure the discovery layer and restrict access to content.
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Hello islandora building a digital repository nov 30, 2016 v6
1. Hello Islandora:
Building a Digital Repository
Part One: Panel Discussion
New York Academy of Medicine (The Academy)
November 30, 2016
2. Introductions
Erin Tripp, discoverygarden inc.
Librarian and business development
manager at discoverygarden. Erin has
managed more than 40 digital repository
projects.
❏ Currently enthralled with on the research data management
applications of Islandora
❏ Particular interest in accommodating funder requirements
and large datasets in obscure file formats
❏ Participating in the JISC Research Data Management Shared
Service Framework and collaborating on the University of
JISC testing instance of Islandora
5. The Technology
Presentation & Collaboration
Drupal is the leading open source
content management system with
over 30,000 user contributed
modules from almost 100,000 active
community members.
Drupal serves as the presentation
and collaboration layer in Islandora.
Islandora is a set of Drupal modules
which allow users to manage and
preserve digital assets.
6. The Technology
Search & Discovery
Solr powers some of the most
heavily-trafficked websites and
applications in the world.
Key features include:
Full-text search
Search faceting & filtering
Highly scalable/Fault tolerant
Near real-time indexing
7. The Technology
Storage & Preservation
Fedora Commons is purpose built
for data preservation and long-term
data accessibility.
Key features include:
Auditing & Fixity checks
RDF Support
Scales to millions of objects
Support for virtually any filetype
Files are readily accessible (no
lock-in)
8. Islandora
Open Source Digital Repository
Framework
Organizations can create
robust digital repository
systems tailored to their
specific needs and grow
the system to handle
virtually limitless amounts
of data.
10. Introducing Martha Tenney
Martha Tenney, Barnard College
Martha is the Associate Director of the
Barnard Archives and Special Collections
and oversees digital archives projects.
❏ She manages selection and digitization of archival materials,
works on metadata and design for the digital collections, and
collaborates with Ben Rosner on development and
functionality for the digital collections site.
❏ She holds a Master's in Information Science with a
concentration in Digital Archives, and her undergraduate
degree is in Sociology.
❏ Martha’s research interests include online communities and
Web archiving, artists' archives, and human rights archives.
12. Barnard
A liberal arts college for women.
Affiliated with Columbia
University.
Our collection includes over 125+
years of Barnard’s incredible history.
About 70k individual archival
objects representing: Yearbooks,
Oral Histories, Manuscripts,
Photographs, Special Collections
and Projects.
Involved in the community.
14. Introducing Chris Stanton
Chris Stanton, METRO
Chris is the Digital Projects and Metadata
Librarian at the Metropolitan New York
Library Council (METRO), where he works
on METRO's hosted digital collection
service, Digital Culture of Metropolitan
New York (dcmny.org).
❏ Chris is primarily responsible for all aspects of metadata
creation and editing as well as facilitating the ingest of
partner content into the repository.
❏ At METRO, Chris also facilitates the harvesting and ingestion
of metadata records from libraries, archives and cultural
heritage institutions throughout New York State into the
Empire State Digital Network (ESDN) repository for
contribution to the Digital Public Library of America.
18. Introducing Henry Raine
Henry Raine, NYHS
Henry Raine is the Director of Digital
Projects and Library Technical Services at
the New-York Historical Society, where he
has worked since 1997 managing large-
scale cataloging, archival processing, and
digitization projects.
❏ He recently oversaw the migration of the library’s digital
collections to Islandora and the development of the N-YHS
Islandora site.
❏ Prior to working at the N-YHS, he held positions at the Folger
Shakespeare Library, the National Library of New Zealand,
and the Library of Congress.
❏ Active in several professional organizations, he was Chair of
the Rare Books and Manuscripts Section of the Association
of College and Research Libraries in 2010-2011 and served
on the Board of Directors of the Ephemera Society of
America from 2010 to 2015.
20. Project
Objectives
Build a flexible, customizable
digital repository for the entire
institution, not just the library
Allow institutional branding
Realize the long-term goal of
searching across library and
museum collections
Wean from proprietary software
and become part of a open-
source community
23. Introducing Robin Naughton
Robin Naughton, PhD, The Academy
Robin Naughton, PhD is Head of Digital for
the New York Academy of Medicine
Library. She manages the digital program
for the library, including digitization efforts
and library systems.
❏ Since starting in June 2015, she has worked on migrating the
Library’s digital collections to Islandora and is in the early
phases of development.
❏ Prior to joining the Academy, Dr. Naughton was a Digital
Consultant in educational technology and has worked with
LearningExpress, an EBSCO company, Oxford University
Press, English Language Teaching, and Kaplan Test Prep
and Admissions.
❏ She has managed digital library migrations, interactive
educational products (web & mobile) created for public and
state libraries, the transition of print to eBooks, and
eLearning courses.
28. Question 1
Getting started
Henry,
Tell us about the decision to move
forward with a new digital
repository.
Why did you go ahead with it? What
were the main factors in your
decision?
29. Question 2
Impact
Robin,
Tell us about your experience
moving to an Open Source software.
What were the surprises,
challenges/ successes?
30. Question 3
Community
Martha,
Open Source has a lot to do with
community. Tell us about your
interactions with the Islandora
Community.
What resources were most
valuable?
31. Question 4
Customization
Chris & Martha,
Tell us about your decision to
customize Islandora.
What were the considerations/
implications/ benefits?
34. Question 7
Impact
Martha,
The Barnard College repository has
been in production longer than the
others. Tell us about the impacts of
your repository project overall.
Have there been engagement,
research or other benefits?
37. Top Takeaways
Group Discussion
● What have been the easy
parts?
● What have been the most
difficult tasks?
● What resources did you need
or find helpful?
● What was your AH HA!
moment?
42. Islandora is used by
some of the most
prestigious
organizations
in the world...
43. Islandora
Open Source Digital Repository
Framework
Organizations can create
robust digital repository
systems tailored to their
specific needs and grow
the system to handle
virtually limitless amounts
of data.
44. discoverygarden
Removing Barriers to
Using Open Source
• Service Provider
• Launched in 2010
• 92% of Islandora code is written
by discoverygarden on behalf
of customers
• Uses Drupal Coding Standards,
Scrum Process, QA (Selenium,
Travis CI)
46. Case Study
University of Prince Edward
Island
Research Data Management
Repository to manage the entire
lifecycle of research data.
Project features:
Drupal data management
planning tools
An open source dropbox-like
utility for “working” data called
Pydio
Workflow states for Public,
Embargoed, or Archive content
Minting DataCite DOIs for any
published materials
https://data.upei.ca/
47. Case Study
National Baseball Hall of Fame
and Museum
Special Collections
DAMS with 3,500,000+ Objects
Large-scale architecture
Preservation Strategy
Integration of Library Cataloging
Functions
Museum Functions
Workflow Development
Migration and Consolidation
Integration with Existing Portals and
Apps
Training & Knowledge Transfer
48. Case Study
Boston College
Institutional Repository
Using Islandora as an Institutional
Repository. Launched in 2015. Project
features:
Focused on copyright management
and embargo
Representing metadata in MODS and
embedded ETD-MS
Preserving and exposing assets related
to scholarly publications - research
data
http://dlib.bc.edu/
50. Ingest Methods
Creating, Reading, Updating and
Deleting Content
Content Models act as a template
for adding digital assets to the
repository
E.g. Defines content model, workflow,
metadata entry forms, derivative
generation, viewer, display, etc.
Solution Packs are modules that
apply content models and
manage CRUD operations
Create, Read, Update, Delete
51. Ingest Methods
Over the Web or on
the Command Line
Individual ingest available over
the web
2 GB file limit
Batch Ingest modules over the
web
Zip File Importer
Four Batch Citation Uploaders
2 GB file limit
Scripted ingests on the Command
line for large files or large
batches
53. Discovery
Settings
Search Results,
Facets,
Advanced Search,
Record Display
and more
Basic/ keyword search out of the
box
Easy to configure modules for
collection search, advanced
search, facets, sort, and custom
record display
External discovery modules for
being indexed by Google,
Google Scholar, WorldCat,
Summon and more!
57. Harvesting
Increasing Access to your
Collections
Data Providers are repositories
that expose structured
metadata via OAI-PMH
= Repository
Service Providers then make OAI-
PMH service requests to
harvest that metadata
= Harvester
59. Vocabulary
Request/ Verb/ Service
The action that the service provider (harvester) is requesting from the data
provider (repository)
Response Size
The maximum number of records to issue per response
Resumption Token
When a request returns records greater than the response size a
resumptionToken is issued such that the service provider can resume
harvesting from where it left off
60. Vocabulary… continued
Identify
This request used to retrieve information about a repository. Some of the
information returned is required as part of the OAI-PMH.
Example: YourSite/oai2?verb=Identify
ListMetadataFormats
This request is used to retrieve the metadata formats available from a
repository.
Example: YourSite/oai2?verb=ListMetadataFormats
61. Vocabulary… continued
ListRecords
This request is used to harvest records from a repository. Optional
arguments permit selective harvesting of records based on set membership
and/or datestamp. Example:
YourSite/oai2?verb=ListRecords&metadataPrefix=oai_dc
ListSets
This request is used to retrieve the set structure of a repository, useful for
selective harvesting. Specific Collection Example:
YourSite/oai2?verb=ListRecords&metadataPrefix=oai_dc&set=ir_citationColl
ection
62. Resources
Want to Learn More
About Islandora?
Islandora Online Discussion
Forums
https://groups.google.com/foru
m/#!forum/islandora
Islandora Documentation
https://wiki.duraspace.org/displ
ay/ISLANDORA715/Start
discoverygarden Case Studies
http://www.discoverygarden.ca/
case-studies/
I’m Erin Tripp, a librarian based in New Brunswick, Canada.
I’ve been involved in the islandora community since 2011 through a company called discoverygarden.
Discoverygarden is an open source software service provider.
Discoverygarden is a partner in the Islandora foundation and the main Islandora code contributor, having built the basis for Islandora 7.
We have served more than a hundred Islandora customers worldwide
So let's talk a little bit about the core components individually.
We’ll start with Drupal.
Many of you are familiar with Drupal, as it is a leading open source content management system with over 30,000 user contributed modules from almost 100,000 active community members or developers. Active community members are vital to any open source project and the same can be said for Drupal. People are invested in Drupal. And that leads to ongoing development and success.
Drupal serves as the presentation layer in Islandora. It’s how we create a website and expose it to the Internet. We also use drupal for collaboration. It has a great framework for users, roles and permissions. We use that for workflow activities and it really does integrate in with a lot of other functionality within Islandora.
Islandora in essence is a suite of Drupal modules that allow us to build repositories on top of Fedora Commons. These modules make it possible to build, populate and configure a digital repository without the aid of a developer.
So next is Apache Solr. Apache Solr is used for discovery within Islandora. Solr is very powerful, flexible and configurable. It’s used on some of the most heavily trafficked websites and applications worldwide.
Key features include:
Full-text search
Search faceting & filtering
It’s Highly scalable and Fault tolerant
And provides Near real-time indexing
For example if we ingest an object into Islandora, it's going to be indexed within Solr almost immediately and will become searchable probably within 30 second after ingest.
The final core component is Fedora Commons. It is the layer that stores and preserves all of our digital content. Fedora commons is purpose built for data preservation and long-term data accessibility.
Key features include:
Auditing & Fixity checks
RDF XML Support
We are able to Scale fedora commons to millions of objects
It has support for virtually any file type
And it’s built for interoperability. You can export or migrate your data in an format so that it can be stored elsewhere.
Islandora is used by hundreds of organizations, worldwide. From academic institutions, museums, nonprofits and other cultural organizations, as well as government departments. Production installations host as little a few hundred objects to many millions of objects and store data as small as a few hundred GB or hundreds of TB. This panel discussion will show some of the diversity in how Islandora is used to build different repositories with various objectives.
I’d now like to introduce you to my companions. Please welcome
Martha Tenney from Barnard College,
Chris Stanton from the Metropolitan New York Library Council. Chris’ colleague Anne couldn’t be here today.
Henry Raine from the New York Historical Society and
Robin Naughton from the New York Academy of Medicine.
A round of applause, please for taking the time to speak to us today.
This next portion of the panel will get these folks up and talking about their individual projects, including
Organization and background
Projects (objective, type of content, coming from X system, new to open source
3-5 minutes maximum.
Martha is the Associate Director of the Barnard Archives and Special Collections and oversees digital archives projects.
She manages selection and digitization of archival materials, works on metadata and design for the digital collections, and collaborates with Ben Rosner on development and functionality for the digital collections site.
Martha
Site launched in 2014 w/ DG
https://barnard.edu/news/librarys-new-digital-collections-showcases-barnards-125-year-history
Ben goes to the site.
Chris is the Digital Projects and Metadata Librarian at the Metropolitan New York Library Council (METRO), where he work on METRO's hosted digital collection service, Digital Culture of Metropolitan New York (dcmny.org). He is primarily responsible for all aspects of metadata creation and editing as well as facilitating the ingest of partner content into the repository.
At METRO, Chris also facilitates the harvesting and ingestion of metadata records from libraries, archives and cultural heritage institutions throughout New York State into the Empire State Digital Network (ESDN) repository for contribution to the Digital Public Library of America.
Prior to joining METRO in 2014, Chris worked for four years as a Collection Description Librarian at the National Library of New Zealand (NLNZ). Prior to joining the NLNZ, he worked for two years at the OCLC Office of Research where he was responsible for supporting the F.A.S.T. (Faceted Application of Subject Terminology) project. Chris holds an M.L.I.S. from Kent State University and a B.A. in History from Miami University.
The Digital Culture of Metropolitan New York site (dcmny.org) is a hosted digital collection service for member institutions of the Metropolitan New York Library Council. We undertook a migration from CONTENTdm to Islandora 3 years ago with the help of Erin and Discovery Garden. METRO has also recently become a member of the Islandora Foundation.
We currently have a number of staff working in various capacities on DCMNY. Anne Karle-Zenith is Digital Services Manager at METRO and managed DCMNY through the migration from CONTENTdm to Islandora. Diego Pino, a core member of the Islandora community, recently joined METRO as a Digital Repository Developer. I spend roughly half of my time on DCMNY, managing metadata, object ingest and other admin tasks (other half of time is spent on ESDN, DPLA service hub for New York State, which harvests metadata from a number of Islandora repositories including DCMNY). Caroline Catchpole our Digitization Specialist, who will be assisting member institutions to digitize collections and make that content available in DCMNY.
We currently host a range of content from a variety of METRO member institutions in New York City and Westchester County. Highlights include fellow “Hello Islandora” panelists the New-York Historical Society and the New York York Academy of Medicine and community archive organizations like the Lesbian Herstory Archive and the LGBT Community Center National History Archive.
Going forward, we plan to provide a range of hosting options and solutions for METRO members looking to add content to DCMNY, including digitization support. Additionally, we hope to continue to contribute to the wider Islandora community through module development, workshops and other collaborative projects.
Chris
Henry Raine | New-York Historical Society
Henry Raine is the Director of Digital Projects and Library Technical Services at the New-York Historical Society, where he has worked since 1997 managing large-scale cataloging, archival processing, and digitization projects. He recently oversaw the migration of the library’s digital collections to Islandora and the development of the N-YHS Islandora site. Prior to working at the N-YHS, he held positions at the Folger Shakespeare Library, the National Library of New Zealand, and the Library of Congress. Active in several professional organizations, he was Chair of the Rare Books and Manuscripts Section of the Association of College and Research Libraries in 2010-2011 and served on the Board of Directors of the Ephemera Society of America from 2010 to 2015. He holds a BA in History from the University of Maryland, an MSLS from the Catholic University of America, and an MA in the History of Design from the Cooper-Hewitt National Design Museum/Parsons School of Design.
Project Objectives
•To build a flexible, customizable digital repository for the entire institution, not just the library
•To allow institutional branding and theming
•To realize the long-term goal of searching across library and museum collections
•To manage all of the institution’s digital assets
•To wean ourselves from proprietary software and become part of a larger open-source development community
Project Objectives
•To build a flexible, customizable digital repository for the entire institution, not just the library
•To allow institutional branding and theming
•To realize the long-term goal of searching across library and museum collections
•To manage all of the institution’s digital assets
•To wean ourselves from proprietary software and become part of a larger open-source development community
CONTENTdm collections
American Manuscripts – 90 compound objects (29,792 images)
Civil War Treasures – 11 compound objects (2,030 images)
Manuscript Collections Relating to Slavery – 1,626 compound objects (21,066 images)
N-YHS Quarterly – 297 compound objects (24,655 images)
Photographs of New York City and Beyond (22,736 images)
Legacy Collections
Revolutionary Era Broadsides – 451 images
Robert Erskine-Simeon DeWitt Maps – 573 images
Papers of William Alexander, Lord Sterling - 1 compound object (632 images)
•Develop capacity to ingest new collections
–Create new top-level collections
–Add to existing collections
•
•Add new content from library collections
–12,000 photographs digitized since migration
–Thousands of pages of manuscripts
–Oral histories (restricted to internal access)
•
•Add new content from museum collections
–Images of paintings, drawings and sculptures soon
–Decorative arts and historical artifacts later
•
•Add other institutional digital assets
Robin Naughton, PhD | The New York Academy of Medicine
Robin Naughton, PhD is Head of Digital for the New York Academy of Medicine Library. She manages the digital program for the library, including digitization efforts and library systems. Since starting in June 2015, she has worked on migrating the Library’s digital collections to Islandora and is in the early phases of development. Prior to joining the Academy, Dr. Naughton was a Digital Consultant in educational technology and has worked with LearningExpress, an EBSCO company, Oxford University Press, English Language Teaching, and Kaplan Test Prep and Admissions. She has managed digital library migrations, interactive educational products (web & mobile) created for public and state libraries, the transition of print to eBooks, and eLearning courses. Dr. Naughton is a user-centered researcher focused on developing interactive systems that are useful, easy to use and enjoyable. She holds a PhD in Information Science from Drexel’s College of Computing and Informatics.
Robin
Robin
Robin
Henry - Tell us about the decision to undertake a new repository project. What was at the heart of the decision to move forward?
Interactions with the community (access to documentation, asking questions, events or training, building or contributing to modules, being part of a release team
Tell us about your decision to customize your repository. What were the considerations/ implications/ benefits?
Data modeling and migration (may include discussion on pre-processing, ingest and potentially post-processing if anyone has anecdotes).
Data modeling and migration (may include discussion on pre-processing, ingest and potentially post-processing if anyone has anecdotes).
Henry - Tell us about the decision to undertake a new repository project. What was at the heart of the decision to move forward?
Top takeaways - challenges/ benefits, sage tidbits you wish you knew when you started.
The result is a process that produces digital repositories that serve users and meet functional requirements.
We have more than a hundred customers worldwide.
We work with the Islandora digital repository framework.
Creating robust digital repository systems for our customers, tailored to their specific needs. The system can grow to handle virtually limitless amounts of data.
Our role in the Islandora ecosystem is that of a service provider.
We remove barriers for organizations who want to use Islandora - and need or want support.
We’ve contributed to over 90% of the Islandora code base on behalf of our many customers.
We use Drupal Coding Standards, Scrum/ Agile Process, Manual and Automated QA (Selenium, Travis CI)
We have been selected to participate in the Research Data Management Shared Service (RDMSS) framework for Lots 1 & 2
Islandora is being used to manage the research data lifecycle from start to finish in the UPEI data repository.
They’ve integrated Drupal data management planning tools, an open source dropbox-like utility for “working” data called Pydio, and minting DataCite DOIs for any published materials.
In essence, they’ve leveraged the existing Islandora tools to create an ecosystem where research data can be more than archived. It is exposed for search, harvest - making it available for re-use.
Different components of the Islandora stack are leveraged for different research data management features.
Research data ingested into the UPEI research data repository can have a status of Public, Embargoed, or Archive. There are different workflow processes per status making data management easier for the researcher.
I’m involved in the JISC research data management shared service framework at the moment and the work developed at UPEI will be adapted to serve organizations in the UK.
Boston College started with our “Jumpstart” installation bundle
It enabled the team at Boston College University Libraries to test the software, define how to modify the theme, receive training and clarify the customizations that would bring their repository to life.
Next, the college undertook a detailed assessment project to outline the approach for migrating data from their DigiTool repository, map existing handles and create new ones, as well as modify the Islandora Video Solution Pack for streaming video.
Boston College launched their new Islandora institutional repository, eScholarship@BC, in February 2015 to showcase and preserve Boston College's scholarly output in digital form and to make it freely accessible globally.
http://dlib.bc.edu/
Case study: http://www.discoverygarden.ca/blog/2015/6/16/boston-college
Next section will be explanations and demonstrations of ingest, discovery and restriction functionality, harvesting.
You can follow along on a sandbox, virtual machine, or simply watch as I go through and explain the steps.
PAUSE TO MAKE SURE FOLKS ARE GOOD TO GO.
At it’s core, Islandora is really a set of Drupal modules that provide a framework for building digital repositories on top of Fedora Commons.
These modules make it possible to build, populate and configure a digital repository without the aid of a developer/
Show Islandora Menu> Utility Modules
http://ondemanddemo.clients.discoverygarden.ca/#overlay=admin/islandora/tools
or Solution Packs
http://ondemanddemo.clients.discoverygarden.ca/#overlay=admin/islandora/solution_pack_config
Individual ingest available over the web
2 GB file limit
Batch Ingest modules over the web
Zip File Importer
Four Batch Citation Uploaders
2 GB file limit
Individual ingest available over the web
2 GB file limit
Batch Ingest modules over the web
Zip File Importer
Four Batch Citation Uploaders
2 GB file limit
Scripted ingests on the Command line for large files or large batches
process will be discussed in detail during Islandora for Developers workshop. But in essence it works like…
GO TO link with Tuque documentation https://github.com/Islandora/islandora/wiki/Working-With-Fedora-Objects-Programmatically-Via-Tuque
Solr for discovery. Flexible, configurable, extensible
Discovery layer could only be configured by a developer four years ago.
Now a series of modules can be configured easily for discovery and record display -- for on site repository.
There have been exciting developments for off site discovery too. They help repo content be discovered through search engines and/ or aggregators
OnSite discovery modules for:
Basic/ Keyword search by default
Advanced search
Facets
Sort
Record display
DEMO
----
No DEMO for the following - but brief overview
OAI
OAI-PMH is a set of protocols to expose structured metadata to harvesters. Allows repositories to be easily harvested by the Digital Public Library of America (DPLA)
Islandora is OAI-PMH compliant and has been approved when submitted to WorldCat (by Nick Ruest at York University) http://www.worldcat.org/search?q=on%3ADGCNT+https%3A%2F%2Fdigital.library.yorku.ca%2Foai2+YOU&qt=results_page
Site map
XML Site map “ tells the search engine about the pages in a site, their relative importance to each other, and how often they are updated. [...] This helps visitors and search engine bots find pages on the site” (Wikipedia, 2014).
Islandora XML Sitemaps creates these maps for your repository so content can be discovered in search engines such as Google, Bing, etc.
Add URLs for Islandora objects to the XML sitemap module's database as custom links. When the XML sitemap module creates its sitemap it will include these custom links.
SEO
Islandora Scholar has a specific SEO (Search engine optimization) setting for adding meta tags to citation objects that can be crawled and aid discovery in Google Scholar.
Add notes from recent webinar
XACML Security Policy Editor provides a graphical user interface to edit XACML policies for objects in a repository or collection. It adds a new tab to each collection called Child Policy and a tab to each item called Item Policy, where permissions can be set on a per User or per Role basis for:
Object Management: Controls who can set XACML policies for an object/collection.
Object Viewing: Controls who can view an object/collection.
Datastreams and MIME types: Controls who can view datastreams by DSID and MIME type.
Scholar embargo
Islandora Embargo is part of Scholar can be used within core Islandora
Utilizes XACML policies to allow for security restrictions that expire on a certain date
Notifies administrators when an object or PDF datastream is a week away from expiring. Also when the embargo is lifted
IP embargo
A Drupal based approach to embargoing content based on IP ranges. This module is not integrated with the Islandora Embargo module.
Resumption Token
This is like a bookmark. When you get to tired to keep reading, you place your bookmark on your page and you can pick up where you left off next time. Resumption Tokens are the same thing.
For example, when a request returns records greater than the response size (discussed on the previous slide) a resumptionToken is issued such that a service provider can resume harvesting from where it left off
IdentifyThis request (verb/ service) used to retrieve information about a repository. Some of the information returned is required as part of the OAI-PMH. Repositories may also employ the Identify verb to return additional descriptive information. YourSite/oai2?verb=Identify
Resumption Token
This is like a bookmark. When you get to tired to keep reading, you place your bookmark on your page and you can pick up where you left off next time. Resumption Tokens are the same thing.
For example, when a request returns records greater than the response size (discussed on the previous slide) a resumptionToken is issued such that a service provider can resume harvesting from where it left off
IdentifyThis request (verb/ service) used to retrieve information about a repository. Some of the information returned is required as part of the OAI-PMH. Repositories may also employ the Identify verb to return additional descriptive information. YourSite/oai2?verb=Identify
ListMetadataFormats
This request (verb/ service) is used to retrieve the metadata formats available from a repository. There’s an optional argument to restrict the request to the formats available for a specific item.YourSite/oai2?verb=ListMetadataFormats
ListRecordsThis request (verb/ service) is used to harvest records from a repository. Optional arguments permit selective harvesting of records based on set membership and/or datestamp. YourSite/oai2?verb=ListRecords&metadataPrefix=oai_dc
ListSetsThis request is used to retrieve the set structure of a repository, useful for selective harvesting YourSite/oai2?verb=ListSets (all collections). YourSite/oai2?verb=ListRecords&metadataPrefix=oai_dc&set=ir_citationCollection (all objects in a specific collection)