SlideShare ist ein Scribd-Unternehmen logo
1 von 21
OAI & OAI-PMH Albulena Bruncaj LIS 882  Metadata for Internet Resources
What is OAI? The Open Archives Initiative OAI is a framework that deals with interoperability standards for digital resources Traces its roots to the open access and institutional repository movements (e-prints) It is “explicitly in transition” Their goal is to define a “low-barrier” framework for cross-repository interoperability
OAI-PMH – A Brief History The Open Archives Initiative Protocol for Metadata Harvesting The framework provides a technical mechanism for harvesting metadata from repositories Santa Fe meeting (1999) Cross-searching multiple archives based on Z39.50 or harvesting metadata into one or more "central" services in a bulk move of data? OAI-PMH 1.0/1.1 followed by OAI-PMH 2.0
History (continued)
OAI-PMH
Why use OAI-PMH? Easily and rapidly deployed because it is XML and HTTP based Interoperability Visibility Can use a single search interface
Who Participates? Two classes: Data Providers Service Providers 1357 registered OAI-PMH repositories — http://www.openarchives.org/Register/BrowseSites
Multiple data & service providers
Aggregators
Repositories As defined by OAI-PMH, a repository is a network-accessible server that exposes metadata to harvesters Three entities related to this accessible metadata: Resource Item Record
Records header (mandatory) identifier  Datestamp (YYYY-MM-DD) setSpec elements status attribute for deleted item metadata (mandatory) XML encoded metadata with root tag, namespaces repositories must support Dublin Core, may support other formats  about (optional) rights statements provenance statements
setSpec Sets are optional Enable a logical partitioning of repositories, but they are not necessarily hierarchical Not necessarily exhaustive of content Helps facilitate selective harvesting, however Publication type Document type Content sets
The Fun Part! OAIster http://oaister.worldcat.org/
Requests & Reponses Both use HTTP Request types (also called the “verbs”) Identify ListMetadataFormats ListSets ListIdentifiers ListRecords GetRecord
Organizational Decisions
DSpace OAI-compliant Free, out-of-the-box software that enables access to digital content Examples: http://timea.rice.edu/browse.jsp http://modiya.nyu.edu/
CONTENTdm CONTENTdm is a software used for the management of digital collections Widely used Metadata in CONTENTdm can be harvested through OAI-PMH Allows collection curators to extend basic Dublin Core schema to include locally defined fields Example: http://content.lib.washington.edu/
CONTENTdm – Tension  At minimum, simple Dublin Core is required CONTENTdm offers an easy way to make their metadata (and thus, digital collections) available while simultaneously providing curators with the means to create local, non-standardized metadata
Issues with OAI-PMH Libraries face the challenge of creating metadata that both meets the requirements of local practices in granularity (e.g., putting certain content in fields not harvestable by others) and wanting to share their digital collections widely Is there any solution? Or are we all left to figure out our library’s balance on our own?
Questions?
Bibliography An overview of OAI OAI-PMH. 12 Nov. 2010. Retrieved from http://www.slideworld.org/viewslides.aspx/An-Overview-of-OAI--OAI-PMH-ppt-2369922 Han, M., Cho, C., Cole, T.W., and Jackson, A.S. “Metadata for special collections in CONTENTdm: How to improve interoperability of unique fields through OAI-PMH.” Journal of library metadata, 9(3/4), 213-238. OAI. 12 Nov. 2010. Retrieved from http://www.openarchives.org/ "OAI-PMH online tutorial." Open Archives Forum.  12 Nov. 2010. Retrieved from http://www.oaforum.org/tutorial/ Zeng, M.L., and Qin, J. (2008). "Metadata repositories." In Metadata (224-232). New York: Neal-Schuman.

Weitere ähnliche Inhalte

Was ist angesagt?

International federation of library associations and institutions
International federation of library associations and institutionsInternational federation of library associations and institutions
International federation of library associations and institutions
Zainuddin Ibrahim
 
Controlled Vocabulary
Controlled VocabularyControlled Vocabulary
Controlled Vocabulary
guest118a9a
 
Institutional repositories
Institutional repositoriesInstitutional repositories
Institutional repositories
Smita Chandra
 

Was ist angesagt? (20)

Soul
Soul Soul
Soul
 
Desidoc
DesidocDesidoc
Desidoc
 
ISO 2709
ISO 2709ISO 2709
ISO 2709
 
Dublin core Presentation
Dublin core PresentationDublin core Presentation
Dublin core Presentation
 
Unisist ppt
Unisist pptUnisist ppt
Unisist ppt
 
Z39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol pptZ39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol ppt
 
International federation of library associations and institutions
International federation of library associations and institutionsInternational federation of library associations and institutions
International federation of library associations and institutions
 
Controlled Vocabulary
Controlled VocabularyControlled Vocabulary
Controlled Vocabulary
 
CAS & SDI service
CAS & SDI serviceCAS & SDI service
CAS & SDI service
 
Information Analysis Consolidation and Repackaging (IACR): an overview
Information Analysis Consolidation and Repackaging (IACR): an overviewInformation Analysis Consolidation and Repackaging (IACR): an overview
Information Analysis Consolidation and Repackaging (IACR): an overview
 
ISBD
ISBDISBD
ISBD
 
NISCAIR.pptx
NISCAIR.pptxNISCAIR.pptx
NISCAIR.pptx
 
citation analysis
citation analysiscitation analysis
citation analysis
 
DELNET.pptx
DELNET.pptxDELNET.pptx
DELNET.pptx
 
BIBLIOMETRICS LAWS
BIBLIOMETRICS LAWSBIBLIOMETRICS LAWS
BIBLIOMETRICS LAWS
 
Reference services in Libraries
Reference services in LibrariesReference services in Libraries
Reference services in Libraries
 
Institutional repositories
Institutional repositoriesInstitutional repositories
Institutional repositories
 
alerting services.pptx
alerting services.pptxalerting services.pptx
alerting services.pptx
 
Dspace software
Dspace softwareDspace software
Dspace software
 
Information products
Information products Information products
Information products
 

Ähnlich wie OAI and OAI-PMH

Towards an Infrastructure for Mining Scientific Publications
Towards an Infrastructure for Mining Scientific PublicationsTowards an Infrastructure for Mining Scientific Publications
Towards an Infrastructure for Mining Scientific Publications
petrknoth
 
Hub Distributed Model 2009
Hub Distributed Model 2009Hub Distributed Model 2009
Hub Distributed Model 2009
Jane Stevenson
 

Ähnlich wie OAI and OAI-PMH (20)

The Open Archives Initiative Protocol for Metadata Harvesting
The Open Archives Initiative Protocol for Metadata HarvestingThe Open Archives Initiative Protocol for Metadata Harvesting
The Open Archives Initiative Protocol for Metadata Harvesting
 
The Open Archives Initiative Protocol for Metadata Harvesting and ePrints UK
The Open Archives Initiative Protocol for Metadata Harvesting and ePrints UKThe Open Archives Initiative Protocol for Metadata Harvesting and ePrints UK
The Open Archives Initiative Protocol for Metadata Harvesting and ePrints UK
 
Open for Business Open Archives, OpenURL, RSS and the Dublin Core
Open for Business  Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business  Open Archives, OpenURL, RSS and the Dublin Core
Open for Business Open Archives, OpenURL, RSS and the Dublin Core
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
 
Digitisation and institutional repositories 3
Digitisation and institutional repositories 3Digitisation and institutional repositories 3
Digitisation and institutional repositories 3
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
 
Harvesting&Metadata Enrich Project EVA 2009
Harvesting&Metadata Enrich Project   EVA 2009Harvesting&Metadata Enrich Project   EVA 2009
Harvesting&Metadata Enrich Project EVA 2009
 
Towards an Infrastructure for Mining Scientific Publications
Towards an Infrastructure for Mining Scientific PublicationsTowards an Infrastructure for Mining Scientific Publications
Towards an Infrastructure for Mining Scientific Publications
 
Hub Distributed Model 2009
Hub Distributed Model 2009Hub Distributed Model 2009
Hub Distributed Model 2009
 
Sharing with the Open Archives Initiative
Sharing with the Open Archives InitiativeSharing with the Open Archives Initiative
Sharing with the Open Archives Initiative
 
Networked digital library through harvesting
Networked digital library through harvestingNetworked digital library through harvesting
Networked digital library through harvesting
 
Metadata april 8 2013
Metadata april 8 2013Metadata april 8 2013
Metadata april 8 2013
 
New ICT Trends and Issues of Librarianship
New ICT Trends and Issues of LibrarianshipNew ICT Trends and Issues of Librarianship
New ICT Trends and Issues of Librarianship
 
From Provider to Portal - a chain of interoperability
From Provider to Portal - a chain of interoperabilityFrom Provider to Portal - a chain of interoperability
From Provider to Portal - a chain of interoperability
 
EuroSakai CLIF project presentation
EuroSakai CLIF project presentationEuroSakai CLIF project presentation
EuroSakai CLIF project presentation
 
OAI-PMH
OAI-PMHOAI-PMH
OAI-PMH
 
The JISC Information Environment and collection description
The JISC Information Environment and collection descriptionThe JISC Information Environment and collection description
The JISC Information Environment and collection description
 
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 

Kürzlich hochgeladen

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Kürzlich hochgeladen (20)

Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 

OAI and OAI-PMH

  • 1. OAI & OAI-PMH Albulena Bruncaj LIS 882 Metadata for Internet Resources
  • 2. What is OAI? The Open Archives Initiative OAI is a framework that deals with interoperability standards for digital resources Traces its roots to the open access and institutional repository movements (e-prints) It is “explicitly in transition” Their goal is to define a “low-barrier” framework for cross-repository interoperability
  • 3. OAI-PMH – A Brief History The Open Archives Initiative Protocol for Metadata Harvesting The framework provides a technical mechanism for harvesting metadata from repositories Santa Fe meeting (1999) Cross-searching multiple archives based on Z39.50 or harvesting metadata into one or more "central" services in a bulk move of data? OAI-PMH 1.0/1.1 followed by OAI-PMH 2.0
  • 6. Why use OAI-PMH? Easily and rapidly deployed because it is XML and HTTP based Interoperability Visibility Can use a single search interface
  • 7. Who Participates? Two classes: Data Providers Service Providers 1357 registered OAI-PMH repositories — http://www.openarchives.org/Register/BrowseSites
  • 8. Multiple data & service providers
  • 10. Repositories As defined by OAI-PMH, a repository is a network-accessible server that exposes metadata to harvesters Three entities related to this accessible metadata: Resource Item Record
  • 11. Records header (mandatory) identifier Datestamp (YYYY-MM-DD) setSpec elements status attribute for deleted item metadata (mandatory) XML encoded metadata with root tag, namespaces repositories must support Dublin Core, may support other formats about (optional) rights statements provenance statements
  • 12. setSpec Sets are optional Enable a logical partitioning of repositories, but they are not necessarily hierarchical Not necessarily exhaustive of content Helps facilitate selective harvesting, however Publication type Document type Content sets
  • 13. The Fun Part! OAIster http://oaister.worldcat.org/
  • 14. Requests & Reponses Both use HTTP Request types (also called the “verbs”) Identify ListMetadataFormats ListSets ListIdentifiers ListRecords GetRecord
  • 16. DSpace OAI-compliant Free, out-of-the-box software that enables access to digital content Examples: http://timea.rice.edu/browse.jsp http://modiya.nyu.edu/
  • 17. CONTENTdm CONTENTdm is a software used for the management of digital collections Widely used Metadata in CONTENTdm can be harvested through OAI-PMH Allows collection curators to extend basic Dublin Core schema to include locally defined fields Example: http://content.lib.washington.edu/
  • 18. CONTENTdm – Tension At minimum, simple Dublin Core is required CONTENTdm offers an easy way to make their metadata (and thus, digital collections) available while simultaneously providing curators with the means to create local, non-standardized metadata
  • 19. Issues with OAI-PMH Libraries face the challenge of creating metadata that both meets the requirements of local practices in granularity (e.g., putting certain content in fields not harvestable by others) and wanting to share their digital collections widely Is there any solution? Or are we all left to figure out our library’s balance on our own?
  • 21. Bibliography An overview of OAI OAI-PMH. 12 Nov. 2010. Retrieved from http://www.slideworld.org/viewslides.aspx/An-Overview-of-OAI--OAI-PMH-ppt-2369922 Han, M., Cho, C., Cole, T.W., and Jackson, A.S. “Metadata for special collections in CONTENTdm: How to improve interoperability of unique fields through OAI-PMH.” Journal of library metadata, 9(3/4), 213-238. OAI. 12 Nov. 2010. Retrieved from http://www.openarchives.org/ "OAI-PMH online tutorial." Open Archives Forum. 12 Nov. 2010. Retrieved from http://www.oaforum.org/tutorial/ Zeng, M.L., and Qin, J. (2008). "Metadata repositories." In Metadata (224-232). New York: Neal-Schuman.

Hinweis der Redaktion

  1. Bullet 1: Don’t be deceived by the word “Archive” in the title—this isn’t what archivists, as we know them, work with. Bullet 2: They “aim to facilitate the efficient dissemination of content.”Bullet 3: OAIshares the objective of supporting access to scholarly materials. Instead of traditional archival materials, OAI serves as a networked repository of scholarly papers. When created, it served as a way to make e-prints, or grey literature, available.Bullet 4: This is their way of saying that they don’t really have technological mechanisms and economic assistance to offer; rather, they offer their framework and their assurance that once the proper technology and standards are created, their organization will adapt its mission and organization to better suit whatever may happen.Bullet 5: “Low-barrier” means that lots of people can use it—it applies to many different institutions and situations. They “believe that exposing metadata is plausible route to such a goal.”
  2. Bullet 3: Technical experts met to discuss two major interoperability problems: end users were faced with multiple search interfaces making resource discovery harder, and there was no machine-based way of sharing the metadata. They wanted, ideally, to create a universal service for author-archived scholarlyresouces, but decided that the first step would need to be to identify or create interoperable technologies and frameworks for the dissemination of e-prints. They came up with an architecture they called UPS, but quickly changed the name because they didn’t want to be mistaken for the United Parcel Service. With some changes, it became OAI-PMH 1.0.Bullet 4: 1.0 was released in January 2001 and an update (1.1) was released just a few months later, in June 2001. There were some minor updates dealing with the XML. 2.0 was released in 2002 and is not compatible with the earlier versions. Instead of using oai_marc schema, they switched to MARCXML.
  3. A helpful image from oaforum.org’s tutorial. XML was their standard from the beginning, but notice the changes in the “ABOUT” row. This shows, succinctly, the evolution of what the creators thought OAI-PMH would be used for. As we know, institutional repositories have not received the response that was hoped for, even though some (invested) people still shower it with praise.
  4. Bullet 1: Not to be understated. In an environment of limited funding and experts, this is invaluable.Bullet 2: Our favorite! Interoperability makes your resources visible to others and thus, metadata from different places can searchable on a single interface. (We’ll play with a single interface soon.) But don’t mistake “CAN” for “ABSOLUTELY DOES.”
  5. Bullet 1: Data Providers administer systems that support the OAI-PMH as a means of exposing metadata; they do not necessarily offer access to full-text.Service Providers use metadata harvested via the OAI-PMH as a basis for building value-added services; there are no live searches, they only search the metadata that is already there.Bullet 2: Remember, libraries using OAI-PMH are not actually required to register, so this list is wildly incomplete.
  6. An OAI aggregator is both a Service Provider and a Data Provider. It is a service that gathers metadata records from multiple Data Providers and then makes those records available for gathering by others using the OAI-PMH.
  7. Bullet 2: Resource is outside the scope of OAI-PMH. It is whatever the metadata is about. Item is “a constituent of a repository from which metadata about a resource can be disseminated. That metadata may be disseminated on-the-fly from the associated resource, cross-walked from some canonical form, actually stored in the repository, etc.” Record is metadata that is in a specific metadata schema.
  8. Bullet 1: Both identifier and datestamp are required, and nonrepeatable. setSpec is optional and repeatable.
  9. Bullet 1: There are no recommendations for the implementation of Sets.
  10. Bullet 2: Harvesters are not required to use each request type, but the repository must allow for each request type. The purpose and parameters of each of these request types are available online, but for the sake of time, I will not explain them individually.
  11. Bullet 2: It is one of the most used digital resource management tools used in the library world because of the following two advantages.Bullet 3: Which can increase the use of a collection, something that all libraries desire.Bullet 4: And those field names do not need to conform to any standard
  12. OAI offers guidelines in an online document called “OAI-PMH Implementation Guidelines,” but it hasn’t been updated since 2005.
  13. *(URI) - URI is the acronym for Universal Resource Identifier. URIs are strings that identify things on the Web. URIs are sometimes informally called URLs (Uniform Resource Locators), although URLs are more limited than URIs. URIs are used in a number of schemes, including the HTTP and FTP URI schemes. Related to Semantic Web.