SlideShare ist ein Scribd-Unternehmen logo
1 von 38
Crosswalks

 March 25, 2013
 Richard Sapon-White

                       1
Overview
 Crosswalk   definition and description
 Issues




                                           2
Interoperability

Search interoperability
 The ability to perform a search over
  diverse sets of metadata records to
  obtain meaningful results
Today’s session focuses on sets of
  records using different metadata
  schemes

                                         3
Definition
 An  authoritative mapping from the
  metadata elements of one scheme to
  the elements of another
 Example:

  Dublin Core to MARC Crosswalk



                                       4
Reciprocal Crosswalks
 Two  crosswalks are needed to map
  from metadata scheme A to scheme B
AND
  from scheme B to scheme A
 With two crosswalks, “round-trip”
  mapping results in loss or distortion of
  information
                                             5
More Examples
 Library
      of Congress has crosswalks for
 MARC21 to/from
  – DC (Dublin Core)
  – FGDC Content Standards for Geospatial
    Metadata (Federal Geographic Data
    Committee)
  – GILS (Global Information Locator Service)
  – ONIX ((ONline Information eXchange)
                                                6
Uses of Crosswalks
 Record   exchange
 Union catalogs
 Metadata harvesting
 Search engines: query fields with
  similar content in different databases
 Aid to understanding unfamiliar
  schemes
                                           7
Complexities of Crosswalk
Creation
   No standard format for metadata schemes
    – Different properties of elements are specified
    – Same properties may employ different terms
   Some elements may map to multiple
    elements in a second scheme, or vice versa
   Elements may be repeatable in one scheme,
    non-repeatable in another


                                                       8
Complexities of Crosswalk
Creation (cont.)
 Source  scheme may specify an element
  for which there is no comparable
  element in the target scheme
 Differences in content rules (e.g., use of
  a controlled vocabulary) or data
  representation (e.g., Michał Kowalski
  vs. Kowalski, Michał)

                                           9
Issues in Crosswalking Content
Metadata Standards
Barriers to creating crosswalks
1. Lack of common terminology between
   metadata schemes
2. Metadata standards are not organized in
   the same way

Margaret St. Pierre and William LaPlant
http://www.niso.org/publications/white_papers/crosswalk/ (1998)



                                                                  10
St. Pierre and LaPlant (cont.)
Barriers to mapping
 One-to-many mapping: source field contains
  multiple keywords while target field is
  repeatable with one keyword per field
 Many-to-one mapping: results in loss of
  information
 Source element does not map to any element
  in target
 Mandatory element in target without any
  element in source
                                           11
Example
 Dublin Core element “Creator” – an
  uncontrolled name
 Creator did not map to MARC
 MARC name fields defined as main or
  added entries (1xx, 7xx) - content
  defined by AACR2
 To develop a crosswalk, a new 720 field
  was added to MARC
                                       12
Mapping DC Subject to MARC
   DC Subject
    – the topic addressed by the work
    – Can be qualified by the scheme (e.g., LCSH)
   MARC fields 600, 630, 650, 651, 653
    – 600, 630, 650, 651 are controlled vocabulary with
      indicator for the scheme used
    – 653 is uncontrolled vocabulary
   If map to 653, then lose identification of
    controlled vocabulary
                                                      13
Mapping DC Subject to MARC
(cont.)
   Cannot map to other subject fields since DC
    doesn’t distinguish between them
   Suggestion: create new MARC field for
    generic subject field (not done)
    Unqualified:
    653 ##$a (Index Term--Uncontrolled)
    Qualified:
    Scheme=LCSH: 650 #0$a (Subject added entry--Topical term)
    Scheme=MeSH: 650 #2$a (Subject added entry--Topical term)
    Scheme=LCC: 050 ##$a (Library of Congress Call Number/Classification
       number)
    Scheme=DDC: 082 ##$a (Dewey Decimal Call Number/Classification
       number)
    Scheme=UDC: 080 ##$a (Universal Decimal Classification Number)
    Scheme=(other): 650 #7$a with $2=code from MARC Code List for       14
       Relators, Sources, Description Conventions
Mapping DC Title to MARC
 DC   Title does not distinguish between
 title (245 $a) and subtitle (245 $b) or
 any other kinds of titles
 Unqualified:
 – 245 00$a (Title Statement/Title proper)
 – If repeated, all titles after the first: 246 33$a (Varying Form
   of Title/Title proper)

 Qualified:
 – Alternative: 246 33$a (Varying Form of Title/Title proper)

                                                                     15
Mapping DC Publisher to MARC
 One-to-one  relationship between DC
  Publisher and MARC 260 $b
 EASY!




                                        16
Mapping DC Date to MARC
   Publication date in DC element Date best
    maps to MARC21 260 $c
   Other dates exist in MARC21:
    – 008/07-10: date in standardized form
    – 260 $c can also include copyright or printing dates
Unqualified:
 260 ##$c (Date of publication, distribution,
  etc.)


                                                       17
Mapping DC Date to MARC
(cont.)
Qualified DC:
Available: 307 ##$a (Hours, Etc.)
Created: 260 ##$g (Date of manufacture)
Issued:      260 ##$c (Date of publication,
  distribution, etc.)
Modified: 583 ##$d with $a=modified
Valid:       518 ##$a (Date/Time and Place of
  an Event Note). Text may be generated in $3
  to include qualifier name.
                                            18
Mapping DC Identifier to MARC
 DC Identifier is any string or number
  used to uniquely identify an object
 Could be ISBN, ISSN, LCCN, URL
  – Each coded differently in MARC21
 MARC   024 (other standard identifier)
 could be used if type of identifier not
 specified

                                           19
Mapping DC Identifier to MARC
(cont.)
Unqualified:
 024 8#$a (Other Standard Identifier/Standard number or code)


Qualified:
 Scheme=URI: 856 40$u (Electronic Location and
  Access/Uniform Resource Locator)
 Scheme=ISBN: 020 ##$a (International Standard Book
  Number)

   Scheme=ISSN: 022 ##$a (International Standard Serial
    Number)

   Scheme=(other): 024 8#$a (Other Standard Identifier/Standard
    number or code) with $2=scheme value
                                                               20
Resolving Difficulties in
Crosswalk Creation: A Summary
 Create  a new field in MARC
 Use qualifiers (Qualified DC) to map to
  specific MARC fields
 If using unqualified DC, then map to
  closest matching field (with loss of
  some information)
  – Some information maps to a “wrong” field
  – Map to an “other” or “uncontrolled” field

                                                21
Introduction to MarcEdit, from first run to philosophy



 Terry Reese
 Gray Family Chair for Innovative
 Library Services
 Oregon State University
 Email: terry.reese@oregonstate.edu
Getting Started
1.       Sample Data Files
     –       Sample MARC records need to be downloaded.
     –       Get them from:
             http://oregonstate.edu/~reeset/marcedit/examples/session_
             data.zip (~5 MB)
     –       Unzip the data to the Desktop
         •       Right click, Extract all to Desktop.
     –       Worksheet File
         •       Includes the examples that I’ll be working from:
             –      http://oregonstate.edu/~reeset/marcedit/examples/marc_worksheet.docx
     –       When you start MarcEdit for the first time, it will ask you to
             update. Don’t. Tell it no – then we’ll turn off the automated
             update checker.
     –       We’ll use this information later.
Keypoints
   What is MarcEdit?
    – Background
    – System Requirements
   Installation Notes
    – First Run
   Understanding the Application Settings
    – Editor Settings
    – Language settings
   Accessing Application Data
   MarcEdit Infrastructure
   Getting Help
   Questions
What is MarcEdit?
 Started   development in 1999
  – Originally coded in 3 programming
    languages: Assembler (libraries), Visual
    Basic (UI) and Delphi (COM).
  – Initially designed as a replacement for LC’s
    DOS-based MARCBreakr/MARCMakr
    software
What is MarcEdit?
 Today:
  – Written in C#
  – Continues to be freely available
  – Supports both UTF/MARC8 charactersets
  – MARC Neutral
  – XML aware
Installing MarcEdit
 Windows:
  – Installing from the Windows Installer
     • 32-bit version:
       http://people.oregonstate.edu/~reeset/marcedit/
       software/development/MarcEdit_Setup.msi
     • 64-bit version:
       http://people.oregonstate.edu/~reeset/marcedit/
       software/development/MarcEdit_Setup64.msi


  – Installing using a Zip file:
     • http://oregonstate.edu/~reeset/marcedit/softwar
Setting up MarcEdit
 Onfirst run, MarcEdit will ask you to
 confirm some settings. These are
 broken down into 5 areas
  – MarcEditor
  – Language
  – Export
  – MARCEngine
  – Other
MarcEdit Export Properties
 Defines   MARC
  import
 Can capture port
  output from record
  input (much in the
  same way OCLC’s
  Connexion can)
MARC Conversions
MarcEdit: crosswalking design

   MarcEdit   model:
    – So long as a schema has been
      mapped to MARCXML, any
      metadata combination could be
      utilized. This means that no more
      than two tranformations will ever
      take place. Example: MODS 
      MARCXML  EAD
MarcEdit: crosswalking design

    MarcEdit   Crosswalk model
     – Pro
       • Crosswalks need not be directly related
         to each other
       • Requires crosswalker to know specific
         knowledge of only one schema
     – Con
       • each known crosswalk must be mapped
         to MARCXML.
MarcEdit Crosswalking model


                       EAD




     Dublin Core                        FGDC

                     MARC21XML




              MARC               MODS
MarcEdit: Crosswalks for everyone
MarcEdit: Crosswalks for everyone


    Example   Crosswalks:
     – MODS => MARC
     – MODS => FGDC
     – MODS => Dublin Core
     – EAD => MODS
     – EAD=>HTML
MarcEdit: Crosswalks for everyone

     What’s MarcEdit doing?
      – Facilitates the crosswalk by:
        1. Performing character translations
           (MARC8-UTF8)
        2. Facilitates interaction between binary
           and XML formats.
Examples
 Project
       Gutenburg RDF => MARC
 EAD=>MARC
MarcEdit Demo
 http://people.oregonstate.edu/~reeset/m
 arcedit/html/index.php




                                       38

Weitere ähnliche Inhalte

Was ist angesagt?

Bibliographic description an overview
Bibliographic description an overviewBibliographic description an overview
Bibliographic description an overviewDr. Utpal Das
 
Library congress subject headings
Library congress subject headings Library congress subject headings
Library congress subject headings MahendraAdhikari7
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to MetadataJenn Riley
 
RDA: Resource Description and Access
RDA: Resource Description and AccessRDA: Resource Description and Access
RDA: Resource Description and AccessRieta Drinkwine
 
Top 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data SolutionTop 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data SolutionDataStax
 
Common communication format
Common communication formatCommon communication format
Common communication formatavid
 
Metadata harvesting
Metadata harvestingMetadata harvesting
Metadata harvestingAndrewLIS688
 
AACR2 to RDA: Using the RDA Toolkit
AACR2 to RDA: Using the RDA ToolkitAACR2 to RDA: Using the RDA Toolkit
AACR2 to RDA: Using the RDA ToolkitTroy Linker
 
Year wise development of library and information science in the world and india
Year wise development of library and information science in the world and indiaYear wise development of library and information science in the world and india
Year wise development of library and information science in the world and indiaSIFS India Forensic Lab
 
Z39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol pptZ39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol pptSUNILKUMARSINGH
 
Chain indexing
Chain indexingChain indexing
Chain indexingsilambu111
 

Was ist angesagt? (20)

OAI and OAI-PMH
OAI and OAI-PMHOAI and OAI-PMH
OAI and OAI-PMH
 
Bibliographic description an overview
Bibliographic description an overviewBibliographic description an overview
Bibliographic description an overview
 
MARC21
MARC21MARC21
MARC21
 
Library congress subject headings
Library congress subject headings Library congress subject headings
Library congress subject headings
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to Metadata
 
Interoperability in Digital Libraries
Interoperability in Digital LibrariesInteroperability in Digital Libraries
Interoperability in Digital Libraries
 
Marc format
Marc formatMarc format
Marc format
 
Dublin core Presentation
Dublin core PresentationDublin core Presentation
Dublin core Presentation
 
Metadata harvesting Tools
Metadata harvesting ToolsMetadata harvesting Tools
Metadata harvesting Tools
 
RDA: Resource Description and Access
RDA: Resource Description and AccessRDA: Resource Description and Access
RDA: Resource Description and Access
 
Top 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data SolutionTop 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data Solution
 
Common communication format
Common communication formatCommon communication format
Common communication format
 
Metadata harvesting
Metadata harvestingMetadata harvesting
Metadata harvesting
 
Unisist ppt
Unisist pptUnisist ppt
Unisist ppt
 
AACR2 to RDA: Using the RDA Toolkit
AACR2 to RDA: Using the RDA ToolkitAACR2 to RDA: Using the RDA Toolkit
AACR2 to RDA: Using the RDA Toolkit
 
UNISIST
UNISISTUNISIST
UNISIST
 
Canons of library classification
Canons of library classificationCanons of library classification
Canons of library classification
 
Year wise development of library and information science in the world and india
Year wise development of library and information science in the world and indiaYear wise development of library and information science in the world and india
Year wise development of library and information science in the world and india
 
Z39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol pptZ39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol ppt
 
Chain indexing
Chain indexingChain indexing
Chain indexing
 

Ähnlich wie Metadata crosswalks

Working with the MarcEditor
Working with the MarcEditorWorking with the MarcEditor
Working with the MarcEditorTerry Reese
 
Dublin Core In Practice
Dublin Core In PracticeDublin Core In Practice
Dublin Core In PracticeMarcia Zeng
 
Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21vty
 
Flexible metadata schemes for research data repositories - Clarin Conference...
Flexible metadata schemes for research data repositories  - Clarin Conference...Flexible metadata schemes for research data repositories  - Clarin Conference...
Flexible metadata schemes for research data repositories - Clarin Conference...Vyacheslav Tykhonov
 
Bigdata analytics K.kiruthika 2nd M.Sc.,computer science Bon secoures college...
Bigdata analytics K.kiruthika 2nd M.Sc.,computer science Bon secoures college...Bigdata analytics K.kiruthika 2nd M.Sc.,computer science Bon secoures college...
Bigdata analytics K.kiruthika 2nd M.Sc.,computer science Bon secoures college...Kiruthikak14
 
Big data analytics K.Kiruthika II-M.Sc.,Computer Science Bonsecours college f...
Big data analytics K.Kiruthika II-M.Sc.,Computer Science Bonsecours college f...Big data analytics K.Kiruthika II-M.Sc.,Computer Science Bonsecours college f...
Big data analytics K.Kiruthika II-M.Sc.,Computer Science Bonsecours college f...Kiruthikak14
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Enrico Daga
 
Arches Getty Brownbag Talk
Arches Getty Brownbag TalkArches Getty Brownbag Talk
Arches Getty Brownbag Talkbenosteen
 
Apdm 101 Arc Gis Pipeline Data Model (1)
Apdm 101 Arc Gis Pipeline Data Model  (1)Apdm 101 Arc Gis Pipeline Data Model  (1)
Apdm 101 Arc Gis Pipeline Data Model (1)David Nichter, GISP
 
How Apache Spark fits in the Big Data landscape
How Apache Spark fits in the Big Data landscapeHow Apache Spark fits in the Big Data landscape
How Apache Spark fits in the Big Data landscapePaco Nathan
 
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODSAlphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODSJenn Riley
 
CLARIAH CMDI use case and flexible metadata schemes
CLARIAH CMDI use case and flexible metadata schemesCLARIAH CMDI use case and flexible metadata schemes
CLARIAH CMDI use case and flexible metadata schemesVyacheslav Tykhonov
 
Building Robust ETL Pipelines with Apache Spark
Building Robust ETL Pipelines with Apache SparkBuilding Robust ETL Pipelines with Apache Spark
Building Robust ETL Pipelines with Apache SparkDatabricks
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataGiorgos Santipantakis
 
Metadata for your Digital Collections
Metadata for your Digital CollectionsMetadata for your Digital Collections
Metadata for your Digital CollectionsJenn Riley
 
CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes vty
 
Optimized index structures for querying rdf from the web
Optimized index structures for querying rdf from the webOptimized index structures for querying rdf from the web
Optimized index structures for querying rdf from the webMahdi Atawneh
 
Building_a_Geodatabase_ArcGIS_9.pdf
Building_a_Geodatabase_ArcGIS_9.pdfBuilding_a_Geodatabase_ArcGIS_9.pdf
Building_a_Geodatabase_ArcGIS_9.pdfkovan azeez
 
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordForce11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordMark Wilkinson
 

Ähnlich wie Metadata crosswalks (20)

Working with the MarcEditor
Working with the MarcEditorWorking with the MarcEditor
Working with the MarcEditor
 
Dublin Core In Practice
Dublin Core In PracticeDublin Core In Practice
Dublin Core In Practice
 
Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21
 
Flexible metadata schemes for research data repositories - Clarin Conference...
Flexible metadata schemes for research data repositories  - Clarin Conference...Flexible metadata schemes for research data repositories  - Clarin Conference...
Flexible metadata schemes for research data repositories - Clarin Conference...
 
Bigdata analytics K.kiruthika 2nd M.Sc.,computer science Bon secoures college...
Bigdata analytics K.kiruthika 2nd M.Sc.,computer science Bon secoures college...Bigdata analytics K.kiruthika 2nd M.Sc.,computer science Bon secoures college...
Bigdata analytics K.kiruthika 2nd M.Sc.,computer science Bon secoures college...
 
Big data analytics K.Kiruthika II-M.Sc.,Computer Science Bonsecours college f...
Big data analytics K.Kiruthika II-M.Sc.,Computer Science Bonsecours college f...Big data analytics K.Kiruthika II-M.Sc.,Computer Science Bonsecours college f...
Big data analytics K.Kiruthika II-M.Sc.,Computer Science Bonsecours college f...
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
 
Arches Getty Brownbag Talk
Arches Getty Brownbag TalkArches Getty Brownbag Talk
Arches Getty Brownbag Talk
 
Apdm 101 Arc Gis Pipeline Data Model (1)
Apdm 101 Arc Gis Pipeline Data Model  (1)Apdm 101 Arc Gis Pipeline Data Model  (1)
Apdm 101 Arc Gis Pipeline Data Model (1)
 
How Apache Spark fits in the Big Data landscape
How Apache Spark fits in the Big Data landscapeHow Apache Spark fits in the Big Data landscape
How Apache Spark fits in the Big Data landscape
 
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODSAlphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
 
CLARIAH CMDI use case and flexible metadata schemes
CLARIAH CMDI use case and flexible metadata schemesCLARIAH CMDI use case and flexible metadata schemes
CLARIAH CMDI use case and flexible metadata schemes
 
Building Robust ETL Pipelines with Apache Spark
Building Robust ETL Pipelines with Apache SparkBuilding Robust ETL Pipelines with Apache Spark
Building Robust ETL Pipelines with Apache Spark
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival data
 
Metadata for your Digital Collections
Metadata for your Digital CollectionsMetadata for your Digital Collections
Metadata for your Digital Collections
 
CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes
 
Optimized index structures for querying rdf from the web
Optimized index structures for querying rdf from the webOptimized index structures for querying rdf from the web
Optimized index structures for querying rdf from the web
 
Building_a_Geodatabase_ArcGIS_9.pdf
Building_a_Geodatabase_ArcGIS_9.pdfBuilding_a_Geodatabase_ArcGIS_9.pdf
Building_a_Geodatabase_ArcGIS_9.pdf
 
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordForce11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, Oxford
 
Computer science
Computer scienceComputer science
Computer science
 

Mehr von Richard.Sapon-White

Rda and new research potentials, agata kawalec
Rda and new research potentials, agata kawalecRda and new research potentials, agata kawalec
Rda and new research potentials, agata kawalecRichard.Sapon-White
 
RDF and the Semantic Web -- Joanna Pszenicyn
RDF and the Semantic Web -- Joanna PszenicynRDF and the Semantic Web -- Joanna Pszenicyn
RDF and the Semantic Web -- Joanna PszenicynRichard.Sapon-White
 
Continuing Education for Metadata Creation and Management
Continuing Education for Metadata Creation and ManagementContinuing Education for Metadata Creation and Management
Continuing Education for Metadata Creation and ManagementRichard.Sapon-White
 
RDA as an international standard
RDA as an international standardRDA as an international standard
RDA as an international standardRichard.Sapon-White
 
Metadata lecture 3, metadata schemes
Metadata lecture 3, metadata schemesMetadata lecture 3, metadata schemes
Metadata lecture 3, metadata schemesRichard.Sapon-White
 
Introduction to metadata, part 2
Introduction to metadata, part 2Introduction to metadata, part 2
Introduction to metadata, part 2Richard.Sapon-White
 
Course syllabus metadata systems for warsaw
Course syllabus metadata systems for warsawCourse syllabus metadata systems for warsaw
Course syllabus metadata systems for warsawRichard.Sapon-White
 
Preparing your presentation.pptx [repaired]
Preparing your presentation.pptx [repaired]Preparing your presentation.pptx [repaired]
Preparing your presentation.pptx [repaired]Richard.Sapon-White
 
E books in public libraries. vendors in poland and usa
E books in public libraries. vendors in poland and usaE books in public libraries. vendors in poland and usa
E books in public libraries. vendors in poland and usaRichard.Sapon-White
 
Accessibility issues with ebooks
Accessibility issues with ebooksAccessibility issues with ebooks
Accessibility issues with ebooksRichard.Sapon-White
 

Mehr von Richard.Sapon-White (20)

Rda and new research potentials, agata kawalec
Rda and new research potentials, agata kawalecRda and new research potentials, agata kawalec
Rda and new research potentials, agata kawalec
 
RDF and the Semantic Web -- Joanna Pszenicyn
RDF and the Semantic Web -- Joanna PszenicynRDF and the Semantic Web -- Joanna Pszenicyn
RDF and the Semantic Web -- Joanna Pszenicyn
 
Continuing Education for Metadata Creation and Management
Continuing Education for Metadata Creation and ManagementContinuing Education for Metadata Creation and Management
Continuing Education for Metadata Creation and Management
 
VRA Core 4.0
VRA Core 4.0VRA Core 4.0
VRA Core 4.0
 
Sgml and xml
Sgml and xmlSgml and xml
Sgml and xml
 
RDA as an international standard
RDA as an international standardRDA as an international standard
RDA as an international standard
 
Metadata april 8 2013
Metadata april 8 2013Metadata april 8 2013
Metadata april 8 2013
 
Metadata and the web
Metadata and the webMetadata and the web
Metadata and the web
 
Metadata lecture 5 part 2
Metadata lecture 5 part 2Metadata lecture 5 part 2
Metadata lecture 5 part 2
 
Metadata lecture 3, metadata schemes
Metadata lecture 3, metadata schemesMetadata lecture 3, metadata schemes
Metadata lecture 3, metadata schemes
 
Rda class, lecture 2
Rda class, lecture 2Rda class, lecture 2
Rda class, lecture 2
 
Rda class, lecture 2
Rda class, lecture 2Rda class, lecture 2
Rda class, lecture 2
 
Introduction to metadata, part 2
Introduction to metadata, part 2Introduction to metadata, part 2
Introduction to metadata, part 2
 
Course syllabus metadata systems for warsaw
Course syllabus metadata systems for warsawCourse syllabus metadata systems for warsaw
Course syllabus metadata systems for warsaw
 
Rda seminar syllabus
Rda seminar syllabusRda seminar syllabus
Rda seminar syllabus
 
Preparing your presentation.pptx [repaired]
Preparing your presentation.pptx [repaired]Preparing your presentation.pptx [repaired]
Preparing your presentation.pptx [repaired]
 
Rda class, lecture 1
Rda class, lecture 1Rda class, lecture 1
Rda class, lecture 1
 
Metadata lecture 1, intro
Metadata lecture 1, introMetadata lecture 1, intro
Metadata lecture 1, intro
 
E books in public libraries. vendors in poland and usa
E books in public libraries. vendors in poland and usaE books in public libraries. vendors in poland and usa
E books in public libraries. vendors in poland and usa
 
Accessibility issues with ebooks
Accessibility issues with ebooksAccessibility issues with ebooks
Accessibility issues with ebooks
 

Kürzlich hochgeladen

Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 

Kürzlich hochgeladen (20)

Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 

Metadata crosswalks

  • 1. Crosswalks March 25, 2013 Richard Sapon-White 1
  • 2. Overview  Crosswalk definition and description  Issues 2
  • 3. Interoperability Search interoperability  The ability to perform a search over diverse sets of metadata records to obtain meaningful results Today’s session focuses on sets of records using different metadata schemes 3
  • 4. Definition  An authoritative mapping from the metadata elements of one scheme to the elements of another  Example: Dublin Core to MARC Crosswalk 4
  • 5. Reciprocal Crosswalks  Two crosswalks are needed to map from metadata scheme A to scheme B AND from scheme B to scheme A  With two crosswalks, “round-trip” mapping results in loss or distortion of information 5
  • 6. More Examples  Library of Congress has crosswalks for MARC21 to/from – DC (Dublin Core) – FGDC Content Standards for Geospatial Metadata (Federal Geographic Data Committee) – GILS (Global Information Locator Service) – ONIX ((ONline Information eXchange) 6
  • 7. Uses of Crosswalks  Record exchange  Union catalogs  Metadata harvesting  Search engines: query fields with similar content in different databases  Aid to understanding unfamiliar schemes 7
  • 8. Complexities of Crosswalk Creation  No standard format for metadata schemes – Different properties of elements are specified – Same properties may employ different terms  Some elements may map to multiple elements in a second scheme, or vice versa  Elements may be repeatable in one scheme, non-repeatable in another 8
  • 9. Complexities of Crosswalk Creation (cont.)  Source scheme may specify an element for which there is no comparable element in the target scheme  Differences in content rules (e.g., use of a controlled vocabulary) or data representation (e.g., Michał Kowalski vs. Kowalski, Michał) 9
  • 10. Issues in Crosswalking Content Metadata Standards Barriers to creating crosswalks 1. Lack of common terminology between metadata schemes 2. Metadata standards are not organized in the same way Margaret St. Pierre and William LaPlant http://www.niso.org/publications/white_papers/crosswalk/ (1998) 10
  • 11. St. Pierre and LaPlant (cont.) Barriers to mapping  One-to-many mapping: source field contains multiple keywords while target field is repeatable with one keyword per field  Many-to-one mapping: results in loss of information  Source element does not map to any element in target  Mandatory element in target without any element in source 11
  • 12. Example  Dublin Core element “Creator” – an uncontrolled name  Creator did not map to MARC  MARC name fields defined as main or added entries (1xx, 7xx) - content defined by AACR2  To develop a crosswalk, a new 720 field was added to MARC 12
  • 13. Mapping DC Subject to MARC  DC Subject – the topic addressed by the work – Can be qualified by the scheme (e.g., LCSH)  MARC fields 600, 630, 650, 651, 653 – 600, 630, 650, 651 are controlled vocabulary with indicator for the scheme used – 653 is uncontrolled vocabulary  If map to 653, then lose identification of controlled vocabulary 13
  • 14. Mapping DC Subject to MARC (cont.)  Cannot map to other subject fields since DC doesn’t distinguish between them  Suggestion: create new MARC field for generic subject field (not done) Unqualified: 653 ##$a (Index Term--Uncontrolled) Qualified: Scheme=LCSH: 650 #0$a (Subject added entry--Topical term) Scheme=MeSH: 650 #2$a (Subject added entry--Topical term) Scheme=LCC: 050 ##$a (Library of Congress Call Number/Classification number) Scheme=DDC: 082 ##$a (Dewey Decimal Call Number/Classification number) Scheme=UDC: 080 ##$a (Universal Decimal Classification Number) Scheme=(other): 650 #7$a with $2=code from MARC Code List for 14 Relators, Sources, Description Conventions
  • 15. Mapping DC Title to MARC  DC Title does not distinguish between title (245 $a) and subtitle (245 $b) or any other kinds of titles Unqualified: – 245 00$a (Title Statement/Title proper) – If repeated, all titles after the first: 246 33$a (Varying Form of Title/Title proper) Qualified: – Alternative: 246 33$a (Varying Form of Title/Title proper) 15
  • 16. Mapping DC Publisher to MARC  One-to-one relationship between DC Publisher and MARC 260 $b  EASY! 16
  • 17. Mapping DC Date to MARC  Publication date in DC element Date best maps to MARC21 260 $c  Other dates exist in MARC21: – 008/07-10: date in standardized form – 260 $c can also include copyright or printing dates Unqualified:  260 ##$c (Date of publication, distribution, etc.) 17
  • 18. Mapping DC Date to MARC (cont.) Qualified DC: Available: 307 ##$a (Hours, Etc.) Created: 260 ##$g (Date of manufacture) Issued: 260 ##$c (Date of publication, distribution, etc.) Modified: 583 ##$d with $a=modified Valid: 518 ##$a (Date/Time and Place of an Event Note). Text may be generated in $3 to include qualifier name. 18
  • 19. Mapping DC Identifier to MARC  DC Identifier is any string or number used to uniquely identify an object  Could be ISBN, ISSN, LCCN, URL – Each coded differently in MARC21  MARC 024 (other standard identifier) could be used if type of identifier not specified 19
  • 20. Mapping DC Identifier to MARC (cont.) Unqualified:  024 8#$a (Other Standard Identifier/Standard number or code) Qualified:  Scheme=URI: 856 40$u (Electronic Location and Access/Uniform Resource Locator)  Scheme=ISBN: 020 ##$a (International Standard Book Number)  Scheme=ISSN: 022 ##$a (International Standard Serial Number)  Scheme=(other): 024 8#$a (Other Standard Identifier/Standard number or code) with $2=scheme value 20
  • 21. Resolving Difficulties in Crosswalk Creation: A Summary  Create a new field in MARC  Use qualifiers (Qualified DC) to map to specific MARC fields  If using unqualified DC, then map to closest matching field (with loss of some information) – Some information maps to a “wrong” field – Map to an “other” or “uncontrolled” field 21
  • 22. Introduction to MarcEdit, from first run to philosophy Terry Reese Gray Family Chair for Innovative Library Services Oregon State University Email: terry.reese@oregonstate.edu
  • 23. Getting Started 1. Sample Data Files – Sample MARC records need to be downloaded. – Get them from: http://oregonstate.edu/~reeset/marcedit/examples/session_ data.zip (~5 MB) – Unzip the data to the Desktop • Right click, Extract all to Desktop. – Worksheet File • Includes the examples that I’ll be working from: – http://oregonstate.edu/~reeset/marcedit/examples/marc_worksheet.docx – When you start MarcEdit for the first time, it will ask you to update. Don’t. Tell it no – then we’ll turn off the automated update checker. – We’ll use this information later.
  • 24. Keypoints  What is MarcEdit? – Background – System Requirements  Installation Notes – First Run  Understanding the Application Settings – Editor Settings – Language settings  Accessing Application Data  MarcEdit Infrastructure  Getting Help  Questions
  • 25. What is MarcEdit?  Started development in 1999 – Originally coded in 3 programming languages: Assembler (libraries), Visual Basic (UI) and Delphi (COM). – Initially designed as a replacement for LC’s DOS-based MARCBreakr/MARCMakr software
  • 26. What is MarcEdit?  Today: – Written in C# – Continues to be freely available – Supports both UTF/MARC8 charactersets – MARC Neutral – XML aware
  • 27. Installing MarcEdit  Windows: – Installing from the Windows Installer • 32-bit version: http://people.oregonstate.edu/~reeset/marcedit/ software/development/MarcEdit_Setup.msi • 64-bit version: http://people.oregonstate.edu/~reeset/marcedit/ software/development/MarcEdit_Setup64.msi – Installing using a Zip file: • http://oregonstate.edu/~reeset/marcedit/softwar
  • 28. Setting up MarcEdit  Onfirst run, MarcEdit will ask you to confirm some settings. These are broken down into 5 areas – MarcEditor – Language – Export – MARCEngine – Other
  • 29. MarcEdit Export Properties  Defines MARC import  Can capture port output from record input (much in the same way OCLC’s Connexion can)
  • 31. MarcEdit: crosswalking design  MarcEdit model: – So long as a schema has been mapped to MARCXML, any metadata combination could be utilized. This means that no more than two tranformations will ever take place. Example: MODS  MARCXML  EAD
  • 32. MarcEdit: crosswalking design  MarcEdit Crosswalk model – Pro • Crosswalks need not be directly related to each other • Requires crosswalker to know specific knowledge of only one schema – Con • each known crosswalk must be mapped to MARCXML.
  • 33. MarcEdit Crosswalking model EAD Dublin Core FGDC MARC21XML MARC MODS
  • 35. MarcEdit: Crosswalks for everyone  Example Crosswalks: – MODS => MARC – MODS => FGDC – MODS => Dublin Core – EAD => MODS – EAD=>HTML
  • 36. MarcEdit: Crosswalks for everyone  What’s MarcEdit doing? – Facilitates the crosswalk by: 1. Performing character translations (MARC8-UTF8) 2. Facilitates interaction between binary and XML formats.
  • 37. Examples  Project Gutenburg RDF => MARC  EAD=>MARC

Hinweis der Redaktion

  1. Would like to now consider Caplan and Guenther’s paper describing the DC to MARC crosswalk mapping at its beginnings in 1996. What follows are specific fields, the problems raised by C&G, and how they were resolved in the current crosswalk. Will then try to summarize how these issues in crosswalks were resolved
  2. So there is loss of information – lose the distinction between title and subtitle – an imperfect conversion
  3. Why did I need to develop a replacement to the DOS-based utility? I’ve always done a lot of consulting work and the DOS-based tools was always my favorite tools. But as I moved to an NT-based system, I started to have more trouble with all DOS software so I decided to develop a windows alternative. Originally, I’d planned on just creating MarcEdit for my own use, but in June 2000, OSU needed to do a large call number flipping project and when I showed MarcEdit to a collegie, Kyle Banerjee, he convinced me that I should make this program available to the public.
  4. This is really the heart of MarcEdit All utilities and functions interact with the MARCEngine in some fashion.