SlideShare ist ein Scribd-Unternehmen logo
1 von 32
Fedora migration considerations
Juliet L. Hardesty
Metadata Analyst, Indiana University
Open Repositories, June 15, 2016
Fedora 3 diagram key
Repository object
Structural metadata datastream
Descriptive metadata datastream
Other metadata datastream
Indiana University President's Office records, 1937-1962.
Subject files, 1937-1962. Aeons, Board of, 1939-1940.
Fedora 3 - documents
VAA8877-06169 Fedora
DC
RELS-
EXT
METS
VAA8877-U-03624
Fedora
DC
RELS-
EXT
PDF
VAA8877-U-03624-001
Fedora
DC
RELS-
EXT
MASTER
Derivatives
MASTER-
MIX
Folder
Document in
folder
Page in document
isMemberOfCollection
isPartOf
isPartOf
Portable soft drink stand at Bowling Green
Fedora 3 - images
P02668 Fedora
DC
RELS-
EXT
METS
PURL
Redirect
isMemberOfCollection
DCMODS
Image
[Program, 2013-2014, no. 117]
Fedora 3 – time-based media
avalon:22187
Fedora
DC
RELS-
EXT
MODSRIGHTS
SECTIONS
TECH/DIS
PLAY
WORKFLOW
Fedora
DC
RELS-
EXT
STRUCTURE MATTERHORN
avalon:22191
Fedora
DC
RELS-
EXT
STRUCTURE MATTERHORN
Item
CD 1 CD 2
avalon:22189
isMemberOfCollection
TECH/DIS
PLAY
TECH/DIS
PLAY
isPartOf
isPartOf
Focusing on Fedora 3 structure
• RELS-EXT defines structure going up
• METS or special datastreams required to
define structure going down
• All in XML as datastreams on objects that
connect together down to the digital file
https://github.com/duraspace/pcdm/wiki
Portland
Common
Data
Model
Document example in PCDM
Wilcox, David and Andrew Woods. “Hands-On: Seeing Fedora 4 Firsthand.” Fedora 4 Training
Workshop. Open Repositories 2015.
METS: fileSec
METS: structMap
VAA8877-06169 VAA8877-U-03626 VAA8877-U-03626-page1
filesdocuments pages
VAA8877-U-03626Proxy VAA8877-U-03626-page1Proxy
thumb.jpg
screen.jpg
large.jpg
pcdm:
hasMember
pcdm:
hasMember
Fedora 4 structure diagram
http://localhost:8080/fcrepo/rest/VAA8877-U-
03625/pages/VAA8877-U-03625-page1Proxy
METS as PCDM in Fedora – SPARQL
query
select DISTINCT ?document ?firstPage ?nextPage where {
{ <http://localhost:8080/fcrepo/rest/VAA8877-06169> pcdm:hasMember ?document .
?document iana:first ?firstPageProxy .
?firstPageProxy ore:proxyFor ?firstPage .
}
union
{
<http://localhost:8080/fcrepo/rest/VAA8877-06169> pcdm:hasMember+ ?page .
?page pcdm:hasFile ?f .
?f ebucore:hasMimeType "image/jpeg"^^xsd:string .
?pageProxy ore:proxyFor ?page .
?pageProxy iana:next ?nextPageProxy .
?nextPageProxy ore:proxyFor ?nextPage .
}
}
SPARQL query results for structure of
folder VAA8877-06169
Transforming METS to PCDM
• Consider structure for collections using METS
• fileSec and structMap
– @GROUPID, @ID, @FILEID (unique identifiers
from both sections) in combination with <div>
structure within structMap can point to grouping
and ordering structure for objects
• Migrating easier if identifiers brought over
Considerations for structural metadata
• RELS-EXT in Fedora 3 migrated to Fedora 4
• Other structure datastreams are on you to
map
• METS – possible to map programmatically but
maybe not in a standard way
Descriptive Metadata - IU
MODS
Fedora 4 options for descriptive
metadata: Option 1
• Migration tools
– migration-utils
– fedora-migrate gem
Neither really takes advantage of Fedora 4/external triplestore
unless original is already RDF; fedora-migrate requires Hydra
Available now, keeps all metadata, nothing lost; can have RDF
statements on object if already in RDF (fedora-migrate)
Descriptive metadata to Fedora 4:
Option 2
• Map only simple statements to RDF
– Minimal descriptive metadata (title, date) or
descriptive metadata indexed for discovery (title,
date, creator, type, subject, genre, language)
– Use ontologies that allow for simple statements
Descriptive metadata to Fedora 4:
Option 2
• Map only simple statements to RDF
– Minimal descriptive metadata (title, date) or
descriptive metadata indexed for discovery (title,
date, creator, type, subject, genre, language)
– Use ontologies that allow for simple statements
Not all metadata is in RDF; changes in
ontologies/standards might not be 1:1 match with original
Creates RDF statements in Fedora 4/external triplestore; great
option if metadata is DC or other non-hierarchical standard
Descriptive metadata to Fedora 4:
Option 3
• Map complex/hierarchical information into
external triplestore
– All RDF statements (simple and complex) go into
external triplestore but only simple statements
are on Fedora 4 object
Cushman photograph – Fedora 4
Cushman photograph – external triplestore
Descriptive metadata to Fedora 4:
Option 3
• Map complex/hierarchical information into
external triplestore
– All RDF statements (simple and complex) go into
external triplestore but only simple statements
are on Fedora 4 object
Separates metadata between repository and triplestore,
problematic if original metadata not kept in repository
More/all metadata available as RDF; updates easier to manage
through triplestore/Fedora 4 functionality
Descriptive metadata to Fedora 4:
Almost Option 4
• MODS “unofficial standard” mapping to RDF
– MODS and RDF Group’s mapping/transformation
scenario (http://mods2rdf.xyz/)
– Available as conversion code to see MODS mapped
into Fedora 4
Work in progress (not always up, transforming few elements);
you might not agree with suggested mappings
Available now to try; might offer standard way to transform
MODS similarly across institutions
Considerations for descriptive
metadata
• Current state of descriptive metadata
• Management needs of Fedora 4
• Transform to RDF statements or also to Linked
Data
• Decision: keep original descriptive metadata
or not?
– At IU, keep original descriptive metadata (for full
item view)
Thank you!
• Julie Hardesty
• jlhardes@iu.edu
• @jlhardes

Weitere ähnliche Inhalte

Was ist angesagt?

The OpenOffice.org ODF Toolkit Project
The OpenOffice.org ODF Toolkit ProjectThe OpenOffice.org ODF Toolkit Project
The OpenOffice.org ODF Toolkit Project
Alexandro Colorado
 
Chicago LOMRDF update 2003-06-19
Chicago LOMRDF update 2003-06-19 Chicago LOMRDF update 2003-06-19
Chicago LOMRDF update 2003-06-19
Mikael Nilsson
 
Linked data for librarians
Linked data for librariansLinked data for librarians
Linked data for librarians
trevorthornton
 
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
eswcsummerschool
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
giurca
 

Was ist angesagt? (19)

Another RDF Encoding Form
Another RDF Encoding FormAnother RDF Encoding Form
Another RDF Encoding Form
 
SWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDFSWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDF
 
The OpenOffice.org ODF Toolkit Project
The OpenOffice.org ODF Toolkit ProjectThe OpenOffice.org ODF Toolkit Project
The OpenOffice.org ODF Toolkit Project
 
FedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked DataFedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked Data
 
Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on Android
 
Chicago LOMRDF update 2003-06-19
Chicago LOMRDF update 2003-06-19 Chicago LOMRDF update 2003-06-19
Chicago LOMRDF update 2003-06-19
 
Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
 
Dublin Core In Practice
Dublin Core In PracticeDublin Core In Practice
Dublin Core In Practice
 
Federated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of DataFederated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of Data
 
Dublin Core Intro
Dublin Core IntroDublin Core Intro
Dublin Core Intro
 
Linked data for librarians
Linked data for librariansLinked data for librarians
Linked data for librarians
 
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
 
The Semantic Web #9 - Web Ontology Language (OWL)
The Semantic Web #9 - Web Ontology Language (OWL)The Semantic Web #9 - Web Ontology Language (OWL)
The Semantic Web #9 - Web Ontology Language (OWL)
 
morph-LDP: An R2RML-based Linked Data Platform implementation
morph-LDP: An R2RML-based Linked Data Platform implementationmorph-LDP: An R2RML-based Linked Data Platform implementation
morph-LDP: An R2RML-based Linked Data Platform implementation
 
Linked Open Data: A simple how-to
Linked Open Data: A simple how-toLinked Open Data: A simple how-to
Linked Open Data: A simple how-to
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystified
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
 

Ähnlich wie Fedora Migration Considerations

Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011
Juan Sequeda
 
An RDF Metadata Model for OpenDocument Format 1.2
An RDF Metadata Model for OpenDocument Format 1.2An RDF Metadata Model for OpenDocument Format 1.2
An RDF Metadata Model for OpenDocument Format 1.2
Alexandro Colorado
 
big data hadoop technonolgy for storing and processing data
big data hadoop technonolgy for storing and processing databig data hadoop technonolgy for storing and processing data
big data hadoop technonolgy for storing and processing data
preetik9044
 
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Alasdair Gray
 

Ähnlich wie Fedora Migration Considerations (20)

Introduction to HDF5 Data Model, Programming Model and Library APIs
Introduction to HDF5 Data Model, Programming Model and Library APIsIntroduction to HDF5 Data Model, Programming Model and Library APIs
Introduction to HDF5 Data Model, Programming Model and Library APIs
 
Fedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN InfrastructureFedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN Infrastructure
 
Second Thoughts about Metadata Standards for Data
Second Thoughts about Metadata Standards for DataSecond Thoughts about Metadata Standards for Data
Second Thoughts about Metadata Standards for Data
 
DHT2 - O Brother, Where Art Thou with Shyam Ranganathan
DHT2 - O Brother, Where Art Thou with 	Shyam RanganathanDHT2 - O Brother, Where Art Thou with 	Shyam Ranganathan
DHT2 - O Brother, Where Art Thou with Shyam Ranganathan
 
Omeka sc4l
Omeka sc4lOmeka sc4l
Omeka sc4l
 
Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011
 
Hdf5 intro
Hdf5 introHdf5 intro
Hdf5 intro
 
MPTStore: A Fast, Scalable, and Stable Resource Index
MPTStore: A Fast, Scalable, and Stable Resource IndexMPTStore: A Fast, Scalable, and Stable Resource Index
MPTStore: A Fast, Scalable, and Stable Resource Index
 
An RDF Metadata Model for OpenDocument Format 1.2
An RDF Metadata Model for OpenDocument Format 1.2An RDF Metadata Model for OpenDocument Format 1.2
An RDF Metadata Model for OpenDocument Format 1.2
 
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordForce11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, Oxford
 
Technical Background
Technical BackgroundTechnical Background
Technical Background
 
How to describe a dataset. Interoperability issues
How to describe a dataset. Interoperability issuesHow to describe a dataset. Interoperability issues
How to describe a dataset. Interoperability issues
 
How to Describe a Dataset. Interoperability Issues, by Valeria Pesce
How to Describe a Dataset. Interoperability Issues, by Valeria PesceHow to Describe a Dataset. Interoperability Issues, by Valeria Pesce
How to Describe a Dataset. Interoperability Issues, by Valeria Pesce
 
HydraDAM2: Repository Challenges and Solutions for Large Media Files
HydraDAM2: Repository Challenges and Solutions for Large Media FilesHydraDAM2: Repository Challenges and Solutions for Large Media Files
HydraDAM2: Repository Challenges and Solutions for Large Media Files
 
big data hadoop technonolgy for storing and processing data
big data hadoop technonolgy for storing and processing databig data hadoop technonolgy for storing and processing data
big data hadoop technonolgy for storing and processing data
 
Ld4 l triannon
Ld4 l triannonLd4 l triannon
Ld4 l triannon
 
Elns and research data management case study of RSpace at the University of ...
Elns and research data management  case study of RSpace at the University of ...Elns and research data management  case study of RSpace at the University of ...
Elns and research data management case study of RSpace at the University of ...
 
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”
 
Saveface - Save your Facebook content as RDF data
Saveface - Save your Facebook content as RDF dataSaveface - Save your Facebook content as RDF data
Saveface - Save your Facebook content as RDF data
 
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
 

Mehr von Avalon Media System

Mehr von Avalon Media System (20)

Researching Researchers: Avalon's Repository Usage
Researching Researchers: Avalon's Repository UsageResearching Researchers: Avalon's Repository Usage
Researching Researchers: Avalon's Repository Usage
 
CNI 2016: Avalon overview
CNI 2016: Avalon overviewCNI 2016: Avalon overview
CNI 2016: Avalon overview
 
Avalon 2016 Overview
Avalon 2016 OverviewAvalon 2016 Overview
Avalon 2016 Overview
 
Avalon 5.0 and Beyond
Avalon 5.0 and BeyondAvalon 5.0 and Beyond
Avalon 5.0 and Beyond
 
Avalon Overview Hydra Connect 2015
Avalon Overview Hydra Connect 2015Avalon Overview Hydra Connect 2015
Avalon Overview Hydra Connect 2015
 
Avalon Poster for Hydra Connect 2015
Avalon Poster for Hydra Connect 2015Avalon Poster for Hydra Connect 2015
Avalon Poster for Hydra Connect 2015
 
Open Repositories 2015: Avalon Media System: Community Implementation and Sus...
Open Repositories 2015: Avalon Media System: Community Implementation and Sus...Open Repositories 2015: Avalon Media System: Community Implementation and Sus...
Open Repositories 2015: Avalon Media System: Community Implementation and Sus...
 
Avalon Variations webinar dec 2015
Avalon Variations webinar dec 2015Avalon Variations webinar dec 2015
Avalon Variations webinar dec 2015
 
Avalon Media System: Implementation and Community
Avalon Media System: Implementation and CommunityAvalon Media System: Implementation and Community
Avalon Media System: Implementation and Community
 
Avalon at Northwestern University
Avalon at Northwestern UniversityAvalon at Northwestern University
Avalon at Northwestern University
 
Avalon at the University of Virginia
Avalon at the University of VirginiaAvalon at the University of Virginia
Avalon at the University of Virginia
 
Avalon at Stanford University Libraries
Avalon at Stanford University LibrariesAvalon at Stanford University Libraries
Avalon at Stanford University Libraries
 
Avalon at Indiana University
Avalon at Indiana UniversityAvalon at Indiana University
Avalon at Indiana University
 
The Avalon Media System: Implementation and Community
The Avalon Media System: Implementation and CommunityThe Avalon Media System: Implementation and Community
The Avalon Media System: Implementation and Community
 
A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...
A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...
A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...
 
The Avalon Media System: Open Source Audio and Video Access for Libraries and...
The Avalon Media System: Open Source Audio and Video Access for Libraries and...The Avalon Media System: Open Source Audio and Video Access for Libraries and...
The Avalon Media System: Open Source Audio and Video Access for Libraries and...
 
OSDPA: One Body, Many Heads: Preservation and Access From Project Hydra
OSDPA: One Body, Many Heads: Preservation and Access From Project HydraOSDPA: One Body, Many Heads: Preservation and Access From Project Hydra
OSDPA: One Body, Many Heads: Preservation and Access From Project Hydra
 
Audio and Video Repositories at Scale - Indiana University’s Media Digitizati...
Audio and Video Repositories at Scale - Indiana University’s Media Digitizati...Audio and Video Repositories at Scale - Indiana University’s Media Digitizati...
Audio and Video Repositories at Scale - Indiana University’s Media Digitizati...
 
Avalon Media System update
Avalon Media System updateAvalon Media System update
Avalon Media System update
 
Avalon Media System (Open Repositories 2014 poster)
Avalon Media System (Open Repositories 2014 poster)Avalon Media System (Open Repositories 2014 poster)
Avalon Media System (Open Repositories 2014 poster)
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Kürzlich hochgeladen (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Fedora Migration Considerations

  • 1. Fedora migration considerations Juliet L. Hardesty Metadata Analyst, Indiana University Open Repositories, June 15, 2016
  • 2. Fedora 3 diagram key Repository object Structural metadata datastream Descriptive metadata datastream Other metadata datastream
  • 3. Indiana University President's Office records, 1937-1962. Subject files, 1937-1962. Aeons, Board of, 1939-1940.
  • 4. Fedora 3 - documents VAA8877-06169 Fedora DC RELS- EXT METS VAA8877-U-03624 Fedora DC RELS- EXT PDF VAA8877-U-03624-001 Fedora DC RELS- EXT MASTER Derivatives MASTER- MIX Folder Document in folder Page in document isMemberOfCollection isPartOf isPartOf
  • 5. Portable soft drink stand at Bowling Green
  • 6. Fedora 3 - images P02668 Fedora DC RELS- EXT METS PURL Redirect isMemberOfCollection DCMODS Image
  • 8. Fedora 3 – time-based media avalon:22187 Fedora DC RELS- EXT MODSRIGHTS SECTIONS TECH/DIS PLAY WORKFLOW Fedora DC RELS- EXT STRUCTURE MATTERHORN avalon:22191 Fedora DC RELS- EXT STRUCTURE MATTERHORN Item CD 1 CD 2 avalon:22189 isMemberOfCollection TECH/DIS PLAY TECH/DIS PLAY isPartOf isPartOf
  • 9. Focusing on Fedora 3 structure • RELS-EXT defines structure going up • METS or special datastreams required to define structure going down • All in XML as datastreams on objects that connect together down to the digital file
  • 11. Document example in PCDM Wilcox, David and Andrew Woods. “Hands-On: Seeing Fedora 4 Firsthand.” Fedora 4 Training Workshop. Open Repositories 2015.
  • 14.
  • 15. VAA8877-06169 VAA8877-U-03626 VAA8877-U-03626-page1 filesdocuments pages VAA8877-U-03626Proxy VAA8877-U-03626-page1Proxy thumb.jpg screen.jpg large.jpg pcdm: hasMember pcdm: hasMember Fedora 4 structure diagram
  • 17. METS as PCDM in Fedora – SPARQL query select DISTINCT ?document ?firstPage ?nextPage where { { <http://localhost:8080/fcrepo/rest/VAA8877-06169> pcdm:hasMember ?document . ?document iana:first ?firstPageProxy . ?firstPageProxy ore:proxyFor ?firstPage . } union { <http://localhost:8080/fcrepo/rest/VAA8877-06169> pcdm:hasMember+ ?page . ?page pcdm:hasFile ?f . ?f ebucore:hasMimeType "image/jpeg"^^xsd:string . ?pageProxy ore:proxyFor ?page . ?pageProxy iana:next ?nextPageProxy . ?nextPageProxy ore:proxyFor ?nextPage . } }
  • 18. SPARQL query results for structure of folder VAA8877-06169
  • 19. Transforming METS to PCDM • Consider structure for collections using METS • fileSec and structMap – @GROUPID, @ID, @FILEID (unique identifiers from both sections) in combination with <div> structure within structMap can point to grouping and ordering structure for objects • Migrating easier if identifiers brought over
  • 20. Considerations for structural metadata • RELS-EXT in Fedora 3 migrated to Fedora 4 • Other structure datastreams are on you to map • METS – possible to map programmatically but maybe not in a standard way
  • 22. Fedora 4 options for descriptive metadata: Option 1 • Migration tools – migration-utils – fedora-migrate gem Neither really takes advantage of Fedora 4/external triplestore unless original is already RDF; fedora-migrate requires Hydra Available now, keeps all metadata, nothing lost; can have RDF statements on object if already in RDF (fedora-migrate)
  • 23. Descriptive metadata to Fedora 4: Option 2 • Map only simple statements to RDF – Minimal descriptive metadata (title, date) or descriptive metadata indexed for discovery (title, date, creator, type, subject, genre, language) – Use ontologies that allow for simple statements
  • 24.
  • 25. Descriptive metadata to Fedora 4: Option 2 • Map only simple statements to RDF – Minimal descriptive metadata (title, date) or descriptive metadata indexed for discovery (title, date, creator, type, subject, genre, language) – Use ontologies that allow for simple statements Not all metadata is in RDF; changes in ontologies/standards might not be 1:1 match with original Creates RDF statements in Fedora 4/external triplestore; great option if metadata is DC or other non-hierarchical standard
  • 26. Descriptive metadata to Fedora 4: Option 3 • Map complex/hierarchical information into external triplestore – All RDF statements (simple and complex) go into external triplestore but only simple statements are on Fedora 4 object
  • 28. Cushman photograph – external triplestore
  • 29. Descriptive metadata to Fedora 4: Option 3 • Map complex/hierarchical information into external triplestore – All RDF statements (simple and complex) go into external triplestore but only simple statements are on Fedora 4 object Separates metadata between repository and triplestore, problematic if original metadata not kept in repository More/all metadata available as RDF; updates easier to manage through triplestore/Fedora 4 functionality
  • 30. Descriptive metadata to Fedora 4: Almost Option 4 • MODS “unofficial standard” mapping to RDF – MODS and RDF Group’s mapping/transformation scenario (http://mods2rdf.xyz/) – Available as conversion code to see MODS mapped into Fedora 4 Work in progress (not always up, transforming few elements); you might not agree with suggested mappings Available now to try; might offer standard way to transform MODS similarly across institutions
  • 31. Considerations for descriptive metadata • Current state of descriptive metadata • Management needs of Fedora 4 • Transform to RDF statements or also to Linked Data • Decision: keep original descriptive metadata or not? – At IU, keep original descriptive metadata (for full item view)
  • 32. Thank you! • Julie Hardesty • jlhardes@iu.edu • @jlhardes

Hinweis der Redaktion

  1. HyBox Idea - https://github.com/projecthydra-labs/hybox-ideas/issues/19 Odering question on Fedora-tech - https://groups.google.com/forum/#!topic/fedora-tech/1dlfy9Nx76Q
  2. Can add MODSRDF namespace and add properties using that namespace Tried using mods:genre since it is is a flat non-hierarchical element in XML; even brought in a URI for the genre Documentary films from the LC Genre/Form Terms vocabulary Problem is: this is not MODSRDF MODSRDF is often complex (nested) RDF - How does it work to add a complex MODS field to an object?