SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
Turning the Page on
Digital Content
David Wilcox (dgi) & Kirsta Stapelfeldt
(Islandora) Open Repositories 2013
Outline
•  Content Models
•  Role of Metadata
•  Preparing Content for Ingest
•  Derivative Creation
•  Display
Content Models
•  Book/Monographs/Journals/Periodicals/
Newspapers
•  Formats: tiff, jpeg, jp2000, pdf(pdf/a)
o  PDF is stored as single continuous object
o  Books and Periodicals stored atomistically
RDF Statements
Book Object
Page Objects
Book Object: Datastreams
RELS-EXT RDF statements connecting book to collection
MODS MODS metadata
DC Dublin Core metadata
TN Display Thumbnail
PDF (Optional) Optional PDF can be generated and stored at the
book level of all pages
Page Object: Datastreams
RELS-EXT RDF statements connecting pages to book and
declaring the order of pages
MODS MODS metadata
DC Dublin Core Metadata
TN Display Thumbnail
OBJ TIFF representing page
JP2 JPEG 2000
JPG Display JPEG (for reader)
OCR Text (generated or uploaded)
HOCR Coordinate data for generated text only
PDF (Optional) PDF for single page can be generated and stored
with the object
Management functions
for book pages
•  Reordering,
deletion,
replacement (of
object or
derivatives)
Approaches to Metadata
•  Default is MODS and DC
•  Ability to add different metadata at Book &
Page level
•  Ability to add encoded text stream (TEI and
HOCR)
o  Syncing issues
o  TEI schema
•  Next: How is content created and managed?
(Interface Tour)
Single Page Ingest
Simple Batch Ingest
Advanced Batch Ingest
Derivative Generation
•  Kakadu > JP2
•  ImageMagick > JPG
•  Ghostscript > PDF
•  Tesseract > OCR/hOCR
Displaying Content: Changes in
Islandora 7
•  Greater generalization
•  Deprecation of the google reader viewer and
IIV
•  Viewers packaged as separate modules
Displaying Content: Changes in
Islandora 7
Sample Projects (discoverygarden)
University of Manitoba
http://digitalcollections.lib.umanitoba.ca
CalTech
http://caltech.discoverygarden.ca
Williams College
http://unbound.williams.edu
Sample Projects (UPEI)
The Island Magazine
http://vre2.upei.ca/islandmag
PEI Legislative Documents Online
http://peildo.ca/
Prince Edward Island Magazine
http://vre2.upei.ca/peimagazine/
The Charlottetown Guardian
http://newspapers.vre.upei.ca/
Contact Us
David Wilcox
david@discoverygarden.ca
Kirsta Stapelfeldt
kstapelfeldt@upei.ca

Weitere ähnliche Inhalte

Was ist angesagt?

Schema Design
Schema Design Schema Design
Schema Design
MongoDB
 
Documentation With Open Source Tools·(ასლი)
Documentation With Open Source Tools·(ასლი)Documentation With Open Source Tools·(ასლი)
Documentation With Open Source Tools·(ასლი)
Rashad Aliyev
 
Documentation With Open Source Tools
Documentation With Open Source ToolsDocumentation With Open Source Tools
Documentation With Open Source Tools
Rashad Aliyev
 

Was ist angesagt? (19)

Hap clojure berlin 2015
Hap clojure berlin 2015Hap clojure berlin 2015
Hap clojure berlin 2015
 
HyperGraphQL
HyperGraphQLHyperGraphQL
HyperGraphQL
 
Clustering in Data Mining
Clustering in Data MiningClustering in Data Mining
Clustering in Data Mining
 
RDFa: an introduction
RDFa: an introductionRDFa: an introduction
RDFa: an introduction
 
Need for css,introduction to css & basic syntax wt
Need for css,introduction to css &  basic syntax wtNeed for css,introduction to css &  basic syntax wt
Need for css,introduction to css & basic syntax wt
 
The document object
The document objectThe document object
The document object
 
Data Integration & Disintegration: Managing SN SciGraph with SHACL and OWL
Data Integration & Disintegration: Managing SN SciGraph with SHACL and OWLData Integration & Disintegration: Managing SN SciGraph with SHACL and OWL
Data Integration & Disintegration: Managing SN SciGraph with SHACL and OWL
 
The Kasabi Information Marketplace
The Kasabi Information MarketplaceThe Kasabi Information Marketplace
The Kasabi Information Marketplace
 
Semantic HTML
Semantic HTMLSemantic HTML
Semantic HTML
 
NISO Bibliographic Roadmap Meeting Proposal
NISO Bibliographic Roadmap Meeting ProposalNISO Bibliographic Roadmap Meeting Proposal
NISO Bibliographic Roadmap Meeting Proposal
 
Schema Design
Schema Design Schema Design
Schema Design
 
Xml
XmlXml
Xml
 
Indexing, searching, and aggregation with redi search and .net
Indexing, searching, and aggregation with redi search and .netIndexing, searching, and aggregation with redi search and .net
Indexing, searching, and aggregation with redi search and .net
 
Data Science Capstone - Global Economics
Data Science Capstone - Global EconomicsData Science Capstone - Global Economics
Data Science Capstone - Global Economics
 
JSON-LD
JSON-LDJSON-LD
JSON-LD
 
Semantic Web introduction
Semantic Web introductionSemantic Web introduction
Semantic Web introduction
 
Documentation With Open Source Tools·(ასლი)
Documentation With Open Source Tools·(ასლი)Documentation With Open Source Tools·(ასლი)
Documentation With Open Source Tools·(ასლი)
 
Documentation With Open Source Tools
Documentation With Open Source ToolsDocumentation With Open Source Tools
Documentation With Open Source Tools
 
CHAOS Platform presentation, The Royal Library in Copenhagen.
CHAOS Platform presentation, The Royal Library in Copenhagen.CHAOS Platform presentation, The Royal Library in Copenhagen.
CHAOS Platform presentation, The Royal Library in Copenhagen.
 

Ähnlich wie Turning the Page on Digital Content

The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
Sören Auer
 
Linked data enhanced publishing for special collections (with Drupal)
Linked data enhanced publishing for special collections (with Drupal)Linked data enhanced publishing for special collections (with Drupal)
Linked data enhanced publishing for special collections (with Drupal)
Joachim Neubert
 
Kampmeier ecn 2012
Kampmeier ecn 2012Kampmeier ecn 2012
Kampmeier ecn 2012
ECNOfficer
 
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Chimezie Ogbuji
 

Ähnlich wie Turning the Page on Digital Content (20)

RDFa: introduction, comparison with microdata and microformats and how to use it
RDFa: introduction, comparison with microdata and microformats and how to use itRDFa: introduction, comparison with microdata and microformats and how to use it
RDFa: introduction, comparison with microdata and microformats and how to use it
 
Expanding the content categories at JaLC
Expanding the content categories at JaLCExpanding the content categories at JaLC
Expanding the content categories at JaLC
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
 
ontology.ppt
ontology.pptontology.ppt
ontology.ppt
 
Hacktoberfest 2020 - Intro to Knowledge Graphs
Hacktoberfest 2020 - Intro to Knowledge GraphsHacktoberfest 2020 - Intro to Knowledge Graphs
Hacktoberfest 2020 - Intro to Knowledge Graphs
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
 
The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015
 
Mongo db
Mongo dbMongo db
Mongo db
 
Ontology based metadata schema for digital library projects in China
Ontology based metadata schema for digital library projects in ChinaOntology based metadata schema for digital library projects in China
Ontology based metadata schema for digital library projects in China
 
Introduction To Docbook 4 .5 Authoring
Introduction To Docbook 4 .5   AuthoringIntroduction To Docbook 4 .5   Authoring
Introduction To Docbook 4 .5 Authoring
 
Linked data enhanced publishing for special collections (with Drupal)
Linked data enhanced publishing for special collections (with Drupal)Linked data enhanced publishing for special collections (with Drupal)
Linked data enhanced publishing for special collections (with Drupal)
 
Kampmeier ecn 2012
Kampmeier ecn 2012Kampmeier ecn 2012
Kampmeier ecn 2012
 
Building bridges - Plone Conference 2015 Bucharest
Building bridges   - Plone Conference 2015 BucharestBuilding bridges   - Plone Conference 2015 Bucharest
Building bridges - Plone Conference 2015 Bucharest
 
Yann Nicolas - Elag 2018 : From XML to MARC
Yann Nicolas - Elag 2018 : From XML to MARCYann Nicolas - Elag 2018 : From XML to MARC
Yann Nicolas - Elag 2018 : From XML to MARC
 
From XML to MARC. RDF behind the scenes.
From XML to MARC. RDF behind the scenes.From XML to MARC. RDF behind the scenes.
From XML to MARC. RDF behind the scenes.
 
The Mysteries of Metadata
The Mysteries of MetadataThe Mysteries of Metadata
The Mysteries of Metadata
 
Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2
 
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDL
 
The Danish National Bibliography as LOD
The Danish National Bibliography as LODThe Danish National Bibliography as LOD
The Danish National Bibliography as LOD
 
Hadoop Ecosystem
Hadoop EcosystemHadoop Ecosystem
Hadoop Ecosystem
 

Kürzlich hochgeladen

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Kürzlich hochgeladen (20)

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

Turning the Page on Digital Content