SlideShare ist ein Scribd-Unternehmen logo
1 von 1
Downloaden Sie, um offline zu lesen
Rapid Digitization of Latin American Ephemera with Hydra 
Project History and Goals 
Princeton University Library began to collect and build an archive of Latin American ephemera and gray 
literature in the mid 1970s to document the activities of political and social organizations and movements, as 
well as the broader political, socioeconomic and cultural developments of the region. Access to the material 
was provided by slowly accumulating and organizing thematic sub-collections, creating finding aids, and 
microfilming selected curated sub-collections. Reproductions of the microfilm were commercially distributed 
and resulting royalties were used to fund new acquisitions. That model gradually become unsustainable 
during the past decade and microfilming was halted in 2008. 
Hydra breathes new life into this project by providing us with a 
framework for creating an end-to-end application that will facilitate rapid 
digitization, cataloging, and access to this important collection. Since 
the system went into production in April of 2014, nearly 1500 items 
have been cataloged, with the throughput rate ultimately accelerating to 
over 300 items per month in August. 
Princeton has several projects and workflows similar to “LAE”, and we 
expect many of the components of this applications to reusable in future 
projects. 
Item received and placed in 
Folder w/ barcode 
Preliminary metadata 
recorded, Folder placed in 
Box* w/ barcode 
Full Box sent offsite for 
imaging and OCR 
Hard drives returned to library, 
boxes to off site storage 
Images qc’d and ingested 
(barcode associates with 
metadata) 
Metadata completed from 
images 
Final quality control 
Production 
Returned drive is 
organized by 
box/folder barcodes. It 
also contains a 
JHOVE audit with 
checksums 
Generously sponsored by the Council on Library and Information Resources (CLIR; http://www.clir.org/) 
and Latin Americanist Research Resources Project (LARRP; http://www.crl.edu/grn/larrp) 
Image QC includes: 
● Checksums verified 
● Color profiles confirmed 
● Confirm (1) OCR for (1) file 
Process returns a file of arguments which: 
● Instantiating Page (image) and Folder 
objects 
● Creating the appropriate associations 
● This process also creates a JP2 that is 
copied to our image server (Loris). 
A Folder is instantiated in repo. Folder 
barcode, box barcode (via scanner), title, 
country, and genre MUST be included 
before it can be saved. 
State is set to “Has Preliminary 
Metadata” 
Folders move through their 
remaining states: 
● “Has Core Metadata" 
● “Needs QC" 
● "In Production" 
Item Cataloging and Imaging Workflow 
* Boxes have a similar workflow. States 
are: 
1. "New" 
2. "Ready to Ship" 
3. "Shipped" 
4. "Received" 
5. "All in Production" 
@prefix dc: <http://purl.org/dc/terms/> . 
@prefix isolang: <http://id.loc.gov/vocabulary/iso639-2/> . 
@prefix lcco: <http://id.loc.gov/vocabulary/countries/> . 
@prefix lcga: <http://id.loc.gov/vocabulary/geographicAreas/> . 
@prefix lcsh: <http://id.loc.gov/authorities/subjects/> . 
@prefix marcrel: <"http://id.loc.gov/vocabulary/relators/> . 
@prefix puls: <http://pul-store.princeton.edu/> . 
@prefix pulterms: <http://princeton.edu/pulstore/terms/> . 
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . 
@prefix tgm: <http://id.loc.gov/vocabulary/graphicMaterials/> . 
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> . 
puls:00b84 dc:title "Memoria 2003."@es; 
marcrel:mfp lcga:cl; 
pulterms:heightInCM "21"^^xsd:int; 
pulterms:isPartOfProject puls:0005p; 
pulterms:pageCount "15"^^xsd:int; 
pulterms:widthInCM "15"^^xsd:int; 
dc:coverage lcga:cl; 
dc:created "2003"^^xsd:gYear; 
dc:format tgm:tgm007415; 
dc:language isolang:spa; 
dc:publisher "Asociación para la Cooperación en el Sur (ACSUR)-Las Segovias"; 
dc:rights "This digital reproduction is intended to support research, teaching, 
and private study. Users are responsible for determining any copyright questions." 
@en; 
dc:subject lcsh:sh85040810, 
"Economic policy--Social aspects"@en, 
"Economics--Political aspects"@en, 
lcsh:sh85067385 . 
puls:004kr dc:title "The Workforce of Puerto Rico: Fringe Benefits."@en; 
marcrel:mfp lcco:pr; 
pulterms:heightInCM "22"^^xsd:int; 
pulterms:isPartOfProject puls:0005p; 
pulterms:pageCount "17"^^xsd:int; 
[...] 
Rich Metadata Available as RDF/Linked Data

Weitere ähnliche Inhalte

Was ist angesagt?

2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod GmodJun Zhao
 
Rdf Overview Presentation
Rdf Overview PresentationRdf Overview Presentation
Rdf Overview PresentationKen Varnum
 
2015 09 rda-pre-meeting_jk
2015 09 rda-pre-meeting_jk2015 09 rda-pre-meeting_jk
2015 09 rda-pre-meeting_jkJohannes Keizer
 
Using Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's TablesUsing Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's Tables祺傑 林
 
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQLVALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQLJane Frazier
 
Using Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's TablesUsing Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's TablesEmir Muñoz
 
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italianiDiego Valerio Camarda
 
Federated Query Formulation and Processing Through BioFed
Federated Query Formulation and Processing Through BioFedFederated Query Formulation and Processing Through BioFed
Federated Query Formulation and Processing Through BioFedMuhammad Saleem
 
Federated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialFederated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialMuhammad Saleem
 
FAIR Projector Builder
FAIR Projector BuilderFAIR Projector Builder
FAIR Projector BuilderMark Wilkinson
 
1 bioline & t space or2013 final
1 bioline & t space or2013 final1 bioline & t space or2013 final
1 bioline & t space or2013 finalKellliBee
 
Harnessing The Semantic Web
Harnessing The Semantic WebHarnessing The Semantic Web
Harnessing The Semantic Webwilliam_greenly
 
Ukgovld registry-intro
Ukgovld registry-introUkgovld registry-intro
Ukgovld registry-introDave Reynolds
 
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panicoDiego Valerio Camarda
 
Grails And The Semantic Web
Grails And The Semantic WebGrails And The Semantic Web
Grails And The Semantic Webwilliam_greenly
 
Semantic Web introduction
Semantic Web introductionSemantic Web introduction
Semantic Web introductionGraphity
 

Was ist angesagt? (20)

2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
Rdf Overview Presentation
Rdf Overview PresentationRdf Overview Presentation
Rdf Overview Presentation
 
2015 09 rda-pre-meeting_jk
2015 09 rda-pre-meeting_jk2015 09 rda-pre-meeting_jk
2015 09 rda-pre-meeting_jk
 
Using Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's TablesUsing Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's Tables
 
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQLVALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
 
Converting GHO to RDF
Converting GHO to RDFConverting GHO to RDF
Converting GHO to RDF
 
Using Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's TablesUsing Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's Tables
 
Keynote session - LOD2014 W3C event
Keynote session - LOD2014 W3C eventKeynote session - LOD2014 W3C event
Keynote session - LOD2014 W3C event
 
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
 
Federated Query Formulation and Processing Through BioFed
Federated Query Formulation and Processing Through BioFedFederated Query Formulation and Processing Through BioFed
Federated Query Formulation and Processing Through BioFed
 
Federated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialFederated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 Tutorial
 
FAIR Projector Builder
FAIR Projector BuilderFAIR Projector Builder
FAIR Projector Builder
 
RDF data model
RDF data modelRDF data model
RDF data model
 
1 bioline & t space or2013 final
1 bioline & t space or2013 final1 bioline & t space or2013 final
1 bioline & t space or2013 final
 
Harnessing The Semantic Web
Harnessing The Semantic WebHarnessing The Semantic Web
Harnessing The Semantic Web
 
Ukgovld registry-intro
Ukgovld registry-introUkgovld registry-intro
Ukgovld registry-intro
 
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
 
Linked open Vocabularies for Linked Open Data - the role of AGROVOC
Linked open Vocabularies for Linked Open Data - the role of AGROVOCLinked open Vocabularies for Linked Open Data - the role of AGROVOC
Linked open Vocabularies for Linked Open Data - the role of AGROVOC
 
Grails And The Semantic Web
Grails And The Semantic WebGrails And The Semantic Web
Grails And The Semantic Web
 
Semantic Web introduction
Semantic Web introductionSemantic Web introduction
Semantic Web introduction
 

Ähnlich wie Rapid Digitization of Latin American Ephemera with Hydra

Preservation Metadata, CARLI Metadata Matters series, December 2010
Preservation Metadata, CARLI Metadata Matters series, December 2010Preservation Metadata, CARLI Metadata Matters series, December 2010
Preservation Metadata, CARLI Metadata Matters series, December 2010Claire Stewart
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collectionslljohnston
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesOCLC
 
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...Trevor Owens
 
Berlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyBerlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyCornelius Puschmann
 
Crossing Institutional Boundaries to Create Permanent Public Access for Gover...
Crossing Institutional Boundaries to Create Permanent Public Access for Gover...Crossing Institutional Boundaries to Create Permanent Public Access for Gover...
Crossing Institutional Boundaries to Create Permanent Public Access for Gover...rhonabwy
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...Marcus Smith
 
Cooperative advantages: Lessons in scale and sustainability from ReCAP
Cooperative advantages: Lessons in scale and sustainability from ReCAPCooperative advantages: Lessons in scale and sustainability from ReCAP
Cooperative advantages: Lessons in scale and sustainability from ReCAPCathal McCauley
 
LOCAH Project and Considerations of Linked Data Approaches
LOCAH Project and Considerations of Linked Data ApproachesLOCAH Project and Considerations of Linked Data Approaches
LOCAH Project and Considerations of Linked Data ApproachesAdrian Stevenson
 
Government Documents Disposition Project Made Easy with Aleph V.18
Government Documents Disposition Project Made Easy with Aleph V.18Government Documents Disposition Project Made Easy with Aleph V.18
Government Documents Disposition Project Made Easy with Aleph V.18guest61f1b7d
 
Leslie Johnston: Challenges of Preserving Every Digital Format, 2012
Leslie Johnston: Challenges of Preserving Every Digital Format, 2012Leslie Johnston: Challenges of Preserving Every Digital Format, 2012
Leslie Johnston: Challenges of Preserving Every Digital Format, 2012lljohnston
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
 
The Digital Archaeological Workflow: A Case Study from Sweden
The Digital Archaeological Workflow: A Case Study from SwedenThe Digital Archaeological Workflow: A Case Study from Sweden
The Digital Archaeological Workflow: A Case Study from SwedenMarcus Smith
 
Digital Library Project Proposal
Digital Library Project ProposalDigital Library Project Proposal
Digital Library Project ProposalMicah Vandegrift
 
160606 data lifecycle project outline
160606 data lifecycle project outline160606 data lifecycle project outline
160606 data lifecycle project outlineIan Duncan
 
Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Adrian Stevenson
 

Ähnlich wie Rapid Digitization of Latin American Ephemera with Hydra (20)

Preservation Metadata, CARLI Metadata Matters series, December 2010
Preservation Metadata, CARLI Metadata Matters series, December 2010Preservation Metadata, CARLI Metadata Matters series, December 2010
Preservation Metadata, CARLI Metadata Matters series, December 2010
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collections
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter Libraries
 
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
 
Berlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyBerlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony Hey
 
Crossing Institutional Boundaries to Create Permanent Public Access for Gover...
Crossing Institutional Boundaries to Create Permanent Public Access for Gover...Crossing Institutional Boundaries to Create Permanent Public Access for Gover...
Crossing Institutional Boundaries to Create Permanent Public Access for Gover...
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
 
Cooperative advantages: Lessons in scale and sustainability from ReCAP
Cooperative advantages: Lessons in scale and sustainability from ReCAPCooperative advantages: Lessons in scale and sustainability from ReCAP
Cooperative advantages: Lessons in scale and sustainability from ReCAP
 
LOCAH Project and Considerations of Linked Data Approaches
LOCAH Project and Considerations of Linked Data ApproachesLOCAH Project and Considerations of Linked Data Approaches
LOCAH Project and Considerations of Linked Data Approaches
 
Government Documents Disposition Project Made Easy with Aleph V.18
Government Documents Disposition Project Made Easy with Aleph V.18Government Documents Disposition Project Made Easy with Aleph V.18
Government Documents Disposition Project Made Easy with Aleph V.18
 
BatIg
BatIgBatIg
BatIg
 
Leslie Johnston: Challenges of Preserving Every Digital Format, 2012
Leslie Johnston: Challenges of Preserving Every Digital Format, 2012Leslie Johnston: Challenges of Preserving Every Digital Format, 2012
Leslie Johnston: Challenges of Preserving Every Digital Format, 2012
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
A Guide for Reproducible Research
A Guide for Reproducible ResearchA Guide for Reproducible Research
A Guide for Reproducible Research
 
The Digital Archaeological Workflow: A Case Study from Sweden
The Digital Archaeological Workflow: A Case Study from SwedenThe Digital Archaeological Workflow: A Case Study from Sweden
The Digital Archaeological Workflow: A Case Study from Sweden
 
Digital Library Project Proposal
Digital Library Project ProposalDigital Library Project Proposal
Digital Library Project Proposal
 
160606 data lifecycle project outline
160606 data lifecycle project outline160606 data lifecycle project outline
160606 data lifecycle project outline
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
 
Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?
 

Mehr von Jon Stroop

A more Worthwhile Sufia: Now with PCDM
A more Worthwhile Sufia: Now with PCDMA more Worthwhile Sufia: Now with PCDM
A more Worthwhile Sufia: Now with PCDMJon Stroop
 
Introduction to the IIIF Image API
Introduction to the IIIF Image APIIntroduction to the IIIF Image API
Introduction to the IIIF Image APIJon Stroop
 
IIIF Technology for VRA33, 14 March 2015, Denver, CO
IIIF Technology for VRA33, 14 March 2015, Denver, COIIIF Technology for VRA33, 14 March 2015, Denver, CO
IIIF Technology for VRA33, 14 March 2015, Denver, COJon Stroop
 
IIIF API Specifications Overview
IIIF API Specifications OverviewIIIF API Specifications Overview
IIIF API Specifications OverviewJon Stroop
 
Meet Loris and OpenSeadragon
Meet Loris and OpenSeadragonMeet Loris and OpenSeadragon
Meet Loris and OpenSeadragonJon Stroop
 
IIIF for Index of Christian Art
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian ArtJon Stroop
 

Mehr von Jon Stroop (6)

A more Worthwhile Sufia: Now with PCDM
A more Worthwhile Sufia: Now with PCDMA more Worthwhile Sufia: Now with PCDM
A more Worthwhile Sufia: Now with PCDM
 
Introduction to the IIIF Image API
Introduction to the IIIF Image APIIntroduction to the IIIF Image API
Introduction to the IIIF Image API
 
IIIF Technology for VRA33, 14 March 2015, Denver, CO
IIIF Technology for VRA33, 14 March 2015, Denver, COIIIF Technology for VRA33, 14 March 2015, Denver, CO
IIIF Technology for VRA33, 14 March 2015, Denver, CO
 
IIIF API Specifications Overview
IIIF API Specifications OverviewIIIF API Specifications Overview
IIIF API Specifications Overview
 
Meet Loris and OpenSeadragon
Meet Loris and OpenSeadragonMeet Loris and OpenSeadragon
Meet Loris and OpenSeadragon
 
IIIF for Index of Christian Art
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian Art
 

Kürzlich hochgeladen

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 

Kürzlich hochgeladen (20)

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 

Rapid Digitization of Latin American Ephemera with Hydra

  • 1. Rapid Digitization of Latin American Ephemera with Hydra Project History and Goals Princeton University Library began to collect and build an archive of Latin American ephemera and gray literature in the mid 1970s to document the activities of political and social organizations and movements, as well as the broader political, socioeconomic and cultural developments of the region. Access to the material was provided by slowly accumulating and organizing thematic sub-collections, creating finding aids, and microfilming selected curated sub-collections. Reproductions of the microfilm were commercially distributed and resulting royalties were used to fund new acquisitions. That model gradually become unsustainable during the past decade and microfilming was halted in 2008. Hydra breathes new life into this project by providing us with a framework for creating an end-to-end application that will facilitate rapid digitization, cataloging, and access to this important collection. Since the system went into production in April of 2014, nearly 1500 items have been cataloged, with the throughput rate ultimately accelerating to over 300 items per month in August. Princeton has several projects and workflows similar to “LAE”, and we expect many of the components of this applications to reusable in future projects. Item received and placed in Folder w/ barcode Preliminary metadata recorded, Folder placed in Box* w/ barcode Full Box sent offsite for imaging and OCR Hard drives returned to library, boxes to off site storage Images qc’d and ingested (barcode associates with metadata) Metadata completed from images Final quality control Production Returned drive is organized by box/folder barcodes. It also contains a JHOVE audit with checksums Generously sponsored by the Council on Library and Information Resources (CLIR; http://www.clir.org/) and Latin Americanist Research Resources Project (LARRP; http://www.crl.edu/grn/larrp) Image QC includes: ● Checksums verified ● Color profiles confirmed ● Confirm (1) OCR for (1) file Process returns a file of arguments which: ● Instantiating Page (image) and Folder objects ● Creating the appropriate associations ● This process also creates a JP2 that is copied to our image server (Loris). A Folder is instantiated in repo. Folder barcode, box barcode (via scanner), title, country, and genre MUST be included before it can be saved. State is set to “Has Preliminary Metadata” Folders move through their remaining states: ● “Has Core Metadata" ● “Needs QC" ● "In Production" Item Cataloging and Imaging Workflow * Boxes have a similar workflow. States are: 1. "New" 2. "Ready to Ship" 3. "Shipped" 4. "Received" 5. "All in Production" @prefix dc: <http://purl.org/dc/terms/> . @prefix isolang: <http://id.loc.gov/vocabulary/iso639-2/> . @prefix lcco: <http://id.loc.gov/vocabulary/countries/> . @prefix lcga: <http://id.loc.gov/vocabulary/geographicAreas/> . @prefix lcsh: <http://id.loc.gov/authorities/subjects/> . @prefix marcrel: <"http://id.loc.gov/vocabulary/relators/> . @prefix puls: <http://pul-store.princeton.edu/> . @prefix pulterms: <http://princeton.edu/pulstore/terms/> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix tgm: <http://id.loc.gov/vocabulary/graphicMaterials/> . @prefix xsd: <http://www.w3.org/2001/XMLSchema#> . puls:00b84 dc:title "Memoria 2003."@es; marcrel:mfp lcga:cl; pulterms:heightInCM "21"^^xsd:int; pulterms:isPartOfProject puls:0005p; pulterms:pageCount "15"^^xsd:int; pulterms:widthInCM "15"^^xsd:int; dc:coverage lcga:cl; dc:created "2003"^^xsd:gYear; dc:format tgm:tgm007415; dc:language isolang:spa; dc:publisher "Asociación para la Cooperación en el Sur (ACSUR)-Las Segovias"; dc:rights "This digital reproduction is intended to support research, teaching, and private study. Users are responsible for determining any copyright questions." @en; dc:subject lcsh:sh85040810, "Economic policy--Social aspects"@en, "Economics--Political aspects"@en, lcsh:sh85067385 . puls:004kr dc:title "The Workforce of Puerto Rico: Fringe Benefits."@en; marcrel:mfp lcco:pr; pulterms:heightInCM "22"^^xsd:int; pulterms:isPartOfProject puls:0005p; pulterms:pageCount "17"^^xsd:int; [...] Rich Metadata Available as RDF/Linked Data