SlideShare ist ein Scribd-Unternehmen logo
1 von 60
Downloaden Sie, um offline zu lesen
Isabella Stewart Gardner Museum Orientation
Sept 17, 2015
Welcome to the Cooperative!
Preservation as a Process
MetaArchive and Distributed Digital Preservation
Sam Meister
Deanna Ulvestad
OhioDIG Meeting
March 9, 2016
MetaArchive History
● Founded 2004
● Distributed digital preservation cooperative
● Preservation aims: prevent loss and
corruption from human malice/error or from a
disaster
● First (known) preservation network to
preserve special collections/unique materials
2
● Distributed digital preservation
● Institutions maintain control over their own
content
● Preservation as a process, not a
push-button exercise
● Simplicity in ingest, management
3
Hallmarks
● Auburn University
● Boston College
● Cal Poly San Luis Obispo
● Consorci de Biblioteques
Universitaris de Catalunya
● Florida State University
● Isabella Stewart Gardner
Museum
● Greene County Public Library
● HBCU Library Alliance
● Indiana State University
● Oregon State University
● Penn State University
● Pontificia Universidade
Catolica do Rio de Janeiro
● Purdue University
● Rockefeller Archives Center
● University of Louisville
● University of North Texas
● University of South Carolina
● Virginia Tech University
Membership
5
MetaArchive Practices
● Basic processes
○ “Producer” (OAIS) determines curation practices;
brings SIPs to MetaArchive
○ Multiple copies of AIPs dispersed across
geographical, political, and environmental lines
○ Checks and repairs automated across network
○ Deaccession cycle versus data deletion
6
● Three membership levels
○ Collaborative members: $2.5K/year
○ Preservation members: $3K/year
○ Sustaining members: $5.5K/year
● Server cost: <$5K/term
● Storage cost: $585/TB/year
7
MetaArchive Pricing
Membership Responsibilities
● Undertake a 3-year membership term
● Take responsibility for content preparation,
evaluation, staging, and ingest testing
● Monitor collections to ensure accurate
long-term preservation
● Host and maintain a MetaArchive cache
(server) or pay in a technology support fee
● Consider contributing to Committees!
8
● MetaArchive is a cooperative, not a
vendor:
○ All hardware and software assets are owned by
members
○ Membership fees and storage fees go to a central
pool of support for members’ co-op activities
9
Cooperative Preservation
● Compatible with any repository system
○ E.g., Dspace, Fedora, Archivalware, ETDb,
CONTENTdm, BePress, Digital Commons, etc
● Member institutions determine their own
curatorial practices
● MetaArchive is a community of support to
help them make informed decisions
10
Philosophy in Practice
Ingest Demo / Overview
Prepare SIP
Stage Collection
● Collections consist of Archival Units (one or many)
Stage Collection
AU AU
AU
AU
AU AU
Stage Collection
● Collections consist of Archival Units (one or many)
● Archival Units contain content and metadata
Stage Collection
ARCHIVAL
UNIT
Content
+
Metadata
Stage Collection
● Collections consist of Archival Units (one or many)
● Archival Units contain content and metadata
● Collections organized to be able to restore collections
later
● Include documentation on restoration procedures
● Make collection web accessible at URL
Stage Collection
AU AU
AU
AU
AU AU
Documentation
http://metaarchive-staging.lib.calpoly.edu
Create Collection
● Create collection level metadata for collection in
Conspectus management tool
○ Title
○ Archive
○ Description
○ Base URL
Create Collection
Create Manifest Page
● Simple HTML page with basic collection description
information and links to collection content for LOCKSS
crawlers
● LOCKSS Crawlers MUST find permission statement to be
able to harvest content
Create Manifest Page
Create Manifest Page
● Simple HTML page with basic collection description
information and links to collection content for LOCKSS
crawlers
● LOCKSS Crawlers MUST find permission statement to be
able to harvest content
● Place Manifest page on same host as content
Create Manifest Page
http://metaarchive-staging.lib.calpoly.edu/mabagitmanifest.html
Develop Collection Plugin
● Plugins tell member caches where to find a designated
Manifest page and how far to follow the links to harvest
collection content
Develop Collection Plugin
Member
Cache
AU
http://metaarchive-staging.lib.calpoly.edu
AU AU
AU AU AU
Plugin:
edu.calp.bagitplugin2
Develop Collection Plugin
● Member creates new plugin via Conspectus based on
existing plugin, or uploads custom plugin
Develop Collection Plugin
Develop Collection Plugin
● Member creates new plugin via Conspectus based on
existing plugin, or uploads custom plugin
● Member gives plugin a unique name
● Member defines plugin rules to determine which files will
be harvested
Develop Collection Plugin
Test & Review
Test Collection Plugin
● Member tests plugin locally and makes changes as
needed
● Member defines Plugin name and Archival Units in
Conspectus
Test Collection Plugin
Test Collection Plugin
Test Collection Plugin
Review Plugin & Test Ingest
● Member requests plugin review and test by MetaArchive
staff
● MetaArchive staff ingests collection to test network
Review Plugin & Test Ingest
AU AU AU
AU AU AU
Test
Cache
Test
Cache
Test
Cache
Plugin
Review Plugin & Test Ingest
● Member requests plugin review and test by MetaArchive
staff
● MetaArchive staff ingests collection to test network
● MetaArchive staff sends member test ingest report to
review
SIP to AIP
Commit Plugin
● If test ingest is successful, MetaArchive staff commits
plugin to production plugin repository
Commit Plugin
Complete Collection Metadata
Make collection available to
network
● MetaArchive staff regenerate LOCKSS Title Database to
expose collection to production network
● MetaArchive staff assigns six geographically distributed
caches to crawl and harvest the collection
AU AU
AU AU
AU
AU
Member
Cache
Member
Cache
Member
Cache
Member
Cache
Member
Cache
Member
Cache
Replicate collection
Replicate collection
Auditing Control
Auditing Control
Voting and Polling
A
U
A
U
cf2304e9b416e4c6e4d7a1bb22bf95e4 data/ua-sel_00000268-M.wav
046763d382e557359731edc1d5a8b821 data/dm-ua-sel_bag_002_MODSmetadata.xml
cf9beab2c63082d0d0b40ce9a8faa0a6 data/ua-sel_00000265_001-M.wav
733298738956be7ff4d9ed6b5d021e56 data/ua-sel_00000259-M.wav
Voting and Polling
A
U
A
U A
U
A
U
A
U
A
U
A
U
Voting and Polling
A
U
A
U A
U
A
U
A
U
A
U
A
U
Damage and Repair
A
U
A
U
cf2304e9b416e4c6e4d7a1bb22bf95e4 data/ua-sel_00000268-M.wav
046763d382e557359731edc1d5a8b821 data/dm-ua-sel_bag_002_MODSmetadata.xml
cf9beab2c63082d0d0b40ce9a8faa0a6 data/ua-sel_00000265_001-M.wav
733298738956be7ff4d9ed6b5d021e56
data/ua-sel_00000259-M.wav
cf2304e9b416e4c6e4d7a1bb22bf95e4 data/ua-sel_00000268-M.wav
046763d382e557359731edc1d5a8b821 data/dm-ua-sel_bag_002_MODSmetadata.xml
cf9beab2c63082d0d0b40ce9a8faa0a6 data/ua-sel_00000265_001-M.wav
733298738956be7ff4d9ed6b5d021e57
data/ua-sel_00000259-M.wav
Damage and Repair
A
U
A
U
A
U
Hi there. Remind
me, have we talked
before?
Damage and Repair
A
U
A
U
A
U
Yep. We go way
back.
metaarchive.org
sam@educopia.org
@samalanmeister
55
Thanks!
Getting Started
56
November 2010
Attended 5-day workshop
“Digital Preservation Management”
University of Michigan
August 2011
Compared Digital Preservation
Repository options
April 2012
Joined MetaArchive as a
Preservation Member
January 2013
Started ingesting collections
Greene County Public Library was housed in the Carnegie building from 1906 – 1978. Xenia, Ohio.
Why MetaArchive
57
◼ Transparent
◼ Affordable
◼ Community-based
◼ Supportive
◼ Diverse
First bookmobile used by the Greene County Public Library from 1948 – 1958. Xenia, Ohio.
Modified IngestGCPL
CONTENTdm
Server
GCPL
Archive
Server
MetaArchive
Server
MetaArchive
Server
MetaArchive Server
Ingests
GCPL Archive Units
MetaArchive
Server
MetaArchive Server
MetaArchive
Server
MetaArchive
Server
58
Modified IngestGCPL
CONTENTdm
Server
GCPL
Archive
Server
MetaArchive
Server
MetaArchive
Server
MetaArchive Server
Replicates
GCPL Archive Units
MetaArchive
Server
MetaArchive Server
MetaArchive
Server
MetaArchive
Server
59
Cost of a Digital Time Capsule….
Library Paid in 2015
Preservation Membership $3,000
Technology Fee 1,000
Storage .50¢ per GB x 3,600 GB 1,800
Total MetaArchive Fees 2015 $4,800
60
Greene County Courthouse Time Capsule of 1901 opened in 2001. Xenia, Ohio.

Weitere ähnliche Inhalte

Was ist angesagt?

LOCKSS UK, with a focus on reporting experience
LOCKSS UK, with a focus on reporting experienceLOCKSS UK, with a focus on reporting experience
LOCKSS UK, with a focus on reporting experience
Chris Rusbridge
 
Mwdl overview for_ri_20140823
Mwdl overview for_ri_20140823Mwdl overview for_ri_20140823
Mwdl overview for_ri_20140823
Sandra McIntyre
 
Archivematica integration handshaking towards comprehensive digital preserva...
Archivematica integration  handshaking towards comprehensive digital preserva...Archivematica integration  handshaking towards comprehensive digital preserva...
Archivematica integration handshaking towards comprehensive digital preserva...
Artefactual Systems - Archivematica
 

Was ist angesagt? (10)

Bibliographic Infrastructure for Shared Print Management
Bibliographic Infrastructure for Shared Print ManagementBibliographic Infrastructure for Shared Print Management
Bibliographic Infrastructure for Shared Print Management
 
Wittenberg Portico: Lessons From a Community Supported Archive
Wittenberg Portico: Lessons From a Community Supported ArchiveWittenberg Portico: Lessons From a Community Supported Archive
Wittenberg Portico: Lessons From a Community Supported Archive
 
LOCKSS UK, with a focus on reporting experience
LOCKSS UK, with a focus on reporting experienceLOCKSS UK, with a focus on reporting experience
LOCKSS UK, with a focus on reporting experience
 
147 eileen fenton2006fall
147 eileen fenton2006fall147 eileen fenton2006fall
147 eileen fenton2006fall
 
WorldCat Holdings Presentation
WorldCat Holdings PresentationWorldCat Holdings Presentation
WorldCat Holdings Presentation
 
Mwdl overview for_ri_20140823
Mwdl overview for_ri_20140823Mwdl overview for_ri_20140823
Mwdl overview for_ri_20140823
 
Archivematica integration handshaking towards comprehensive digital preserva...
Archivematica integration  handshaking towards comprehensive digital preserva...Archivematica integration  handshaking towards comprehensive digital preserva...
Archivematica integration handshaking towards comprehensive digital preserva...
 
292 daniel dollar ssp yale_28_may2008
292 daniel dollar ssp yale_28_may2008292 daniel dollar ssp yale_28_may2008
292 daniel dollar ssp yale_28_may2008
 
IWMW 2006: Archiving the Web What can Institutions learn from National and In...
IWMW 2006: Archiving the Web What can Institutions learn from National and In...IWMW 2006: Archiving the Web What can Institutions learn from National and In...
IWMW 2006: Archiving the Web What can Institutions learn from National and In...
 
BatIg
BatIgBatIg
BatIg
 

Ähnlich wie Preservation as a Process MetaArchive and Distributed Digital Preservation

2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
Courtney Mumma
 

Ähnlich wie Preservation as a Process MetaArchive and Distributed Digital Preservation (20)

Disasters at any Scale: The MetaArchive Cooperative’s Community-based approac...
Disasters at any Scale: The MetaArchive Cooperative’s Community-based approac...Disasters at any Scale: The MetaArchive Cooperative’s Community-based approac...
Disasters at any Scale: The MetaArchive Cooperative’s Community-based approac...
 
IR-AUDIT
IR-AUDITIR-AUDIT
IR-AUDIT
 
Corporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbaiCorporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbai
 
Corporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbaiCorporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbai
 
Publishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecyclePublishing the Full Research Data Lifecycle
Publishing the Full Research Data Lifecycle
 
Capture All the URLs: First Steps in Web Archiving
Capture All the URLs: First Steps in Web ArchivingCapture All the URLs: First Steps in Web Archiving
Capture All the URLs: First Steps in Web Archiving
 
MetaArchive Cooperative: Case Study in Collaboration
MetaArchive Cooperative: Case Study in CollaborationMetaArchive Cooperative: Case Study in Collaboration
MetaArchive Cooperative: Case Study in Collaboration
 
IR-GUIDE
IR-GUIDEIR-GUIDE
IR-GUIDE
 
Solution Manager in Denodo Platform 7.0: Admin Made Simple
Solution Manager in Denodo Platform 7.0: Admin Made SimpleSolution Manager in Denodo Platform 7.0: Admin Made Simple
Solution Manager in Denodo Platform 7.0: Admin Made Simple
 
Using Archivemedia to preserve research data
Using Archivemedia to preserve research dataUsing Archivemedia to preserve research data
Using Archivemedia to preserve research data
 
VII Jornadas eMadrid "Education in exponential times". "Open Analytics in an ...
VII Jornadas eMadrid "Education in exponential times". "Open Analytics in an ...VII Jornadas eMadrid "Education in exponential times". "Open Analytics in an ...
VII Jornadas eMadrid "Education in exponential times". "Open Analytics in an ...
 
Archiving In Content Management - A Deeper Look
Archiving In Content Management - A Deeper LookArchiving In Content Management - A Deeper Look
Archiving In Content Management - A Deeper Look
 
Repo for cbt
Repo for cbtRepo for cbt
Repo for cbt
 
Developing Infrastructure to Support Closer Collaboration of Aggregators with...
Developing Infrastructure to Support Closer Collaboration of Aggregators with...Developing Infrastructure to Support Closer Collaboration of Aggregators with...
Developing Infrastructure to Support Closer Collaboration of Aggregators with...
 
Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)
 
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
 
Project update: A collaborative approach to "filling the digital preservation...
Project update: A collaborative approach to "filling the digital preservation...Project update: A collaborative approach to "filling the digital preservation...
Project update: A collaborative approach to "filling the digital preservation...
 
OA Repositories for DE in Myanmar presentation
OA Repositories for DE in Myanmar presentationOA Repositories for DE in Myanmar presentation
OA Repositories for DE in Myanmar presentation
 
UKSG Conference 2015 - Digital preservation: we know what it means today, but...
UKSG Conference 2015 - Digital preservation: we know what it means today, but...UKSG Conference 2015 - Digital preservation: we know what it means today, but...
UKSG Conference 2015 - Digital preservation: we know what it means today, but...
 
Backing Library Operations with Open Source Applications
Backing Library Operations with Open Source ApplicationsBacking Library Operations with Open Source Applications
Backing Library Operations with Open Source Applications
 

Mehr von Educopia

Mehr von Educopia (7)

Spanning Our Field Libraries: Mindfully Managing LAM Collaborations
Spanning Our Field Libraries: Mindfully Managing LAM CollaborationsSpanning Our Field Libraries: Mindfully Managing LAM Collaborations
Spanning Our Field Libraries: Mindfully Managing LAM Collaborations
 
Communities of Action: Distributed Digital Preservation
Communities of Action: Distributed Digital PreservationCommunities of Action: Distributed Digital Preservation
Communities of Action: Distributed Digital Preservation
 
Cultivating sustaining-networks-katherine-skinner
Cultivating sustaining-networks-katherine-skinnerCultivating sustaining-networks-katherine-skinner
Cultivating sustaining-networks-katherine-skinner
 
From act-to-impact-katherine
From act-to-impact-katherineFrom act-to-impact-katherine
From act-to-impact-katherine
 
Layers of Leadership for Archives, Libraries and Museums
Layers of Leadership for Archives, Libraries and MuseumsLayers of Leadership for Archives, Libraries and Museums
Layers of Leadership for Archives, Libraries and Museums
 
Building a National Agenda for Saving Online News
Building a National Agenda for Saving Online NewsBuilding a National Agenda for Saving Online News
Building a National Agenda for Saving Online News
 
A Preservation Compass
A Preservation CompassA Preservation Compass
A Preservation Compass
 

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Preservation as a Process MetaArchive and Distributed Digital Preservation