SlideShare ist ein Scribd-Unternehmen logo
1 von 65
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Archiving the Evolving Scholarly Record: A Perspective
Herbert Van de Sompel
@hvdsomp
Los Alamos National Laboratory
Acknowledgments: Andrew Treloar, @atreloar , ANDS
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
In This Talk
1. Functions of scholarly communication
2. Pointers to the future
3. Characterizing the future
1. Archiving the future
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Functions of Scholarly Communication
• Registration: Allows claims of precedence for a scholarly finding
• Certification: Establishes validity of the claim
• Awareness: Allows actors in the system to remain aware of new
claims
• Archiving: Preserves the scholarly record over time
Roosendaal, H, Geurts, C. (1997) Forces and functions in scientific communication
http://www.physik.uni-oldenburg.de/conferences/crisp97/roosendaal.html
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
System of Journals, Paper Version
• Registration: Manuscript submission
• Certification: Peer review
• Awareness: alerts, library shelf surfing
• Archiving: Journals in library stacks
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
System of Journals, Digital Version
• Registration: Manuscript submission
• Certification: Peer review
• Awareness: Various web discovery services
• Archiving: Special purpose archives (e.g. Portico), publishers
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
In This Talk
1. Functions of scholarly communication
2. Pointers to the future
3. Characterizing the future
1. Archiving the future
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Pointers to the Future
“The future is already here – it’s just not
very evenly distributed”
William Gibson
Gibson, W. (1999) The Science in Science FIction, NPR Interview
http://www.npr.org/templates/story/story.php?storyId=1067220
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Registration - BioRxiv
http://biorxiv.org
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Registration - GitHub
http://github.com
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Registration – slideshare
http://www.slideshare.net/hvdsomp/presentations
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Registration - WikiPathways
http://wikipathways.org/index.php/WikiPathways
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Registration - Neurolex
http://neurolex.org/wiki/Category:Olfactory_cortex_horizontal_cell
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Registration – Research Objects
http://researchobject.org/
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Registration - Observations
• Registration of wide variety of objects
• dynamic, compound, inter-related, distributed across the web
• Decoupling registration from certification
• Time stamping, versioning
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Certification – PubMed Commons
http://www.ncbi.nlm.nih.gov/pubmedcommons/
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Certification – The Open Journal
http://theoj.org
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Certification – slideshare
http://www.slideshare.net/hvdsomp/presentations
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Certification – Project FeederWatch
http://feederwatch.org
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Certification - Observations
• Certification decoupled from registration
• Certification of various types of objects
• Social interactions validating
• Machines validating
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Awareness – Twitter
http://twitter.com
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Awareness – myexperiment
http://myexperiment.org/
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Awareness – NARCIS
http://narcis.nl/
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Awareness – eLabNoteBook RSS Feeds
http://malaria.ourexperiment.org/feeds
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Awareness - Observations
• Awareness for various types of objects
• Real time awareness
• Awareness through social media
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Archiving – CLOCKSS
http://www.clockss.org/
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Archiving – DANS Easy
http://easy.dans.knaw.nl/
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Archiving – Australian Antarctic Data Centre
http://data.aad.gov.au/
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Archiving – perma.cc
http://perma.cc
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Archiving – EU Trusted Digital Repositories
http://trusteddigitalrepository.eu/Site/Welcome.html
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Archiving - Observations
• Archiving/Archives for various types of objects
• Distributed archives
• Archival consortia
• Audit for trustworthiness
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
In This Talk
1. Functions of scholarly communication
2. Pointers to the future
3. Characterizing the future
1. Archiving the future
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
The Future
• Registration
• Wide variety of objects
• Versions of objects
• Interrelated, interdependent objects
• Certification
• Variety of certification mechanisms
• Decoupled from / Overlaid upon Registration
• Awareness
• Real-time
• Social
• Variety of objects
• Archiving …
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Characterizing the Future – Scholarly Communication
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Characterizing the Future – Communicated Objects
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
In This Talk
1. Functions of scholarly communication
2. Pointers to the future
3. Characterizing the future
1. Archiving the future
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
The Future – Core Observations
• The research process, not just its outcome, is becoming visible …
on the web
• Massive extension of the scholarly record with an enormous variety
of novel objects
• The objects are heterogeneous, dynamic, compound, inter-related
and distributed across the web
• The objects are often hosted on common web platforms that are not
dedicated to scholarship
The archival paradigm must take these characteristics into account
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Considerations about Archiving
• On the right track?
• Capturing paradigms
• Pockets of persistence
• Recording versus Archiving
• A perspective on scholarly infrastructure
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Considerations about Archiving
• On the right track?
• Capturing paradigms
• Pockets of persistence
• Recording versus Archiving
• A perspective on scholarly infrastructure
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Web-Based Journal System – Links to Articles
• Special-purpose archival solutions
for articles
• Rosenthal finds that what is archived
is too few, too healthy, too easy
• Attempts with the Keepers Registry
to map out what is archived
• Based on [ISSN, volume, issue],
not on DOI, HTTP URI
David Rosenthal (2013) Patio Perspectives at ANADP II: Preserving the Other Half
http://blog.dshr.org/2013/11/patio-perspectives-at-anadp-ii.html
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Web-Based Journal System – Links to Articles
Peter Burnhill (2014) Ensuring access to digital back copy
http://www.cni.org/topics/digital-preservation/ensuring-access-to-digital-back-copy/
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Web-Based Journal System – Links to Web at Large Resources
• Web archives contain snapshots, the
result of incidental archiving
• The Hiberlink project finds that for the
large majority of these “Web at Large”
resources, no temporally appropriate
archived versions exist
• Memento infrastructure allows auditing
what is globally archived based on
HTTP URI
http://hiberlink.org
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Links Abstracted to Top Level Domain Targets
Martin Klein, Herbert Van de Sompel et al. (2014) Scholarly context not found
To appear in PLoS ONE on December 26 2014
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Loss of Current Context – Link Rot
Martin Klein, Herbert Van de Sompel et al. (2014) Scholarly context not found
To appear in PLoS ONE on December 26 2014
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Loss of Past Context – Archival Status (14 day window)
Martin Klein, Herbert Van de Sompel et al. (2014) Scholarly context not found
To appear in PLoS ONE on December 26 2014
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Considerations about Archiving
• On the right track?
• Capturing paradigms
• Pockets of persistence
• Recording versus Archiving
• A perspective on scholarly infrastructure
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Perspective on “Repository” Capture Paradigm
• Atomic object
• Finalized object
• Removal of context
• Perspective on object: file in a file
system
• Capture request by owner of object
• Capture time decided by owner of
object
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Perspective on “Web” Capture Paradigm
• Compound object (context essential)
• Constituents of compound object in
flux
• Perspective on constituents:
resources with URIs on the web
• Capture request by user of the
constituents, owned by self, owned by
3rd parties
• Capture time decided by user of the
constituents
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Considerations about Archiving
• On the right track?
• Capturing paradigms
• Pockets of persistence
• Recording versus Archiving
• A perspective on scholarly infrastructure
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Creating Pockets of Persistence
How to achieve the ability to:
• Persistently
• Precisely
• Seamlessly
revisit the Scholarly Web of
the Past and of the Now at
some point in the Future
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Creating Pockets of Persistence
How to achieve the ability to:
• Persistently
• Precisely
• Seamlessly
revisit the Scholarly Web of
the Past and of the Now at
some point in the Future
This challenge exists for the entire web,
but some communities actually care
about addressing it:
• scholarly communication,
• legal publications,
• journalism,
• Wikipedia,
• …
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Pro-Active Capture for a Seed Collection
• Seed Collection - Starting point for capture is a seed collection of
interest to communities that care, e.g.
o Scholarly literature
o Legal documents
o On-Line journalism
o Wikipedia articles
• Lifecycle Events – Intervene at critical moments in the lifecycle of
items in these collections to pro-actively capture
o Collection items – some solutions in place
o Web resources referenced in collection items
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Pro-Active Capture for a Seed Collection
• Request by user of a A to capture A,
B, C, D, E
• Request for capture may result in
• In-situ or remote capture
• Creation of snapshot or creation
of trace
• Archival URI, capture datetime
• Interoperability for on-demand
capture
• Orchestration of capture process
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Pro-Active Capture for Seed Collection
• What those crucial lifecycle events are may depend on the
collection type
Wikipedia
• Creation of new article
• Creation of new version of
article
• Creation of substantially
new version of article
• Addition of external
reference to article
• References to article
exceed a certain threshold
Scholarly Literature
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Scholarly Literature: Experimental Zotero Extension
Richard Wincewicz (2014) Prototype Hiberlink plugin for Zotero
https://www.youtube.com/v/ZYmi_Ydr65M%26vq
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Scholarly Literature: Experimental HiberActive Service
Martin Klein et al. (2014) HiberActive: Pro-Active Archiving of web references
Open Repositories 2014 http://www.slideshare.net/martinklein0815/hiberactive
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Considerations about Archiving
• On the right track?
• Capturing paradigms
• Pockets of persistence
• Recording versus Archiving
• A perspective on scholarly infrastructure
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Web Platforms for Scholarship
• Increasingly, common web platforms are used for scholarship
• GitHub, Wikis, Wordpress, etc.
• Many of these platforms have desirable characteristics
• Versioning
• Time stamping
• Social embedding
• But, these platforms record rather than archive
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Recording is not Archiving
“GitHub reserves the right at any time and from time to time to
modify or discontinue, temporarily or permanently, the Service (or
any part thereof) with or without notice.”
“GitHub does not warrant that (i) the service will meet your specific
requirements, (ii) the service will be uninterrupted, timely, secure, or
error-free, (iii) the results that may be obtained from the use of the
service will be accurate or reliable, (iv) the quality of any products,
services, information, or other material purchased or obtained by
you through the service will meet your expectations, and (v) any
errors in the Service will be corrected.”
GitHub Terms of Service
http://help.github.com/articles/github-terms-of-service
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Recording versus Archiving
Recording Archiving
Short-term Longer-term
No guarantees provided Attempt to provide guarantees
Write many/read many Write once/Read many
Scholarly process Scholarly record
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Considerations about Archiving
• On the right track?
• Capturing paradigms
• Recording versus Archiving
• A perspective on scholarly infrastructure
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Infrastructure Considerations
• Various incentives to move objects from Private to Recording:
• Share with self, team, comply with funder requirements
• Objects in Recording are network accessible and in global (HTTP)
namespace
• Within reach of web-scale processes aimed at selectively
moving them from Recording to Archiving
• Core aspects of these processes include
• Ability to snapshot the state of interlinked objects at specific
moments in their lifecycle
• Transfer of snapshots from Recording platforms to appropriate,
distributed Archive platforms (interoperability)
• Curatorial decisions regarding what should be captured
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Curatorial Considerations
• What are the criteria involved in deciding (which states of) which
objects get captured/archived?
• What triggers transition from Recording to Archiving?
• On-demand in lifecycle, social status of the object, reference
made to object, deliberate randomness for serendipity, …
• What to archive?
• Snapshot of object or trace of object (metadata, provenance, …)
?
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Final Considerations
• Need organizational, technical, and curatorial interfaces between
Recording and Archiving platforms
• Need organizational and technical interfaces across Archiving
platforms
Herbert Van de Sompel
OCLC ESR, Washington, DC, December 10 2014
Archiving the Evolving Scholarly Record: A Perspective
Herbert Van de Sompel
@hvdsomp
Los Alamos National Laboratory
Acknowledgments: Andrew Treloar, @atreloar , ANDS

Weitere ähnliche Inhalte

Was ist angesagt?

The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
Herbert Van de Sompel
 
The SFX Framework for Context-Sensitive Reference Linking
The SFX Framework for  Context-Sensitive Reference LinkingThe SFX Framework for  Context-Sensitive Reference Linking
The SFX Framework for Context-Sensitive Reference Linking
Herbert Van de Sompel
 
Sarah Callaghan Research Data Overview
Sarah Callaghan Research Data OverviewSarah Callaghan Research Data Overview
Sarah Callaghan Research Data Overview
OpenAIRE
 
Motivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustrationMotivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustration
Herbert Van de Sompel
 
It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011
Ross Singer
 
Linked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and MuseumsLinked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and Museums
trevorthornton
 

Was ist angesagt? (18)

The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
 
Creating Pockets of Persistence
Creating Pockets of PersistenceCreating Pockets of Persistence
Creating Pockets of Persistence
 
The SFX Framework for Context-Sensitive Reference Linking
The SFX Framework for  Context-Sensitive Reference LinkingThe SFX Framework for  Context-Sensitive Reference Linking
The SFX Framework for Context-Sensitive Reference Linking
 
Data management for researchers
Data management for researchersData management for researchers
Data management for researchers
 
Memento 101
Memento 101Memento 101
Memento 101
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
 
Sarah Callaghan Research Data Overview
Sarah Callaghan Research Data OverviewSarah Callaghan Research Data Overview
Sarah Callaghan Research Data Overview
 
Reminiscing about interoperability
Reminiscing about interoperabilityReminiscing about interoperability
Reminiscing about interoperability
 
Motivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustrationMotivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustration
 
MESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage data
 
Linked Open Data for Libraries
Linked Open Data for LibrariesLinked Open Data for Libraries
Linked Open Data for Libraries
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
 
It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & Museums
 
Linked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and MuseumsLinked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and Museums
 
April 23 NISO Virtual Conference: Dealing with the Data Deluge: Successful Te...
April 23 NISO Virtual Conference: Dealing with the Data Deluge: Successful Te...April 23 NISO Virtual Conference: Dealing with the Data Deluge: Successful Te...
April 23 NISO Virtual Conference: Dealing with the Data Deluge: Successful Te...
 
鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107
 

Andere mochten auch

Open Archives Initiative Object Re-Use & Exchange
Open Archives Initiative Object Re-Use & ExchangeOpen Archives Initiative Object Re-Use & Exchange
Open Archives Initiative Object Re-Use & Exchange
Herbert Van de Sompel
 
The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking Servers
Herbert Van de Sompel
 
Attempts at innovation in scholarly communication
Attempts at innovation in scholarly communicationAttempts at innovation in scholarly communication
Attempts at innovation in scholarly communication
Herbert Van de Sompel
 
Augmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositoriesAugmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositories
Herbert Van de Sompel
 

Andere mochten auch (16)

The Roof is on Fire
The Roof is on FireThe Roof is on Fire
The Roof is on Fire
 
Memento: Time Travel for the Web
Memento: Time Travel for the WebMemento: Time Travel for the Web
Memento: Time Travel for the Web
 
PID Signposting Pattern
PID Signposting PatternPID Signposting Pattern
PID Signposting Pattern
 
Open Archives Initiative Object Re-Use & Exchange
Open Archives Initiative Object Re-Use & ExchangeOpen Archives Initiative Object Re-Use & Exchange
Open Archives Initiative Object Re-Use & Exchange
 
The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking Servers
 
Attempts at innovation in scholarly communication
Attempts at innovation in scholarly communicationAttempts at innovation in scholarly communication
Attempts at innovation in scholarly communication
 
Augmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositoriesAugmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositories
 
The djatoka Image Server
The djatoka Image ServerThe djatoka Image Server
The djatoka Image Server
 
the UPS protoproto project
the UPS protoproto projectthe UPS protoproto project
the UPS protoproto project
 
An HTTP-Based Versioning Mechanism for Linked Data
An HTTP-Based Versioning Mechanism for Linked DataAn HTTP-Based Versioning Mechanism for Linked Data
An HTTP-Based Versioning Mechanism for Linked Data
 
Memento: Big Leaps Towards Seamless Navigation of the Web of the Past
Memento: Big Leaps Towards Seamless Navigation of the Web of the PastMemento: Big Leaps Towards Seamless Navigation of the Web of the Past
Memento: Big Leaps Towards Seamless Navigation of the Web of the Past
 
The Web as infrastructure for scholarly research and communication
The Web as infrastructure for scholarly research and communicationThe Web as infrastructure for scholarly research and communication
The Web as infrastructure for scholarly research and communication
 
Untitled I: Challenges ahead
Untitled I: Challenges aheadUntitled I: Challenges ahead
Untitled I: Challenges ahead
 
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingPersistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
 
Interoperability for web based scholarship
Interoperability for web based scholarshipInteroperability for web based scholarship
Interoperability for web based scholarship
 
ResourceSync Quick Overview
ResourceSync Quick OverviewResourceSync Quick Overview
ResourceSync Quick Overview
 

Ähnlich wie A Perspective on Archiving the Scholarly Record

Reference Rot in Scholarly Communication: A Reliable Quantification and a P...
Reference Rot in Scholarly Communication: A Reliable Quantification and a P...Reference Rot in Scholarly Communication: A Reliable Quantification and a P...
Reference Rot in Scholarly Communication: A Reliable Quantification and a P...
Martin Klein
 
Beyond Management: Data Curation as Scholarship in Archaeology
Beyond Management: Data Curation as Scholarship in ArchaeologyBeyond Management: Data Curation as Scholarship in Archaeology
Beyond Management: Data Curation as Scholarship in Archaeology
Sarah Whitcher Kansa
 

Ähnlich wie A Perspective on Archiving the Scholarly Record (20)

I pres 2014 slides
I pres 2014 slidesI pres 2014 slides
I pres 2014 slides
 
What is Linked Data?
What is Linked Data?What is Linked Data?
What is Linked Data?
 
"In the Early Days of a Better Nation": Enhancing the power of metadata today...
"In the Early Days of a Better Nation": Enhancing the power of metadata today..."In the Early Days of a Better Nation": Enhancing the power of metadata today...
"In the Early Days of a Better Nation": Enhancing the power of metadata today...
 
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
 
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
 
Prototypes of pro-active approaches to support the archiving of web reference...
Prototypes of pro-active approaches to support the archiving of web reference...Prototypes of pro-active approaches to support the archiving of web reference...
Prototypes of pro-active approaches to support the archiving of web reference...
 
Reference Rot in Scholarly Communication: A Reliable Quantification and a P...
Reference Rot in Scholarly Communication: A Reliable Quantification and a P...Reference Rot in Scholarly Communication: A Reliable Quantification and a P...
Reference Rot in Scholarly Communication: A Reliable Quantification and a P...
 
From Data to Knowledge with Workflows & Provenance
From Data to Knowledge with Workflows & ProvenanceFrom Data to Knowledge with Workflows & Provenance
From Data to Knowledge with Workflows & Provenance
 
To the Rescue of the Orphans of Scholarly Communication
To the Rescue of the Orphans of Scholarly CommunicationTo the Rescue of the Orphans of Scholarly Communication
To the Rescue of the Orphans of Scholarly Communication
 
Beyond Management: Data Curation as Scholarship in Archaeology
Beyond Management: Data Curation as Scholarship in ArchaeologyBeyond Management: Data Curation as Scholarship in Archaeology
Beyond Management: Data Curation as Scholarship in Archaeology
 
Reference Rot and Linked Data: Threat and Remedy
Reference Rot and Linked Data: Threat and RemedyReference Rot and Linked Data: Threat and Remedy
Reference Rot and Linked Data: Threat and Remedy
 
GEC 112 - Kaplan FALL 2014
GEC 112 - Kaplan FALL 2014GEC 112 - Kaplan FALL 2014
GEC 112 - Kaplan FALL 2014
 
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
 
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
 
Snac saa-aug-2011-try 3 keynote
Snac saa-aug-2011-try 3 keynoteSnac saa-aug-2011-try 3 keynote
Snac saa-aug-2011-try 3 keynote
 
DSpace-CRIS@HKU: Achieving visibility with a CERIF compliant open source system
DSpace-CRIS@HKU: Achieving visibility with a CERIF compliant open source systemDSpace-CRIS@HKU: Achieving visibility with a CERIF compliant open source system
DSpace-CRIS@HKU: Achieving visibility with a CERIF compliant open source system
 
Open Repositories 2014: Crowdsourced Transcription via IIIF
Open Repositories 2014: Crowdsourced Transcription via IIIFOpen Repositories 2014: Crowdsourced Transcription via IIIF
Open Repositories 2014: Crowdsourced Transcription via IIIF
 
Moving Beyond Access: Unlocking the Potential of Moving Image Archive Collect...
Moving Beyond Access: Unlocking the Potential of Moving Image Archive Collect...Moving Beyond Access: Unlocking the Potential of Moving Image Archive Collect...
Moving Beyond Access: Unlocking the Potential of Moving Image Archive Collect...
 
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
 
Sally Rumsey, Janet McKnight, James A.J. Wilson - Research data management fo...
Sally Rumsey, Janet McKnight, James A.J. Wilson - Research data management fo...Sally Rumsey, Janet McKnight, James A.J. Wilson - Research data management fo...
Sally Rumsey, Janet McKnight, James A.J. Wilson - Research data management fo...
 

Mehr von Herbert Van de Sompel

ResourceSync Overview
ResourceSync OverviewResourceSync Overview
ResourceSync Overview
Herbert Van de Sompel
 

Mehr von Herbert Van de Sompel (18)

The web is rotting and what to do about it
The web is rotting and what to do about itThe web is rotting and what to do about it
The web is rotting and what to do about it
 
Researcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebResearcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized Web
 
Persistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DonePersistent Identification: Easier Said than Done
Persistent Identification: Easier Said than Done
 
FAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning Issue
 
Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)
 
Collecting the organizational scholarly record
Collecting the organizational scholarly recordCollecting the organizational scholarly record
Collecting the organizational scholarly record
 
To the Rescue of Scholarly Orphans
To the Rescue of Scholarly OrphansTo the Rescue of Scholarly Orphans
To the Rescue of Scholarly Orphans
 
Almost two decades at LANL
Almost two decades at LANLAlmost two decades at LANL
Almost two decades at LANL
 
Perseverance on Persistence
Perseverance on PersistencePerseverance on Persistence
Perseverance on Persistence
 
Paul Evan Peters Lecture
Paul Evan Peters LecturePaul Evan Peters Lecture
Paul Evan Peters Lecture
 
Achieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed CollectionsAchieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed Collections
 
Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)
 
Signposting Overview
Signposting OverviewSignposting Overview
Signposting Overview
 
ResourceSync Overview
ResourceSync OverviewResourceSync Overview
ResourceSync Overview
 
ResourceSync tutorial OAI8
ResourceSync tutorial OAI8ResourceSync tutorial OAI8
ResourceSync tutorial OAI8
 
Paint-Yourself-In-The-Corner Infrastructure
Paint-Yourself-In-The-Corner InfrastructurePaint-Yourself-In-The-Corner Infrastructure
Paint-Yourself-In-The-Corner Infrastructure
 
ResourceSync: Web-Based Resource Synchronization
ResourceSync: Web-Based Resource SynchronizationResourceSync: Web-Based Resource Synchronization
ResourceSync: Web-Based Resource Synchronization
 
ResourceSync: Conceptual and Technical Problem Perspective
ResourceSync: Conceptual and Technical Problem PerspectiveResourceSync: Conceptual and Technical Problem Perspective
ResourceSync: Conceptual and Technical Problem Perspective
 

Kürzlich hochgeladen

Rohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
SofiyaSharma5
 
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Sheetaleventcompany
 

Kürzlich hochgeladen (20)

Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
 
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
 
(+971568250507 ))# Young Call Girls in Ajman By Pakistani Call Girls in ...
(+971568250507  ))#  Young Call Girls  in Ajman  By Pakistani Call Girls  in ...(+971568250507  ))#  Young Call Girls  in Ajman  By Pakistani Call Girls  in ...
(+971568250507 ))# Young Call Girls in Ajman By Pakistani Call Girls in ...
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
 
✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663
✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663
✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663
 
Rohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)
 
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
 
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
 
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
 
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl ServiceRussian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
 
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
 
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
 
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
 
Russian Call Girls in %(+971524965298 )# Call Girls in Dubai
Russian Call Girls in %(+971524965298  )#  Call Girls in DubaiRussian Call Girls in %(+971524965298  )#  Call Girls in Dubai
Russian Call Girls in %(+971524965298 )# Call Girls in Dubai
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
 

A Perspective on Archiving the Scholarly Record

  • 1. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Archiving the Evolving Scholarly Record: A Perspective Herbert Van de Sompel @hvdsomp Los Alamos National Laboratory Acknowledgments: Andrew Treloar, @atreloar , ANDS
  • 2. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 In This Talk 1. Functions of scholarly communication 2. Pointers to the future 3. Characterizing the future 1. Archiving the future
  • 3. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Functions of Scholarly Communication • Registration: Allows claims of precedence for a scholarly finding • Certification: Establishes validity of the claim • Awareness: Allows actors in the system to remain aware of new claims • Archiving: Preserves the scholarly record over time Roosendaal, H, Geurts, C. (1997) Forces and functions in scientific communication http://www.physik.uni-oldenburg.de/conferences/crisp97/roosendaal.html
  • 4. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 System of Journals, Paper Version • Registration: Manuscript submission • Certification: Peer review • Awareness: alerts, library shelf surfing • Archiving: Journals in library stacks
  • 5. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 System of Journals, Digital Version • Registration: Manuscript submission • Certification: Peer review • Awareness: Various web discovery services • Archiving: Special purpose archives (e.g. Portico), publishers
  • 6. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 In This Talk 1. Functions of scholarly communication 2. Pointers to the future 3. Characterizing the future 1. Archiving the future
  • 7. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Pointers to the Future “The future is already here – it’s just not very evenly distributed” William Gibson Gibson, W. (1999) The Science in Science FIction, NPR Interview http://www.npr.org/templates/story/story.php?storyId=1067220
  • 8. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Registration - BioRxiv http://biorxiv.org
  • 9. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Registration - GitHub http://github.com
  • 10. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Registration – slideshare http://www.slideshare.net/hvdsomp/presentations
  • 11. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Registration - WikiPathways http://wikipathways.org/index.php/WikiPathways
  • 12. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Registration - Neurolex http://neurolex.org/wiki/Category:Olfactory_cortex_horizontal_cell
  • 13. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Registration – Research Objects http://researchobject.org/
  • 14. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Registration - Observations • Registration of wide variety of objects • dynamic, compound, inter-related, distributed across the web • Decoupling registration from certification • Time stamping, versioning
  • 15. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Certification – PubMed Commons http://www.ncbi.nlm.nih.gov/pubmedcommons/
  • 16. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Certification – The Open Journal http://theoj.org
  • 17. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Certification – slideshare http://www.slideshare.net/hvdsomp/presentations
  • 18. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Certification – Project FeederWatch http://feederwatch.org
  • 19. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Certification - Observations • Certification decoupled from registration • Certification of various types of objects • Social interactions validating • Machines validating
  • 20. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Awareness – Twitter http://twitter.com
  • 21. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Awareness – myexperiment http://myexperiment.org/
  • 22. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Awareness – NARCIS http://narcis.nl/
  • 23. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Awareness – eLabNoteBook RSS Feeds http://malaria.ourexperiment.org/feeds
  • 24. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Awareness - Observations • Awareness for various types of objects • Real time awareness • Awareness through social media
  • 25. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Archiving – CLOCKSS http://www.clockss.org/
  • 26. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Archiving – DANS Easy http://easy.dans.knaw.nl/
  • 27. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Archiving – Australian Antarctic Data Centre http://data.aad.gov.au/
  • 28. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Archiving – perma.cc http://perma.cc
  • 29. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Archiving – EU Trusted Digital Repositories http://trusteddigitalrepository.eu/Site/Welcome.html
  • 30. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Archiving - Observations • Archiving/Archives for various types of objects • Distributed archives • Archival consortia • Audit for trustworthiness
  • 31. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 In This Talk 1. Functions of scholarly communication 2. Pointers to the future 3. Characterizing the future 1. Archiving the future
  • 32. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 The Future • Registration • Wide variety of objects • Versions of objects • Interrelated, interdependent objects • Certification • Variety of certification mechanisms • Decoupled from / Overlaid upon Registration • Awareness • Real-time • Social • Variety of objects • Archiving …
  • 33. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Characterizing the Future – Scholarly Communication
  • 34. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Characterizing the Future – Communicated Objects
  • 35. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 In This Talk 1. Functions of scholarly communication 2. Pointers to the future 3. Characterizing the future 1. Archiving the future
  • 36. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 The Future – Core Observations • The research process, not just its outcome, is becoming visible … on the web • Massive extension of the scholarly record with an enormous variety of novel objects • The objects are heterogeneous, dynamic, compound, inter-related and distributed across the web • The objects are often hosted on common web platforms that are not dedicated to scholarship The archival paradigm must take these characteristics into account
  • 37. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Considerations about Archiving • On the right track? • Capturing paradigms • Pockets of persistence • Recording versus Archiving • A perspective on scholarly infrastructure
  • 38. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Considerations about Archiving • On the right track? • Capturing paradigms • Pockets of persistence • Recording versus Archiving • A perspective on scholarly infrastructure
  • 39. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Web-Based Journal System – Links to Articles • Special-purpose archival solutions for articles • Rosenthal finds that what is archived is too few, too healthy, too easy • Attempts with the Keepers Registry to map out what is archived • Based on [ISSN, volume, issue], not on DOI, HTTP URI David Rosenthal (2013) Patio Perspectives at ANADP II: Preserving the Other Half http://blog.dshr.org/2013/11/patio-perspectives-at-anadp-ii.html
  • 40. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Web-Based Journal System – Links to Articles Peter Burnhill (2014) Ensuring access to digital back copy http://www.cni.org/topics/digital-preservation/ensuring-access-to-digital-back-copy/
  • 41. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Web-Based Journal System – Links to Web at Large Resources • Web archives contain snapshots, the result of incidental archiving • The Hiberlink project finds that for the large majority of these “Web at Large” resources, no temporally appropriate archived versions exist • Memento infrastructure allows auditing what is globally archived based on HTTP URI http://hiberlink.org
  • 42. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Links Abstracted to Top Level Domain Targets Martin Klein, Herbert Van de Sompel et al. (2014) Scholarly context not found To appear in PLoS ONE on December 26 2014
  • 43. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Loss of Current Context – Link Rot Martin Klein, Herbert Van de Sompel et al. (2014) Scholarly context not found To appear in PLoS ONE on December 26 2014
  • 44. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Loss of Past Context – Archival Status (14 day window) Martin Klein, Herbert Van de Sompel et al. (2014) Scholarly context not found To appear in PLoS ONE on December 26 2014
  • 45. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Considerations about Archiving • On the right track? • Capturing paradigms • Pockets of persistence • Recording versus Archiving • A perspective on scholarly infrastructure
  • 46. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Perspective on “Repository” Capture Paradigm • Atomic object • Finalized object • Removal of context • Perspective on object: file in a file system • Capture request by owner of object • Capture time decided by owner of object
  • 47. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Perspective on “Web” Capture Paradigm • Compound object (context essential) • Constituents of compound object in flux • Perspective on constituents: resources with URIs on the web • Capture request by user of the constituents, owned by self, owned by 3rd parties • Capture time decided by user of the constituents
  • 48. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Considerations about Archiving • On the right track? • Capturing paradigms • Pockets of persistence • Recording versus Archiving • A perspective on scholarly infrastructure
  • 49. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Creating Pockets of Persistence How to achieve the ability to: • Persistently • Precisely • Seamlessly revisit the Scholarly Web of the Past and of the Now at some point in the Future
  • 50. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Creating Pockets of Persistence How to achieve the ability to: • Persistently • Precisely • Seamlessly revisit the Scholarly Web of the Past and of the Now at some point in the Future This challenge exists for the entire web, but some communities actually care about addressing it: • scholarly communication, • legal publications, • journalism, • Wikipedia, • …
  • 51. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Pro-Active Capture for a Seed Collection • Seed Collection - Starting point for capture is a seed collection of interest to communities that care, e.g. o Scholarly literature o Legal documents o On-Line journalism o Wikipedia articles • Lifecycle Events – Intervene at critical moments in the lifecycle of items in these collections to pro-actively capture o Collection items – some solutions in place o Web resources referenced in collection items
  • 52. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Pro-Active Capture for a Seed Collection • Request by user of a A to capture A, B, C, D, E • Request for capture may result in • In-situ or remote capture • Creation of snapshot or creation of trace • Archival URI, capture datetime • Interoperability for on-demand capture • Orchestration of capture process
  • 53. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Pro-Active Capture for Seed Collection • What those crucial lifecycle events are may depend on the collection type Wikipedia • Creation of new article • Creation of new version of article • Creation of substantially new version of article • Addition of external reference to article • References to article exceed a certain threshold Scholarly Literature
  • 54. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Scholarly Literature: Experimental Zotero Extension Richard Wincewicz (2014) Prototype Hiberlink plugin for Zotero https://www.youtube.com/v/ZYmi_Ydr65M%26vq
  • 55. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Scholarly Literature: Experimental HiberActive Service Martin Klein et al. (2014) HiberActive: Pro-Active Archiving of web references Open Repositories 2014 http://www.slideshare.net/martinklein0815/hiberactive
  • 56. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Considerations about Archiving • On the right track? • Capturing paradigms • Pockets of persistence • Recording versus Archiving • A perspective on scholarly infrastructure
  • 57. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Web Platforms for Scholarship • Increasingly, common web platforms are used for scholarship • GitHub, Wikis, Wordpress, etc. • Many of these platforms have desirable characteristics • Versioning • Time stamping • Social embedding • But, these platforms record rather than archive
  • 58. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Recording is not Archiving “GitHub reserves the right at any time and from time to time to modify or discontinue, temporarily or permanently, the Service (or any part thereof) with or without notice.” “GitHub does not warrant that (i) the service will meet your specific requirements, (ii) the service will be uninterrupted, timely, secure, or error-free, (iii) the results that may be obtained from the use of the service will be accurate or reliable, (iv) the quality of any products, services, information, or other material purchased or obtained by you through the service will meet your expectations, and (v) any errors in the Service will be corrected.” GitHub Terms of Service http://help.github.com/articles/github-terms-of-service
  • 59. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Recording versus Archiving Recording Archiving Short-term Longer-term No guarantees provided Attempt to provide guarantees Write many/read many Write once/Read many Scholarly process Scholarly record
  • 60. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Considerations about Archiving • On the right track? • Capturing paradigms • Recording versus Archiving • A perspective on scholarly infrastructure
  • 61. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014
  • 62. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Infrastructure Considerations • Various incentives to move objects from Private to Recording: • Share with self, team, comply with funder requirements • Objects in Recording are network accessible and in global (HTTP) namespace • Within reach of web-scale processes aimed at selectively moving them from Recording to Archiving • Core aspects of these processes include • Ability to snapshot the state of interlinked objects at specific moments in their lifecycle • Transfer of snapshots from Recording platforms to appropriate, distributed Archive platforms (interoperability) • Curatorial decisions regarding what should be captured
  • 63. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Curatorial Considerations • What are the criteria involved in deciding (which states of) which objects get captured/archived? • What triggers transition from Recording to Archiving? • On-demand in lifecycle, social status of the object, reference made to object, deliberate randomness for serendipity, … • What to archive? • Snapshot of object or trace of object (metadata, provenance, …) ?
  • 64. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Final Considerations • Need organizational, technical, and curatorial interfaces between Recording and Archiving platforms • Need organizational and technical interfaces across Archiving platforms
  • 65. Herbert Van de Sompel OCLC ESR, Washington, DC, December 10 2014 Archiving the Evolving Scholarly Record: A Perspective Herbert Van de Sompel @hvdsomp Los Alamos National Laboratory Acknowledgments: Andrew Treloar, @atreloar , ANDS