SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Downloaden Sie, um offline zu lesen
ResourceSync - An Introduction



                                                    Todd Carpenter
                                           Executive Director, NISO
                      ALCTS Continuing Resources Standards Forum
                                            Sunday, June 24, 2012
 With thanks to Herbert Van de Sompel and Robert Sanderson (LANL)
@TAC_NISO Twitter Highlights
•   Presenting this morning on the ResourceSync project at ALCTS Continuing Resources Standards
    Forum #ALCTSCRS #ala12

•   I’m pre-tweeing my slides during #rsync presentation. Slides here:  _________ #ala12

•   NISO mission is to develop and maintain technical standards related to information, documentation,
    discovery and distribution of content #ala12

•   Standards are all around us, even if we don't notice them, especially in books.  Things like page
    numbers, paper, binding, even spelling is standardized. #NISO #ala12

•   Machines don’t talk like people do.  Then again some people don’t talk like other people do,
    particularly teenagers #ala12

•   So where did the ResourceSync project start?  #NISO approached OAI about updating the PMH
    protocol. #ala12

•   The #NISO / OAI ResourceSync project was possible through the generous support of the Alfred P.
    Sloan Foundation.  Thank you!  #ala12

•   What is RSync trying to solve: Source Server has resources that change.  Destination servers want to
    leverage some or all of Source on regular ongoing basis in near-real-time & at web scale. #ala12

•   Syntonization can be good enough or perfect and synchronization can be fast or fast enough.  #ala12

•   RSync is studying a number of existing protocols to determine which (or combination of) protocols
    can best meet needs.  We have an bias against developing new spec from scratch. #ala12

•   There are several models for synchronizing content: pull, push, conditional pull, mediated feed and pull,
    and a mix of feed/push/pull/service models. #ala12

•   The goal of ResourceSync is to find the model that most efficiently distributes the content, while
    limiting the tax on the source system. #ala12

•   This is very early days in the process of standards development. We’re still in the incubation stage.
    Consensus and adoption phases will come in 2013 and beyond. #ala12

•   We hope to have a beta specification available by the end of 2012 of ResourceSync #ala12
About
Non-profit industry trade association
accredited by ANSI
Mission of developing and maintaining technical
standards related to information,
documentation, discovery and distribution of
published materials and media
Volunteer driven organization: 400+ spread out
across the world
Standards	
  are	
  familiar,	
  even	
  if	
  you	
  don’t	
  no4ce




June	
  23,	
  2012              ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   4
Machines don’t talk like people do




June	
  23,	
  2012   ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   5
Machines talk like this




June	
  23,	
  2012        ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   6
How	
  did	
  we	
  get	
  here?
• OAI-­‐PMH	
  Protocol
          – Developed	
  in	
  200X
          – Developed	
  by	
  Herbert	
  van	
  de	
  Sompel,	
  Carl	
  Lagoze	
  and	
  
            the	
  OAI	
  team
          – Fairly	
  wide	
  adopQon	
  in	
  scholarly	
  community
• In	
  spring	
  2011,	
  NISO	
  approached	
  OAI	
  to	
  discuss	
  
  updaQng	
  PMH	
  Protocol
• Response	
  was	
  “Let’s	
  try	
  something	
  else	
  more	
  in	
  
  line	
  with	
  more	
  modern	
  technology”	
  
June	
  23,	
  2012               ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   7
A partnership is born
•        Agreement to launch RSync as a
          NISO standards process
•        Partnership on grant application
•        OAI team comprised core
          technology team
•        Partnership on grant application
June	
  23,	
  2012         ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   8
Special	
  thanks	
  are	
  due	
  to...	
  




June	
  23,	
  2012               ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   9
What	
  we	
  trying	
  to	
  solve?
   Consideration:
   Source (server) A has resources that change over time: they get
      created, modified, deleted, moved, …
   Destination (servers) X, Y, and Z leverage (some) resources of
      Source A.
Problem:
   Destinations want to keep in step with the resource changes at
      Source A: resource synchronization.

Task of ResourceSync effort:
   Design an approach for resource synchronization aligned with the
       Web Architecture that has a fair chance of adoption by different
       communities.
        The approach must scale better than recurrent HTTP HEAD/
          GET on resources.

June	
  23,	
  2012            ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   10
Use	
  cases	
  differ
                      How good is the synchronization?


                      Perfect                                                                          Good	
  enough

          How fast is the synchronization?



                        Fast                                                                           Fast	
  enough


June	
  23,	
  2012                  ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012               11
3	
  disQnct	
  needs	
  regarding	
  resource	
  synchronizaQon

       Baseline	
  matching:	
  An	
  approach	
  to	
  allow	
  a	
  DesQnaQon	
  that	
  wants	
  
       to	
  start	
  synchronizing	
  with	
  a	
  Source	
  to	
  perform	
  an	
  iniQal	
  catch	
  up	
  
       –	
  Dump.

       Incremental	
  resource	
  synchronizaQon:	
  An	
  approach	
  to	
  allow	
  a	
  
       DesQnaQon	
  to	
  remain	
  up-­‐to-­‐date	
  regarding	
  changes	
  at	
  the	
  
       Source.

       Audit:	
  An	
  approach	
  to	
  allow	
  checking	
  whether	
  a	
  DesQnaQon	
  is	
  in	
  
       sync	
  with	
  a	
  Source	
  	
  –	
  Inventory.

       =>	
  All	
  3	
  are	
  considered	
  in	
  scope	
  for	
  the	
  effort


June	
  23,	
  2012                     ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012    12
ResourceSync	
  Working	
  Group
Herbert Van de Sompel (Chair)                                                Stuart Lewis
Los Alamos National Laboratory                                               Joint Information Systems Committee (JISC)

Todd Carpenter (Co-Chair)                                                    Peter Murray
National Information Standards Organization (NISO)                           Lyrasis

Nettie Lagace                                                                Michael Nelson
National Information Standards Organization (NISO)                           Old Dominion University
                                                                             David Rosenthal
Manuel Bernhardt                                                             Stanford University
Delving B.V.
                                                                             Christian Sadilek
Kevin Ford                                                                   Red Hat
Library of Congress
                                                                             Shlomo Sanders
Bernhard Haslhofer                                                           Ex Libris, Inc.
Cornell University
                                                                             Robert Sanderson
Richard Jones                                                                Los Alamos National Laboratory
Joint Information Systems Committee (JISC)
                                                                             Sjoerd Siebinga
Martin Klein                                                                 Delving B.V.
Los Alamos National Laboratory
                                                                             Ed Summers
Graham Klyne                                                                 Library of Congress
Joint Information Systems Committee (JISC)
                                                                             Simeon Warner
Carl Lagoze                                                                  Cornell University
Cornell University
                                                                             Jeff Young
                                                                             OCLC Online Computer Library Center
June	
  23,	
  2012                      ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012                  13
hep://imgs.xkcd.com/comics/standards.png/

8/23/11              Data	
  AeribuQon	
  and	
  CitaQon	
  Workshop
                                                                       14
Change	
  NoQficaQon	
  -­‐	
  Protocols
             Atom PubSubHubbub (PuSH)
          XMPP
             PubSub extension
             BoSH (XMPP over HTTP)
          Comet / HTTP Streaming
             Open an HTTP connection and keep reading from it
             Bayeux Protocol
          Long Polling
             Keep HTTP connection open until a message, then reopen
             BoSH, Bayeux option
          WebSockets
             NullMQ / ZeroMQ
             XMPP over WebSockets?



June	
  23,	
  2012         ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   15
Incremental	
  Synchroniza9on	
  

          Change	
  NoQficaQon	
  (CN)
                 Alert	
  that	
  something	
  happened	
  
                   (create,update,delete)


          Content	
  Transfer	
  (CT)
                      Transfer	
  of	
  just	
  the	
  change	
  or	
  the	
  full	
  resource


June	
  23,	
  2012                     ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   16
Trivial	
  versus	
  OpQmal	
  Approaches
• Trivial	
  Approach	
  -­‐	
  Retrieve	
  &	
  Compare




• OpQmal	
  Approach	
  -­‐	
  push only the change to only the
       destinations monitoring the resource




June	
  23,	
  2012         ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   17
More	
  advanced	
  opQons
• Trivial	
  Approach	
  plus	
  CondiQonal	
  GET:
          – Retrieve	
  every	
  resource	
  if	
  it	
  has	
  changed
          – EssenQally	
  this	
  is	
  a	
  Change	
  NoQficaQon	
  Pull
          – Not	
  scalable,	
  strain	
  on	
  Source	
  Systems,	
  no	
  way	
  to	
  
            know	
  of	
  new	
  resources




June	
  23,	
  2012               ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   18
More	
  advanced	
  opQons
      Simplest	
  Workable	
  Model:
         Introduce	
  a	
  Feed	
  of	
  change	
  noQficaQons	
  for	
  all	
  
         resources
         Atom,	
  RSS,	
  OAI-­‐PMH,	
  SiteMaps,	
  etc
         =>SQll	
  not	
  efficient,	
  no	
  way	
  to	
  know	
  when	
  to	
  pull




June	
  23,	
  2012            ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   19
More	
  advanced	
  opQons
      Feed	
  Extension	
  SoluQon:
      ConQnue	
  the	
  Feed	
  paradigm,	
  but	
  introduce	
  
      aggregaQng	
  service	
  and	
  ping	
  noQficaQon	
  to	
  re-­‐pull	
  
      (simulated	
  push)
      Only	
  advantageous	
  if	
  Source	
  already	
  supports	
  a	
  Feed




June	
  23,	
  2012          ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   20
The	
  lifecycle	
  of	
  standards	
  




                 You	
  are	
  here




June	
  23,	
  2012                   ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   21
Ongoing	
  Research
•      Change	
  NoQficaQon	
  -­‐	
  XMPP	
  &	
  XMPP	
  PubSub	
  &	
  bleeps
          – LANL
          – Ongoing	
  Experiment	
  with	
  Live	
  DBPedia
•      Change	
  NoQficaQon	
  -­‐	
  Comet	
  /	
  HTTP	
  Streaming	
  &	
  bleeps
          – ODU
          – Bayeux	
  Protocol	
  via	
  Faye	
  ImplementaQon
•      Change	
  NoQficaQon	
  -­‐	
  Change	
  Simulator
          – Cornell	
  U
          – Generate	
  configurable	
  change	
  noQficaQons
          – Use	
  as	
  standardized	
  input	
  to	
  different	
  systems	
  for	
  tesQng
•      Baseline	
  Matching	
  &	
  Audit
          – Cornell	
  U

June	
  23,	
  2012                         ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   22
Timeline
• Project	
  Launch	
  =	
  November	
  2011
• Approved	
  work	
  item	
  =	
  December	
  2011
• Working	
  Group	
  formed	
  =	
  February	
  2012
• Webinar	
  on	
  project	
  =	
  March	
  2012
• JCDL	
  meeQng,	
  Washington	
  DC	
  =	
  June	
  2012
• Alpha	
  =	
  ??	
  September	
  2012
• Beta/Dran	
  for	
  trail	
  use	
  =	
  ??	
  December	
  2012
• Comment	
  period	
  =	
  ??	
  December	
  2012	
  -­‐	
  March	
  2012
• Training	
  =	
  ??	
  May	
  -­‐	
  July	
  2013
• Approval	
  =	
  ??	
  December	
  2013

June	
  23,	
  2012                    ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   23
Thank you!



                                      Todd Carpenter, Executive Director
                                                   tcarpenter@niso.org



          National Information Standards Organization (NISO)
          One North Charles Street, Suite 1905
          Baltimore, MD 21201 USA
          +1 (301) 654-2512                     NOTE	
  =>NISO	
  IS	
  MOVING	
  IN	
  JULY	
  2012	
  <=
          www.niso.org


June	
  23,	
  2012                   ALCTS	
  CRS	
  Standards	
  IG	
  -­‐	
  ALA	
  Annual	
  2012   24

Weitere ähnliche Inhalte

Ähnlich wie ResourceSync - An Introduction

2015 09 emc lsug
2015 09 emc lsug2015 09 emc lsug
2015 09 emc lsugChris Dwan
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) CommonsJames Hendler
 
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...Daniel Bryant
 
Going Glocal—Polar Data in a Global Infrastructure
Going Glocal—Polar Data in a Global InfrastructureGoing Glocal—Polar Data in a Global Infrastructure
Going Glocal—Polar Data in a Global InfrastructureResearch Data Alliance
 
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...James Powell
 
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
20120718 linkedopendataandnextgenerationsciencemcguinnessesip finalDeborah McGuinness
 
20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicagoDeborah McGuinness
 
2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavis2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavisSean Davis
 
Better integrations through open interfaces
Better integrations through open interfacesBetter integrations through open interfaces
Better integrations through open interfacesSteve Speicher
 
Kerry Taylor - Semantics & sensors
Kerry Taylor - Semantics & sensorsKerry Taylor - Semantics & sensors
Kerry Taylor - Semantics & sensorsWeb Directions
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational ScienceChelle Gentemann
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxLIS EPI Meeting
 
The Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceThe Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceRobert Grossman
 
From Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsFrom Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsSimeon Warner
 

Ähnlich wie ResourceSync - An Introduction (20)

Resource Sync - Introduction
Resource Sync - IntroductionResource Sync - Introduction
Resource Sync - Introduction
 
NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...
NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...
NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...
 
2015 09 emc lsug
2015 09 emc lsug2015 09 emc lsug
2015 09 emc lsug
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
 
Importance of data standards for large scale data integration in chemistry
Importance of data standards for large scale data integration in chemistryImportance of data standards for large scale data integration in chemistry
Importance of data standards for large scale data integration in chemistry
 
Cshl minseqe 2013_ouellette
Cshl minseqe 2013_ouelletteCshl minseqe 2013_ouellette
Cshl minseqe 2013_ouellette
 
Going Glocal—Polar Data in a Global Infrastructure
Going Glocal—Polar Data in a Global InfrastructureGoing Glocal—Polar Data in a Global Infrastructure
Going Glocal—Polar Data in a Global Infrastructure
 
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
 
20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago
 
2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavis2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavis
 
Better integrations through open interfaces
Better integrations through open interfacesBetter integrations through open interfaces
Better integrations through open interfaces
 
Kerry Taylor - Semantics & sensors
Kerry Taylor - Semantics & sensorsKerry Taylor - Semantics & sensors
Kerry Taylor - Semantics & sensors
 
Cyberistructure
CyberistructureCyberistructure
Cyberistructure
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational Science
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-redux
 
The Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceThe Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of Science
 
From Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsFrom Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and Collaborations
 

Mehr von National Information Standards Organization (NISO)

Mehr von National Information Standards Organization (NISO) (20)

Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
 
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
 
Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"
 
Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"
 
Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"
 
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
 
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
 
Hahnel “Mapping Progress: Reflections and Charting Future Pathways"
Hahnel “Mapping Progress: Reflections and Charting Future Pathways"Hahnel “Mapping Progress: Reflections and Charting Future Pathways"
Hahnel “Mapping Progress: Reflections and Charting Future Pathways"
 
Stall "Open Science: The Journey of a Scholarly Society"
Stall "Open Science: The Journey of a Scholarly Society"Stall "Open Science: The Journey of a Scholarly Society"
Stall "Open Science: The Journey of a Scholarly Society"
 
Hrynaszkiewicz "A Publisher's Perspective on Open Science"
Hrynaszkiewicz "A Publisher's Perspective on Open Science"Hrynaszkiewicz "A Publisher's Perspective on Open Science"
Hrynaszkiewicz "A Publisher's Perspective on Open Science"
 

Kürzlich hochgeladen

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 

Kürzlich hochgeladen (20)

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 

ResourceSync - An Introduction

  • 1. ResourceSync - An Introduction Todd Carpenter Executive Director, NISO ALCTS Continuing Resources Standards Forum Sunday, June 24, 2012 With thanks to Herbert Van de Sompel and Robert Sanderson (LANL)
  • 2. @TAC_NISO Twitter Highlights • Presenting this morning on the ResourceSync project at ALCTS Continuing Resources Standards Forum #ALCTSCRS #ala12 • I’m pre-tweeing my slides during #rsync presentation. Slides here:  _________ #ala12 • NISO mission is to develop and maintain technical standards related to information, documentation, discovery and distribution of content #ala12 • Standards are all around us, even if we don't notice them, especially in books.  Things like page numbers, paper, binding, even spelling is standardized. #NISO #ala12 • Machines don’t talk like people do.  Then again some people don’t talk like other people do, particularly teenagers #ala12 • So where did the ResourceSync project start?  #NISO approached OAI about updating the PMH protocol. #ala12 • The #NISO / OAI ResourceSync project was possible through the generous support of the Alfred P. Sloan Foundation.  Thank you!  #ala12 • What is RSync trying to solve: Source Server has resources that change.  Destination servers want to leverage some or all of Source on regular ongoing basis in near-real-time & at web scale. #ala12 • Syntonization can be good enough or perfect and synchronization can be fast or fast enough.  #ala12 • RSync is studying a number of existing protocols to determine which (or combination of) protocols can best meet needs.  We have an bias against developing new spec from scratch. #ala12 • There are several models for synchronizing content: pull, push, conditional pull, mediated feed and pull, and a mix of feed/push/pull/service models. #ala12 • The goal of ResourceSync is to find the model that most efficiently distributes the content, while limiting the tax on the source system. #ala12 • This is very early days in the process of standards development. We’re still in the incubation stage. Consensus and adoption phases will come in 2013 and beyond. #ala12 • We hope to have a beta specification available by the end of 2012 of ResourceSync #ala12
  • 3. About Non-profit industry trade association accredited by ANSI Mission of developing and maintaining technical standards related to information, documentation, discovery and distribution of published materials and media Volunteer driven organization: 400+ spread out across the world
  • 4. Standards  are  familiar,  even  if  you  don’t  no4ce June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 4
  • 5. Machines don’t talk like people do June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 5
  • 6. Machines talk like this June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 6
  • 7. How  did  we  get  here? • OAI-­‐PMH  Protocol – Developed  in  200X – Developed  by  Herbert  van  de  Sompel,  Carl  Lagoze  and   the  OAI  team – Fairly  wide  adopQon  in  scholarly  community • In  spring  2011,  NISO  approached  OAI  to  discuss   updaQng  PMH  Protocol • Response  was  “Let’s  try  something  else  more  in   line  with  more  modern  technology”   June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 7
  • 8. A partnership is born • Agreement to launch RSync as a NISO standards process • Partnership on grant application • OAI team comprised core technology team • Partnership on grant application June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 8
  • 9. Special  thanks  are  due  to...   June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 9
  • 10. What  we  trying  to  solve? Consideration: Source (server) A has resources that change over time: they get created, modified, deleted, moved, … Destination (servers) X, Y, and Z leverage (some) resources of Source A. Problem: Destinations want to keep in step with the resource changes at Source A: resource synchronization. Task of ResourceSync effort: Design an approach for resource synchronization aligned with the Web Architecture that has a fair chance of adoption by different communities. The approach must scale better than recurrent HTTP HEAD/ GET on resources. June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 10
  • 11. Use  cases  differ How good is the synchronization? Perfect Good  enough How fast is the synchronization? Fast Fast  enough June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 11
  • 12. 3  disQnct  needs  regarding  resource  synchronizaQon Baseline  matching:  An  approach  to  allow  a  DesQnaQon  that  wants   to  start  synchronizing  with  a  Source  to  perform  an  iniQal  catch  up   –  Dump. Incremental  resource  synchronizaQon:  An  approach  to  allow  a   DesQnaQon  to  remain  up-­‐to-­‐date  regarding  changes  at  the   Source. Audit:  An  approach  to  allow  checking  whether  a  DesQnaQon  is  in   sync  with  a  Source    –  Inventory. =>  All  3  are  considered  in  scope  for  the  effort June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 12
  • 13. ResourceSync  Working  Group Herbert Van de Sompel (Chair) Stuart Lewis Los Alamos National Laboratory Joint Information Systems Committee (JISC) Todd Carpenter (Co-Chair) Peter Murray National Information Standards Organization (NISO) Lyrasis Nettie Lagace Michael Nelson National Information Standards Organization (NISO) Old Dominion University David Rosenthal Manuel Bernhardt Stanford University Delving B.V. Christian Sadilek Kevin Ford Red Hat Library of Congress Shlomo Sanders Bernhard Haslhofer Ex Libris, Inc. Cornell University Robert Sanderson Richard Jones Los Alamos National Laboratory Joint Information Systems Committee (JISC) Sjoerd Siebinga Martin Klein Delving B.V. Los Alamos National Laboratory Ed Summers Graham Klyne Library of Congress Joint Information Systems Committee (JISC) Simeon Warner Carl Lagoze Cornell University Cornell University Jeff Young OCLC Online Computer Library Center June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 13
  • 14. hep://imgs.xkcd.com/comics/standards.png/ 8/23/11 Data  AeribuQon  and  CitaQon  Workshop 14
  • 15. Change  NoQficaQon  -­‐  Protocols Atom PubSubHubbub (PuSH) XMPP PubSub extension BoSH (XMPP over HTTP) Comet / HTTP Streaming Open an HTTP connection and keep reading from it Bayeux Protocol Long Polling Keep HTTP connection open until a message, then reopen BoSH, Bayeux option WebSockets NullMQ / ZeroMQ XMPP over WebSockets? June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 15
  • 16. Incremental  Synchroniza9on   Change  NoQficaQon  (CN) Alert  that  something  happened   (create,update,delete) Content  Transfer  (CT) Transfer  of  just  the  change  or  the  full  resource June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 16
  • 17. Trivial  versus  OpQmal  Approaches • Trivial  Approach  -­‐  Retrieve  &  Compare • OpQmal  Approach  -­‐  push only the change to only the destinations monitoring the resource June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 17
  • 18. More  advanced  opQons • Trivial  Approach  plus  CondiQonal  GET: – Retrieve  every  resource  if  it  has  changed – EssenQally  this  is  a  Change  NoQficaQon  Pull – Not  scalable,  strain  on  Source  Systems,  no  way  to   know  of  new  resources June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 18
  • 19. More  advanced  opQons Simplest  Workable  Model: Introduce  a  Feed  of  change  noQficaQons  for  all   resources Atom,  RSS,  OAI-­‐PMH,  SiteMaps,  etc =>SQll  not  efficient,  no  way  to  know  when  to  pull June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 19
  • 20. More  advanced  opQons Feed  Extension  SoluQon: ConQnue  the  Feed  paradigm,  but  introduce   aggregaQng  service  and  ping  noQficaQon  to  re-­‐pull   (simulated  push) Only  advantageous  if  Source  already  supports  a  Feed June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 20
  • 21. The  lifecycle  of  standards   You  are  here June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 21
  • 22. Ongoing  Research • Change  NoQficaQon  -­‐  XMPP  &  XMPP  PubSub  &  bleeps – LANL – Ongoing  Experiment  with  Live  DBPedia • Change  NoQficaQon  -­‐  Comet  /  HTTP  Streaming  &  bleeps – ODU – Bayeux  Protocol  via  Faye  ImplementaQon • Change  NoQficaQon  -­‐  Change  Simulator – Cornell  U – Generate  configurable  change  noQficaQons – Use  as  standardized  input  to  different  systems  for  tesQng • Baseline  Matching  &  Audit – Cornell  U June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 22
  • 23. Timeline • Project  Launch  =  November  2011 • Approved  work  item  =  December  2011 • Working  Group  formed  =  February  2012 • Webinar  on  project  =  March  2012 • JCDL  meeQng,  Washington  DC  =  June  2012 • Alpha  =  ??  September  2012 • Beta/Dran  for  trail  use  =  ??  December  2012 • Comment  period  =  ??  December  2012  -­‐  March  2012 • Training  =  ??  May  -­‐  July  2013 • Approval  =  ??  December  2013 June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 23
  • 24. Thank you! Todd Carpenter, Executive Director tcarpenter@niso.org National Information Standards Organization (NISO) One North Charles Street, Suite 1905 Baltimore, MD 21201 USA +1 (301) 654-2512 NOTE  =>NISO  IS  MOVING  IN  JULY  2012  <= www.niso.org June  23,  2012 ALCTS  CRS  Standards  IG  -­‐  ALA  Annual  2012 24