SlideShare ist ein Scribd-Unternehmen logo
1 von 6
Data Conservancy 1     Data Conservancy embraces a shared vision: scientific data curation is a means to collect, organize, validate and preserve data so that scientists can find new ways to address the grand research challenges that face society. ASIS&T RDAP Summit  April 1, 2011 Elliot Metsger (emetsger@jhu.edu)
Principles of Navigation Flexibility Modularity Openness
Architecture 3 Open Archival Information System Functional Entities Data Conservancy Service Architecture Block Diagram
Policy Framework 4 Policy management and enforcement must be properly modeled Understand the policy framework interactions with other components of the system Build proper abstractions Support inclusion of associated policies when transferring objects among archives Support services over data which apply policies
(Some) Motivating Use Cases 5 Embargo Logging  Authentication and Authorization Privacy controls Obfuscating certain data Geo-locations of endangered species Personally identifiable information Issues: Granularity of policy application Obfuscation without reducing data utility (“fuzzing” algorithms)
Implementation 6 Design and implementation in Year 3 August 2011 – July 2012 In collaboration with Other DataNets DC Partners (e.g. NSIDC) Existing organizations (Federation of Earth Science Information Partners)

Weitere ähnliche Inhalte

Was ist angesagt?

Access methods for analysing sensitive data (amased)
Access methods for analysing sensitive data (amased)Access methods for analysing sensitive data (amased)
Access methods for analysing sensitive data (amased)
Jisc
 
Demonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsDemonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations Systems
GESIS
 

Was ist angesagt? (15)

Gwi dm intro_20140605
Gwi dm intro_20140605Gwi dm intro_20140605
Gwi dm intro_20140605
 
Access methods for analysing sensitive data (amased)
Access methods for analysing sensitive data (amased)Access methods for analysing sensitive data (amased)
Access methods for analysing sensitive data (amased)
 
iRODS User Group Meeting 2016 - MUMC+
iRODS User Group Meeting 2016 - MUMC+iRODS User Group Meeting 2016 - MUMC+
iRODS User Group Meeting 2016 - MUMC+
 
Privacy Audits in Law Libraries
Privacy Audits in Law LibrariesPrivacy Audits in Law Libraries
Privacy Audits in Law Libraries
 
Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADA
 
Abstract
AbstractAbstract
Abstract
 
MRDB 4
MRDB 4MRDB 4
MRDB 4
 
Challenges in altmetric data collection
Challenges in altmetric data collectionChallenges in altmetric data collection
Challenges in altmetric data collection
 
Demonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsDemonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations Systems
 
How to write a data management plan
How to write a data management planHow to write a data management plan
How to write a data management plan
 
An Overview of Data Citation Principles Synthesis Activity
An Overview of Data Citation Principles Synthesis ActivityAn Overview of Data Citation Principles Synthesis Activity
An Overview of Data Citation Principles Synthesis Activity
 
MEDIN data guidelines
MEDIN data guidelinesMEDIN data guidelines
MEDIN data guidelines
 
The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can help
 
NIH Data Sharing Plan Workshop - Handout
NIH Data Sharing Plan Workshop - HandoutNIH Data Sharing Plan Workshop - Handout
NIH Data Sharing Plan Workshop - Handout
 
Data management plan template
Data management plan templateData management plan template
Data management plan template
 

Ă„hnlich wie Metsger RDAP11 Policy-based Data Management

Funder requirements for Data Management Plans
Funder requirements for Data Management PlansFunder requirements for Data Management Plans
Funder requirements for Data Management Plans
Sherry Lake
 

Ă„hnlich wie Metsger RDAP11 Policy-based Data Management (20)

Metadata for digital long-term preservation
Metadata for digital long-term preservationMetadata for digital long-term preservation
Metadata for digital long-term preservation
 
Data Sharing & Data Citation
Data Sharing & Data CitationData Sharing & Data Citation
Data Sharing & Data Citation
 
Curation of Research Data
Curation of Research DataCuration of Research Data
Curation of Research Data
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curation
 
Current and emerging scientific data curation practices
Current and emerging scientific data curation practicesCurrent and emerging scientific data curation practices
Current and emerging scientific data curation practices
 
Disciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curationDisciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curation
 
DCC 101: Preservation
DCC 101: PreservationDCC 101: Preservation
DCC 101: Preservation
 
Digital Curation 101: Preserve
Digital Curation 101: PreserveDigital Curation 101: Preserve
Digital Curation 101: Preserve
 
Cologne open access slides dec 2010
Cologne open access slides dec 2010Cologne open access slides dec 2010
Cologne open access slides dec 2010
 
Malcolm Read: Drivers for Open Access and Data - a funder's perspective
Malcolm Read: Drivers for Open Access and Data - a funder's perspectiveMalcolm Read: Drivers for Open Access and Data - a funder's perspective
Malcolm Read: Drivers for Open Access and Data - a funder's perspective
 
ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...
ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...
ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...
 
Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharing
 
Libraries and Research Data Management – What Works? LERU´s Recommendations o...
Libraries and Research Data Management – What Works? LERU´s Recommendations o...Libraries and Research Data Management – What Works? LERU´s Recommendations o...
Libraries and Research Data Management – What Works? LERU´s Recommendations o...
 
Martone grethe
Martone gretheMartone grethe
Martone grethe
 
Birgit Schmidt: RDA for Libraries from an International Perspective
Birgit Schmidt: RDA for Libraries from an International PerspectiveBirgit Schmidt: RDA for Libraries from an International Perspective
Birgit Schmidt: RDA for Libraries from an International Perspective
 
Data wranglers in LibraryLand: Finding opportunities in the changing policy l...
Data wranglers in LibraryLand: Finding opportunities in the changing policy l...Data wranglers in LibraryLand: Finding opportunities in the changing policy l...
Data wranglers in LibraryLand: Finding opportunities in the changing policy l...
 
Turning Learning into Numbers - A Learning Analytics Framework
Turning Learning into Numbers - A Learning Analytics FrameworkTurning Learning into Numbers - A Learning Analytics Framework
Turning Learning into Numbers - A Learning Analytics Framework
 
ODIN Final Event - The Care and Feeding of Scientific Data
ODIN Final Event - The Care and Feeding of Scientific DataODIN Final Event - The Care and Feeding of Scientific Data
ODIN Final Event - The Care and Feeding of Scientific Data
 
Funder requirements for Data Management Plans
Funder requirements for Data Management PlansFunder requirements for Data Management Plans
Funder requirements for Data Management Plans
 
You down with dmp yeah you know me!
You down with dmp  yeah you know me!You down with dmp  yeah you know me!
You down with dmp yeah you know me!
 

Mehr von ASIS&T

Mehr von ASIS&T (20)

RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
 
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
 
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service ExperiencesRDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
 
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
 
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
 
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
 
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
 
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
 
RDAP 16 Poster: Interpreting Local Data Policies in Practice
RDAP 16 Poster: Interpreting Local Data Policies in PracticeRDAP 16 Poster: Interpreting Local Data Policies in Practice
RDAP 16 Poster: Interpreting Local Data Policies in Practice
 
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
 
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
 
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
 
RDAP 16 Lightning: RDM Discussion Group: How'd that go?
RDAP 16 Lightning: RDM Discussion Group: How'd that go?RDAP 16 Lightning: RDM Discussion Group: How'd that go?
RDAP 16 Lightning: RDM Discussion Group: How'd that go?
 
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
 
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge BrokerRDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
 
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
 
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
 
RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
RDAP 16 Lightning: Personas as a Policy Development Tool for Research DataRDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
 
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide CollaborationRDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
 
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
 

KĂĽrzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

KĂĽrzlich hochgeladen (20)

MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 

Metsger RDAP11 Policy-based Data Management

  • 1. Data Conservancy 1 Data Conservancy embraces a shared vision: scientific data curation is a means to collect, organize, validate and preserve data so that scientists can find new ways to address the grand research challenges that face society. ASIS&T RDAP Summit April 1, 2011 Elliot Metsger (emetsger@jhu.edu)
  • 2. Principles of Navigation Flexibility Modularity Openness
  • 3. Architecture 3 Open Archival Information System Functional Entities Data Conservancy Service Architecture Block Diagram
  • 4. Policy Framework 4 Policy management and enforcement must be properly modeled Understand the policy framework interactions with other components of the system Build proper abstractions Support inclusion of associated policies when transferring objects among archives Support services over data which apply policies
  • 5. (Some) Motivating Use Cases 5 Embargo Logging Authentication and Authorization Privacy controls Obfuscating certain data Geo-locations of endangered species Personally identifiable information Issues: Granularity of policy application Obfuscation without reducing data utility (“fuzzing” algorithms)
  • 6. Implementation 6 Design and implementation in Year 3 August 2011 – July 2012 In collaboration with Other DataNets DC Partners (e.g. NSIDC) Existing organizations (Federation of Earth Science Information Partners)

Hinweis der Redaktion

  1. Talk is not about DC, but it sets the contextProvide brief context of DC, its architecture and design, move on to policy aspectsFunded by the NSF through the DataNet program out of OCIIn our 19th monthWhat are we building: infrastructure providing curation, preservation, and access to scientific dataDCS as technical manifestation of infrastructureNot singular monolithic instance, but a blueprint made up of modular servicesI don’t intend this to be a talk about the Data Conservancy, but because it has been my life for the past 18 months, it really sets the context of this talk. So I’ll provide some brief context about the Data Conservancy, and its architecture and design, and then move onto the policy aspects.The Data Conservancy is building infrastructure that will provide curation, preservation, and access to scientific data. The Data Conservancy Service, or DCS, is the technical manifestation of this infrastructure. We do not envision the DCS as a singular instance of a monolithic system, but a blueprint for a modular system that can be followed by those who choose to do so.
  2. Simultaneously developing a system, exploring research problems, managing a user requirements processFlexible to accommodate input from users and requirements processesModularity a focal point of DCS designabstracted at proper level to ensure completeness, correctness, and impls adapted for user needs and research outcomesProvides public APIs, minimizes dependencies between system componentsOpen technical environment allows for adapting and evolving in desirable waysInteroperability with other infrastructureTechnical sustainability (storage plugin leveraging more cost effective storage)Evolution of DCS modules (adding ingest pipeline components)multiple implementations of archival storage API; separation of bit storage from archival storageIn addition to technical benefits, this design principle has facilitated collaboration with other DataNet awardeesOpen: closed system is non-starter. At odds with providing long-term preservation and access to data.Principles have immediate application, also forward thinking, provides technical sustainability. Because we are simultaneously developing a system, exploring research problems and managing a user requirements process, the DCS infrastructure needs to be flexible to accommodate user needs and research outcomes. Modularity has been a focal point of DCS design. Each element of the DCS infrastructure must be abstracted at the proper level. This ensures the correctness and completeness of the system, and allows for concrete implementations to be adapted to changing user needs and research outcomes.By providing public APIs and minimizing dependencies between system components, we provide an open technical environment where the DCS can adapt and evolve in desirable ways. Where possible we intend to “prove” abstractions by providing multiple concrete implementations. For example, we have two different implementations of our archival storage API: one file-system based, the other object-based using fedora. We have also been careful to differentiate between archival storage and bit storage.In addition to the technical benefits, this design principle has facilitated collaboration with other DataNet awardees. Finally, the DCS infrastructure must be open. A closed system is a non-starter; At odds with providing long-term preservation and access to data These principles are not only applied immediately, they are forward thinking and ensure the technical sustainability of the DCS and the data managed within for years to come. 
  3. The DCS architecture has been influenced and guided by the OAIS reference model. As you can see on this figure, OAIS functional concepts are realized in various DCS modules.Not every DCS module directly maps to an OAIS functional concept.
  4. Adhering to our principles of navigation, policy management and enforcement must be properly modeledUnderstand interactions with other system componentsBuild the proper abstractionsWe believe it will be a requirement to transfer data between archival systems, including “policy-encumbered” dataPlan on supporting the inclusion of associated policies when transferring objects among archivesStoring objects in our local archive Audit will be one mechanism used to ensure that remote archives are able to enforce the policyOf course, we also plan to support services over data which apply policies
  5. EmbargosLoggingAccessDownloadsAuthentication and Authorizationprivacy controlsE.g. user must contact producer for a copy of the dataDeliberate obfuscation of certain dataGeo-locations of endangered speciesPersonally identifiable informationIssues: granularity of the policy application, obfuscate data without reducing the utility of the data (“fuzzing” algorithms)
  6. Policy framework implementation has not yet begunScheduled for year 3, which starts in Aug. 2011We plan to design and implement our policy framework with collaboration from:Other datanets We feel the need need for broad interoperability beyond just the DataNets both in a disciplinary sense and in a interdisciplinary sense.DC partners like NSIDCExisting and evolving frameworks in the earth sciences