SlideShare ist ein Scribd-Unternehmen logo
1 von 58
Downloaden Sie, um offline zu lesen
Digital preservation
caring for our data to foster
 knowledge discovery and
       dissemination

     Claudia Bauzer Medeiros
      Institute of Computing
             UNICAMP
Pre-Saervare
 (Before) – (Save)
= save before disappears
Maintain
    Manu-tenere

= being able to get/find it
Dec 2008




Feb 2010
Data deluge
• At end of 2011 – info created and replicated > 1.8 zettabytes

• 90% data created in the last 2 years

• 5 hour flight – 240 Tbytes

• Facebook – 200 million users, >70 languages

• Each person in England is filmed 300 times/day

• Teenagers in the US send average 110 phone text messages a day

=> We need to build arks during the deluge - PRESERVATION
Outline
•   Why preserve?
•   What to preserve?
•   How to preserve?
•   Where to preserve?

And a few associated challenges
Outline
•   Why preserve?
•   What to preserve?
•   How to preserve?
•   Where to preserve?

And a few associated challenges
WHY PRESERVE
• Costly to produce

• Contribute to progress of science

• Intrinsic value
  culture/science/sustainability
WHY PRESERVE
• Costly to produce
   – Infrastructure, power, software, models, visualization,
     people
   – Hardware, Software, Peopleware
• Contribute to progress of science
   – Reproducibility and reusability
   – Publication and sharing
   – Quality
• Intrinsic value culture/science/sustainability
   – Digital humanities
   – Domesday project
   – Fonoteca Neotropical Jacques Vieillard
WHY PRESERVE
• Costly to produce
   – Infrastructure, power, software, models, visualization,
     people
   – Hardware, Software, Peopleware
• Contribute to progress of science
   – Reproducibility and reusability
   – Publication and sharing
   – Quality
• Intrinsic value culture/science/sustainability
   – Digital humanities
   – Domesday project
   – Fonoteca Neotropical Jacques Vieillard
WHY PRESERVE
• Costly to produce
   – Infrastructure, power, software, models, visualization,
     people
   – Hardware, Software, Peopleware
• Contribute to progress of science
   – Reproducibility and reusability
   – Publication and sharing
   – Quality
• Intrinsic value culture/science/sustainability
   – Digital humanities
   – Domesday project
   – Fonoteca Neotropical Jacques Vieillard
The Domesday Project 1086-1986
• Digital decay
• Equipment obsolescence
• Software obsolescence
Domesday reloaded
Fonoteca
Neotropical
Jacques
Vieillard
Outline
• Why preserve?

• What to preserve?
• How to preserve?

And associated challenges
What to preserve?
• Data

• BUT what is “data”?



• Only data?
What to preserve?
• Data
• BUT what is “data”?
  – Files and records
  – Models, documentation, annotations, sketches,
    experiments, recordings
• Only data?
What to preserve?
• Data
• BUT what is “data”?
  – Files and records
  – Models, documentation, annotations, sketches,
    experiments, recordings
• Only data?
  – How produced it – workflows, devices,
    methodologies, materials and methods,
    reasonings, logs --- provenance
What to preserve?
• Data
• Environment in which was produced

• Data needed to preserve occupies more space
  than the data itself
• Preservation means storing more than object
  itself
What about our research data?
               (slide adapted from Jim Gray)
Experiments
 Instruments

  Files                           Questions

  Papers                          Answers

   Simulations
          Models


             DATA



Data-driven science                    “Collaboratory”


                                                         23/10000
Data sources?
    Table of Product Characteristics
   id        Property name Value
 MilkProd     productsrep     MilkA
 MilkProd       quantity      10000
 MilkProd     validity date 10/06/2006
CheeseProd productsr          Minas
CheeseProd    epquantity      2000
CheeseProd validity date 12/02/2006
CheeseProd      shape        Circular




                                                         24/10000
eEnvironmental Science
• Direct and indirect observations




                                     25/10000
Data sources




               26/10000
27/10000
We are
 DATASCOPE
 engineers


Software is the
      device/tool
Outline
• Why preserve?
• What to preserve?

• How to preserve?

And associated challenges
How to preserve?

How to construct the ark during the
             deluge?

Presaervare, Manutenere and Share
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures
• To afford maintenance costs
  – Cloud? CAP theorem?
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures
• To afford maintenance costs
  – Cloud? CAP theorem?
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures
• To afford maintenance costs
  – Cloud? CAP theorem?
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures, metadata,standards
• To afford maintenance costs
  – Cloud? CAP theorem?
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures,metadata, standards
• To afford maintenance costs
  – Cloud? CAP theorem? =======     WHERE
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay
  – PEOPLE DECAY
• To ensure quality
  – Curation procedures,metadata, standards
• To afford maintenance costs
  – Cloud? CAP theorem? =======     WHERE
Sharing and open access

NSF Data Management Policy

 Paper and data publication
Sharing of Data Leads to Progress on Alzheimer’s
                                        By GINA KOLATA
                                   Published: August 12, 2010
                                      = NEW YORK TIMES

In 2003, a group of scientists and executives from the National Institutes of Health, the Food and
Drug Administration, the drug and medical-imaging industries, universities and nonprofit groups
joined in a project that experts say had no precedent: a collaborative effort to find the biological
          markers that show the progression of Alzheimer’s disease in the human brain.



   share all the data, making every single
  finding public immediately, available to
 anyone with a computer anywhere in the
                    world
        => AVAILABILITY and REUSE
• Data must be properly curated throughout its
  life-cycle and released with the appropriate
  high-quality metadata.
• Medical Research Council UK




                                           40/10000
• Research data should be made available for
  use by other researchers. Researchers must
  retain research data, including electronic data,
  in a durable, indexed and retrievable form.
• Australian Govnmt National Health and
  Medical Research Council



                                              41/10000
Microsoft Academic Search
40M publications
19M authors
75 publishers (Wiley, Springer, ACM, IEEE …)




                                               42/10000
Google Scholar Citations




                      43/10000
• Citing data is as important as citing papers
• For researchers, publishers, data centers
• Over 1M DOI, several major national research
  libraries
  – Germany, France, Korea, Netherlands, Australia,
    USA...
• Present manager – German National Library of
  Science and Technology

                                                 44/10000
Publish on the Cloud
Add metadata
Pre-print sharing




                       45/10000
FNJV
       proj.lis.ic.unicamp.br/fnjv
• Sharing by publishing on the Web
• Retrievability by extending metadata




                                         46/10000
CURATION AND USE OF STANDARDS
Workflows and model preservation
Workflows and model preservation
         Comb-e-Chem
                   Video
                                                    Simulation

                                                                 Properties

                           Analysis
  Diffractometer




                                           Structures
                                           Database




X-Ray                                                                   Properties
e-Lab                                                                   e-Lab

                                      Grid Middleware

                                                                          52/10000
The cloud and CAP
Outline
•   Why preserve?
•   What to preserve?
•   How to preserve?
•   Where to preserve?

And a few associated challenges
PRE-SAVE and MANU-TENERE
Outline
• Why preserve?
  – Costly to produce (hardware, software, peopleware)
  – Contribute to progress of science
  – Value – culture, science, sustainability
• What to preserve?
  – Data [WHAT IS DATA?]
  – Context of production and use
• How to preserve?
  – Accessibility and sharing – standards, metadata,
    ontologies
  – Integrity and quality – context to use (hw, sw),
    standards
References
•




             56/10000
References
NSF – CISE Data management policy
The Domesday Project
http://www.atsf.co.uk/dottext/domesday.html
The CLARIN Project (languages)
Eigenfactor.org
Altmetrics movement
Thank you!!!!

Weitere ähnliche Inhalte

Was ist angesagt?

Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iJose Enrique Ruiz
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleCarole Goble
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?myGrid team
 
Status of Alien Invasive Species Information in Canada
Status of Alien Invasive Species Information in CanadaStatus of Alien Invasive Species Information in Canada
Status of Alien Invasive Species Information in CanadaHans Herrmann
 
Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Chris Rusbridge
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceGarethKnight
 
Federation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudFederation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudOpenStack
 

Was ist angesagt? (8)

Just Digitise It - Daniel Wilksch - 2015
Just Digitise It - Daniel Wilksch - 2015Just Digitise It - Daniel Wilksch - 2015
Just Digitise It - Daniel Wilksch - 2015
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote Goble
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?
 
Status of Alien Invasive Species Information in Canada
Status of Alien Invasive Species Information in CanadaStatus of Alien Invasive Species Information in Canada
Status of Alien Invasive Species Information in Canada
 
Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support Service
 
Federation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudFederation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research Cloud
 

Andere mochten auch

Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...ariadnenetwork
 
Legal Hold and Data Preservation Best Practices
Legal Hold and Data Preservation Best PracticesLegal Hold and Data Preservation Best Practices
Legal Hold and Data Preservation Best PracticesZapproved
 
Research bites: Digital Preservation for Research Data
Research bites: Digital Preservation for Research DataResearch bites: Digital Preservation for Research Data
Research bites: Digital Preservation for Research DataLancaster University Library
 
D.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital PreservationD.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital PreservationPRELIDA Project
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservationsmtcd
 
Digital preservation
Digital preservationDigital preservation
Digital preservationSarika Sawant
 
Basic of Human Resource Management
Basic of Human Resource ManagementBasic of Human Resource Management
Basic of Human Resource ManagementAshit Jain
 
Introduction to human resource management
Introduction to human resource managementIntroduction to human resource management
Introduction to human resource managementTanuj Poddar
 
Human resource management ppt
Human resource management ppt Human resource management ppt
Human resource management ppt Babasab Patil
 
Human Resource Management
Human Resource ManagementHuman Resource Management
Human Resource Managementgumbhir singh
 

Andere mochten auch (13)

Data preservation
Data preservationData preservation
Data preservation
 
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
 
Legal Hold and Data Preservation Best Practices
Legal Hold and Data Preservation Best PracticesLegal Hold and Data Preservation Best Practices
Legal Hold and Data Preservation Best Practices
 
Research bites: Digital Preservation for Research Data
Research bites: Digital Preservation for Research DataResearch bites: Digital Preservation for Research Data
Research bites: Digital Preservation for Research Data
 
D.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital PreservationD.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital Preservation
 
Data preservation 101
Data preservation 101Data preservation 101
Data preservation 101
 
Is Violent Crime Increasing or Decreasing?
Is Violent Crime Increasing or Decreasing?Is Violent Crime Increasing or Decreasing?
Is Violent Crime Increasing or Decreasing?
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Digital preservation
Digital preservationDigital preservation
Digital preservation
 
Basic of Human Resource Management
Basic of Human Resource ManagementBasic of Human Resource Management
Basic of Human Resource Management
 
Introduction to human resource management
Introduction to human resource managementIntroduction to human resource management
Introduction to human resource management
 
Human resource management ppt
Human resource management ppt Human resource management ppt
Human resource management ppt
 
Human Resource Management
Human Resource ManagementHuman Resource Management
Human Resource Management
 

Ähnlich wie Claudia Bauzer Medeiros Digital preservation – caring for our data to foster knowledge discovery and dissemination

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...Bonnie Hurwitz
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management EcosystemJohn Kunze
 
RDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemRDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemASIS&T
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12ASIS&T
 
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...Ardan Patwardhan
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarshiptsbbbu
 
Graham Pryor
Graham PryorGraham Pryor
Graham PryorEduserv
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
 
Collaboration and Sharing
Collaboration and SharingCollaboration and Sharing
Collaboration and SharingJisc
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Ola Spjuth
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourcePhilippa Griffin
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2Alex Hardisty
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Jeroen Rombouts
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3guru122
 

Ähnlich wie Claudia Bauzer Medeiros Digital preservation – caring for our data to foster knowledge discovery and dissemination (20)

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
RDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemRDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management Ecosystem
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
 
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarship
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
Researh data management
Researh data managementResearh data management
Researh data management
 
Collaboration and Sharing
Collaboration and SharingCollaboration and Sharing
Collaboration and Sharing
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data Resource
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10
 
Big Data
Big Data Big Data
Big Data
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 

Mehr von Beniamino Murgante

Analyzing and assessing ecological transition in building sustainable cities
Analyzing and assessing ecological transition in building sustainable citiesAnalyzing and assessing ecological transition in building sustainable cities
Analyzing and assessing ecological transition in building sustainable citiesBeniamino Murgante
 
Smart Cities: New Science for the Cities
Smart Cities: New Science for the CitiesSmart Cities: New Science for the Cities
Smart Cities: New Science for the CitiesBeniamino Murgante
 
The evolution of spatial analysis and modeling in decision processes
The evolution of spatial analysis and modeling in decision processesThe evolution of spatial analysis and modeling in decision processes
The evolution of spatial analysis and modeling in decision processesBeniamino Murgante
 
Involving citizens in smart energy approaches: the experience of an energy pa...
Involving citizens in smart energy approaches: the experience of an energy pa...Involving citizens in smart energy approaches: the experience of an energy pa...
Involving citizens in smart energy approaches: the experience of an energy pa...Beniamino Murgante
 
Programmazione per la governance territoriale in tema di tutela della biodive...
Programmazione per la governance territoriale in tema di tutela della biodive...Programmazione per la governance territoriale in tema di tutela della biodive...
Programmazione per la governance territoriale in tema di tutela della biodive...Beniamino Murgante
 
Involving Citizens in a Participation Process for Increasing Walkability
Involving Citizens in a Participation Process for Increasing WalkabilityInvolving Citizens in a Participation Process for Increasing Walkability
Involving Citizens in a Participation Process for Increasing WalkabilityBeniamino Murgante
 
Presentation of ICCSA 2019 at the University of Saint petersburg
Presentation of ICCSA 2019 at the University of Saint petersburg Presentation of ICCSA 2019 at the University of Saint petersburg
Presentation of ICCSA 2019 at the University of Saint petersburg Beniamino Murgante
 
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...Beniamino Murgante
 
Presentation of ICCSA 2017 at the University of trieste
Presentation of ICCSA 2017 at the University of triestePresentation of ICCSA 2017 at the University of trieste
Presentation of ICCSA 2017 at the University of triesteBeniamino Murgante
 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...Beniamino Murgante
 
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...Beniamino Murgante
 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector Beniamino Murgante
 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...Beniamino Murgante
 
Garden in motion. An experience of citizens involvement in public space regen...
Garden in motion. An experience of citizens involvement in public space regen...Garden in motion. An experience of citizens involvement in public space regen...
Garden in motion. An experience of citizens involvement in public space regen...Beniamino Murgante
 
Planning and Smartness: the true challenge
Planning and Smartness: the true challengePlanning and Smartness: the true challenge
Planning and Smartness: the true challengeBeniamino Murgante
 
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...Beniamino Murgante
 
Informazione Geografica, Città, Smartness
Informazione Geografica, Città, Smartness Informazione Geografica, Città, Smartness
Informazione Geografica, Città, Smartness Beniamino Murgante
 
Tecnologie, Territorio, Smartness
Tecnologie, Territorio, SmartnessTecnologie, Territorio, Smartness
Tecnologie, Territorio, SmartnessBeniamino Murgante
 

Mehr von Beniamino Murgante (20)

Analyzing and assessing ecological transition in building sustainable cities
Analyzing and assessing ecological transition in building sustainable citiesAnalyzing and assessing ecological transition in building sustainable cities
Analyzing and assessing ecological transition in building sustainable cities
 
Smart Cities: New Science for the Cities
Smart Cities: New Science for the CitiesSmart Cities: New Science for the Cities
Smart Cities: New Science for the Cities
 
The evolution of spatial analysis and modeling in decision processes
The evolution of spatial analysis and modeling in decision processesThe evolution of spatial analysis and modeling in decision processes
The evolution of spatial analysis and modeling in decision processes
 
Smart City or Urban Science?
Smart City or Urban Science?Smart City or Urban Science?
Smart City or Urban Science?
 
Involving citizens in smart energy approaches: the experience of an energy pa...
Involving citizens in smart energy approaches: the experience of an energy pa...Involving citizens in smart energy approaches: the experience of an energy pa...
Involving citizens in smart energy approaches: the experience of an energy pa...
 
Programmazione per la governance territoriale in tema di tutela della biodive...
Programmazione per la governance territoriale in tema di tutela della biodive...Programmazione per la governance territoriale in tema di tutela della biodive...
Programmazione per la governance territoriale in tema di tutela della biodive...
 
Involving Citizens in a Participation Process for Increasing Walkability
Involving Citizens in a Participation Process for Increasing WalkabilityInvolving Citizens in a Participation Process for Increasing Walkability
Involving Citizens in a Participation Process for Increasing Walkability
 
Presentation of ICCSA 2019 at the University of Saint petersburg
Presentation of ICCSA 2019 at the University of Saint petersburg Presentation of ICCSA 2019 at the University of Saint petersburg
Presentation of ICCSA 2019 at the University of Saint petersburg
 
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
 
Presentation of ICCSA 2017 at the University of trieste
Presentation of ICCSA 2017 at the University of triestePresentation of ICCSA 2017 at the University of trieste
Presentation of ICCSA 2017 at the University of trieste
 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
 
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
 
Garden in motion. An experience of citizens involvement in public space regen...
Garden in motion. An experience of citizens involvement in public space regen...Garden in motion. An experience of citizens involvement in public space regen...
Garden in motion. An experience of citizens involvement in public space regen...
 
Planning and Smartness: the true challenge
Planning and Smartness: the true challengePlanning and Smartness: the true challenge
Planning and Smartness: the true challenge
 
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
 
Murgante smart energy
Murgante smart energyMurgante smart energy
Murgante smart energy
 
Informazione Geografica, Città, Smartness
Informazione Geografica, Città, Smartness Informazione Geografica, Città, Smartness
Informazione Geografica, Città, Smartness
 
Tecnologie, Territorio, Smartness
Tecnologie, Territorio, SmartnessTecnologie, Territorio, Smartness
Tecnologie, Territorio, Smartness
 

Kürzlich hochgeladen

ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxAmita Gupta
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxdhanalakshmis0310
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 

Kürzlich hochgeladen (20)

Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 

Claudia Bauzer Medeiros Digital preservation – caring for our data to foster knowledge discovery and dissemination

  • 1. Digital preservation caring for our data to foster knowledge discovery and dissemination Claudia Bauzer Medeiros Institute of Computing UNICAMP
  • 2. Pre-Saervare (Before) – (Save) = save before disappears
  • 3. Maintain Manu-tenere = being able to get/find it
  • 4.
  • 6. Data deluge • At end of 2011 – info created and replicated > 1.8 zettabytes • 90% data created in the last 2 years • 5 hour flight – 240 Tbytes • Facebook – 200 million users, >70 languages • Each person in England is filmed 300 times/day • Teenagers in the US send average 110 phone text messages a day => We need to build arks during the deluge - PRESERVATION
  • 7. Outline • Why preserve? • What to preserve? • How to preserve? • Where to preserve? And a few associated challenges
  • 8. Outline • Why preserve? • What to preserve? • How to preserve? • Where to preserve? And a few associated challenges
  • 9. WHY PRESERVE • Costly to produce • Contribute to progress of science • Intrinsic value culture/science/sustainability
  • 10. WHY PRESERVE • Costly to produce – Infrastructure, power, software, models, visualization, people – Hardware, Software, Peopleware • Contribute to progress of science – Reproducibility and reusability – Publication and sharing – Quality • Intrinsic value culture/science/sustainability – Digital humanities – Domesday project – Fonoteca Neotropical Jacques Vieillard
  • 11. WHY PRESERVE • Costly to produce – Infrastructure, power, software, models, visualization, people – Hardware, Software, Peopleware • Contribute to progress of science – Reproducibility and reusability – Publication and sharing – Quality • Intrinsic value culture/science/sustainability – Digital humanities – Domesday project – Fonoteca Neotropical Jacques Vieillard
  • 12. WHY PRESERVE • Costly to produce – Infrastructure, power, software, models, visualization, people – Hardware, Software, Peopleware • Contribute to progress of science – Reproducibility and reusability – Publication and sharing – Quality • Intrinsic value culture/science/sustainability – Digital humanities – Domesday project – Fonoteca Neotropical Jacques Vieillard
  • 13. The Domesday Project 1086-1986 • Digital decay • Equipment obsolescence • Software obsolescence
  • 16.
  • 17.
  • 18. Outline • Why preserve? • What to preserve? • How to preserve? And associated challenges
  • 19. What to preserve? • Data • BUT what is “data”? • Only data?
  • 20. What to preserve? • Data • BUT what is “data”? – Files and records – Models, documentation, annotations, sketches, experiments, recordings • Only data?
  • 21. What to preserve? • Data • BUT what is “data”? – Files and records – Models, documentation, annotations, sketches, experiments, recordings • Only data? – How produced it – workflows, devices, methodologies, materials and methods, reasonings, logs --- provenance
  • 22. What to preserve? • Data • Environment in which was produced • Data needed to preserve occupies more space than the data itself • Preservation means storing more than object itself
  • 23. What about our research data? (slide adapted from Jim Gray) Experiments Instruments Files Questions Papers Answers Simulations Models DATA Data-driven science “Collaboratory” 23/10000
  • 24. Data sources? Table of Product Characteristics id Property name Value MilkProd productsrep MilkA MilkProd quantity 10000 MilkProd validity date 10/06/2006 CheeseProd productsr Minas CheeseProd epquantity 2000 CheeseProd validity date 12/02/2006 CheeseProd shape Circular 24/10000
  • 25. eEnvironmental Science • Direct and indirect observations 25/10000
  • 26. Data sources 26/10000
  • 28. We are DATASCOPE engineers Software is the device/tool
  • 29. Outline • Why preserve? • What to preserve? • How to preserve? And associated challenges
  • 30. How to preserve? How to construct the ark during the deluge? Presaervare, Manutenere and Share
  • 31. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures • To afford maintenance costs – Cloud? CAP theorem?
  • 32. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures • To afford maintenance costs – Cloud? CAP theorem?
  • 33. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures • To afford maintenance costs – Cloud? CAP theorem?
  • 34. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures, metadata,standards • To afford maintenance costs – Cloud? CAP theorem?
  • 35. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures,metadata, standards • To afford maintenance costs – Cloud? CAP theorem? ======= WHERE
  • 36. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay – PEOPLE DECAY • To ensure quality – Curation procedures,metadata, standards • To afford maintenance costs – Cloud? CAP theorem? ======= WHERE
  • 37. Sharing and open access NSF Data Management Policy Paper and data publication
  • 38.
  • 39. Sharing of Data Leads to Progress on Alzheimer’s By GINA KOLATA Published: August 12, 2010 = NEW YORK TIMES In 2003, a group of scientists and executives from the National Institutes of Health, the Food and Drug Administration, the drug and medical-imaging industries, universities and nonprofit groups joined in a project that experts say had no precedent: a collaborative effort to find the biological markers that show the progression of Alzheimer’s disease in the human brain. share all the data, making every single finding public immediately, available to anyone with a computer anywhere in the world => AVAILABILITY and REUSE
  • 40. • Data must be properly curated throughout its life-cycle and released with the appropriate high-quality metadata. • Medical Research Council UK 40/10000
  • 41. • Research data should be made available for use by other researchers. Researchers must retain research data, including electronic data, in a durable, indexed and retrievable form. • Australian Govnmt National Health and Medical Research Council 41/10000
  • 42. Microsoft Academic Search 40M publications 19M authors 75 publishers (Wiley, Springer, ACM, IEEE …) 42/10000
  • 44. • Citing data is as important as citing papers • For researchers, publishers, data centers • Over 1M DOI, several major national research libraries – Germany, France, Korea, Netherlands, Australia, USA... • Present manager – German National Library of Science and Technology 44/10000
  • 45. Publish on the Cloud Add metadata Pre-print sharing 45/10000
  • 46. FNJV proj.lis.ic.unicamp.br/fnjv • Sharing by publishing on the Web • Retrievability by extending metadata 46/10000
  • 47.
  • 48.
  • 49. CURATION AND USE OF STANDARDS
  • 50. Workflows and model preservation
  • 51.
  • 52. Workflows and model preservation Comb-e-Chem Video Simulation Properties Analysis Diffractometer Structures Database X-Ray Properties e-Lab e-Lab Grid Middleware 52/10000
  • 54. Outline • Why preserve? • What to preserve? • How to preserve? • Where to preserve? And a few associated challenges PRE-SAVE and MANU-TENERE
  • 55. Outline • Why preserve? – Costly to produce (hardware, software, peopleware) – Contribute to progress of science – Value – culture, science, sustainability • What to preserve? – Data [WHAT IS DATA?] – Context of production and use • How to preserve? – Accessibility and sharing – standards, metadata, ontologies – Integrity and quality – context to use (hw, sw), standards
  • 56. References • 56/10000
  • 57. References NSF – CISE Data management policy The Domesday Project http://www.atsf.co.uk/dottext/domesday.html The CLARIN Project (languages) Eigenfactor.org Altmetrics movement