SlideShare ist ein Scribd-Unternehmen logo
1 von 28
REPOSITORIES FOR SCIENTIFIC DATA
An #animalgarden show
Peter Murray-Rust,
OKFN and University of Cambridge
Chuff OWL
Moomin
AMI
Gulliver
Sleepless
cleanTux
UncleSam
I’m AMI studying
biodiversity. I
compute
phylogenetic trees
Only 4% of computed trees are saved
I’m in a
pear tree.
Where can I put
my data?
Institutional repos
don’t work, we’ve tried
WE NEED DOMAIN
REPOSITORIES FOR
SCIENCE
So how do you
manage data?
We’re BIG DATA
at NASA
We hire data
experts
But I’m a LONG-TAIL scientist!
Australia have a
national data
service (ANDS)
We could use
their TARDIS*
Let’s ask the
crystallographers.
They save their
data
I want to
publish this
paper
You MUST send ALL
the data. The IUCr will
check if it’s correct
It takes
years to
create
vocabularies
Core dictionary (coreCIF) version 2.4.3
_diffrn_ambient_temperature
Definition: The mean temperature in kelvins at
which the intensities were measured.
Range: 0.0 -> infinity Type: numb
ID
For
humans
For machines:
Constraint + type
We need domain vocabularies through
inter/national efforts
PMRgroup also
built a crystal
structure repo
(Crystaleye)
It’s got
200,000
entries
But none from
Elsevier, Wiley,
Springer
And NONE of
the results are
archived
Computational Materials
scientists costs 1,000
Million USD / year
PMR wrote software to
turn FORTRAN into
XML
PMR and others have
started a global effort
to create
vocabularies
It’s hard and slow
work
PMR group built
compchem repository
Chempound XML RDF
NoSQL SPARQL
Is PMR making
progress?
Hoping to work with
Obama’s 500 M
USD “materials
genome”
WE NEED DOMAIN
REPOSITORIES FOR
BIODIVERSITY
We could use
Figshare
As long as it’s
Open
Or OKFN’s
CKAN
And we can
also do theses!
PMR and Ross
Mounce will index the
whole of published
bioscience!
5 years of JISC
projects helped
We’re going to index
SPECIES, PLACES,
DATES
I’m a baby Buddleja
Davidii
OKFN Chuff!
I’m an Okapi
balloonii
WE NEED DOMAIN
REPOSITORIES FOR SCIENCE
Wake up,
nearly finished
PechaKucha i
knackering
Chuff
REPOSITORIES FOR SCIENTIFIC DATA
An #animalgarden show
Peter Murray-Rust,
OKFN and University of Cambridge
WE NEED
DOMAIN
REPOSITORIES
FOR SCIENCE
Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust
Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust
Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust
Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust
Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust
Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust
Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust
Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust
Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust

Weitere ähnliche Inhalte

Was ist angesagt?

Extract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotationExtract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotationLars Juhl Jensen
 
Why we should (urgently) sequence the genomes of our biodiversity ?
Why we should (urgently) sequence the genomes of our biodiversity ?Why we should (urgently) sequence the genomes of our biodiversity ?
Why we should (urgently) sequence the genomes of our biodiversity ?Alberto Dávila
 
Thermoregulation Mechanisms and Grazing Behaviors of Dairy Goats
Thermoregulation Mechanisms and Grazing Behaviors of Dairy GoatsThermoregulation Mechanisms and Grazing Behaviors of Dairy Goats
Thermoregulation Mechanisms and Grazing Behaviors of Dairy GoatsConferenceproceedings
 
Karl kjer : A collegiate career at rutgers university
Karl kjer : A collegiate career at rutgers universityKarl kjer : A collegiate career at rutgers university
Karl kjer : A collegiate career at rutgers universityKarl Kjer
 
CRISPR-Cas9: The new frontier of Genome Engineering
CRISPR-Cas9: The new frontier of Genome EngineeringCRISPR-Cas9: The new frontier of Genome Engineering
CRISPR-Cas9: The new frontier of Genome EngineeringSt Xaviers
 
Dna editing as easy as cut paste will transform life as we know it
Dna editing as easy as cut paste will transform life as we know itDna editing as easy as cut paste will transform life as we know it
Dna editing as easy as cut paste will transform life as we know itOther Mother
 

Was ist angesagt? (6)

Extract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotationExtract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotation
 
Why we should (urgently) sequence the genomes of our biodiversity ?
Why we should (urgently) sequence the genomes of our biodiversity ?Why we should (urgently) sequence the genomes of our biodiversity ?
Why we should (urgently) sequence the genomes of our biodiversity ?
 
Thermoregulation Mechanisms and Grazing Behaviors of Dairy Goats
Thermoregulation Mechanisms and Grazing Behaviors of Dairy GoatsThermoregulation Mechanisms and Grazing Behaviors of Dairy Goats
Thermoregulation Mechanisms and Grazing Behaviors of Dairy Goats
 
Karl kjer : A collegiate career at rutgers university
Karl kjer : A collegiate career at rutgers universityKarl kjer : A collegiate career at rutgers university
Karl kjer : A collegiate career at rutgers university
 
CRISPR-Cas9: The new frontier of Genome Engineering
CRISPR-Cas9: The new frontier of Genome EngineeringCRISPR-Cas9: The new frontier of Genome Engineering
CRISPR-Cas9: The new frontier of Genome Engineering
 
Dna editing as easy as cut paste will transform life as we know it
Dna editing as easy as cut paste will transform life as we know itDna editing as easy as cut paste will transform life as we know it
Dna editing as easy as cut paste will transform life as we know it
 

Andere mochten auch

Enrica Menozzi | Chi ha detto che per creare App serve programmare?
Enrica Menozzi | Chi ha detto che per creare App serve programmare?Enrica Menozzi | Chi ha detto che per creare App serve programmare?
Enrica Menozzi | Chi ha detto che per creare App serve programmare?Donne Digitali
 
Maredata 20161005
Maredata 20161005Maredata 20161005
Maredata 20161005maredata
 
Il nostro expertise nel Fashion & Luxury
Il nostro expertise nel Fashion & LuxuryIl nostro expertise nel Fashion & Luxury
Il nostro expertise nel Fashion & LuxuryKelly Services Italia
 
Famous people archive
Famous people archiveFamous people archive
Famous people archiveCloptonChurch
 
OIDC16: Open Data in Belgium
OIDC16: Open Data in BelgiumOIDC16: Open Data in Belgium
OIDC16: Open Data in BelgiumBart Hanssens
 
kintone Cafe Tokyo vol.5/ultra fast recovery
kintone Cafe Tokyo vol.5/ultra fast recoverykintone Cafe Tokyo vol.5/ultra fast recovery
kintone Cafe Tokyo vol.5/ultra fast recoveryTakahiro Kubo
 
Vanessa Bertoni | Come la rete può diventare uno strumento di crescita person...
Vanessa Bertoni | Come la rete può diventare uno strumento di crescita person...Vanessa Bertoni | Come la rete può diventare uno strumento di crescita person...
Vanessa Bertoni | Come la rete può diventare uno strumento di crescita person...Donne Digitali
 

Andere mochten auch (20)

Enrica Menozzi | Chi ha detto che per creare App serve programmare?
Enrica Menozzi | Chi ha detto che per creare App serve programmare?Enrica Menozzi | Chi ha detto che per creare App serve programmare?
Enrica Menozzi | Chi ha detto che per creare App serve programmare?
 
Rome
RomeRome
Rome
 
Ashley's Veterans' Issues Presentation, 7th
Ashley's Veterans' Issues Presentation, 7thAshley's Veterans' Issues Presentation, 7th
Ashley's Veterans' Issues Presentation, 7th
 
Manors archive
Manors archiveManors archive
Manors archive
 
Loren's Veterans' Issues Presentation, 7th period
Loren's Veterans' Issues Presentation, 7th periodLoren's Veterans' Issues Presentation, 7th period
Loren's Veterans' Issues Presentation, 7th period
 
Jonathan's Veterans' Issues Presentation, 7th period
Jonathan's Veterans' Issues Presentation, 7th periodJonathan's Veterans' Issues Presentation, 7th period
Jonathan's Veterans' Issues Presentation, 7th period
 
#FIWAREPamplona Aporta IODC16 Open Data
#FIWAREPamplona Aporta IODC16 Open Data#FIWAREPamplona Aporta IODC16 Open Data
#FIWAREPamplona Aporta IODC16 Open Data
 
畢氏定理
畢氏定理畢氏定理
畢氏定理
 
Clasificación de la calidad de las revistas científicas de educación española...
Clasificación de la calidad de las revistas científicas de educación española...Clasificación de la calidad de las revistas científicas de educación española...
Clasificación de la calidad de las revistas científicas de educación española...
 
El proceso de creación de una nueva revista científica: incertidumbres, refle...
El proceso de creación de una nueva revista científica: incertidumbres, refle...El proceso de creación de una nueva revista científica: incertidumbres, refle...
El proceso de creación de una nueva revista científica: incertidumbres, refle...
 
PNA: una revista con bajo presupuesto que busca mejorar su calidad. María-C. ...
PNA: una revista con bajo presupuesto que busca mejorar su calidad. María-C. ...PNA: una revista con bajo presupuesto que busca mejorar su calidad. María-C. ...
PNA: una revista con bajo presupuesto que busca mejorar su calidad. María-C. ...
 
JConsole with OpenIDM
JConsole with OpenIDMJConsole with OpenIDM
JConsole with OpenIDM
 
Maredata 20161005
Maredata 20161005Maredata 20161005
Maredata 20161005
 
Houses archive
Houses archiveHouses archive
Houses archive
 
Local content policies in the mining sector: Approaches and lessons learnt
Local content policies in the mining sector:  Approaches and lessons learntLocal content policies in the mining sector:  Approaches and lessons learnt
Local content policies in the mining sector: Approaches and lessons learnt
 
Il nostro expertise nel Fashion & Luxury
Il nostro expertise nel Fashion & LuxuryIl nostro expertise nel Fashion & Luxury
Il nostro expertise nel Fashion & Luxury
 
Famous people archive
Famous people archiveFamous people archive
Famous people archive
 
OIDC16: Open Data in Belgium
OIDC16: Open Data in BelgiumOIDC16: Open Data in Belgium
OIDC16: Open Data in Belgium
 
kintone Cafe Tokyo vol.5/ultra fast recovery
kintone Cafe Tokyo vol.5/ultra fast recoverykintone Cafe Tokyo vol.5/ultra fast recovery
kintone Cafe Tokyo vol.5/ultra fast recovery
 
Vanessa Bertoni | Come la rete può diventare uno strumento di crescita person...
Vanessa Bertoni | Come la rete può diventare uno strumento di crescita person...Vanessa Bertoni | Come la rete può diventare uno strumento di crescita person...
Vanessa Bertoni | Come la rete può diventare uno strumento di crescita person...
 

Ähnlich wie Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust

Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...Neuroscience Information Framework
 
ORiGAMI: Oak Ridge Graph Analytics for Medical Innovation
ORiGAMI: Oak Ridge Graph Analytics for Medical InnovationORiGAMI: Oak Ridge Graph Analytics for Medical Innovation
ORiGAMI: Oak Ridge Graph Analytics for Medical Innovationinside-BigData.com
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK Cyndy Parr
 
The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework Neuroscience Information Framework
 
General ausplots school
General ausplots schoolGeneral ausplots school
General ausplots schoolbensparrowau
 
Nanotechnology
NanotechnologyNanotechnology
NanotechnologyRudy Garns
 
Animal telemetry, Ross Dwyer ACEAS Grand 2014
Animal telemetry, Ross Dwyer ACEAS Grand 2014Animal telemetry, Ross Dwyer ACEAS Grand 2014
Animal telemetry, Ross Dwyer ACEAS Grand 2014aceas13tern
 
1 05 questionsandconclusions. (1)
1 05 questionsandconclusions. (1)1 05 questionsandconclusions. (1)
1 05 questionsandconclusions. (1)Dr. John
 
CRISPR PROJECT.pptx
CRISPR PROJECT.pptxCRISPR PROJECT.pptx
CRISPR PROJECT.pptxAcSni
 
Ecosystem science requirements for uas remote sensing
Ecosystem science requirements for uas remote sensing Ecosystem science requirements for uas remote sensing
Ecosystem science requirements for uas remote sensing bensparrowau
 
Brain bits® Overview
Brain bits® OverviewBrain bits® Overview
Brain bits® OverviewPeter Roberts
 
Miguel Foronda T3chfest
Miguel Foronda T3chfestMiguel Foronda T3chfest
Miguel Foronda T3chfestMiguel Foronda
 
Spatial Learning by mice on a 3D task 3
Spatial Learning by mice on a 3D task 3Spatial Learning by mice on a 3D task 3
Spatial Learning by mice on a 3D task 3Benjamin James
 
Why we should clone extinct animals
Why we should clone extinct animalsWhy we should clone extinct animals
Why we should clone extinct animalsMorganScience
 
Why we should clone extinct animals
Why we should clone extinct animalsWhy we should clone extinct animals
Why we should clone extinct animalsMorganScience
 
Why we should clone extinct animals
Why we should clone extinct animalsWhy we should clone extinct animals
Why we should clone extinct animalsMorganScience
 
Group 5 DNA Tech - Ecology & Envt
Group 5 DNA Tech - Ecology & EnvtGroup 5 DNA Tech - Ecology & Envt
Group 5 DNA Tech - Ecology & EnvtJessica Kabigting
 

Ähnlich wie Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust (20)

Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
 
ORiGAMI: Oak Ridge Graph Analytics for Medical Innovation
ORiGAMI: Oak Ridge Graph Analytics for Medical InnovationORiGAMI: Oak Ridge Graph Analytics for Medical Innovation
ORiGAMI: Oak Ridge Graph Analytics for Medical Innovation
 
Big data nebraska
Big data nebraskaBig data nebraska
Big data nebraska
 
Sweden_eemis_big_data
Sweden_eemis_big_dataSweden_eemis_big_data
Sweden_eemis_big_data
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
Big data nebraska
Big data nebraskaBig data nebraska
Big data nebraska
 
The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework
 
General ausplots school
General ausplots schoolGeneral ausplots school
General ausplots school
 
Nanotechnology
NanotechnologyNanotechnology
Nanotechnology
 
Animal telemetry, Ross Dwyer ACEAS Grand 2014
Animal telemetry, Ross Dwyer ACEAS Grand 2014Animal telemetry, Ross Dwyer ACEAS Grand 2014
Animal telemetry, Ross Dwyer ACEAS Grand 2014
 
1 05 questionsandconclusions. (1)
1 05 questionsandconclusions. (1)1 05 questionsandconclusions. (1)
1 05 questionsandconclusions. (1)
 
CRISPR PROJECT.pptx
CRISPR PROJECT.pptxCRISPR PROJECT.pptx
CRISPR PROJECT.pptx
 
Ecosystem science requirements for uas remote sensing
Ecosystem science requirements for uas remote sensing Ecosystem science requirements for uas remote sensing
Ecosystem science requirements for uas remote sensing
 
Brain bits® Overview
Brain bits® OverviewBrain bits® Overview
Brain bits® Overview
 
Miguel Foronda T3chfest
Miguel Foronda T3chfestMiguel Foronda T3chfest
Miguel Foronda T3chfest
 
Spatial Learning by mice on a 3D task 3
Spatial Learning by mice on a 3D task 3Spatial Learning by mice on a 3D task 3
Spatial Learning by mice on a 3D task 3
 
Why we should clone extinct animals
Why we should clone extinct animalsWhy we should clone extinct animals
Why we should clone extinct animals
 
Why we should clone extinct animals
Why we should clone extinct animalsWhy we should clone extinct animals
Why we should clone extinct animals
 
Why we should clone extinct animals
Why we should clone extinct animalsWhy we should clone extinct animals
Why we should clone extinct animals
 
Group 5 DNA Tech - Ecology & Envt
Group 5 DNA Tech - Ecology & EnvtGroup 5 DNA Tech - Ecology & Envt
Group 5 DNA Tech - Ecology & Envt
 

Mehr von Repository Fringe

Unlocking Thesis Data - Stephen Grace, University of East London
Unlocking Thesis Data - Stephen Grace, University of East LondonUnlocking Thesis Data - Stephen Grace, University of East London
Unlocking Thesis Data - Stephen Grace, University of East LondonRepository Fringe
 
Integration - the heart of researcher centric research data management system...
Integration - the heart of researcher centric research data management system...Integration - the heart of researcher centric research data management system...
Integration - the heart of researcher centric research data management system...Repository Fringe
 
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheon
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheonOpen Access workshop at Repository Fringe 2015 - Valerie McCutcheon
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheonRepository Fringe
 
Repositories for OA, RDM and Beyond - Rory McNicholl
Repositories for OA, RDM and Beyond - Rory McNichollRepositories for OA, RDM and Beyond - Rory McNicholl
Repositories for OA, RDM and Beyond - Rory McNichollRepository Fringe
 
RSpace - Rory Macneil at Repository Fringe 2015
RSpace - Rory Macneil at Repository Fringe 2015RSpace - Rory Macneil at Repository Fringe 2015
RSpace - Rory Macneil at Repository Fringe 2015Repository Fringe
 
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, JiscRepository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, JiscRepository Fringe
 
Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Repository Fringe
 
Jisc on repositories unleashing data - Daniela Duca
Jisc on repositories unleashing data - Daniela DucaJisc on repositories unleashing data - Daniela Duca
Jisc on repositories unleashing data - Daniela DucaRepository Fringe
 
IRUS-UK at Repository Fringe 2015 - Jo Alcock
IRUS-UK at Repository Fringe 2015 - Jo AlcockIRUS-UK at Repository Fringe 2015 - Jo Alcock
IRUS-UK at Repository Fringe 2015 - Jo AlcockRepository Fringe
 
Impact and EPrints - Rosie-Marie Barbeau and Mick Eadie
Impact and EPrints - Rosie-Marie Barbeau and Mick EadieImpact and EPrints - Rosie-Marie Barbeau and Mick Eadie
Impact and EPrints - Rosie-Marie Barbeau and Mick EadieRepository Fringe
 
Open Data and Sharing Science - Graham Steel, Contentmine
Open Data and Sharing Science - Graham Steel, ContentmineOpen Data and Sharing Science - Graham Steel, Contentmine
Open Data and Sharing Science - Graham Steel, ContentmineRepository Fringe
 
SHERPA Services breakout session - Bill Hubbard
SHERPA Services breakout session - Bill HubbardSHERPA Services breakout session - Bill Hubbard
SHERPA Services breakout session - Bill HubbardRepository Fringe
 
REF compliance - what Jisc is doing
REF compliance - what Jisc is doingREF compliance - what Jisc is doing
REF compliance - what Jisc is doingRepository Fringe
 
Linking Software: citations, roles, references and more
Linking Software: citations, roles, references and moreLinking Software: citations, roles, references and more
Linking Software: citations, roles, references and moreRepository Fringe
 
Linking Research Outputs - Rachel Kotarski
Linking Research Outputs - Rachel KotarskiLinking Research Outputs - Rachel Kotarski
Linking Research Outputs - Rachel KotarskiRepository Fringe
 
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...Repository Fringe
 
Latest developments in Hydra-land - Chris Awre, University of Hull
Latest developments in Hydra-land - Chris Awre, University of HullLatest developments in Hydra-land - Chris Awre, University of Hull
Latest developments in Hydra-land - Chris Awre, University of HullRepository Fringe
 
ArchivesSpace - Scott Renton, University of Edinburgh
ArchivesSpace - Scott Renton, University of EdinburghArchivesSpace - Scott Renton, University of Edinburgh
ArchivesSpace - Scott Renton, University of EdinburghRepository Fringe
 

Mehr von Repository Fringe (20)

Unlocking Thesis Data - Stephen Grace, University of East London
Unlocking Thesis Data - Stephen Grace, University of East LondonUnlocking Thesis Data - Stephen Grace, University of East London
Unlocking Thesis Data - Stephen Grace, University of East London
 
Integration - the heart of researcher centric research data management system...
Integration - the heart of researcher centric research data management system...Integration - the heart of researcher centric research data management system...
Integration - the heart of researcher centric research data management system...
 
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheon
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheonOpen Access workshop at Repository Fringe 2015 - Valerie McCutcheon
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheon
 
Repositories for OA, RDM and Beyond - Rory McNicholl
Repositories for OA, RDM and Beyond - Rory McNichollRepositories for OA, RDM and Beyond - Rory McNicholl
Repositories for OA, RDM and Beyond - Rory McNicholl
 
RSpace - Rory Macneil at Repository Fringe 2015
RSpace - Rory Macneil at Repository Fringe 2015RSpace - Rory Macneil at Repository Fringe 2015
RSpace - Rory Macneil at Repository Fringe 2015
 
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, JiscRepository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
 
Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...
 
Jisc on repositories unleashing data - Daniela Duca
Jisc on repositories unleashing data - Daniela DucaJisc on repositories unleashing data - Daniela Duca
Jisc on repositories unleashing data - Daniela Duca
 
IRUS-UK at Repository Fringe 2015 - Jo Alcock
IRUS-UK at Repository Fringe 2015 - Jo AlcockIRUS-UK at Repository Fringe 2015 - Jo Alcock
IRUS-UK at Repository Fringe 2015 - Jo Alcock
 
Impact and EPrints - Rosie-Marie Barbeau and Mick Eadie
Impact and EPrints - Rosie-Marie Barbeau and Mick EadieImpact and EPrints - Rosie-Marie Barbeau and Mick Eadie
Impact and EPrints - Rosie-Marie Barbeau and Mick Eadie
 
Open Data and Sharing Science - Graham Steel, Contentmine
Open Data and Sharing Science - Graham Steel, ContentmineOpen Data and Sharing Science - Graham Steel, Contentmine
Open Data and Sharing Science - Graham Steel, Contentmine
 
SHERPA Services breakout session - Bill Hubbard
SHERPA Services breakout session - Bill HubbardSHERPA Services breakout session - Bill Hubbard
SHERPA Services breakout session - Bill Hubbard
 
REF compliance - what Jisc is doing
REF compliance - what Jisc is doingREF compliance - what Jisc is doing
REF compliance - what Jisc is doing
 
RCUK - what Jisc is doing
RCUK - what Jisc is doingRCUK - what Jisc is doing
RCUK - what Jisc is doing
 
Linking Software: citations, roles, references and more
Linking Software: citations, roles, references and moreLinking Software: citations, roles, references and more
Linking Software: citations, roles, references and more
 
Jisc Publications Router
Jisc Publications RouterJisc Publications Router
Jisc Publications Router
 
Linking Research Outputs - Rachel Kotarski
Linking Research Outputs - Rachel KotarskiLinking Research Outputs - Rachel Kotarski
Linking Research Outputs - Rachel Kotarski
 
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...
 
Latest developments in Hydra-land - Chris Awre, University of Hull
Latest developments in Hydra-land - Chris Awre, University of HullLatest developments in Hydra-land - Chris Awre, University of Hull
Latest developments in Hydra-land - Chris Awre, University of Hull
 
ArchivesSpace - Scott Renton, University of Edinburgh
ArchivesSpace - Scott Renton, University of EdinburghArchivesSpace - Scott Renton, University of Edinburgh
ArchivesSpace - Scott Renton, University of Edinburgh
 

Kürzlich hochgeladen

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 

Kürzlich hochgeladen (20)

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 

Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust

  • 1. REPOSITORIES FOR SCIENTIFIC DATA An #animalgarden show Peter Murray-Rust, OKFN and University of Cambridge Chuff OWL Moomin AMI Gulliver Sleepless cleanTux UncleSam
  • 2. I’m AMI studying biodiversity. I compute phylogenetic trees Only 4% of computed trees are saved I’m in a pear tree.
  • 3. Where can I put my data? Institutional repos don’t work, we’ve tried WE NEED DOMAIN REPOSITORIES FOR SCIENCE
  • 4. So how do you manage data? We’re BIG DATA at NASA We hire data experts
  • 5. But I’m a LONG-TAIL scientist!
  • 6. Australia have a national data service (ANDS) We could use their TARDIS* Let’s ask the crystallographers. They save their data
  • 7. I want to publish this paper You MUST send ALL the data. The IUCr will check if it’s correct
  • 8. It takes years to create vocabularies Core dictionary (coreCIF) version 2.4.3 _diffrn_ambient_temperature Definition: The mean temperature in kelvins at which the intensities were measured. Range: 0.0 -> infinity Type: numb ID For humans For machines: Constraint + type We need domain vocabularies through inter/national efforts
  • 9. PMRgroup also built a crystal structure repo (Crystaleye) It’s got 200,000 entries But none from Elsevier, Wiley, Springer
  • 10. And NONE of the results are archived Computational Materials scientists costs 1,000 Million USD / year PMR wrote software to turn FORTRAN into XML
  • 11. PMR and others have started a global effort to create vocabularies It’s hard and slow work PMR group built compchem repository Chempound XML RDF NoSQL SPARQL
  • 12. Is PMR making progress? Hoping to work with Obama’s 500 M USD “materials genome”
  • 13. WE NEED DOMAIN REPOSITORIES FOR BIODIVERSITY
  • 14. We could use Figshare As long as it’s Open Or OKFN’s CKAN
  • 15. And we can also do theses! PMR and Ross Mounce will index the whole of published bioscience! 5 years of JISC projects helped
  • 16. We’re going to index SPECIES, PLACES, DATES I’m a baby Buddleja Davidii
  • 17. OKFN Chuff! I’m an Okapi balloonii
  • 18. WE NEED DOMAIN REPOSITORIES FOR SCIENCE Wake up, nearly finished PechaKucha i knackering
  • 19. Chuff REPOSITORIES FOR SCIENTIFIC DATA An #animalgarden show Peter Murray-Rust, OKFN and University of Cambridge WE NEED DOMAIN REPOSITORIES FOR SCIENCE