SlideShare ist ein Scribd-Unternehmen logo
1 von 13
Downloaden Sie, um offline zu lesen
FISH.Link
JISC MRD Workshop, March 2011
Questions
1. What’s the additional value of publishing linked data (as
   opposed to triplifying and/or providing open data)?
2. How do we support the production/annotation of datasets
   from small providers?
3. How can we support non-SPARQL experts in writing
   SPARQL queries/getting information out of triple stores?



                                                    http://www.flickr.com/photos/-bast-/349497988/
FISH.Link
•    Investigating Linked Data approaches to support data sharing
     in freshwater biology community
    Will high altitude water bodies provide a refuge for species under
    pressure from invasive species taking advantage of changes in
    climate?

•    Integration with the FISH.Net repository
    •   FISH.Net “Traffic Light” System
        characterising metadata states
    •   FISH.Link at Green+: readiness
        for LD publication



                                                     http://www.flickr.com/photos/amulligan/103460333
FISH.Link
•   This is not primarily about scale
•   Data sets aren’t huge.
•   Small, hand-crafted idiosyncratic data sets.
•   FISHNet carrots/sticks
    •   Motivations: why publish LD (as opposed to consuming it)?




                                             http://www.flickr.com/photos/brapps/3368998420
Initial Experiments
•   Paper: Aquatic Plant Diversity
    Jones, Li, Maberly
•   Extracted initial set of queries supporting
    research question
•   Hand mapping existing datasets to support
    initial queries
•   D2R, Jena, some scripting
•   SPARQL queries generating “wide tables” of data for analysis
•   Science/analysis still left in the hands of the experts




                                                              http://www.flickr.com/photos/fabelfroh/
Initial Experiments
         •     Paper: Aquatic Plant Diversity
1. Range of altitude
2. Area of water body
               Jones, Li, Maberly
3. Range of alkalinity
4. Covariation of ion concentration with altitude
         •     Extracted initial set of queries supporting
5. Correlation of ion strength with alkalinity
6. Correlation of area and altitude
               research question
7. Correlation of depth and area
8. Correlation of area with presence of inflows
         •     Hand mapping existing datasets to support
9. Correlation of area with number of inflows
10. Correlation of area with presence of out-flows
               initial queries
11. Correlation of area with concentration of nitrate
12. Correlation of altitude with presence of inflows
         •     D2R, Jena, some scripting
13. Correlation of altitude with number of inflows
13. Correlation of altitude with presence of out-flows
         •     SPARQL queries generating “wide tables” of data for analysis
14. Correlation of altitude with distance to nearest body of standing water
15. Any measured variable as predictor of species richness
16. Altitude at which species occur, its range correlated with altitude
         •     Science/analysis still left in the hands of the experts
17. Frequency of occurrence of species and altitudinal range size
18. Mid-point of altitudinal range size
19. Number of observations
20. Attribute group per water body correlated with area
21. Attribute group correlated with altitude
22. Mean number of species per attribute group correlated with area and then altitude
23. Correlation of each variable with number of attribute groups per water body
24. Correlation of each variable with number of species per water body                  http://www.flickr.com/photos/fabelfroh/
Initial Experiments
•   Paper: Aquatic Plant Diversity
    Jones, Li, Maberly
•   Extracted initial set of queries supporting
    research question
•   Hand mapping existing datasets to support
    initial queries
•   D2R, Jena, some scripting
•   SPARQL queries generating “wide tables” of data for analysis
•   Science/analysis still left in the hands of the experts




                                                              http://www.flickr.com/photos/fabelfroh/
Initial Data Sets
•   Major ions, Lakes & Tarns, 1982    
•   Major ions, Duddon & Windermere, 1982    
•   Cumbria Tarns/Stokoe Tarns    
•   Ferry House/Windermere Level &
    Temp Data
•   Willby Species Groupings    
•   Meteorological Data
•   RIVPACS    
•   BIOSYS
Difficult Aspects
• Column Names                    • Multiple values/dirty data
• Multiple names for the             • Altitude
  same things                        • Area
  • Alkalinity/ALK_1/             • Miscellaneous
    Average_Alk
                                     • “Nearest Water”
• Similar names for different
  things                             • Tarns vs Tarn Groups
  • ALK_1/ALK_2

          Tools to assist transition from FISHNet
                     Orange to Green


                                                    http://www.flickr.com/photos/guitarfish/
Observations
•   Datasets are largely about observations
    •   Provenance: who, what, where, when, how
    •   Associated Actors/Organisations
    •   Associated Locations
    •   Methodologies/Protocols
•   Derived observations
    •   5 measurements, average recorded
•   Models for investigations
    •   ISA: Investigation/Study/Assay for
        managing experimental metadata
    •   Top-down vs bottom-up
Where are we?
•   Datasets in triple store
•   SPARQL queries pulling out data tables allowing recreation
    of analysis
    •   Plus analysis using different source data (e.g. 2010 rather
        than 2003)
    •   How to produce/write these queries?


•   Benefits of integration via RDF
•   Benefits of data publication
•   But,... Linked Data?
Questions
1. What’s the additional value of publishing linked data (as
   opposed to triplifying and/or providing open data)?
2. How do we support the production/annotation of datasets
   from small providers?
3. How can we support non-SPARQL experts in writing
   SPARQL queries/getting information out of triple stores?



                                                    http://www.flickr.com/photos/-bast-/349497988/
Commercial Break...

Weitere ähnliche Inhalte

Was ist angesagt?

The Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of LifeThe Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of LifeCyndy Parr
 
Informing natural resource issues with the Oregon Explorer
Informing natural resource issues with the Oregon ExplorerInforming natural resource issues with the Oregon Explorer
Informing natural resource issues with the Oregon ExplorerInstitute for Natural Resources
 
Global Names Architecture - Remsen
Global Names Architecture - RemsenGlobal Names Architecture - Remsen
Global Names Architecture - RemsenDavid Remsen
 
Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Cyndy Parr
 
TAIR Presentation ASPB 2016
TAIR Presentation ASPB 2016TAIR Presentation ASPB 2016
TAIR Presentation ASPB 2016Leonore Reiser
 
Brad Thomas Resume 2016
Brad Thomas Resume 2016Brad Thomas Resume 2016
Brad Thomas Resume 2016Brad Thomas
 
The Research Data Life Cycle for Biology - A Researcher Perspective
The Research Data Life Cycle for Biology - A Researcher PerspectiveThe Research Data Life Cycle for Biology - A Researcher Perspective
The Research Data Life Cycle for Biology - A Researcher PerspectivePhilippa Griffin
 
Bird Studies Canada
Bird Studies CanadaBird Studies Canada
Bird Studies CanadaCybera Inc.
 
TRY - a global database of plant traits
TRY - a global database of plant traitsTRY - a global database of plant traits
TRY - a global database of plant traitsFuture Earth
 
Proteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyProteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyChrist College, Rajkot
 
Using and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataUsing and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataCyndy Parr
 
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...marcosmartinezromero
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyICZN
 

Was ist angesagt? (19)

The Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of LifeThe Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of Life
 
Informing natural resource issues with the Oregon Explorer
Informing natural resource issues with the Oregon ExplorerInforming natural resource issues with the Oregon Explorer
Informing natural resource issues with the Oregon Explorer
 
Global Names Architecture - Remsen
Global Names Architecture - RemsenGlobal Names Architecture - Remsen
Global Names Architecture - Remsen
 
Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...
 
TAIR Presentation ASPB 2016
TAIR Presentation ASPB 2016TAIR Presentation ASPB 2016
TAIR Presentation ASPB 2016
 
Dr Sarah Adamowicz - Ecological studies
Dr Sarah Adamowicz - Ecological studiesDr Sarah Adamowicz - Ecological studies
Dr Sarah Adamowicz - Ecological studies
 
Amlc
AmlcAmlc
Amlc
 
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
 
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
 
Brad Thomas Resume 2016
Brad Thomas Resume 2016Brad Thomas Resume 2016
Brad Thomas Resume 2016
 
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
 
Sperling Esc08 V Mds
Sperling Esc08 V MdsSperling Esc08 V Mds
Sperling Esc08 V Mds
 
The Research Data Life Cycle for Biology - A Researcher Perspective
The Research Data Life Cycle for Biology - A Researcher PerspectiveThe Research Data Life Cycle for Biology - A Researcher Perspective
The Research Data Life Cycle for Biology - A Researcher Perspective
 
Bird Studies Canada
Bird Studies CanadaBird Studies Canada
Bird Studies Canada
 
TRY - a global database of plant traits
TRY - a global database of plant traitsTRY - a global database of plant traits
TRY - a global database of plant traits
 
Proteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyProteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASy
 
Using and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataUsing and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute data
 
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
 

Andere mochten auch

SharePoint and Yammer Social from Microsoft and Atidan
SharePoint and Yammer Social from Microsoft and AtidanSharePoint and Yammer Social from Microsoft and Atidan
SharePoint and Yammer Social from Microsoft and AtidanDavid J Rosenthal
 
Currency highlights 31.10.14
Currency highlights 31.10.14Currency highlights 31.10.14
Currency highlights 31.10.14choice broking
 
Generational News & Views Newsletter 2.29.09
Generational News & Views Newsletter 2.29.09Generational News & Views Newsletter 2.29.09
Generational News & Views Newsletter 2.29.09David Stutts
 
Journal.pgen.1003025
Journal.pgen.1003025Journal.pgen.1003025
Journal.pgen.1003025鋒博 蔡
 
ICT: dall'impresa alla PA e ritorno. Racconto di un'esperienza vissuta
ICT: dall'impresa alla PA e ritorno. Racconto di un'esperienza vissutaICT: dall'impresa alla PA e ritorno. Racconto di un'esperienza vissuta
ICT: dall'impresa alla PA e ritorno. Racconto di un'esperienza vissutaMassimo Carnevali
 
Social media 102 battle plan
Social media 102 battle planSocial media 102 battle plan
Social media 102 battle planEsteban Kolsky
 
82225582 kissm-1a
82225582 kissm-1a82225582 kissm-1a
82225582 kissm-1adizzna
 
Idus euroson 2008
Idus euroson 2008Idus euroson 2008
Idus euroson 2008sebikovacs
 
Pp giaitoan hh
Pp giaitoan hhPp giaitoan hh
Pp giaitoan hhtefanozil
 

Andere mochten auch (20)

SharePoint and Yammer Social from Microsoft and Atidan
SharePoint and Yammer Social from Microsoft and AtidanSharePoint and Yammer Social from Microsoft and Atidan
SharePoint and Yammer Social from Microsoft and Atidan
 
Currency highlights 31.10.14
Currency highlights 31.10.14Currency highlights 31.10.14
Currency highlights 31.10.14
 
UShareSoft_20130425
UShareSoft_20130425UShareSoft_20130425
UShareSoft_20130425
 
Qpcr
QpcrQpcr
Qpcr
 
Generational News & Views Newsletter 2.29.09
Generational News & Views Newsletter 2.29.09Generational News & Views Newsletter 2.29.09
Generational News & Views Newsletter 2.29.09
 
Journal.pgen.1003025
Journal.pgen.1003025Journal.pgen.1003025
Journal.pgen.1003025
 
Offline vs online
Offline vs onlineOffline vs online
Offline vs online
 
ICT: dall'impresa alla PA e ritorno. Racconto di un'esperienza vissuta
ICT: dall'impresa alla PA e ritorno. Racconto di un'esperienza vissutaICT: dall'impresa alla PA e ritorno. Racconto di un'esperienza vissuta
ICT: dall'impresa alla PA e ritorno. Racconto di un'esperienza vissuta
 
Social media 102 battle plan
Social media 102 battle planSocial media 102 battle plan
Social media 102 battle plan
 
Conclusiones
ConclusionesConclusiones
Conclusiones
 
82225582 kissm-1a
82225582 kissm-1a82225582 kissm-1a
82225582 kissm-1a
 
Greece
GreeceGreece
Greece
 
Idus euroson 2008
Idus euroson 2008Idus euroson 2008
Idus euroson 2008
 
Research Powerpoint
Research PowerpointResearch Powerpoint
Research Powerpoint
 
Steven
StevenSteven
Steven
 
marketing
marketingmarketing
marketing
 
Bullying
BullyingBullying
Bullying
 
Pp giaitoan hh
Pp giaitoan hhPp giaitoan hh
Pp giaitoan hh
 
Blogger
BloggerBlogger
Blogger
 
30
3030
30
 

Ähnlich wie FISHLink Presentation at JISC MRD Workshop

iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK Cyndy Parr
 
Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013Anita de Waard
 
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1Gianpaolo Coro
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Spark Summit
 
From Data to Data: One version of a History of Scholarly Communication
From Data to Data: One version of a History of Scholarly CommunicationFrom Data to Data: One version of a History of Scholarly Communication
From Data to Data: One version of a History of Scholarly CommunicationAndrew Treloar
 
Developing an Australian phenology monitoring network, Tim Brown, ACEAS Grand...
Developing an Australian phenology monitoring network, Tim Brown, ACEAS Grand...Developing an Australian phenology monitoring network, Tim Brown, ACEAS Grand...
Developing an Australian phenology monitoring network, Tim Brown, ACEAS Grand...aceas13tern
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life Cyndy Parr
 
How and Why to Share Your Data
How and Why to Share Your DataHow and Why to Share Your Data
How and Why to Share Your Datakfear
 
Looking for Data: Finding New Science
Looking for Data: Finding New ScienceLooking for Data: Finding New Science
Looking for Data: Finding New ScienceAnita de Waard
 
Andrew Treloar, overview of ACEAS Data Workflow, ACEAS Grand 2014
Andrew Treloar, overview of ACEAS Data Workflow, ACEAS Grand 2014Andrew Treloar, overview of ACEAS Data Workflow, ACEAS Grand 2014
Andrew Treloar, overview of ACEAS Data Workflow, ACEAS Grand 2014aceas13tern
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds
 
Biodiversity Informatics at the Natural History Museum
Biodiversity Informatics at the Natural History MuseumBiodiversity Informatics at the Natural History Museum
Biodiversity Informatics at the Natural History MuseumEdward Baker
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsJohn Kunze
 
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...Anita de Waard
 
Open science: your questions answered
Open science: your questions answeredOpen science: your questions answered
Open science: your questions answeredVarsha Khodiyar
 
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)EarthCube
 
Introduction to EOL.org for scientists
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientistsCyndy Parr
 
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
10-31-13 “Researcher Perspectives of Data Curation” Presentation SlidesDuraSpace
 

Ähnlich wie FISHLink Presentation at JISC MRD Workshop (20)

iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013
 
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
 
From Data to Data: One version of a History of Scholarly Communication
From Data to Data: One version of a History of Scholarly CommunicationFrom Data to Data: One version of a History of Scholarly Communication
From Data to Data: One version of a History of Scholarly Communication
 
Developing an Australian phenology monitoring network, Tim Brown, ACEAS Grand...
Developing an Australian phenology monitoring network, Tim Brown, ACEAS Grand...Developing an Australian phenology monitoring network, Tim Brown, ACEAS Grand...
Developing an Australian phenology monitoring network, Tim Brown, ACEAS Grand...
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life
 
How and Why to Share Your Data
How and Why to Share Your DataHow and Why to Share Your Data
How and Why to Share Your Data
 
Looking for Data: Finding New Science
Looking for Data: Finding New ScienceLooking for Data: Finding New Science
Looking for Data: Finding New Science
 
RAC data day
RAC data dayRAC data day
RAC data day
 
Andrew Treloar, overview of ACEAS Data Workflow, ACEAS Grand 2014
Andrew Treloar, overview of ACEAS Data Workflow, ACEAS Grand 2014Andrew Treloar, overview of ACEAS Data Workflow, ACEAS Grand 2014
Andrew Treloar, overview of ACEAS Data Workflow, ACEAS Grand 2014
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
 
Biodiversity Informatics at the Natural History Museum
Biodiversity Informatics at the Natural History MuseumBiodiversity Informatics at the Natural History Museum
Biodiversity Informatics at the Natural History Museum
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data Citations
 
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
 
Open science: your questions answered
Open science: your questions answeredOpen science: your questions answered
Open science: your questions answered
 
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
 
Introduction to EOL.org for scientists
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientists
 
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
 
Open Science
Open Science Open Science
Open Science
 

Mehr von seanb

Linked Data Publication of Live Music Archives and Analyses
Linked Data Publication of Live Music Archives and AnalysesLinked Data Publication of Live Music Archives and Analyses
Linked Data Publication of Live Music Archives and Analysesseanb
 
Animation 14: Computer Science and Music
Animation 14: Computer Science and MusicAnimation 14: Computer Science and Music
Animation 14: Computer Science and Musicseanb
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objectsseanb
 
Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014seanb
 
RO Advisory Kickoff Slides
RO Advisory Kickoff SlidesRO Advisory Kickoff Slides
RO Advisory Kickoff Slidesseanb
 
Linked Data Publication of Live Music Archives
Linked Data Publication of Live Music ArchivesLinked Data Publication of Live Music Archives
Linked Data Publication of Live Music Archivesseanb
 
Ontologies and Vocabularies
Ontologies and VocabulariesOntologies and Vocabularies
Ontologies and Vocabulariesseanb
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminarseanb
 
Scientific Social Objects
Scientific Social ObjectsScientific Social Objects
Scientific Social Objectsseanb
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objectsseanb
 
SKOS, Past, Present and Future
SKOS, Past, Present and FutureSKOS, Past, Present and Future
SKOS, Past, Present and Futureseanb
 
Semantic Web for Multimedia
Semantic Web for MultimediaSemantic Web for Multimedia
Semantic Web for Multimediaseanb
 

Mehr von seanb (12)

Linked Data Publication of Live Music Archives and Analyses
Linked Data Publication of Live Music Archives and AnalysesLinked Data Publication of Live Music Archives and Analyses
Linked Data Publication of Live Music Archives and Analyses
 
Animation 14: Computer Science and Music
Animation 14: Computer Science and MusicAnimation 14: Computer Science and Music
Animation 14: Computer Science and Music
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objects
 
Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014
 
RO Advisory Kickoff Slides
RO Advisory Kickoff SlidesRO Advisory Kickoff Slides
RO Advisory Kickoff Slides
 
Linked Data Publication of Live Music Archives
Linked Data Publication of Live Music ArchivesLinked Data Publication of Live Music Archives
Linked Data Publication of Live Music Archives
 
Ontologies and Vocabularies
Ontologies and VocabulariesOntologies and Vocabularies
Ontologies and Vocabularies
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminar
 
Scientific Social Objects
Scientific Social ObjectsScientific Social Objects
Scientific Social Objects
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objects
 
SKOS, Past, Present and Future
SKOS, Past, Present and FutureSKOS, Past, Present and Future
SKOS, Past, Present and Future
 
Semantic Web for Multimedia
Semantic Web for MultimediaSemantic Web for Multimedia
Semantic Web for Multimedia
 

FISHLink Presentation at JISC MRD Workshop

  • 2. Questions 1. What’s the additional value of publishing linked data (as opposed to triplifying and/or providing open data)? 2. How do we support the production/annotation of datasets from small providers? 3. How can we support non-SPARQL experts in writing SPARQL queries/getting information out of triple stores? http://www.flickr.com/photos/-bast-/349497988/
  • 3. FISH.Link • Investigating Linked Data approaches to support data sharing in freshwater biology community Will high altitude water bodies provide a refuge for species under pressure from invasive species taking advantage of changes in climate? • Integration with the FISH.Net repository • FISH.Net “Traffic Light” System characterising metadata states • FISH.Link at Green+: readiness for LD publication http://www.flickr.com/photos/amulligan/103460333
  • 4. FISH.Link • This is not primarily about scale • Data sets aren’t huge. • Small, hand-crafted idiosyncratic data sets. • FISHNet carrots/sticks • Motivations: why publish LD (as opposed to consuming it)? http://www.flickr.com/photos/brapps/3368998420
  • 5. Initial Experiments • Paper: Aquatic Plant Diversity Jones, Li, Maberly • Extracted initial set of queries supporting research question • Hand mapping existing datasets to support initial queries • D2R, Jena, some scripting • SPARQL queries generating “wide tables” of data for analysis • Science/analysis still left in the hands of the experts http://www.flickr.com/photos/fabelfroh/
  • 6. Initial Experiments • Paper: Aquatic Plant Diversity 1. Range of altitude 2. Area of water body Jones, Li, Maberly 3. Range of alkalinity 4. Covariation of ion concentration with altitude • Extracted initial set of queries supporting 5. Correlation of ion strength with alkalinity 6. Correlation of area and altitude research question 7. Correlation of depth and area 8. Correlation of area with presence of inflows • Hand mapping existing datasets to support 9. Correlation of area with number of inflows 10. Correlation of area with presence of out-flows initial queries 11. Correlation of area with concentration of nitrate 12. Correlation of altitude with presence of inflows • D2R, Jena, some scripting 13. Correlation of altitude with number of inflows 13. Correlation of altitude with presence of out-flows • SPARQL queries generating “wide tables” of data for analysis 14. Correlation of altitude with distance to nearest body of standing water 15. Any measured variable as predictor of species richness 16. Altitude at which species occur, its range correlated with altitude • Science/analysis still left in the hands of the experts 17. Frequency of occurrence of species and altitudinal range size 18. Mid-point of altitudinal range size 19. Number of observations 20. Attribute group per water body correlated with area 21. Attribute group correlated with altitude 22. Mean number of species per attribute group correlated with area and then altitude 23. Correlation of each variable with number of attribute groups per water body 24. Correlation of each variable with number of species per water body http://www.flickr.com/photos/fabelfroh/
  • 7. Initial Experiments • Paper: Aquatic Plant Diversity Jones, Li, Maberly • Extracted initial set of queries supporting research question • Hand mapping existing datasets to support initial queries • D2R, Jena, some scripting • SPARQL queries generating “wide tables” of data for analysis • Science/analysis still left in the hands of the experts http://www.flickr.com/photos/fabelfroh/
  • 8. Initial Data Sets • Major ions, Lakes & Tarns, 1982     • Major ions, Duddon & Windermere, 1982     • Cumbria Tarns/Stokoe Tarns     • Ferry House/Windermere Level & Temp Data • Willby Species Groupings     • Meteorological Data • RIVPACS     • BIOSYS
  • 9. Difficult Aspects • Column Names • Multiple values/dirty data • Multiple names for the • Altitude same things • Area • Alkalinity/ALK_1/ • Miscellaneous Average_Alk • “Nearest Water” • Similar names for different things • Tarns vs Tarn Groups • ALK_1/ALK_2 Tools to assist transition from FISHNet Orange to Green http://www.flickr.com/photos/guitarfish/
  • 10. Observations • Datasets are largely about observations • Provenance: who, what, where, when, how • Associated Actors/Organisations • Associated Locations • Methodologies/Protocols • Derived observations • 5 measurements, average recorded • Models for investigations • ISA: Investigation/Study/Assay for managing experimental metadata • Top-down vs bottom-up
  • 11. Where are we? • Datasets in triple store • SPARQL queries pulling out data tables allowing recreation of analysis • Plus analysis using different source data (e.g. 2010 rather than 2003) • How to produce/write these queries? • Benefits of integration via RDF • Benefits of data publication • But,... Linked Data?
  • 12. Questions 1. What’s the additional value of publishing linked data (as opposed to triplifying and/or providing open data)? 2. How do we support the production/annotation of datasets from small providers? 3. How can we support non-SPARQL experts in writing SPARQL queries/getting information out of triple stores? http://www.flickr.com/photos/-bast-/349497988/