SlideShare a Scribd company logo
1 of 17
PROV and Real Things
LAND AND WATER
Simon J D Cox & Nicholas J Car
4th December 2015
The problem
preparation workflow on a
specimen’s journey from
collection to analysis is complex,
and variable
its description must be available
in order to evaluate or
reproduce observations
this workflow may link in to
other business workflows like
data publication
PROV and real things | Cox & Car2 |
Real workflows
PROV and real things | Cox & Car3 |
Proposal:
1. Describe specimen preparation using a generic process model
(i.e. input-process-output)
2. Use W3C PROV as the generic process model
PROV and real things | Cox & Car4 |
ISO 19156 Specimen model
Specimen is a kind of
Sampling Feature
• [0..*] Preparation Steps
Issues:
• Can’t tie the
predecessor/successor to a
preparation step
• UML/XML only
SF_Specimen
+ currentLocation: Location [0..1]
+ materialClass: GenericName
+ samplingLocation: GM_Object [0..1]
+ samplingMethod: SF_Process [0..1]
+ samplingTime: TM_Object
+ size: Measure [0..1]
+ specimenType: GenericName [0..1]
SF_SamplingFeature
+ lineage: LI_Lineage [0..1]
+ parameter: NamedValue [0..*]
Location
+ geometryLocation: GM_Object
+ nameLocation: EX_GeographicDescription
GFI_Feature
SamplingFeatureComplex
+ role: GenericName
PreparationStep
+ processOperator: CI_ResponsibleParty [0..1]
+ time: TM_Object
SF_Process
+processingDetails
0..*
Intention
+sampledFeature 1..*
0..*
+relatedSamplingFeature
0..*
PROV and real things | Cox & Car5 |
ISO 19156:2011 Geographic Information – Observations and measurements (S J D Cox, Ed.)
Specimen model in sam-lite
PROV and real things | Cox & Car6 |
S J D Cox, Ontology for observations and sampling features, with alignments to existing models, Sem. Web (in press)
What is PROV?
Core classes:
- Entity
- thing of interest
(‘endurant’)
- Activity
- transformation event
(‘occurrent’)
- Agent
- responsible party or
process
PROV and real things | Cox & Car7 |
T. Lebo, S. Sahoo, D.L. McGuinness, PROV-O: The PROV Ontology, (2013). http://www.w3.org/TR/prov-o/
PROV applications
The entities of interest are usually
• Datasets
• Publications, papers, reports, products
i.e. information objects
How about the ‘internet of things’?
PROV and real things | Cox & Car8 |
Specimen  PROV mapping
Specimen sub-class-of prov:Entity .
Process sub-class-of prov:Agent .
Preparation-step sub-class-of prov:Activity .
Specimen is a real thing!
PROV and real things | Cox & Car9 |
Example:
carbonate
analysis
PROV and real things | Cox & Car10 |
Entities:
specimens,
data, reports
Agents:
people,
machines
Activities:
preparation-steps,
observations
Ontology Design Pattern
PROV and real things | Cox & Car11 |
input data
config
plan
output
data
activity
machine
agent
human
agent
This work with real things fits my generic PROV usage design pattern
Example: insect
taxonomy
PROV and real things | Cox & Car12 |
Entities:
specimens,
data, reports
Agents:
people,
machines
Activities:
preparation-steps,
observations
URIs for agents  some ‘vocabularies’ required
• People, including functional positions
(‘the lab technician at the time’)
• Machines, other pieces of kit
• URIs for specimens – see IGSN
PROV and real things | Cox & Car13 |
Elaborations
PROV and real things | Cox & Car14 |
These examples use only ‘core’ PROV:
qualifiedInfluence properties
enable recording of more detail
Provenance vs provenance
Provenance in GLAM* world
= chain-of-custody of non-reproducible things
To verify identity, but also important for assay data, drug-testing, forensics
Provenance in data world
= transformations of reproducible things
Provenance in the world of specimens
= transformations of non-reproducible things ….?
PROV and real things | Cox & Car15 |
* Galleries, Libraries, Archives & Museums
Summary
• Original application of PROV to real things
• Core PROV model capable of capturing specimen prep & analysis
• Specialization might make it stronger, but little needed
• Bringing ‘data’ provenance home to the GLAM application …
PROV and real things | Cox & Car16 |
CSIRO Land and Water
Simon Cox
Research Scientist
t +61 3 9545 2365
e simon.cox@csiro.au
Geoscience Australia
Nicholas Car
Data Architect
t +61 2 6249 9093
e nicholas.car@ga.gov.au
LAND AND WATER
Thank you

More Related Content

Similar to PROV and Real Things

Pitfalls in alignment of observation models resolved using PROV as an upper o...
Pitfalls in alignment of observation models resolved using PROV as an upper o...Pitfalls in alignment of observation models resolved using PROV as an upper o...
Pitfalls in alignment of observation models resolved using PROV as an upper o...
Simon Cox
 
PhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflowsPhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflows
dgarijo
 
An Architecture for Agile Machine Learning in Real-Time Applications
An Architecture for Agile Machine Learning in Real-Time ApplicationsAn Architecture for Agile Machine Learning in Real-Time Applications
An Architecture for Agile Machine Learning in Real-Time Applications
Johann Schleier-Smith
 

Similar to PROV and Real Things (20)

Prov and real things
Prov and real thingsProv and real things
Prov and real things
 
2014 Asdenca - Capability-driven development of a soa platform, a case study
2014 Asdenca - Capability-driven development of a soa platform, a case study2014 Asdenca - Capability-driven development of a soa platform, a case study
2014 Asdenca - Capability-driven development of a soa platform, a case study
 
Pitfalls in alignment of observation models resolved using PROV as an upper o...
Pitfalls in alignment of observation models resolved using PROV as an upper o...Pitfalls in alignment of observation models resolved using PROV as an upper o...
Pitfalls in alignment of observation models resolved using PROV as an upper o...
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
 
Dolap13 v9 7.docx
Dolap13 v9 7.docxDolap13 v9 7.docx
Dolap13 v9 7.docx
 
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, RomeWorkflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
 
PhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflowsPhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflows
 
Kelis king - requirements analysis and the unified process
Kelis king - requirements analysis and the unified processKelis king - requirements analysis and the unified process
Kelis king - requirements analysis and the unified process
 
12proto
12proto12proto
12proto
 
Project Management 1. Chi-Fa-Cosa
Project Management 1. Chi-Fa-Cosa Project Management 1. Chi-Fa-Cosa
Project Management 1. Chi-Fa-Cosa
 
Performance testing reference model
Performance testing reference modelPerformance testing reference model
Performance testing reference model
 
Data Science in the Elastic Stack
Data Science in the Elastic StackData Science in the Elastic Stack
Data Science in the Elastic Stack
 
QA&test 2016 (Bilbao) Pros and Cons of Doing Performance Testing Along with D...
QA&test 2016 (Bilbao) Pros and Cons of Doing Performance Testing Along with D...QA&test 2016 (Bilbao) Pros and Cons of Doing Performance Testing Along with D...
QA&test 2016 (Bilbao) Pros and Cons of Doing Performance Testing Along with D...
 
Testware Hierarchy for Test Automation
Testware Hierarchy for Test AutomationTestware Hierarchy for Test Automation
Testware Hierarchy for Test Automation
 
Conway Case Study - Optimizing Application Integration SDLC
Conway Case Study -  Optimizing Application Integration SDLCConway Case Study -  Optimizing Application Integration SDLC
Conway Case Study - Optimizing Application Integration SDLC
 
Agile Data Science on Greenplum Using Airflow - Greenplum Summit 2019
Agile Data Science on Greenplum Using Airflow - Greenplum Summit 2019Agile Data Science on Greenplum Using Airflow - Greenplum Summit 2019
Agile Data Science on Greenplum Using Airflow - Greenplum Summit 2019
 
EUGM 2014 - Serge P. Parel (Exquiron): Farewell, PipelinePilot : Migrating th...
EUGM 2014 - Serge P. Parel (Exquiron): Farewell, PipelinePilot : Migrating th...EUGM 2014 - Serge P. Parel (Exquiron): Farewell, PipelinePilot : Migrating th...
EUGM 2014 - Serge P. Parel (Exquiron): Farewell, PipelinePilot : Migrating th...
 
An Architecture for Agile Machine Learning in Real-Time Applications
An Architecture for Agile Machine Learning in Real-Time ApplicationsAn Architecture for Agile Machine Learning in Real-Time Applications
An Architecture for Agile Machine Learning in Real-Time Applications
 
Architecture of automated test cases for legacy applications
Architecture of automated test cases for legacy applicationsArchitecture of automated test cases for legacy applications
Architecture of automated test cases for legacy applications
 
Who cares about Software Process Modelling? A First Investigation about the P...
Who cares about Software Process Modelling? A First Investigation about the P...Who cares about Software Process Modelling? A First Investigation about the P...
Who cares about Software Process Modelling? A First Investigation about the P...
 

Recently uploaded

The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 

Recently uploaded (20)

GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to Viruses
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 

PROV and Real Things

  • 1. PROV and Real Things LAND AND WATER Simon J D Cox & Nicholas J Car 4th December 2015
  • 2. The problem preparation workflow on a specimen’s journey from collection to analysis is complex, and variable its description must be available in order to evaluate or reproduce observations this workflow may link in to other business workflows like data publication PROV and real things | Cox & Car2 |
  • 3. Real workflows PROV and real things | Cox & Car3 |
  • 4. Proposal: 1. Describe specimen preparation using a generic process model (i.e. input-process-output) 2. Use W3C PROV as the generic process model PROV and real things | Cox & Car4 |
  • 5. ISO 19156 Specimen model Specimen is a kind of Sampling Feature • [0..*] Preparation Steps Issues: • Can’t tie the predecessor/successor to a preparation step • UML/XML only SF_Specimen + currentLocation: Location [0..1] + materialClass: GenericName + samplingLocation: GM_Object [0..1] + samplingMethod: SF_Process [0..1] + samplingTime: TM_Object + size: Measure [0..1] + specimenType: GenericName [0..1] SF_SamplingFeature + lineage: LI_Lineage [0..1] + parameter: NamedValue [0..*] Location + geometryLocation: GM_Object + nameLocation: EX_GeographicDescription GFI_Feature SamplingFeatureComplex + role: GenericName PreparationStep + processOperator: CI_ResponsibleParty [0..1] + time: TM_Object SF_Process +processingDetails 0..* Intention +sampledFeature 1..* 0..* +relatedSamplingFeature 0..* PROV and real things | Cox & Car5 | ISO 19156:2011 Geographic Information – Observations and measurements (S J D Cox, Ed.)
  • 6. Specimen model in sam-lite PROV and real things | Cox & Car6 | S J D Cox, Ontology for observations and sampling features, with alignments to existing models, Sem. Web (in press)
  • 7. What is PROV? Core classes: - Entity - thing of interest (‘endurant’) - Activity - transformation event (‘occurrent’) - Agent - responsible party or process PROV and real things | Cox & Car7 | T. Lebo, S. Sahoo, D.L. McGuinness, PROV-O: The PROV Ontology, (2013). http://www.w3.org/TR/prov-o/
  • 8. PROV applications The entities of interest are usually • Datasets • Publications, papers, reports, products i.e. information objects How about the ‘internet of things’? PROV and real things | Cox & Car8 |
  • 9. Specimen  PROV mapping Specimen sub-class-of prov:Entity . Process sub-class-of prov:Agent . Preparation-step sub-class-of prov:Activity . Specimen is a real thing! PROV and real things | Cox & Car9 |
  • 10. Example: carbonate analysis PROV and real things | Cox & Car10 | Entities: specimens, data, reports Agents: people, machines Activities: preparation-steps, observations
  • 11. Ontology Design Pattern PROV and real things | Cox & Car11 | input data config plan output data activity machine agent human agent This work with real things fits my generic PROV usage design pattern
  • 12. Example: insect taxonomy PROV and real things | Cox & Car12 | Entities: specimens, data, reports Agents: people, machines Activities: preparation-steps, observations
  • 13. URIs for agents  some ‘vocabularies’ required • People, including functional positions (‘the lab technician at the time’) • Machines, other pieces of kit • URIs for specimens – see IGSN PROV and real things | Cox & Car13 |
  • 14. Elaborations PROV and real things | Cox & Car14 | These examples use only ‘core’ PROV: qualifiedInfluence properties enable recording of more detail
  • 15. Provenance vs provenance Provenance in GLAM* world = chain-of-custody of non-reproducible things To verify identity, but also important for assay data, drug-testing, forensics Provenance in data world = transformations of reproducible things Provenance in the world of specimens = transformations of non-reproducible things ….? PROV and real things | Cox & Car15 | * Galleries, Libraries, Archives & Museums
  • 16. Summary • Original application of PROV to real things • Core PROV model capable of capturing specimen prep & analysis • Specialization might make it stronger, but little needed • Bringing ‘data’ provenance home to the GLAM application … PROV and real things | Cox & Car16 |
  • 17. CSIRO Land and Water Simon Cox Research Scientist t +61 3 9545 2365 e simon.cox@csiro.au Geoscience Australia Nicholas Car Data Architect t +61 2 6249 9093 e nicholas.car@ga.gov.au LAND AND WATER Thank you