SlideShare ist ein Scribd-Unternehmen logo
1 von 24
A Sightseeing Tour of
PROV and Some of its
Extensions
Khalid Belhajjame
LAMSADE, Université Paris-Dauphine
16/03/16 MADICS: ReProVirtuFlow 1
Why do we care about provenance
…
 Help explain results and outliers
 Assess trust and quality
 Promote systems transparency: users are able to
determine whether a particular use of information is
appropriate under a set of rules.
 Assist in debugging
 Promote reuse and reproducibility
16/03/16 MADICS: ReProVirtuFlow 2
A bit of History
Provenance is not a new topic. There has been a lot of
provenance work in:
 Databases, Workflows, Information retrieval, ….
 By 2009, there have been a number of models/vocabularies
for expressing provenance information
 Open Provenance Model (OPM),
 Proof Markup Language (PML),
 Provenance Vocabulary,
 PREservation Metadata : Implementation Strategies
(PREMIS),
 Semantic Web Applications in Neuromedicine (SWAN)
Ontology,
 Dublin Core, ….
16/03/16 MADICS: ReProVirtuFlow 3
A bit of History
 2009-2010: W3C Provenance Provenance Incubator Group
 Objective: provides a state of the art and possible
recommendations for standardization efforts
 2011: W3C Provenance Working Group
 Objective: To define a standard vocabulary primarily for
the semantic Web
 2013: The W3C Provenance Working Group published a
number of PROV recommendations and notes:
 PROV-DM, PROV-O, …
 Since then a number of models and vocabularies have
extended and/or defined mapping rules to PROV
16/03/16 MADICS: ReProVirtuFlow 4
Family of PROV
documents
16/03/16 MADICS: ReProVirtuFlow 5
Family of PROV
documents
16/03/16 MADICS: ReProVirtuFlow 6
Provenance
The W3C Provenance Working Group defined provenance
as:
Provenance is defined as a record that
describes the people, institutions, entities,
and activities involved in producing,
influencing, or delivering a piece of data or
a thing.
16/03/16 MADICS: ReProVirtuFlow 7
PROV…
is not a recommendation for representing and
collecting provenance information that should be
adopted internally by all systems.
 That is not realistic, and won’t happen any time soon
Instead, the aim to facilitate and promote
interoperability between domains and applications that
adopt their specific representations of provenance.
 More pragmatic, and thus likely to happen.
16/03/16 MADICS: ReProVirtuFlow 8
Example
16/03/16 MADICS: ReProVirtuFlow 9
PROV Core Structures
16/03/16 MADICS: ReProVirtuFlow 10
Entity
 An entity is a physical, digital, conceptual, or other
kind of thing with some fixed aspects; entities may
be real or imaginary.
 Example: An entity may be the document at IRI
http://www.bbc.co.uk/news/science-environment-
17526723, a file in a file system, a car, or an idea.
16/03/16 MADICS: ReProVirtuFlow 11
Activity
 An activity is something that occurs over a period of
time and acts upon or with entities; it may include
consuming, processing, transforming, modifying,
relocating, using, or generating entities.
 Example: An activity may be the publishing of a
document on the Web, sending a twitter message,
extracting metadata embedded in a file, driving a
car from Paris to Lyon, etc.
16/03/16 MADICS: ReProVirtuFlow 12
Agent
 An agent is something that bears some form of
responsibility for an activity taking place, for the
existence of an entity, or for another agent's activity.
 Example: A site selling books on the Web and the
companies hosting them can be seen as agents.
16/03/16 MADICS: ReProVirtuFlow 13
Usage and Generation
 Usage is the beginning of utilizing an entity by an
activity. Before usage, the activity had not begun to
utilize this entity and could not have been affected
by the entity.
 Example: A program beginning to read an input file
 Generation is the completion of production of a new
entity by an activity. This entity did not exist before
generation and becomes available for usage after
this generation.
 Example: the completed creation of a file by a
program
16/03/16 MADICS: ReProVirtuFlow 14
Derivation
 Derivation is a transformation of an entity into
another, an update of an entity resulting in a new
one, or the construction of a new entity based on a
pre-existing entity.
 Example: The transformation of a relational table
into a linked data set
16/03/16 MADICS: ReProVirtuFlow 15
Association and Attribution
 An activity association is an assignment of
responsibility to an agent for an activity, indicating
that the agent had a role in the activity
 Example: the workflow system is responsible for the
enactment of a workflow execution
 Attribution is the ascribing of an entity to an agent.
 Example: A blog post can be attributed to an author,
a mobile phone to its manufacturer.
16/03/16 MADICS: ReProVirtuFlow 16
PROV Core Structures
16/03/16 MADICS: ReProVirtuFlow 17
W3C PROV Implementations:
Preliminary Analysis
16/03/16 MADICS: ReProVirtuFlow 18
Source: https://khalidbelhajjame.wordpress.com/2013/04/04/w3c-prov-implementations/
PROV Compliant
Vocabularies
This is by no mean complete ….
PRO
V
ProvONE
wfprov
wfdescc
DC
PAV
extends
extends
c
extends
mapsTo
mapsTo
16/03/16 MADICS: ReProVirtuFlow 19
Prospective provenance
Retrospective
provenance
ProvONE: A PROV Extension Data
Model for Scientific Workflow Provenance
16/03/16 MADICS: ReProVirtuFlow 20
PAV ontology:
provenance, authoring and versioning
16/03/16 MADICS: ReProVirtuFlow 21
PAV ontology:
provenance, authoring and versioning
16/03/16 MADICS: ReProVirtuFlow 22
Acknowledgements
 W3C Provenance Working Group
 DataONE Workflow and Provenance Interest Group
 PAV’s friends: Paolo Ciccarese, Stian Soiland-
Reyes, Alasdair JG Gray, Carole Goble and Tim
Clark
16/03/16 MADICS: ReProVirtuFlow 23
A Sightseeing Tour of
PROV and Some of its
Extensions
Khalid Belhajjame
LAMSADE, Université Paris-Dauphine
16/03/16 MADICS: ReProVirtuFlow 24

Weitere ähnliche Inhalte

Was ist angesagt?

Improving the chemistry content of Wikipedia using workflow tools
Improving the chemistry content of Wikipedia using workflow toolsImproving the chemistry content of Wikipedia using workflow tools
Improving the chemistry content of Wikipedia using workflow toolsMitch Miller
 
Role of PIDs in connecting scholarly works
Role of PIDs in connecting scholarly worksRole of PIDs in connecting scholarly works
Role of PIDs in connecting scholarly worksOpenAIRE
 
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...OpenAIRE
 
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...dgarijo
 
Research Object Community Update
Research Object Community UpdateResearch Object Community Update
Research Object Community UpdateCarole Goble
 
FAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologiesFAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologiesResearch Data Alliance
 
SOMEF: a metadata extraction framework from software documentation
SOMEF: a metadata extraction framework from software documentationSOMEF: a metadata extraction framework from software documentation
SOMEF: a metadata extraction framework from software documentationdgarijo
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
Spark-MPI: Approaching the Fifth Paradigm with Nikolay Malitsky
Spark-MPI: Approaching the Fifth Paradigm with Nikolay MalitskySpark-MPI: Approaching the Fifth Paradigm with Nikolay Malitsky
Spark-MPI: Approaching the Fifth Paradigm with Nikolay MalitskyDatabricks
 
A Template-Based Approach for Annotating Long-Tailed Datasets
A Template-Based Approach for Annotating Long-Tailed DatasetsA Template-Based Approach for Annotating Long-Tailed Datasets
A Template-Based Approach for Annotating Long-Tailed Datasetsdgarijo
 
AUTOMATIC CONVERSION OF RELATIONAL DATABASES INTO ONTOLOGIES: A COMPARATIVE A...
AUTOMATIC CONVERSION OF RELATIONAL DATABASES INTO ONTOLOGIES: A COMPARATIVE A...AUTOMATIC CONVERSION OF RELATIONAL DATABASES INTO ONTOLOGIES: A COMPARATIVE A...
AUTOMATIC CONVERSION OF RELATIONAL DATABASES INTO ONTOLOGIES: A COMPARATIVE A...IJwest
 
2017 06-01-eswc2017-ug
2017 06-01-eswc2017-ug2017 06-01-eswc2017-ug
2017 06-01-eswc2017-ugMonika Solanki
 
Scientific Software Registry Collaboration Workshop: From Software Metadata r...
Scientific Software Registry Collaboration Workshop: From Software Metadata r...Scientific Software Registry Collaboration Workshop: From Software Metadata r...
Scientific Software Registry Collaboration Workshop: From Software Metadata r...dgarijo
 

Was ist angesagt? (14)

Improving the chemistry content of Wikipedia using workflow tools
Improving the chemistry content of Wikipedia using workflow toolsImproving the chemistry content of Wikipedia using workflow tools
Improving the chemistry content of Wikipedia using workflow tools
 
Role of PIDs in connecting scholarly works
Role of PIDs in connecting scholarly worksRole of PIDs in connecting scholarly works
Role of PIDs in connecting scholarly works
 
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
 
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...
 
Research Object Community Update
Research Object Community UpdateResearch Object Community Update
Research Object Community Update
 
FAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologiesFAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologies
 
Coming to terms to FAIR semantics
Coming to terms to FAIR semanticsComing to terms to FAIR semantics
Coming to terms to FAIR semantics
 
SOMEF: a metadata extraction framework from software documentation
SOMEF: a metadata extraction framework from software documentationSOMEF: a metadata extraction framework from software documentation
SOMEF: a metadata extraction framework from software documentation
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Spark-MPI: Approaching the Fifth Paradigm with Nikolay Malitsky
Spark-MPI: Approaching the Fifth Paradigm with Nikolay MalitskySpark-MPI: Approaching the Fifth Paradigm with Nikolay Malitsky
Spark-MPI: Approaching the Fifth Paradigm with Nikolay Malitsky
 
A Template-Based Approach for Annotating Long-Tailed Datasets
A Template-Based Approach for Annotating Long-Tailed DatasetsA Template-Based Approach for Annotating Long-Tailed Datasets
A Template-Based Approach for Annotating Long-Tailed Datasets
 
AUTOMATIC CONVERSION OF RELATIONAL DATABASES INTO ONTOLOGIES: A COMPARATIVE A...
AUTOMATIC CONVERSION OF RELATIONAL DATABASES INTO ONTOLOGIES: A COMPARATIVE A...AUTOMATIC CONVERSION OF RELATIONAL DATABASES INTO ONTOLOGIES: A COMPARATIVE A...
AUTOMATIC CONVERSION OF RELATIONAL DATABASES INTO ONTOLOGIES: A COMPARATIVE A...
 
2017 06-01-eswc2017-ug
2017 06-01-eswc2017-ug2017 06-01-eswc2017-ug
2017 06-01-eswc2017-ug
 
Scientific Software Registry Collaboration Workshop: From Software Metadata r...
Scientific Software Registry Collaboration Workshop: From Software Metadata r...Scientific Software Registry Collaboration Workshop: From Software Metadata r...
Scientific Software Registry Collaboration Workshop: From Software Metadata r...
 

Ähnlich wie Tour of PROV Provenance Model and Extensions

TRACK OER - Project proposal
TRACK OER - Project proposalTRACK OER - Project proposal
TRACK OER - Project proposalPatrick McAndrew
 
principle of oop’s in cpp
principle of oop’s in cppprinciple of oop’s in cpp
principle of oop’s in cppgourav kottawar
 
Onboarding in Open Source Software Projects: A Preliminary Analysis
Onboarding in Open Source Software Projects: A Preliminary AnalysisOnboarding in Open Source Software Projects: A Preliminary Analysis
Onboarding in Open Source Software Projects: A Preliminary AnalysisJürgen Münch
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutionsOpen Data Support
 
A LASSO for Linked Data
A LASSO for Linked DataA LASSO for Linked Data
A LASSO for Linked Datathosch
 
Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database  Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database dannyijwest
 
PATHS state of the art monitoring report
PATHS state of the art monitoring reportPATHS state of the art monitoring report
PATHS state of the art monitoring reportpathsproject
 
Information sharing pipeline
Information sharing pipelineInformation sharing pipeline
Information sharing pipelineVioleta Ilik
 
Linked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and ExamplesLinked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and ExamplesOpen Data Support
 
Connecting Publications & Data: Raising visibility of local data collections...
Connecting Publications & Data:  Raising visibility of local data collections...Connecting Publications & Data:  Raising visibility of local data collections...
Connecting Publications & Data: Raising visibility of local data collections...Michael Habib
 
Arguing By Analogy An Essay On Regulating User Generated Content On Peer-2-P...
Arguing By Analogy  An Essay On Regulating User Generated Content On Peer-2-P...Arguing By Analogy  An Essay On Regulating User Generated Content On Peer-2-P...
Arguing By Analogy An Essay On Regulating User Generated Content On Peer-2-P...Tracy Drey
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchangelagoze
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVEUDAT
 
ICWE 2010 Demonstration and Poster elevator pitch session
ICWE 2010 Demonstration and Poster elevator pitch sessionICWE 2010 Demonstration and Poster elevator pitch session
ICWE 2010 Demonstration and Poster elevator pitch sessionMarco Brambilla
 
Getting started with oracle eprcs
Getting started with oracle eprcsGetting started with oracle eprcs
Getting started with oracle eprcsRati Sharma
 
Software Sustainability Institute
Software Sustainability InstituteSoftware Sustainability Institute
Software Sustainability InstituteNeil Chue Hong
 

Ähnlich wie Tour of PROV Provenance Model and Extensions (20)

TRACK OER - Project proposal
TRACK OER - Project proposalTRACK OER - Project proposal
TRACK OER - Project proposal
 
principle of oop’s in cpp
principle of oop’s in cppprinciple of oop’s in cpp
principle of oop’s in cpp
 
Onboarding in Open Source Software Projects: A Preliminary Analysis
Onboarding in Open Source Software Projects: A Preliminary AnalysisOnboarding in Open Source Software Projects: A Preliminary Analysis
Onboarding in Open Source Software Projects: A Preliminary Analysis
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutions
 
A LASSO for Linked Data
A LASSO for Linked DataA LASSO for Linked Data
A LASSO for Linked Data
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database  Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database
 
PATHS state of the art monitoring report
PATHS state of the art monitoring reportPATHS state of the art monitoring report
PATHS state of the art monitoring report
 
2016 nov-ieee-sdn-wiki
2016 nov-ieee-sdn-wiki2016 nov-ieee-sdn-wiki
2016 nov-ieee-sdn-wiki
 
Information sharing pipeline
Information sharing pipelineInformation sharing pipeline
Information sharing pipeline
 
Linked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and ExamplesLinked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and Examples
 
Connecting Publications & Data: Raising visibility of local data collections...
Connecting Publications & Data:  Raising visibility of local data collections...Connecting Publications & Data:  Raising visibility of local data collections...
Connecting Publications & Data: Raising visibility of local data collections...
 
Arguing By Analogy An Essay On Regulating User Generated Content On Peer-2-P...
Arguing By Analogy  An Essay On Regulating User Generated Content On Peer-2-P...Arguing By Analogy  An Essay On Regulating User Generated Content On Peer-2-P...
Arguing By Analogy An Essay On Regulating User Generated Content On Peer-2-P...
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
 
ICWE 2010 Demonstration and Poster elevator pitch session
ICWE 2010 Demonstration and Poster elevator pitch sessionICWE 2010 Demonstration and Poster elevator pitch session
ICWE 2010 Demonstration and Poster elevator pitch session
 
Planetdata simpda
Planetdata simpdaPlanetdata simpda
Planetdata simpda
 
PlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web ScalePlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web Scale
 
Getting started with oracle eprcs
Getting started with oracle eprcsGetting started with oracle eprcs
Getting started with oracle eprcs
 
Software Sustainability Institute
Software Sustainability InstituteSoftware Sustainability Institute
Software Sustainability Institute
 

Mehr von Khalid Belhajjame

Lineage-Preserving Anonymization of the Provenance of Collection-Based Workflows
Lineage-Preserving Anonymization of the Provenance of Collection-Based WorkflowsLineage-Preserving Anonymization of the Provenance of Collection-Based Workflows
Lineage-Preserving Anonymization of the Provenance of Collection-Based WorkflowsKhalid Belhajjame
 
Privacy-Preserving Data Analysis Workflows for eScience
Privacy-Preserving Data Analysis Workflows for eSciencePrivacy-Preserving Data Analysis Workflows for eScience
Privacy-Preserving Data Analysis Workflows for eScienceKhalid Belhajjame
 
Linking the prospective and retrospective provenance of scripts
Linking the prospective and retrospective provenance of scriptsLinking the prospective and retrospective provenance of scripts
Linking the prospective and retrospective provenance of scriptsKhalid Belhajjame
 
Introduction to ProvBench @ Provenance Week 2014
Introduction to ProvBench @ Provenance Week 2014Introduction to ProvBench @ Provenance Week 2014
Introduction to ProvBench @ Provenance Week 2014Khalid Belhajjame
 
Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotat...
Small Is Beautiful:  Summarizing Scientific Workflows  Using Semantic Annotat...Small Is Beautiful:  Summarizing Scientific Workflows  Using Semantic Annotat...
Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotat...Khalid Belhajjame
 
Detecting Duplicate Records in Scientific Workflow Results
Detecting Duplicate Records in Scientific Workflow ResultsDetecting Duplicate Records in Scientific Workflow Results
Detecting Duplicate Records in Scientific Workflow ResultsKhalid Belhajjame
 
Research Object Model in Sepublica
Research Object Model in SepublicaResearch Object Model in Sepublica
Research Object Model in SepublicaKhalid Belhajjame
 
Case studyworkshoponprovenance
Case studyworkshoponprovenanceCase studyworkshoponprovenance
Case studyworkshoponprovenanceKhalid Belhajjame
 
Intégration incrémentale de données (Valenciennes juin 2010)
Intégration incrémentale de données (Valenciennes juin 2010)Intégration incrémentale de données (Valenciennes juin 2010)
Intégration incrémentale de données (Valenciennes juin 2010)Khalid Belhajjame
 

Mehr von Khalid Belhajjame (20)

Provenance witha purpose
Provenance witha purposeProvenance witha purpose
Provenance witha purpose
 
Lineage-Preserving Anonymization of the Provenance of Collection-Based Workflows
Lineage-Preserving Anonymization of the Provenance of Collection-Based WorkflowsLineage-Preserving Anonymization of the Provenance of Collection-Based Workflows
Lineage-Preserving Anonymization of the Provenance of Collection-Based Workflows
 
Privacy-Preserving Data Analysis Workflows for eScience
Privacy-Preserving Data Analysis Workflows for eSciencePrivacy-Preserving Data Analysis Workflows for eScience
Privacy-Preserving Data Analysis Workflows for eScience
 
Irpb workshop
Irpb workshopIrpb workshop
Irpb workshop
 
Aussois bda-mdd-2018
Aussois bda-mdd-2018Aussois bda-mdd-2018
Aussois bda-mdd-2018
 
Anr cair meeting feb 2016
Anr cair meeting feb 2016Anr cair meeting feb 2016
Anr cair meeting feb 2016
 
Ikc 2015
Ikc 2015Ikc 2015
Ikc 2015
 
Linking the prospective and retrospective provenance of scripts
Linking the prospective and retrospective provenance of scriptsLinking the prospective and retrospective provenance of scripts
Linking the prospective and retrospective provenance of scripts
 
Reproducibility 1
Reproducibility 1Reproducibility 1
Reproducibility 1
 
Introduction to ProvBench @ Provenance Week 2014
Introduction to ProvBench @ Provenance Week 2014Introduction to ProvBench @ Provenance Week 2014
Introduction to ProvBench @ Provenance Week 2014
 
Tapp 2014 (belhajjame)
Tapp 2014 (belhajjame)Tapp 2014 (belhajjame)
Tapp 2014 (belhajjame)
 
Edbt2014 talk
Edbt2014 talkEdbt2014 talk
Edbt2014 talk
 
Credible workshop
Credible workshopCredible workshop
Credible workshop
 
Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotat...
Small Is Beautiful:  Summarizing Scientific Workflows  Using Semantic Annotat...Small Is Beautiful:  Summarizing Scientific Workflows  Using Semantic Annotat...
Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotat...
 
Why Workflows Break
Why Workflows BreakWhy Workflows Break
Why Workflows Break
 
D-prov use-case
D-prov use-caseD-prov use-case
D-prov use-case
 
Detecting Duplicate Records in Scientific Workflow Results
Detecting Duplicate Records in Scientific Workflow ResultsDetecting Duplicate Records in Scientific Workflow Results
Detecting Duplicate Records in Scientific Workflow Results
 
Research Object Model in Sepublica
Research Object Model in SepublicaResearch Object Model in Sepublica
Research Object Model in Sepublica
 
Case studyworkshoponprovenance
Case studyworkshoponprovenanceCase studyworkshoponprovenance
Case studyworkshoponprovenance
 
Intégration incrémentale de données (Valenciennes juin 2010)
Intégration incrémentale de données (Valenciennes juin 2010)Intégration incrémentale de données (Valenciennes juin 2010)
Intégration incrémentale de données (Valenciennes juin 2010)
 

Kürzlich hochgeladen

INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Dust Of Snow By Robert Frost Class-X English CBSE
Dust Of Snow By Robert Frost Class-X English CBSEDust Of Snow By Robert Frost Class-X English CBSE
Dust Of Snow By Robert Frost Class-X English CBSEaurabinda banchhor
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
EMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxEMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxElton John Embodo
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 

Kürzlich hochgeladen (20)

INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Dust Of Snow By Robert Frost Class-X English CBSE
Dust Of Snow By Robert Frost Class-X English CBSEDust Of Snow By Robert Frost Class-X English CBSE
Dust Of Snow By Robert Frost Class-X English CBSE
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
EMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxEMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docx
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 

Tour of PROV Provenance Model and Extensions

  • 1. A Sightseeing Tour of PROV and Some of its Extensions Khalid Belhajjame LAMSADE, Université Paris-Dauphine 16/03/16 MADICS: ReProVirtuFlow 1
  • 2. Why do we care about provenance …  Help explain results and outliers  Assess trust and quality  Promote systems transparency: users are able to determine whether a particular use of information is appropriate under a set of rules.  Assist in debugging  Promote reuse and reproducibility 16/03/16 MADICS: ReProVirtuFlow 2
  • 3. A bit of History Provenance is not a new topic. There has been a lot of provenance work in:  Databases, Workflows, Information retrieval, ….  By 2009, there have been a number of models/vocabularies for expressing provenance information  Open Provenance Model (OPM),  Proof Markup Language (PML),  Provenance Vocabulary,  PREservation Metadata : Implementation Strategies (PREMIS),  Semantic Web Applications in Neuromedicine (SWAN) Ontology,  Dublin Core, …. 16/03/16 MADICS: ReProVirtuFlow 3
  • 4. A bit of History  2009-2010: W3C Provenance Provenance Incubator Group  Objective: provides a state of the art and possible recommendations for standardization efforts  2011: W3C Provenance Working Group  Objective: To define a standard vocabulary primarily for the semantic Web  2013: The W3C Provenance Working Group published a number of PROV recommendations and notes:  PROV-DM, PROV-O, …  Since then a number of models and vocabularies have extended and/or defined mapping rules to PROV 16/03/16 MADICS: ReProVirtuFlow 4
  • 5. Family of PROV documents 16/03/16 MADICS: ReProVirtuFlow 5
  • 6. Family of PROV documents 16/03/16 MADICS: ReProVirtuFlow 6
  • 7. Provenance The W3C Provenance Working Group defined provenance as: Provenance is defined as a record that describes the people, institutions, entities, and activities involved in producing, influencing, or delivering a piece of data or a thing. 16/03/16 MADICS: ReProVirtuFlow 7
  • 8. PROV… is not a recommendation for representing and collecting provenance information that should be adopted internally by all systems.  That is not realistic, and won’t happen any time soon Instead, the aim to facilitate and promote interoperability between domains and applications that adopt their specific representations of provenance.  More pragmatic, and thus likely to happen. 16/03/16 MADICS: ReProVirtuFlow 8
  • 10. PROV Core Structures 16/03/16 MADICS: ReProVirtuFlow 10
  • 11. Entity  An entity is a physical, digital, conceptual, or other kind of thing with some fixed aspects; entities may be real or imaginary.  Example: An entity may be the document at IRI http://www.bbc.co.uk/news/science-environment- 17526723, a file in a file system, a car, or an idea. 16/03/16 MADICS: ReProVirtuFlow 11
  • 12. Activity  An activity is something that occurs over a period of time and acts upon or with entities; it may include consuming, processing, transforming, modifying, relocating, using, or generating entities.  Example: An activity may be the publishing of a document on the Web, sending a twitter message, extracting metadata embedded in a file, driving a car from Paris to Lyon, etc. 16/03/16 MADICS: ReProVirtuFlow 12
  • 13. Agent  An agent is something that bears some form of responsibility for an activity taking place, for the existence of an entity, or for another agent's activity.  Example: A site selling books on the Web and the companies hosting them can be seen as agents. 16/03/16 MADICS: ReProVirtuFlow 13
  • 14. Usage and Generation  Usage is the beginning of utilizing an entity by an activity. Before usage, the activity had not begun to utilize this entity and could not have been affected by the entity.  Example: A program beginning to read an input file  Generation is the completion of production of a new entity by an activity. This entity did not exist before generation and becomes available for usage after this generation.  Example: the completed creation of a file by a program 16/03/16 MADICS: ReProVirtuFlow 14
  • 15. Derivation  Derivation is a transformation of an entity into another, an update of an entity resulting in a new one, or the construction of a new entity based on a pre-existing entity.  Example: The transformation of a relational table into a linked data set 16/03/16 MADICS: ReProVirtuFlow 15
  • 16. Association and Attribution  An activity association is an assignment of responsibility to an agent for an activity, indicating that the agent had a role in the activity  Example: the workflow system is responsible for the enactment of a workflow execution  Attribution is the ascribing of an entity to an agent.  Example: A blog post can be attributed to an author, a mobile phone to its manufacturer. 16/03/16 MADICS: ReProVirtuFlow 16
  • 17. PROV Core Structures 16/03/16 MADICS: ReProVirtuFlow 17
  • 18. W3C PROV Implementations: Preliminary Analysis 16/03/16 MADICS: ReProVirtuFlow 18 Source: https://khalidbelhajjame.wordpress.com/2013/04/04/w3c-prov-implementations/
  • 19. PROV Compliant Vocabularies This is by no mean complete …. PRO V ProvONE wfprov wfdescc DC PAV extends extends c extends mapsTo mapsTo 16/03/16 MADICS: ReProVirtuFlow 19
  • 20. Prospective provenance Retrospective provenance ProvONE: A PROV Extension Data Model for Scientific Workflow Provenance 16/03/16 MADICS: ReProVirtuFlow 20
  • 21. PAV ontology: provenance, authoring and versioning 16/03/16 MADICS: ReProVirtuFlow 21
  • 22. PAV ontology: provenance, authoring and versioning 16/03/16 MADICS: ReProVirtuFlow 22
  • 23. Acknowledgements  W3C Provenance Working Group  DataONE Workflow and Provenance Interest Group  PAV’s friends: Paolo Ciccarese, Stian Soiland- Reyes, Alasdair JG Gray, Carole Goble and Tim Clark 16/03/16 MADICS: ReProVirtuFlow 23
  • 24. A Sightseeing Tour of PROV and Some of its Extensions Khalid Belhajjame LAMSADE, Université Paris-Dauphine 16/03/16 MADICS: ReProVirtuFlow 24

Hinweis der Redaktion

  1. W3C Incubator Activity with a charter to provide a state-of-the art understanding and develop a roadmap in the area of provenance and possible recommendations for standardization efforts.
  2. W3C Incubator Activity with a charter to provide a state-of-the art understanding and develop a roadmap in the area of provenance and possible recommendations for standardization efforts.
  3. IRI: Internationalized Resource Identifier
  4. The core concepts: Entity, Activity, Agent, Usage and Generation, are supported by almost all implementations. On the other hand, we observe that the core concepts of Attribution, Communication and Delegation are supported by less than half of the implementations.