SlideShare ist ein Scribd-Unternehmen logo
1 von 16
On correctness in RDF stream processor
benchmarking
Daniele Dell’Aglio, Jean-Paul Calbimonte, Marco Balduini,
Oscar Corcho and Emanuele Della Valle
The correctness problem (1)
:alice :isIn :hall
:alice :isIn :kitchen
:bob :isIn :hall
:bob :isIn :kitchen

S

S1

S2

S3

S4

1

3

6

9

t

Executio 1° answer 2° answer
Where are Alice and
n
Bob, when they are
1
:hall [6]
:kitchen [11]
together?
2
:hall [5]
:kitchen [10]
 Let’s consider a
3
:hall [6]
:kitchen [11]
tumbling window
4
- [7]
- [12]
slide
Which is the correct answer?
W(ω=β=5)
All of them!
 Let’s execute the
width
experiment 4 times processor benchmarking
2/16
On correctness in RDF stream
ISWC, 24 October 2013


The correctness problem (2)
:alice :isIn :hall
:alice :isIn :kitchen
:bob :isIn :hall
:bob :isIn :kitchen

S

S1

S2

S3

S4

1

3

6

9

t

System 1

System 2

Executio
n

1° answer

2° answer

Executio
n

1° answer

2° answer

1

:hall [6]

:kitchen [11]

1

:hall [3]

:kitchen [9]

2

:hall [5]

:kitchen [10]

2

3

:hall [6]

:kitchen [11]

3

4

- [7]

- [12]

3/16

No answers
:hall [3]

:kitchen [9]

4
No answers
Which system behaves in the
Both!
correct way?

On correctness in RDF stream processor benchmarking

ISWC, 24 October 2013
Different results for the same query? Yes!
Given the same data and
the same query, we notice
that a system can provide
multiple correct
answers







The operational semantics
of the systems often do not
explain this behaviour



Two systems can provide
different correct
answers


Even if the system
implements similar
operational semantics, they
behave in different ways
(that the model does not
explain)

Why is it important to understand those
behaviours?



4/16

To assess the correct implementation of the systems
To improve the comprehension of the benchmarking
On correctness in RDF stream processor benchmarking

ISWC, 24 October 2013
Our contribution



A common model for the RDF stream processor
operational semantics
We propose CSR-bench, an extension of the
SRbench benchmark that focuses on correctness





An oracle (an automatic correctness validator)
A test suite

We considered the window-based RDF stream
processors




5/16

CQELS
C-SPARQL
SPARQLstream

On correctness in RDF stream processor benchmarking

ISWC, 24 October 2013
The CSR Model (by CQL)
Input stream


S2R
operator





R2R
operator





R2S
operator


Converts the infinite stream of RDF elements in a
finite set of mappings
The window operators: time-based, tuple-based,
…
Transform a set of mappings in another set of
mappings
SPARQL 1.0/1.1 queries

Each set of mapping produced by the R2R
operator is transformed and appended to the
output stream
Operators: RStream, DStream, IStream

Output stream
6/16

On correctness in RDF stream processor benchmarking

ISWC, 24 October 2013
The window operator (through SECRET)
REPORT: When is the window content
made available to the R2R operator?
Non-empty content, Content-change,
Window-close, Periodic

t0: When does the
window start?
(internal window
param)

R2R operator

ω

β

TICK: When are
data stream
elements added to
the window?
Triple-based vs
graph-based

S

W(ω,β)

S1

S3

S2

S4

S6
S5

S7

S9

S11

S8
S10

S12
t

7/16

On correctness in RDF stream processor benchmarking

ISWC, 24 October 2013
Classification of existing systems




The report and the tick policies are related to the
RSPs implementation and on how they implement
the window operators
Analysing the RSPs and their related documentation
(papers, technical reports, etc.), we classified the
systems:
CQELS
Report Content-change

Tick Tuple-driven
Empty relation No
notification

8/16

C-SPARQL

SPARQLstream

Window-close
Non-empty
content

Window-close
Non-empty
content

Tuple-driven

Tuple-driven

Yes

No

On correctness in RDF stream processor benchmarking

ISWC, 24 October 2013
RSP output correctness
:alice :isIn :hall
:alice :isIn :kitchen
:bob :isIn :hall
:bob :isIn :kitchen

S

S1

S2

S3

S4

1

3

6

9

t

t0=0
t0=1
t0=2

Execution

1° answer

2° answer

1

:hall [6]

:kitchen [11]

2

:hall [5]

:kitchen
[10]

3

:hall [6]

:kitchen [11]

4

- [7]

- [12]

9/16

Window

1° answer

2° answer

t0=0
t0=1
t0=2

:hall [5]

:kitchen [10]

:hall [6]

:kitchen [11]

- [7]

- [12]

On correctness in RDF stream processor benchmarking

ISWC, 24 October 2013
RSP output correctness
:alice :isIn :hall
:alice :isIn :kitchen
:bob :isIn :hall
:bob :isIn :kitchen

S

S1

S2

S3

S4

1

3

6

9

t

System 1

System 2

Executio
n

1° answer

2° answer

Executio
n

1° answer

2° answer

1

:hall [6]

:kitchen [11]

1

:hall [3]

:kitchen [9]

2

:hall [5]

:kitchen [10]

2

3

:hall [6]

:kitchen [11]

3

4

- [7]

- [12]

4

Window-close vs
content-change
report policy
10/16

No answers
:hall [3]

:kitchen [9]
No answers

Empty relation
notification
(yes|no)

On correctness in RDF stream processor benchmarking

ISWC, 24 October 2013
The oracle
Online
RSP

q

S
Stream
importer

Query
transformer

Query
executor

Result
matcher

M

Correctness
assessment

Offline

Available at: https://github.com/dellaglio/csrbench-ora
(Apache 2.0 licence)
11/16

On correctness in RDF stream processor benchmarking

ISWC, 24 October 2013
Design of the tests


As data set, we consider the LinkedSensorData data
set




We designed the query set taking into account







Data stream describing blizzards and hurricanes in the
US
Window size and slide parameters
Presence of aggregation operators
Joins of timestamped triples

We collected a set of seven parametrized queries
The list of the queries and their explanation is
available at: http://www.w3.org/wiki/CSRBench
12/16

On correctness in RDF stream processor benchmarking

ISWC, 24 October 2013
Results








Initialization
Slide parameter
Window contents

timestamps of the triples


13/16

Internal timestamp management

On correctness in RDF stream processor benchmarking

SPARQLstrea

the window operator

C-SPARQL



CQELS



All the three systems that we
considered in our experiments
showed wrong behaviours
The defects we identified are related
to:

Q1
Q2
Q3
Q4
Q5
Q6

Q7

ISWC, 24 October 2013
(Removable) constraints
Reasonin
g

From single to
multi stream

S2R
S2R

R2R

R2S

S2R

From single
to multi
window
q2

14/16

Multiple
queries

On correctness in RDF stream processor benchmarking

Static
knowledge

ISWC, 24 October 2013
Conclusions


A model that describes in a more accurate way the
RSPs’ operational semantics helps their
improvement:





A common and shared test environment helps
developers of both existing and upcoming RSPs




Better design of the system
Prediction of the expected behaviours

It becomes easier to set up experiments to detect defects
and to correct them

Possible improvements of the existing
benchmarks (e.g., SRbench and LSBench)




Design of new tests
Better interpretation of the experiment results

15/16

On correctness in RDF stream processor benchmarking

ISWC, 24 October 2013
Thank you! Questions?

On correctness in RDF stream processor
benchmarking
Daniele Dell’Aglio (DEIB, Politecnico di Milano)
Jean-Paul Calbimonte (OEG, Universidad Politécnica de Madrid)
Marco Balduini (DEIB, Politecnico di Milano)
Oscar Corcho (OEG, Universidad Politécnica de Madrid)
Emanuele Della Valle (DEIB, Politecnico di Milano)

wiki: http://www.w3.org/wiki/CSRBench
software: https://github.com/dellaglio/csrbench-oracle

16/16

On correctness in RDF stream processor benchmarking

ISWC, 24 October 2013

Weitere ähnliche Inhalte

Was ist angesagt?

From Apache Flink® 1.3 to 1.4
From Apache Flink® 1.3 to 1.4From Apache Flink® 1.3 to 1.4
From Apache Flink® 1.3 to 1.4Till Rohrmann
 
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...Jonas Traub
 
Flink Forward Berlin 2017: Dongwon Kim - Predictive Maintenance with Apache F...
Flink Forward Berlin 2017: Dongwon Kim - Predictive Maintenance with Apache F...Flink Forward Berlin 2017: Dongwon Kim - Predictive Maintenance with Apache F...
Flink Forward Berlin 2017: Dongwon Kim - Predictive Maintenance with Apache F...Flink Forward
 
On Unified Stream Reasoning - The RDF Stream Processing realm
On Unified Stream Reasoning - The RDF Stream Processing realmOn Unified Stream Reasoning - The RDF Stream Processing realm
On Unified Stream Reasoning - The RDF Stream Processing realmDaniele Dell'Aglio
 
2018-01 Seattle Apache Flink Meetup at OfferUp, Opening Remarks and Talk 2
2018-01 Seattle Apache Flink Meetup at OfferUp, Opening Remarks and Talk 22018-01 Seattle Apache Flink Meetup at OfferUp, Opening Remarks and Talk 2
2018-01 Seattle Apache Flink Meetup at OfferUp, Opening Remarks and Talk 2Ververica
 
Kostas Kloudas - Complex Event Processing with Flink: the state of FlinkCEP
Kostas Kloudas - Complex Event Processing with Flink: the state of FlinkCEP Kostas Kloudas - Complex Event Processing with Flink: the state of FlinkCEP
Kostas Kloudas - Complex Event Processing with Flink: the state of FlinkCEP Ververica
 
Kostas Kloudas - Extending Flink's Streaming APIs
Kostas Kloudas - Extending Flink's Streaming APIsKostas Kloudas - Extending Flink's Streaming APIs
Kostas Kloudas - Extending Flink's Streaming APIsVerverica
 
Going Reactive with Spring 5
Going Reactive with Spring 5Going Reactive with Spring 5
Going Reactive with Spring 5Drazen Nikolic
 
Modern Stream Processing With Apache Flink @ GOTO Berlin 2017
Modern Stream Processing With Apache Flink @ GOTO Berlin 2017Modern Stream Processing With Apache Flink @ GOTO Berlin 2017
Modern Stream Processing With Apache Flink @ GOTO Berlin 2017Till Rohrmann
 
Flexible Memory Allocation in Kinetic Monte Carlo Simulations
Flexible Memory Allocation in Kinetic Monte Carlo SimulationsFlexible Memory Allocation in Kinetic Monte Carlo Simulations
Flexible Memory Allocation in Kinetic Monte Carlo SimulationsAaron Craig
 
Unify Enterprise Data Processing System Platform Level Integration of Flink a...
Unify Enterprise Data Processing System Platform Level Integration of Flink a...Unify Enterprise Data Processing System Platform Level Integration of Flink a...
Unify Enterprise Data Processing System Platform Level Integration of Flink a...Flink Forward
 
Mikio Braun – Data flow vs. procedural programming
Mikio Braun – Data flow vs. procedural programming Mikio Braun – Data flow vs. procedural programming
Mikio Braun – Data flow vs. procedural programming Flink Forward
 
Introduction to R2DBC
Introduction to R2DBCIntroduction to R2DBC
Introduction to R2DBCRob Hedgpeth
 
Flink Forward Berlin 2017: Till Rohrmann - From Apache Flink 1.3 to 1.4
Flink Forward Berlin 2017: Till Rohrmann - From Apache Flink 1.3 to 1.4Flink Forward Berlin 2017: Till Rohrmann - From Apache Flink 1.3 to 1.4
Flink Forward Berlin 2017: Till Rohrmann - From Apache Flink 1.3 to 1.4Flink Forward
 
Flink Forward Berlin 2018: Shriya Arora - "Taming large-state to join dataset...
Flink Forward Berlin 2018: Shriya Arora - "Taming large-state to join dataset...Flink Forward Berlin 2018: Shriya Arora - "Taming large-state to join dataset...
Flink Forward Berlin 2018: Shriya Arora - "Taming large-state to join dataset...Flink Forward
 
Circonus: Design failures - A Case Study
Circonus: Design failures - A Case StudyCirconus: Design failures - A Case Study
Circonus: Design failures - A Case StudyHeinrich Hartmann
 
Seminar @ Tohoku U. 2012.05.31
Seminar @ Tohoku U. 2012.05.31Seminar @ Tohoku U. 2012.05.31
Seminar @ Tohoku U. 2012.05.31Yoshitaro Takaesu
 

Was ist angesagt? (19)

From Apache Flink® 1.3 to 1.4
From Apache Flink® 1.3 to 1.4From Apache Flink® 1.3 to 1.4
From Apache Flink® 1.3 to 1.4
 
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
 
Flink Forward Berlin 2017: Dongwon Kim - Predictive Maintenance with Apache F...
Flink Forward Berlin 2017: Dongwon Kim - Predictive Maintenance with Apache F...Flink Forward Berlin 2017: Dongwon Kim - Predictive Maintenance with Apache F...
Flink Forward Berlin 2017: Dongwon Kim - Predictive Maintenance with Apache F...
 
On Unified Stream Reasoning - The RDF Stream Processing realm
On Unified Stream Reasoning - The RDF Stream Processing realmOn Unified Stream Reasoning - The RDF Stream Processing realm
On Unified Stream Reasoning - The RDF Stream Processing realm
 
2018-01 Seattle Apache Flink Meetup at OfferUp, Opening Remarks and Talk 2
2018-01 Seattle Apache Flink Meetup at OfferUp, Opening Remarks and Talk 22018-01 Seattle Apache Flink Meetup at OfferUp, Opening Remarks and Talk 2
2018-01 Seattle Apache Flink Meetup at OfferUp, Opening Remarks and Talk 2
 
Kostas Kloudas - Complex Event Processing with Flink: the state of FlinkCEP
Kostas Kloudas - Complex Event Processing with Flink: the state of FlinkCEP Kostas Kloudas - Complex Event Processing with Flink: the state of FlinkCEP
Kostas Kloudas - Complex Event Processing with Flink: the state of FlinkCEP
 
Kostas Kloudas - Extending Flink's Streaming APIs
Kostas Kloudas - Extending Flink's Streaming APIsKostas Kloudas - Extending Flink's Streaming APIs
Kostas Kloudas - Extending Flink's Streaming APIs
 
Going Reactive with Spring 5
Going Reactive with Spring 5Going Reactive with Spring 5
Going Reactive with Spring 5
 
Modern Stream Processing With Apache Flink @ GOTO Berlin 2017
Modern Stream Processing With Apache Flink @ GOTO Berlin 2017Modern Stream Processing With Apache Flink @ GOTO Berlin 2017
Modern Stream Processing With Apache Flink @ GOTO Berlin 2017
 
Apache flink
Apache flinkApache flink
Apache flink
 
Flexible Memory Allocation in Kinetic Monte Carlo Simulations
Flexible Memory Allocation in Kinetic Monte Carlo SimulationsFlexible Memory Allocation in Kinetic Monte Carlo Simulations
Flexible Memory Allocation in Kinetic Monte Carlo Simulations
 
Zurich Flink Meetup
Zurich Flink MeetupZurich Flink Meetup
Zurich Flink Meetup
 
Unify Enterprise Data Processing System Platform Level Integration of Flink a...
Unify Enterprise Data Processing System Platform Level Integration of Flink a...Unify Enterprise Data Processing System Platform Level Integration of Flink a...
Unify Enterprise Data Processing System Platform Level Integration of Flink a...
 
Mikio Braun – Data flow vs. procedural programming
Mikio Braun – Data flow vs. procedural programming Mikio Braun – Data flow vs. procedural programming
Mikio Braun – Data flow vs. procedural programming
 
Introduction to R2DBC
Introduction to R2DBCIntroduction to R2DBC
Introduction to R2DBC
 
Flink Forward Berlin 2017: Till Rohrmann - From Apache Flink 1.3 to 1.4
Flink Forward Berlin 2017: Till Rohrmann - From Apache Flink 1.3 to 1.4Flink Forward Berlin 2017: Till Rohrmann - From Apache Flink 1.3 to 1.4
Flink Forward Berlin 2017: Till Rohrmann - From Apache Flink 1.3 to 1.4
 
Flink Forward Berlin 2018: Shriya Arora - "Taming large-state to join dataset...
Flink Forward Berlin 2018: Shriya Arora - "Taming large-state to join dataset...Flink Forward Berlin 2018: Shriya Arora - "Taming large-state to join dataset...
Flink Forward Berlin 2018: Shriya Arora - "Taming large-state to join dataset...
 
Circonus: Design failures - A Case Study
Circonus: Design failures - A Case StudyCirconus: Design failures - A Case Study
Circonus: Design failures - A Case Study
 
Seminar @ Tohoku U. 2012.05.31
Seminar @ Tohoku U. 2012.05.31Seminar @ Tohoku U. 2012.05.31
Seminar @ Tohoku U. 2012.05.31
 

Ähnlich wie On correctness in RDF stream processor benchmarking

An experience on empirical research about rdf stream
An experience on empirical research about rdf streamAn experience on empirical research about rdf stream
An experience on empirical research about rdf streamDaniele Dell'Aglio
 
Connecting Stream Reasoners on the Web
Connecting Stream Reasoners on the WebConnecting Stream Reasoners on the Web
Connecting Stream Reasoners on the WebJean-Paul Calbimonte
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactJean-Paul Calbimonte
 
On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks Emanuele Della Valle
 
YABench: A Comprehensive Framework for RDF Stream Processor Correctness and P...
YABench: A Comprehensive Framework for RDF Stream Processor Correctness and P...YABench: A Comprehensive Framework for RDF Stream Processor Correctness and P...
YABench: A Comprehensive Framework for RDF Stream Processor Correctness and P...Maxim Kolchin
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsAlejandro Llaves
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsAlejandro Llaves
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataGiorgos Santipantakis
 
RSP4J: An API for RDF Stream Processing
RSP4J: An API for RDF Stream ProcessingRSP4J: An API for RDF Stream Processing
RSP4J: An API for RDF Stream ProcessingRiccardo Tommasini
 
RDF Stream Processing and the role of Semantics
RDF Stream Processing and the role of SemanticsRDF Stream Processing and the role of Semantics
RDF Stream Processing and the role of SemanticsJean-Paul Calbimonte
 
RDF Stream Processing Models (SR4LD2013)
RDF Stream Processing Models (SR4LD2013)RDF Stream Processing Models (SR4LD2013)
RDF Stream Processing Models (SR4LD2013)Daniele Dell'Aglio
 
Creating and Utilizing Linked Open Statistical Data for the Development of Ad...
Creating and Utilizing Linked Open Statistical Data for the Development of Ad...Creating and Utilizing Linked Open Statistical Data for the Development of Ad...
Creating and Utilizing Linked Open Statistical Data for the Development of Ad...Evangelos Kalampokis
 
On the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingOn the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingPlanetData Network of Excellence
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...Oscar Corcho
 
Apache Lens at Hadoop meetup
Apache Lens at Hadoop meetupApache Lens at Hadoop meetup
Apache Lens at Hadoop meetupamarsri
 
Distributed Collaboration on RDF Datasets Using Git: Towards the Quit Store
Distributed Collaboration on RDF Datasets Using Git: Towards the Quit StoreDistributed Collaboration on RDF Datasets Using Git: Towards the Quit Store
Distributed Collaboration on RDF Datasets Using Git: Towards the Quit StoreLinked Enterprise Date Services
 
Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Nikolaos Konstantinou
 
polystore_NYC_inrae_sysinfo2021-1.pdf
polystore_NYC_inrae_sysinfo2021-1.pdfpolystore_NYC_inrae_sysinfo2021-1.pdf
polystore_NYC_inrae_sysinfo2021-1.pdfRim Moussa
 
Triplewave: a step towards RDF Stream Processing on the Web
Triplewave: a step towards RDF Stream Processing on the WebTriplewave: a step towards RDF Stream Processing on the Web
Triplewave: a step towards RDF Stream Processing on the WebDaniele Dell'Aglio
 

Ähnlich wie On correctness in RDF stream processor benchmarking (20)

An experience on empirical research about rdf stream
An experience on empirical research about rdf streamAn experience on empirical research about rdf stream
An experience on empirical research about rdf stream
 
Connecting Stream Reasoners on the Web
Connecting Stream Reasoners on the WebConnecting Stream Reasoners on the Web
Connecting Stream Reasoners on the Web
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's React
 
On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks
 
YABench: A Comprehensive Framework for RDF Stream Processor Correctness and P...
YABench: A Comprehensive Framework for RDF Stream Processor Correctness and P...YABench: A Comprehensive Framework for RDF Stream Processor Correctness and P...
YABench: A Comprehensive Framework for RDF Stream Processor Correctness and P...
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streams
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streams
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival data
 
RSP4J: An API for RDF Stream Processing
RSP4J: An API for RDF Stream ProcessingRSP4J: An API for RDF Stream Processing
RSP4J: An API for RDF Stream Processing
 
RDF Stream Processing and the role of Semantics
RDF Stream Processing and the role of SemanticsRDF Stream Processing and the role of Semantics
RDF Stream Processing and the role of Semantics
 
RDF Stream Processing Models (SR4LD2013)
RDF Stream Processing Models (SR4LD2013)RDF Stream Processing Models (SR4LD2013)
RDF Stream Processing Models (SR4LD2013)
 
Creating and Utilizing Linked Open Statistical Data for the Development of Ad...
Creating and Utilizing Linked Open Statistical Data for the Development of Ad...Creating and Utilizing Linked Open Statistical Data for the Development of Ad...
Creating and Utilizing Linked Open Statistical Data for the Development of Ad...
 
On the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingOn the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream Processing
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
 
Apache Lens at Hadoop meetup
Apache Lens at Hadoop meetupApache Lens at Hadoop meetup
Apache Lens at Hadoop meetup
 
Distributed Collaboration on RDF Datasets Using Git: Towards the Quit Store
Distributed Collaboration on RDF Datasets Using Git: Towards the Quit StoreDistributed Collaboration on RDF Datasets Using Git: Towards the Quit Store
Distributed Collaboration on RDF Datasets Using Git: Towards the Quit Store
 
Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...
 
ECSA 2013 (Cuesta)
ECSA 2013 (Cuesta)ECSA 2013 (Cuesta)
ECSA 2013 (Cuesta)
 
polystore_NYC_inrae_sysinfo2021-1.pdf
polystore_NYC_inrae_sysinfo2021-1.pdfpolystore_NYC_inrae_sysinfo2021-1.pdf
polystore_NYC_inrae_sysinfo2021-1.pdf
 
Triplewave: a step towards RDF Stream Processing on the Web
Triplewave: a step towards RDF Stream Processing on the WebTriplewave: a step towards RDF Stream Processing on the Web
Triplewave: a step towards RDF Stream Processing on the Web
 

Mehr von Daniele Dell'Aglio

Distributed stream consistency checking
Distributed stream consistency checkingDistributed stream consistency checking
Distributed stream consistency checkingDaniele Dell'Aglio
 
RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...
RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...
RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...Daniele Dell'Aglio
 
Summary of the Stream Reasoning workshop at ISWC 2016
Summary of the Stream Reasoning workshop at ISWC 2016Summary of the Stream Reasoning workshop at ISWC 2016
Summary of the Stream Reasoning workshop at ISWC 2016Daniele Dell'Aglio
 
Querying the Web of Data with XSPARQL 1.1
Querying the Web of Data with XSPARQL 1.1Querying the Web of Data with XSPARQL 1.1
Querying the Web of Data with XSPARQL 1.1Daniele Dell'Aglio
 
Augmented Participation to Live Events through Social Network Content Enrichm...
Augmented Participation to Live Events through Social Network Content Enrichm...Augmented Participation to Live Events through Social Network Content Enrichm...
Augmented Participation to Live Events through Social Network Content Enrichm...Daniele Dell'Aglio
 
RDF Stream Processing Models (RSP2014)
RDF Stream Processing Models (RSP2014)RDF Stream Processing Models (RSP2014)
RDF Stream Processing Models (RSP2014)Daniele Dell'Aglio
 
A Survey of Temporal Extensions of Description Logics
A Survey of Temporal Extensions of Description LogicsA Survey of Temporal Extensions of Description Logics
A Survey of Temporal Extensions of Description LogicsDaniele Dell'Aglio
 
IMaRS - Incremental Materialization for RDF Streams (SR4LD2013)
IMaRS - Incremental Materialization for RDF Streams (SR4LD2013)IMaRS - Incremental Materialization for RDF Streams (SR4LD2013)
IMaRS - Incremental Materialization for RDF Streams (SR4LD2013)Daniele Dell'Aglio
 
Ontology based top-k query answering over massive, heterogeneous, and dynamic...
Ontology based top-k query answering over massive, heterogeneous, and dynamic...Ontology based top-k query answering over massive, heterogeneous, and dynamic...
Ontology based top-k query answering over massive, heterogeneous, and dynamic...Daniele Dell'Aglio
 
An Ontological Formulation and an OPM profile for Causality in Planning Appli...
An Ontological Formulation and an OPM profile for Causality in Planning Appli...An Ontological Formulation and an OPM profile for Causality in Planning Appli...
An Ontological Formulation and an OPM profile for Causality in Planning Appli...Daniele Dell'Aglio
 
P&MSP2012 - Version Control Systems
P&MSP2012 - Version Control SystemsP&MSP2012 - Version Control Systems
P&MSP2012 - Version Control SystemsDaniele Dell'Aglio
 
P&MSP2012 - Logging Frameworks
P&MSP2012 - Logging FrameworksP&MSP2012 - Logging Frameworks
P&MSP2012 - Logging FrameworksDaniele Dell'Aglio
 

Mehr von Daniele Dell'Aglio (17)

Distributed stream consistency checking
Distributed stream consistency checkingDistributed stream consistency checking
Distributed stream consistency checking
 
On web stream processing
On web stream processingOn web stream processing
On web stream processing
 
On a web of data streams
On a web of data streamsOn a web of data streams
On a web of data streams
 
RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...
RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...
RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...
 
Summary of the Stream Reasoning workshop at ISWC 2016
Summary of the Stream Reasoning workshop at ISWC 2016Summary of the Stream Reasoning workshop at ISWC 2016
Summary of the Stream Reasoning workshop at ISWC 2016
 
On Unified Stream Reasoning
On Unified Stream ReasoningOn Unified Stream Reasoning
On Unified Stream Reasoning
 
Querying the Web of Data with XSPARQL 1.1
Querying the Web of Data with XSPARQL 1.1Querying the Web of Data with XSPARQL 1.1
Querying the Web of Data with XSPARQL 1.1
 
Augmented Participation to Live Events through Social Network Content Enrichm...
Augmented Participation to Live Events through Social Network Content Enrichm...Augmented Participation to Live Events through Social Network Content Enrichm...
Augmented Participation to Live Events through Social Network Content Enrichm...
 
RDF Stream Processing Models (RSP2014)
RDF Stream Processing Models (RSP2014)RDF Stream Processing Models (RSP2014)
RDF Stream Processing Models (RSP2014)
 
A Survey of Temporal Extensions of Description Logics
A Survey of Temporal Extensions of Description LogicsA Survey of Temporal Extensions of Description Logics
A Survey of Temporal Extensions of Description Logics
 
IMaRS - Incremental Materialization for RDF Streams (SR4LD2013)
IMaRS - Incremental Materialization for RDF Streams (SR4LD2013)IMaRS - Incremental Materialization for RDF Streams (SR4LD2013)
IMaRS - Incremental Materialization for RDF Streams (SR4LD2013)
 
Ontology based top-k query answering over massive, heterogeneous, and dynamic...
Ontology based top-k query answering over massive, heterogeneous, and dynamic...Ontology based top-k query answering over massive, heterogeneous, and dynamic...
Ontology based top-k query answering over massive, heterogeneous, and dynamic...
 
An Ontological Formulation and an OPM profile for Causality in Planning Appli...
An Ontological Formulation and an OPM profile for Causality in Planning Appli...An Ontological Formulation and an OPM profile for Causality in Planning Appli...
An Ontological Formulation and an OPM profile for Causality in Planning Appli...
 
P&MSP2012 - Maven
P&MSP2012 - MavenP&MSP2012 - Maven
P&MSP2012 - Maven
 
P&MSP2012 - Version Control Systems
P&MSP2012 - Version Control SystemsP&MSP2012 - Version Control Systems
P&MSP2012 - Version Control Systems
 
P&MSP2012 - Unit Testing
P&MSP2012 - Unit TestingP&MSP2012 - Unit Testing
P&MSP2012 - Unit Testing
 
P&MSP2012 - Logging Frameworks
P&MSP2012 - Logging FrameworksP&MSP2012 - Logging Frameworks
P&MSP2012 - Logging Frameworks
 

Kürzlich hochgeladen

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 

Kürzlich hochgeladen (20)

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 

On correctness in RDF stream processor benchmarking

  • 1. On correctness in RDF stream processor benchmarking Daniele Dell’Aglio, Jean-Paul Calbimonte, Marco Balduini, Oscar Corcho and Emanuele Della Valle
  • 2. The correctness problem (1) :alice :isIn :hall :alice :isIn :kitchen :bob :isIn :hall :bob :isIn :kitchen S S1 S2 S3 S4 1 3 6 9 t Executio 1° answer 2° answer Where are Alice and n Bob, when they are 1 :hall [6] :kitchen [11] together? 2 :hall [5] :kitchen [10]  Let’s consider a 3 :hall [6] :kitchen [11] tumbling window 4 - [7] - [12] slide Which is the correct answer? W(ω=β=5) All of them!  Let’s execute the width experiment 4 times processor benchmarking 2/16 On correctness in RDF stream ISWC, 24 October 2013 
  • 3. The correctness problem (2) :alice :isIn :hall :alice :isIn :kitchen :bob :isIn :hall :bob :isIn :kitchen S S1 S2 S3 S4 1 3 6 9 t System 1 System 2 Executio n 1° answer 2° answer Executio n 1° answer 2° answer 1 :hall [6] :kitchen [11] 1 :hall [3] :kitchen [9] 2 :hall [5] :kitchen [10] 2 3 :hall [6] :kitchen [11] 3 4 - [7] - [12] 3/16 No answers :hall [3] :kitchen [9] 4 No answers Which system behaves in the Both! correct way? On correctness in RDF stream processor benchmarking ISWC, 24 October 2013
  • 4. Different results for the same query? Yes! Given the same data and the same query, we notice that a system can provide multiple correct answers    The operational semantics of the systems often do not explain this behaviour  Two systems can provide different correct answers  Even if the system implements similar operational semantics, they behave in different ways (that the model does not explain) Why is it important to understand those behaviours?   4/16 To assess the correct implementation of the systems To improve the comprehension of the benchmarking On correctness in RDF stream processor benchmarking ISWC, 24 October 2013
  • 5. Our contribution   A common model for the RDF stream processor operational semantics We propose CSR-bench, an extension of the SRbench benchmark that focuses on correctness    An oracle (an automatic correctness validator) A test suite We considered the window-based RDF stream processors    5/16 CQELS C-SPARQL SPARQLstream On correctness in RDF stream processor benchmarking ISWC, 24 October 2013
  • 6. The CSR Model (by CQL) Input stream  S2R operator   R2R operator   R2S operator  Converts the infinite stream of RDF elements in a finite set of mappings The window operators: time-based, tuple-based, … Transform a set of mappings in another set of mappings SPARQL 1.0/1.1 queries Each set of mapping produced by the R2R operator is transformed and appended to the output stream Operators: RStream, DStream, IStream Output stream 6/16 On correctness in RDF stream processor benchmarking ISWC, 24 October 2013
  • 7. The window operator (through SECRET) REPORT: When is the window content made available to the R2R operator? Non-empty content, Content-change, Window-close, Periodic t0: When does the window start? (internal window param) R2R operator ω β TICK: When are data stream elements added to the window? Triple-based vs graph-based S W(ω,β) S1 S3 S2 S4 S6 S5 S7 S9 S11 S8 S10 S12 t 7/16 On correctness in RDF stream processor benchmarking ISWC, 24 October 2013
  • 8. Classification of existing systems   The report and the tick policies are related to the RSPs implementation and on how they implement the window operators Analysing the RSPs and their related documentation (papers, technical reports, etc.), we classified the systems: CQELS Report Content-change Tick Tuple-driven Empty relation No notification 8/16 C-SPARQL SPARQLstream Window-close Non-empty content Window-close Non-empty content Tuple-driven Tuple-driven Yes No On correctness in RDF stream processor benchmarking ISWC, 24 October 2013
  • 9. RSP output correctness :alice :isIn :hall :alice :isIn :kitchen :bob :isIn :hall :bob :isIn :kitchen S S1 S2 S3 S4 1 3 6 9 t t0=0 t0=1 t0=2 Execution 1° answer 2° answer 1 :hall [6] :kitchen [11] 2 :hall [5] :kitchen [10] 3 :hall [6] :kitchen [11] 4 - [7] - [12] 9/16 Window 1° answer 2° answer t0=0 t0=1 t0=2 :hall [5] :kitchen [10] :hall [6] :kitchen [11] - [7] - [12] On correctness in RDF stream processor benchmarking ISWC, 24 October 2013
  • 10. RSP output correctness :alice :isIn :hall :alice :isIn :kitchen :bob :isIn :hall :bob :isIn :kitchen S S1 S2 S3 S4 1 3 6 9 t System 1 System 2 Executio n 1° answer 2° answer Executio n 1° answer 2° answer 1 :hall [6] :kitchen [11] 1 :hall [3] :kitchen [9] 2 :hall [5] :kitchen [10] 2 3 :hall [6] :kitchen [11] 3 4 - [7] - [12] 4 Window-close vs content-change report policy 10/16 No answers :hall [3] :kitchen [9] No answers Empty relation notification (yes|no) On correctness in RDF stream processor benchmarking ISWC, 24 October 2013
  • 11. The oracle Online RSP q S Stream importer Query transformer Query executor Result matcher M Correctness assessment Offline Available at: https://github.com/dellaglio/csrbench-ora (Apache 2.0 licence) 11/16 On correctness in RDF stream processor benchmarking ISWC, 24 October 2013
  • 12. Design of the tests  As data set, we consider the LinkedSensorData data set   We designed the query set taking into account      Data stream describing blizzards and hurricanes in the US Window size and slide parameters Presence of aggregation operators Joins of timestamped triples We collected a set of seven parametrized queries The list of the queries and their explanation is available at: http://www.w3.org/wiki/CSRBench 12/16 On correctness in RDF stream processor benchmarking ISWC, 24 October 2013
  • 13. Results      Initialization Slide parameter Window contents timestamps of the triples  13/16 Internal timestamp management On correctness in RDF stream processor benchmarking SPARQLstrea the window operator C-SPARQL  CQELS  All the three systems that we considered in our experiments showed wrong behaviours The defects we identified are related to: Q1 Q2 Q3 Q4 Q5 Q6 Q7 ISWC, 24 October 2013
  • 14. (Removable) constraints Reasonin g From single to multi stream S2R S2R R2R R2S S2R From single to multi window q2 14/16 Multiple queries On correctness in RDF stream processor benchmarking Static knowledge ISWC, 24 October 2013
  • 15. Conclusions  A model that describes in a more accurate way the RSPs’ operational semantics helps their improvement:    A common and shared test environment helps developers of both existing and upcoming RSPs   Better design of the system Prediction of the expected behaviours It becomes easier to set up experiments to detect defects and to correct them Possible improvements of the existing benchmarks (e.g., SRbench and LSBench)   Design of new tests Better interpretation of the experiment results 15/16 On correctness in RDF stream processor benchmarking ISWC, 24 October 2013
  • 16. Thank you! Questions? On correctness in RDF stream processor benchmarking Daniele Dell’Aglio (DEIB, Politecnico di Milano) Jean-Paul Calbimonte (OEG, Universidad Politécnica de Madrid) Marco Balduini (DEIB, Politecnico di Milano) Oscar Corcho (OEG, Universidad Politécnica de Madrid) Emanuele Della Valle (DEIB, Politecnico di Milano) wiki: http://www.w3.org/wiki/CSRBench software: https://github.com/dellaglio/csrbench-oracle 16/16 On correctness in RDF stream processor benchmarking ISWC, 24 October 2013