SlideShare a Scribd company logo
1 of 51
Download to read offline
Dynamic Planning for Link Discovery
Kleanthi Georgala and Daniel Obraczka and Axel-Cyrille Ngonga Ngomo
AKSW Research Group, University of Leipzig, Germany
Data Science Group (DICE), Paderborn University, Germany
June 2nd, 2018
Heraklion, Crete, Greece
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 1 / 29
Overview
1 Motivation
2 Approach
3 Evaluation
4 Conclusions and Future Work
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 2 / 29
Motivation
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 3 / 29
Why Link Discovery?
:E1 rdfs:label "Engine failure"@en
:E1 rdf:type :Error
:E1 :beginDate :"2015-04-22T11:39:35"
:E1 :endDate :"2015-04-22T11:39:37"
:E2 rdfs:label "Car accident"@en
:E2 rdf:type :Accident
:E2 :beginDate :"2015-06-28T11:45:22"
:E2 :endDate :"2015-06-28T11:45:24"
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 4 / 29
What is Link Discovery
4th Linked Data principle: Include links to other
URIs so that they can discover more things.
Definition (Link Discovery)
Given sets S and T of resources and relation R
Find M = {(s, t) ∈ S × T : R(s, t)}
Example: R = :failureType
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 5 / 29
Declarative Link Discovery
M is difficult to compute directly
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 6 / 29
Declarative Link Discovery
M is difficult to compute directly
compute M = {(s, t) ∈ S × T : σ(s, t) ≥ θ}
use Link Specification (LS)
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 6 / 29
Declarative Link Discovery
M is difficult to compute directly
compute M = {(s, t) ∈ S × T : σ(s, t) ≥ θ}
use Link Specification (LS)
describe conditions for which R(s, t) holds
Similarity measure m : S × T → [0, 1]
Specification operators op: , , 
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 6 / 29
Declarative Link Discovery
M is difficult to compute directly
compute M = {(s, t) ∈ S × T : σ(s, t) ≥ θ}
use Link Specification (LS)
describe conditions for which R(s, t) holds
Similarity measure m : S × T → [0, 1]
Specification operators op: , , 
(θ, 0.73) cosine(:label, :label), 0.46
Right Child
Similarity measure:cosine(:label, :label)Threshold:0.46
Atomic LS
trigrams(:type, :type), 0.87
Left Child
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 6 / 29
Why is it difficult?
Accuracy: correct links
Genetic programming
Refinement operators
. . .
Time efficiency: fast and scalable linking
Runtime reduction of the atomic similarity measures
Planning algorithms (e.g. HELIOS [1])
Use of cost functions to approximate runtime of LS
No exploitation of global knowledge about the LS
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 7 / 29
Our Contributions
Intuition
The execution engine knows more about runtimes than the planner once it has
executed a portion of the specification.
First dynamic planner for LD (Condor)
Mutable plans by re-shaping
Feedback loop between the planner and
the engine
Duplicated steps are executed once
Dependencies between steps of the plan
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 8 / 29
Approach
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 9 / 29
Dynamic execution of LS
LS Rewriter Planner Execution Engine Links
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 10 / 29
Planning for LD - Example LS
(θ, 0.5) 
(θ, 0.6)
trigrams(name, name), 0.8
cosine(label, label), 0.4
(θ, 0.7)
trigrams(name, name), 0.8
cosine(label, label), 0.4
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 11 / 29
Condor
Condor
Left Child LS
cosine(label, label), 0.4
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 12 / 29
Condor
Condor
Left Child LS
cosine(label, label), 0.4
trigrams(name, name), 0.8
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 12 / 29
Condor
Condor
Left Child LS
cosine(label, label), 0.4
trigrams(name, name), 0.8
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 12 / 29
Condor
Condor
Left Child LS
cosine(label, label), 0.4
trigrams(name, name), 0.8
(θ, 0.7)
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 12 / 29
Condor
operator
RT(Plan1) = 3.5s
(θ, 0.6)
RT(θ, 0.6) = 0.2s
RT( ) = 1s
cosine(label, label), 0.4
RT(cosine(label, label), 0.4) = 1.8s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0.5s
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 13 / 29
Condor
operator
RT(Plan1) = 3.5s
(θ, 0.6)
RT(θ, 0.6) = 0.2s
RT( ) = 1s
cosine(label, label), 0.4
RT(cosine(label, label), 0.4) = 1.8s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0.5s
RT(Plan2) = 1.5s
(θ, 0.6)
RT(θ, 0.6) = 0.2s
ϕ(cosine(label, label), 0.4)
RT(ϕ(cosine(label, label), 0.4)) = 0.8s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0.5s
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 13 / 29
Condor
operator
RT(Plan1) = 3.5s
(θ, 0.6)
RT(θ, 0.6) = 0.2s
RT( ) = 1s
cosine(label, label), 0.4
RT(cosine(label, label), 0.4) = 1.8s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0.5s
RT(Plan2) = 1.5s
(θ, 0.6)
RT(θ, 0.6) = 0.2s
ϕ(cosine(label, label), 0.4)
RT(ϕ(cosine(label, label), 0.4)) = 0.8s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0.5s
RT(Plan3) = 2.6s
(θ, 0.6)
RT(θ, 0.6) = 0.2s
ϕ(trigrams(name, name), 0.8)
RT(ϕ(trigrams(name, name), 0.8)) = 0.6s
cosine(label, label), 0.4
RT(cosine(label, label), 0.4) = 1.8s
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 13 / 29
Condor
(θ, 0.5) 
(θ, 0.6) ϕ(cosine(label, label), 0.4)
trigrams(name, name), 0.8
(θ, 0.7)
trigrams(name, name), 0.8
cosine(label, label), 0.4
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 14 / 29
Condor
Condor:  operator
(θ, 0.5)
RT(θ, 0.5) = 0.2s

RT() = 1s
(θ, 0.6) ϕ(cosine(label, label), 0.4)
RT(cosine(label, label), 0.4) = 0.8s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0.5s
(θ, 0.7)
RT(θ, 0.6) = 0.2s
trigrams(name, name), 0.8
cosine(label, label), 0.4
RT(PC1) = 2.7s + RT(LeftChild)
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 15 / 29
Condor
Condor:  operator
(θ, 0.5)
RT(θ, 0.5) = 0.2s

RT() = 1s
(θ, 0.6) ϕ(cosine(label, label), 0.4)
RT(cosine(label, label), 0.4) = 0.8s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0.5s
(θ, 0.7)
RT(θ, 0.6) = 0.2s
trigrams(name, name), 0.8
cosine(label, label), 0.4
RT(PC1) = 2.7s + RT(LeftChild)
(θ, 0.5)
RT(θ, 0.5) = 0.2s
(θ, 0.6) ((cosine(label, label), 0.4), (trigrams(name, name), 0.8))
RT(( (cosine(label, label), 0.4), (trigrams(name, name), 0.8)) = 1.3s
(θ, 0.7)
RT(θ, 0.6) = 0.2s
trigrams(name, name), 0.8
cosine(label, label), 0.4
RT(PC2) = 1.7s + RT(LeftChild)
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 15 / 29
Condor
Condor: initial Plan
(θ, 0.5) (θ, 0.6) ((cosine(label, label), 0.4), (trigrams(name, name), 0.8))
(θ, 0.7)
trigrams(name, name), 0.8
cosine(label, label), 0.4
RT(P(Condor)) = 5.2s
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 16 / 29
Dynamic execution of LS
Execution Engine:
Execute Left Child
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
Dynamic execution of LS
Execution Engine:
Execute Left Child
Cache intermediate results
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
Dynamic execution of LS
Execution Engine:
Execute Left Child
Cache intermediate results
Replace the estimated costs with its real costs
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
Dynamic execution of LS
Execution Engine:
Execute Left Child
Cache intermediate results
Replace the estimated costs with its real costs
Set Left Child and its sub-LSs as executed
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
Dynamic execution of LS
Execution Engine:
Execute Left Child
Cache intermediate results
Replace the estimated costs with its real costs
Set Left Child and its sub-LSs as executed
Condor:
Receive feedback from Execution Engine
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
Dynamic execution of LS
Execution Engine:
Execute Left Child
Cache intermediate results
Replace the estimated costs with its real costs
Set Left Child and its sub-LSs as executed
Condor:
Receive feedback from Execution Engine
Re-evaluate plan
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
Dynamic execution of LS
Execution Engine:
Execute Left Child
Cache intermediate results
Replace the estimated costs with its real costs
Set Left Child and its sub-LSs as executed
Condor:
Receive feedback from Execution Engine
Re-evaluate plan
Executed plans are more important than runtime estimations
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
Dynamic execution of LS
Execution Engine:
Execute Left Child
Cache intermediate results
Replace the estimated costs with its real costs
Set Left Child and its sub-LSs as executed
Condor:
Receive feedback from Execution Engine
Re-evaluate plan
Executed plans are more important than runtime estimations
Re-plan remaining steps
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
Condor
Re-planning Right Child LS
RT(Plan1) = 0s, RT(Plan1) = 3.5s
(θ, 0.6)
RT(θ, 0.6) = 0s
RT( ) = 0s
cosine(label, label), 0.4
RT(cosine(label, label), 0.4) = 0s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0s
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 18 / 29
Condor
Re-planning Right Child LS
RT(Plan1) = 0s, RT(Plan1) = 3.5s
(θ, 0.6)
RT(θ, 0.6) = 0s
RT( ) = 0s
cosine(label, label), 0.4
RT(cosine(label, label), 0.4) = 0s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0s
RT(Plan2) = 1s, RT(Plan2) = 1.5s
(θ, 0.6)
RT(θ, 0.6) = 0.2s
ϕ(cosine(label, label), 0.4)
RT(ϕ(cosine(label, label), 0.4)) = 0.8s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0s
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 18 / 29
Condor
Re-planning Right Child LS
RT(Plan1) = 0s, RT(Plan1) = 3.5s
(θ, 0.6)
RT(θ, 0.6) = 0s
RT( ) = 0s
cosine(label, label), 0.4
RT(cosine(label, label), 0.4) = 0s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0s
RT(Plan2) = 1s, RT(Plan2) = 1.5s
(θ, 0.6)
RT(θ, 0.6) = 0.2s
ϕ(cosine(label, label), 0.4)
RT(ϕ(cosine(label, label), 0.4)) = 0.8s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0s
RT(Plan3) = 0.8s, RT(Plan3) = 2.6s
(θ, 0.6)
RT(θ, 0.6) = 0.2s
ϕ(trigrams(name, name), 0.8)
RT(ϕ(trigrams(name, name), 0.8)) = 0.6s
cosine(label, label), 0.4
RT(cosine(label, label), 0.4) = 0s
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 18 / 29
Condor
Condor:  operator
(θ, 0.5)
RT(θ, 0.5) = 0.2s

RT() = 1s
(θ, 0.6)
RT(θ, 0.6) = 0s
RT( ) = 0s
cosine(label, label), 0.4
RT(cosine(label, label), 0.4) = 0s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0s
(θ, 0.7)
trigrams(name, name), 0.8
cosine(label, label), 0.4
RT(PC1 ) = 1.2s
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 19 / 29
Condor
Condor:  operator
(θ, 0.5)
RT(θ, 0.5) = 0.2s

RT() = 1s
(θ, 0.6)
RT(θ, 0.6) = 0s
RT( ) = 0s
cosine(label, label), 0.4
RT(cosine(label, label), 0.4) = 0s
trigrams(name, name), 0.8
RT(trigrams(name, name), 0.8) = 0s
(θ, 0.7)
trigrams(name, name), 0.8
cosine(label, label), 0.4
RT(PC1 ) = 1.2s
(θ, 0.5)
RT(θ, 0.5) = 0.2s
(θ, 0.6) ((cosine(label, label), 0.4), (trigrams(name, name), 0.8))
RT(( (cosine(label, label), 0.4), (trigrams(name, name), 0.8)) = 1.3s
(θ, 0.7)
RT(θ, 0.6) = 0.2s
trigrams(name, name), 0.8
cosine(label, label), 0.4
RT(PC2 ) = 1.7s
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 19 / 29
Condor
(θ, 0.5) 
(θ, 0.6)
(θ, 0.7)
trigrams(name, name), 0.8
cosine(label, label), 0.4
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 20 / 29
Evaluation
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 21 / 29
Experiment set-up
Datasets:
4 benchmark datasets: Abt-Buy Amazon-GP, DBLP-ACM and DBLP-Scholar
Scalability: MOVIES, TOWNS and VILLAGES
Input LS:
100 LSs for each dataset by Eagle
Unsupervised version
High accuracy in LSs
Comparison with Canonical and Helios
All planners achieved 100% F-measure
Evaluation metric: Runtime
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 22 / 29
Experiment 1
Q1 : Does Condor achieve better runtimes for LSs?
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 23 / 29
Experiment 1
Q1 : Does Condor achieve better runtimes for LSs?
Condor outperforms both static planners in all datasets
Wilcoxon signed-rank test on cumulative runtimes: statistically significant
differences
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 23 / 29
Experiment 2
Q2 : How much time does Condor spend planning?
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 24 / 29
Experiment 2
Q2 : How much time does Condor spend planning?
Condor needs less than 10ms for planning
Best average performance in Amazon-GP
4.6 times faster than Canonical
8 times faster than Helios
0.1% of overall runtime used in planning
Highest absolute difference in DBLP-Scholar
600s less runtime than Canonical
110s less runtime than Helios
0.0005% of overall runtime used in planning
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 24 / 29
Experiment 3
Q3 : How do the different sizes of LSs affect Condor ’s runtime?
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 25 / 29
Experiment 3
Q3 : How do the different sizes of LSs affect Condor ’s runtime?
LSs of size 1: same results for all planners
LSs of size 3: 7.5% faster than static planners
LSs of size 5++: 30.5% resp. 55.7% less time compared to Canonical
resp. Helios
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 25 / 29
Conclusions and Future Work
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 26 / 29
Conclusions and Future Work
Condor, a dynamic planner for link discovery:
Combination of dynamic planning with subsumption and result caching
Comparison with state-of-the-art: Canonical and Helios
Evaluation:
Experiments on 7 datasets: variety in size and classes
Significantly better runtimes than existing planning solutions
Up to 2 orders of magnitude faster
Requires less than 0.1% of the total runtime of a given LS for plan generation
Future Work:
Improvement of the cost function
Parallel execution of plans
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 27 / 29
Thank you!
Visit http://aksw.org/Projects/LIMES.html
https://twitter.com/DiceResearch
Questions?
Kleanthi Georgala
georgala@informatik.uni-leipzig.de
AKSW Research Group at Leipzig University
DICE Group at Paderborn University
http://aksw.org/KleanthiGeorgala.html
This work has been supported by H2020 projects SLIPO (GA no. 731581) and HOBBIT (GA no. 688227) as well as the DFG project LinkingLOD (project
no. NG 105/3-2) and the BMWI Project GEISER (project no. 01MD16014)
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 28 / 29
References
A.-C. Ngonga Ngomo.
HELIOS - Execution Optimization for Link Discovery.
In The Semantic Web - ISWC 2014 - 13th International Semantic Web Conference,
Riva del Garda, Italy, October 19-23, 2014. Proceedings, Part I, pages 17–32.
Springer, 2014.
Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 29 / 29

More Related Content

Similar to Dynamic planning for link discovery - ESWC 2018

CS 542 -- Query Optimization
CS 542 -- Query OptimizationCS 542 -- Query Optimization
CS 542 -- Query Optimization
J Singh
 
Company_X_Data_Analyst_Challenge
Company_X_Data_Analyst_ChallengeCompany_X_Data_Analyst_Challenge
Company_X_Data_Analyst_Challenge
Mark Yashar
 

Similar to Dynamic planning for link discovery - ESWC 2018 (20)

Graph Analytics with ArangoDB
Graph Analytics with ArangoDBGraph Analytics with ArangoDB
Graph Analytics with ArangoDB
 
CS 542 -- Query Optimization
CS 542 -- Query OptimizationCS 542 -- Query Optimization
CS 542 -- Query Optimization
 
Exploring Graph Use Cases with JanusGraph
Exploring Graph Use Cases with JanusGraphExploring Graph Use Cases with JanusGraph
Exploring Graph Use Cases with JanusGraph
 
Tech talk ggplot2
Tech talk   ggplot2Tech talk   ggplot2
Tech talk ggplot2
 
Company_X_Data_Analyst_Challenge
Company_X_Data_Analyst_ChallengeCompany_X_Data_Analyst_Challenge
Company_X_Data_Analyst_Challenge
 
Cdis hammar stadler_15_oct_2020
Cdis hammar stadler_15_oct_2020Cdis hammar stadler_15_oct_2020
Cdis hammar stadler_15_oct_2020
 
2018 Fighting Game AI Competition
2018 Fighting Game AI Competition 2018 Fighting Game AI Competition
2018 Fighting Game AI Competition
 
SECRYPT 2018 Presentation: 15th International Conference on Security and Cry...
SECRYPT 2018  Presentation: 15th International Conference on Security and Cry...SECRYPT 2018  Presentation: 15th International Conference on Security and Cry...
SECRYPT 2018 Presentation: 15th International Conference on Security and Cry...
 
Data Modeling, Normalization and Denormalization | Nordic PGDay 2018 | Dimitr...
Data Modeling, Normalization and Denormalization | Nordic PGDay 2018 | Dimitr...Data Modeling, Normalization and Denormalization | Nordic PGDay 2018 | Dimitr...
Data Modeling, Normalization and Denormalization | Nordic PGDay 2018 | Dimitr...
 
Link Discovery Tutorial Part I: Efficiency
Link Discovery Tutorial Part I: EfficiencyLink Discovery Tutorial Part I: Efficiency
Link Discovery Tutorial Part I: Efficiency
 
Model-Based Optimization / INFORMS International
Model-Based Optimization / INFORMS InternationalModel-Based Optimization / INFORMS International
Model-Based Optimization / INFORMS International
 
On the Effect of Geometries Simplification on Geo-spatial Link Discovery
On the Effect of Geometries Simplification on Geo-spatial Link DiscoveryOn the Effect of Geometries Simplification on Geo-spatial Link Discovery
On the Effect of Geometries Simplification on Geo-spatial Link Discovery
 
The Lazy Traveling Salesman Memory Management for Large-Scale Link Discovery
The Lazy Traveling Salesman Memory Management for Large-Scale Link DiscoveryThe Lazy Traveling Salesman Memory Management for Large-Scale Link Discovery
The Lazy Traveling Salesman Memory Management for Large-Scale Link Discovery
 
presentation_n.pdf
presentation_n.pdfpresentation_n.pdf
presentation_n.pdf
 
Tensor Explained
Tensor ExplainedTensor Explained
Tensor Explained
 
R getting spatial
R getting spatialR getting spatial
R getting spatial
 
DL-Foil:Class Expression Learning Revisited
DL-Foil:Class Expression Learning RevisitedDL-Foil:Class Expression Learning Revisited
DL-Foil:Class Expression Learning Revisited
 
Scalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven ApplicationsScalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven Applications
 
L'ingénierie dans les nuages
L'ingénierie dans les nuagesL'ingénierie dans les nuages
L'ingénierie dans les nuages
 
Outrageous Ideas for Graph Databases
Outrageous Ideas for Graph DatabasesOutrageous Ideas for Graph Databases
Outrageous Ideas for Graph Databases
 

More from Holistic Benchmarking of Big Linked Data

More from Holistic Benchmarking of Big Linked Data (20)

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
 
Benchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT ProjectBenchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT Project
 
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
 
The DEBS Grand Challenge 2018
The DEBS Grand Challenge 2018The DEBS Grand Challenge 2018
The DEBS Grand Challenge 2018
 
Benchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systemsBenchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systems
 
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation FrameworkSQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
 
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federationLargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
 
The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017
 
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
 
Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)
 
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F... Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
SPgen: A Benchmark Generator for Spatial Link Discovery ToolsSPgen: A Benchmark Generator for Spatial Link Discovery Tools
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
 
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation CampaignIntroducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
 
OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018
 
MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018
 
Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017
 
Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)
 
Leopard ISWC Semantic Web Challenge 2017
Leopard ISWC Semantic Web Challenge 2017Leopard ISWC Semantic Web Challenge 2017
Leopard ISWC Semantic Web Challenge 2017
 
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017. Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
 
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
 

Recently uploaded

Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
panagenda
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
Muhammad Subhan
 
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
FIDO Alliance
 

Recently uploaded (20)

Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024
 
TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfFrisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
 
Vector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptxVector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptx
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 

Dynamic planning for link discovery - ESWC 2018

  • 1. Dynamic Planning for Link Discovery Kleanthi Georgala and Daniel Obraczka and Axel-Cyrille Ngonga Ngomo AKSW Research Group, University of Leipzig, Germany Data Science Group (DICE), Paderborn University, Germany June 2nd, 2018 Heraklion, Crete, Greece Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 1 / 29
  • 2. Overview 1 Motivation 2 Approach 3 Evaluation 4 Conclusions and Future Work Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 2 / 29
  • 3. Motivation Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 3 / 29
  • 4. Why Link Discovery? :E1 rdfs:label "Engine failure"@en :E1 rdf:type :Error :E1 :beginDate :"2015-04-22T11:39:35" :E1 :endDate :"2015-04-22T11:39:37" :E2 rdfs:label "Car accident"@en :E2 rdf:type :Accident :E2 :beginDate :"2015-06-28T11:45:22" :E2 :endDate :"2015-06-28T11:45:24" Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 4 / 29
  • 5. What is Link Discovery 4th Linked Data principle: Include links to other URIs so that they can discover more things. Definition (Link Discovery) Given sets S and T of resources and relation R Find M = {(s, t) ∈ S × T : R(s, t)} Example: R = :failureType Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 5 / 29
  • 6. Declarative Link Discovery M is difficult to compute directly Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 6 / 29
  • 7. Declarative Link Discovery M is difficult to compute directly compute M = {(s, t) ∈ S × T : σ(s, t) ≥ θ} use Link Specification (LS) Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 6 / 29
  • 8. Declarative Link Discovery M is difficult to compute directly compute M = {(s, t) ∈ S × T : σ(s, t) ≥ θ} use Link Specification (LS) describe conditions for which R(s, t) holds Similarity measure m : S × T → [0, 1] Specification operators op: , , Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 6 / 29
  • 9. Declarative Link Discovery M is difficult to compute directly compute M = {(s, t) ∈ S × T : σ(s, t) ≥ θ} use Link Specification (LS) describe conditions for which R(s, t) holds Similarity measure m : S × T → [0, 1] Specification operators op: , , (θ, 0.73) cosine(:label, :label), 0.46 Right Child Similarity measure:cosine(:label, :label)Threshold:0.46 Atomic LS trigrams(:type, :type), 0.87 Left Child Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 6 / 29
  • 10. Why is it difficult? Accuracy: correct links Genetic programming Refinement operators . . . Time efficiency: fast and scalable linking Runtime reduction of the atomic similarity measures Planning algorithms (e.g. HELIOS [1]) Use of cost functions to approximate runtime of LS No exploitation of global knowledge about the LS Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 7 / 29
  • 11. Our Contributions Intuition The execution engine knows more about runtimes than the planner once it has executed a portion of the specification. First dynamic planner for LD (Condor) Mutable plans by re-shaping Feedback loop between the planner and the engine Duplicated steps are executed once Dependencies between steps of the plan Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 8 / 29
  • 12. Approach Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 9 / 29
  • 13. Dynamic execution of LS LS Rewriter Planner Execution Engine Links Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 10 / 29
  • 14. Planning for LD - Example LS (θ, 0.5) (θ, 0.6) trigrams(name, name), 0.8 cosine(label, label), 0.4 (θ, 0.7) trigrams(name, name), 0.8 cosine(label, label), 0.4 Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 11 / 29
  • 15. Condor Condor Left Child LS cosine(label, label), 0.4 Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 12 / 29
  • 16. Condor Condor Left Child LS cosine(label, label), 0.4 trigrams(name, name), 0.8 Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 12 / 29
  • 17. Condor Condor Left Child LS cosine(label, label), 0.4 trigrams(name, name), 0.8 Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 12 / 29
  • 18. Condor Condor Left Child LS cosine(label, label), 0.4 trigrams(name, name), 0.8 (θ, 0.7) Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 12 / 29
  • 19. Condor operator RT(Plan1) = 3.5s (θ, 0.6) RT(θ, 0.6) = 0.2s RT( ) = 1s cosine(label, label), 0.4 RT(cosine(label, label), 0.4) = 1.8s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0.5s Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 13 / 29
  • 20. Condor operator RT(Plan1) = 3.5s (θ, 0.6) RT(θ, 0.6) = 0.2s RT( ) = 1s cosine(label, label), 0.4 RT(cosine(label, label), 0.4) = 1.8s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0.5s RT(Plan2) = 1.5s (θ, 0.6) RT(θ, 0.6) = 0.2s ϕ(cosine(label, label), 0.4) RT(ϕ(cosine(label, label), 0.4)) = 0.8s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0.5s Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 13 / 29
  • 21. Condor operator RT(Plan1) = 3.5s (θ, 0.6) RT(θ, 0.6) = 0.2s RT( ) = 1s cosine(label, label), 0.4 RT(cosine(label, label), 0.4) = 1.8s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0.5s RT(Plan2) = 1.5s (θ, 0.6) RT(θ, 0.6) = 0.2s ϕ(cosine(label, label), 0.4) RT(ϕ(cosine(label, label), 0.4)) = 0.8s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0.5s RT(Plan3) = 2.6s (θ, 0.6) RT(θ, 0.6) = 0.2s ϕ(trigrams(name, name), 0.8) RT(ϕ(trigrams(name, name), 0.8)) = 0.6s cosine(label, label), 0.4 RT(cosine(label, label), 0.4) = 1.8s Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 13 / 29
  • 22. Condor (θ, 0.5) (θ, 0.6) ϕ(cosine(label, label), 0.4) trigrams(name, name), 0.8 (θ, 0.7) trigrams(name, name), 0.8 cosine(label, label), 0.4 Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 14 / 29
  • 23. Condor Condor: operator (θ, 0.5) RT(θ, 0.5) = 0.2s RT() = 1s (θ, 0.6) ϕ(cosine(label, label), 0.4) RT(cosine(label, label), 0.4) = 0.8s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0.5s (θ, 0.7) RT(θ, 0.6) = 0.2s trigrams(name, name), 0.8 cosine(label, label), 0.4 RT(PC1) = 2.7s + RT(LeftChild) Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 15 / 29
  • 24. Condor Condor: operator (θ, 0.5) RT(θ, 0.5) = 0.2s RT() = 1s (θ, 0.6) ϕ(cosine(label, label), 0.4) RT(cosine(label, label), 0.4) = 0.8s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0.5s (θ, 0.7) RT(θ, 0.6) = 0.2s trigrams(name, name), 0.8 cosine(label, label), 0.4 RT(PC1) = 2.7s + RT(LeftChild) (θ, 0.5) RT(θ, 0.5) = 0.2s (θ, 0.6) ((cosine(label, label), 0.4), (trigrams(name, name), 0.8)) RT(( (cosine(label, label), 0.4), (trigrams(name, name), 0.8)) = 1.3s (θ, 0.7) RT(θ, 0.6) = 0.2s trigrams(name, name), 0.8 cosine(label, label), 0.4 RT(PC2) = 1.7s + RT(LeftChild) Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 15 / 29
  • 25. Condor Condor: initial Plan (θ, 0.5) (θ, 0.6) ((cosine(label, label), 0.4), (trigrams(name, name), 0.8)) (θ, 0.7) trigrams(name, name), 0.8 cosine(label, label), 0.4 RT(P(Condor)) = 5.2s Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 16 / 29
  • 26. Dynamic execution of LS Execution Engine: Execute Left Child Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
  • 27. Dynamic execution of LS Execution Engine: Execute Left Child Cache intermediate results Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
  • 28. Dynamic execution of LS Execution Engine: Execute Left Child Cache intermediate results Replace the estimated costs with its real costs Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
  • 29. Dynamic execution of LS Execution Engine: Execute Left Child Cache intermediate results Replace the estimated costs with its real costs Set Left Child and its sub-LSs as executed Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
  • 30. Dynamic execution of LS Execution Engine: Execute Left Child Cache intermediate results Replace the estimated costs with its real costs Set Left Child and its sub-LSs as executed Condor: Receive feedback from Execution Engine Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
  • 31. Dynamic execution of LS Execution Engine: Execute Left Child Cache intermediate results Replace the estimated costs with its real costs Set Left Child and its sub-LSs as executed Condor: Receive feedback from Execution Engine Re-evaluate plan Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
  • 32. Dynamic execution of LS Execution Engine: Execute Left Child Cache intermediate results Replace the estimated costs with its real costs Set Left Child and its sub-LSs as executed Condor: Receive feedback from Execution Engine Re-evaluate plan Executed plans are more important than runtime estimations Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
  • 33. Dynamic execution of LS Execution Engine: Execute Left Child Cache intermediate results Replace the estimated costs with its real costs Set Left Child and its sub-LSs as executed Condor: Receive feedback from Execution Engine Re-evaluate plan Executed plans are more important than runtime estimations Re-plan remaining steps Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 17 / 29
  • 34. Condor Re-planning Right Child LS RT(Plan1) = 0s, RT(Plan1) = 3.5s (θ, 0.6) RT(θ, 0.6) = 0s RT( ) = 0s cosine(label, label), 0.4 RT(cosine(label, label), 0.4) = 0s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0s Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 18 / 29
  • 35. Condor Re-planning Right Child LS RT(Plan1) = 0s, RT(Plan1) = 3.5s (θ, 0.6) RT(θ, 0.6) = 0s RT( ) = 0s cosine(label, label), 0.4 RT(cosine(label, label), 0.4) = 0s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0s RT(Plan2) = 1s, RT(Plan2) = 1.5s (θ, 0.6) RT(θ, 0.6) = 0.2s ϕ(cosine(label, label), 0.4) RT(ϕ(cosine(label, label), 0.4)) = 0.8s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0s Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 18 / 29
  • 36. Condor Re-planning Right Child LS RT(Plan1) = 0s, RT(Plan1) = 3.5s (θ, 0.6) RT(θ, 0.6) = 0s RT( ) = 0s cosine(label, label), 0.4 RT(cosine(label, label), 0.4) = 0s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0s RT(Plan2) = 1s, RT(Plan2) = 1.5s (θ, 0.6) RT(θ, 0.6) = 0.2s ϕ(cosine(label, label), 0.4) RT(ϕ(cosine(label, label), 0.4)) = 0.8s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0s RT(Plan3) = 0.8s, RT(Plan3) = 2.6s (θ, 0.6) RT(θ, 0.6) = 0.2s ϕ(trigrams(name, name), 0.8) RT(ϕ(trigrams(name, name), 0.8)) = 0.6s cosine(label, label), 0.4 RT(cosine(label, label), 0.4) = 0s Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 18 / 29
  • 37. Condor Condor: operator (θ, 0.5) RT(θ, 0.5) = 0.2s RT() = 1s (θ, 0.6) RT(θ, 0.6) = 0s RT( ) = 0s cosine(label, label), 0.4 RT(cosine(label, label), 0.4) = 0s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0s (θ, 0.7) trigrams(name, name), 0.8 cosine(label, label), 0.4 RT(PC1 ) = 1.2s Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 19 / 29
  • 38. Condor Condor: operator (θ, 0.5) RT(θ, 0.5) = 0.2s RT() = 1s (θ, 0.6) RT(θ, 0.6) = 0s RT( ) = 0s cosine(label, label), 0.4 RT(cosine(label, label), 0.4) = 0s trigrams(name, name), 0.8 RT(trigrams(name, name), 0.8) = 0s (θ, 0.7) trigrams(name, name), 0.8 cosine(label, label), 0.4 RT(PC1 ) = 1.2s (θ, 0.5) RT(θ, 0.5) = 0.2s (θ, 0.6) ((cosine(label, label), 0.4), (trigrams(name, name), 0.8)) RT(( (cosine(label, label), 0.4), (trigrams(name, name), 0.8)) = 1.3s (θ, 0.7) RT(θ, 0.6) = 0.2s trigrams(name, name), 0.8 cosine(label, label), 0.4 RT(PC2 ) = 1.7s Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 19 / 29
  • 39. Condor (θ, 0.5) (θ, 0.6) (θ, 0.7) trigrams(name, name), 0.8 cosine(label, label), 0.4 Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 20 / 29
  • 40. Evaluation Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 21 / 29
  • 41. Experiment set-up Datasets: 4 benchmark datasets: Abt-Buy Amazon-GP, DBLP-ACM and DBLP-Scholar Scalability: MOVIES, TOWNS and VILLAGES Input LS: 100 LSs for each dataset by Eagle Unsupervised version High accuracy in LSs Comparison with Canonical and Helios All planners achieved 100% F-measure Evaluation metric: Runtime Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 22 / 29
  • 42. Experiment 1 Q1 : Does Condor achieve better runtimes for LSs? Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 23 / 29
  • 43. Experiment 1 Q1 : Does Condor achieve better runtimes for LSs? Condor outperforms both static planners in all datasets Wilcoxon signed-rank test on cumulative runtimes: statistically significant differences Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 23 / 29
  • 44. Experiment 2 Q2 : How much time does Condor spend planning? Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 24 / 29
  • 45. Experiment 2 Q2 : How much time does Condor spend planning? Condor needs less than 10ms for planning Best average performance in Amazon-GP 4.6 times faster than Canonical 8 times faster than Helios 0.1% of overall runtime used in planning Highest absolute difference in DBLP-Scholar 600s less runtime than Canonical 110s less runtime than Helios 0.0005% of overall runtime used in planning Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 24 / 29
  • 46. Experiment 3 Q3 : How do the different sizes of LSs affect Condor ’s runtime? Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 25 / 29
  • 47. Experiment 3 Q3 : How do the different sizes of LSs affect Condor ’s runtime? LSs of size 1: same results for all planners LSs of size 3: 7.5% faster than static planners LSs of size 5++: 30.5% resp. 55.7% less time compared to Canonical resp. Helios Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 25 / 29
  • 48. Conclusions and Future Work Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 26 / 29
  • 49. Conclusions and Future Work Condor, a dynamic planner for link discovery: Combination of dynamic planning with subsumption and result caching Comparison with state-of-the-art: Canonical and Helios Evaluation: Experiments on 7 datasets: variety in size and classes Significantly better runtimes than existing planning solutions Up to 2 orders of magnitude faster Requires less than 0.1% of the total runtime of a given LS for plan generation Future Work: Improvement of the cost function Parallel execution of plans Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 27 / 29
  • 50. Thank you! Visit http://aksw.org/Projects/LIMES.html https://twitter.com/DiceResearch Questions? Kleanthi Georgala georgala@informatik.uni-leipzig.de AKSW Research Group at Leipzig University DICE Group at Paderborn University http://aksw.org/KleanthiGeorgala.html This work has been supported by H2020 projects SLIPO (GA no. 731581) and HOBBIT (GA no. 688227) as well as the DFG project LinkingLOD (project no. NG 105/3-2) and the BMWI Project GEISER (project no. 01MD16014) Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 28 / 29
  • 51. References A.-C. Ngonga Ngomo. HELIOS - Execution Optimization for Link Discovery. In The Semantic Web - ISWC 2014 - 13th International Semantic Web Conference, Riva del Garda, Italy, October 19-23, 2014. Proceedings, Part I, pages 17–32. Springer, 2014. Georgala, Obraczka & Ngonga Ngomo (DICE) CONDOR June 5, 2018 29 / 29