SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
inteSearch: An Intelligent Linked Data Information Access 
Framework 
Md-Mizanur Rahoman, Ryutaro Ichise 
November 11, 2014
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Outline 
Introduction 
Background of Linked Data Information Access 
Problem and Probable Solution 
Proposed Retrieval Framework: inteSearch 
Pre-processing of Linked Data 
Framework Details 
Experiment 
Conclusion 
Md-Mizanur Rahoman, Ryutaro Ichise j 2
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Linked Data (LD) 
are structured data 
represent knowledge with tuples like 
<< Subject, Predicate, Object >> 
which called as RDF triples 
can be represented by graph 
can use SQL-like expressive query 
store, as openly available, 
2122 datasets, 61 billion 
RDF triples (as of Apr. 2014) 
label 
type 
Property 
type type 
:birthPlace :supervisor :spouse 
Birth Place 
Supervisor Spouse 
label label 
range domain 
domain range 
domainrange 
:Country :Person 
Country 
Person 
label label 
type 
Class 
type 
Schema/Ontology 
:amnd :barl :clra :dnld 
label label 
Amanda 
type 
:grmn :uk :grce 
Germany United 
Kingdom 
Greece 
Donald 
:spouse :supervisor :spouse 
:birthPlace :birthPlace :birthPlace 
:birthPlace 
label label label 
type 
Berlusconi Cleyra 
label label 
Instances 
Md-Mizanur Rahoman, Ryutaro Ichise j 3
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Information Access over LD 
It require 
sub-graph
nding over LD graph 
impose sub-stantial execution cost, 
if graph size get bigger 
know-how of (dataset speci
c) 
vocabulary, schema, LD query 
(i.e., linked data semantics) 
demand domain-level expertise 
expect automated tool to 
understand linked data semantics 
label 
type 
Property 
type type 
:birthPlace :supervisor :spouse 
Birth Place 
Supervisor Spouse 
label label 
range domain 
domain range 
domainrange 
:Country :Person 
Country 
Person 
label label 
type 
Class 
type 
Schema/Ontology 
:amnd :barl :clra :dnld 
label label 
Amanda 
type 
:grmn :uk :grce 
Germany United 
Kingdom 
Greece 
Donald 
:spouse :supervisor :spouse 
:birthPlace :birthPlace :birthPlace 
:birthPlace 
label label label 
type 
Berlusconi Cleyra 
label label 
Instances 
:spouse 
:dnld 
:birthPlacelabel 
:grce 
Donald 
label 
Greece 
Md-Mizanur Rahoman, Ryutaro Ichise j 4
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Contemporary LD Information Access Systems 
Language-Tool-Based-Systems (PowerAqua'06, TBSL'12, 
FREyA'11, SemSek'12, CASIA'13 etc.) 
use language tools (e.g., parser, POS tagger etc.) to predict possible 
sub-graphs (over LD graph) 
convert sub-graphs to
nd SPARQL query 
Pivot-Point-Based-Systems (Treo'11, NLP-Reduce'07 etc.) 
pick a query word (i.e., pivot point), then try to pick other query word 
w.r.t. the pivot point and predict a possible sub-graph (over LD graph) 
convert sub-graph to
nd SPARQL query 
Md-Mizanur Rahoman, Ryutaro Ichise j 5
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Language-Tool-Based-Systems 
Problem 
generate many improper parsed trees - dierent parser gives dierent 
parsed trees, with dierent parsing tags. 
tag for improper semantics (e.g., miss tagging of query words, such as 
whether query word spouse should be tagged for Object or 
Predicate) 
generate empty result or improper result - choosing incorrect sub-graph 
Md-Mizanur Rahoman, Ryutaro Ichise j 6
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Pivot-Point-Based-Systems 
Problem 
depend heavily upon picking correct pivot point - most of the cases, 
systems pick NE (named entities) related pivot points
rst, then other 
pivot points 
impose huge cost, if pivot point need to change - one pivot point can 
have multiple LD resources 
miss contextual information attachment e.g., random choosing of pivot 
points could generate very dierent result 
Md-Mizanur Rahoman, Ryutaro Ichise j 7
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Problem Statement  Probable Solution 
Problem Statement 
To LD information access, how can we
nd the required sub-graph 
(over LD graph) within minimum execution cost that 
will not generate empty result 
will not miss contextual information of query 
Solution 
To
nd correct sub-graph - check maximum possible sub-graph 
generation possibility 
To achieve minimum execute cost - prepare pre-processed LD statistics 
which insight sub-graph generation possibility 
To not lose contextual information of query - adapt a sub-graph 
joining technique called Progressive Joining Approach (Rahoman  
Ichise'14) 
Md-Mizanur Rahoman, Ryutaro Ichise j 8
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
inteSearch - Overview 
Pre-processed data statistics 
store LD resources in a way so that they can be picked easily 
store pattern of LD resources so that they can give insight about 
possible sub-graph 
Development of framework 
generate single query word based graph (called as, Basic Graph) 
merge all Basic Graphs to predict all possible sub-graphs (i.e., called as 
Keyword Graphs) 
rank all possible Keyword Graphs using pre-processed data statistics 
generate SPARQL query for the best ranked Keyword Graphs 
Md-Mizanur Rahoman, Ryutaro Ichise j 9
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Pre-processed data statistics 
Label Extractor - extract and store label of LD resource 
lv (r ) = fo j 9  r ; p; o 2 RDF triples of dataset ^ p 2 rrp 
rrp is resource representing Predicates e.g., label, title etc.g 
Pattern-wise Resource Frequency Generator - compute and store 
LD resource pattern frequency 
sf (r ) = j f r ; p; o j 9  r ; p; o 2 RDF triples of datasetg j 
pf (r ) = j f s; r ; o j 9  s; r ; o 2 RDF triples of datasetg j 
of (r ) = j f s; p; r j 9  s; p; r 2 RDF triples of datasetg j 
Md-Mizanur Rahoman, Ryutaro Ichise j 10
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Example of Pre-processed Data Statistics 
Exemplary LD graph 
Supervisor Spouse 
label 
type 
Property 
type type 
:birthPlace :supervisor :spouse 
Birth Place 
label label 
range domain 
domain range 
domainrange 
:Country :Person 
Country 
Person 
label label 
type 
Class 
type 
Schema/Ontology 
:amnd :barl :clra :dnld 
label label 
Amanda 
type 
:grmn :uk :grce 
Germany United 
Kingdom 
Greece 
Donald 
:spouse :supervisor :spouse 
:birthPlace :birthPlace :birthPlace 
:birthPlace 
label label label 
type 
Berlusconi Cleyra 
label label 
Instances 
Country 
label 
:Country 
type 
Class 
Pre-processed data statistics 
r lv (r ) sf(r) pf (r ) of (r ) 
:Country Country 2 ... ... 
:... ... ... ... ... 
Md-Mizanur Rahoman, Ryutaro Ichise j 11
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Development of Framework 
Basic Graph Generator - generate the Basic Graphs 
Keyword Graph Generator - merge all Basic Graphs to predict the 
Keyword Graphs 
Ranker - rank all possible Keyword Graphs using pre-processed data 
statistics 
SPARQL Query Generator - generate SPARQL query for the best 
ranked Keyword Graphs 
Md-Mizanur Rahoman, Ryutaro Ichise j 12
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Development of Framework 
Md-Mizanur Rahoman, Ryutaro Ichise j 13
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Basic Graph Generator 
Choose one of the three Basic Graphs for each query word 
?o 
?p 
k 
?s , or k 
k , or ?o 
?p 
?s 
decided by (particular) similar LD resources (toward the query word) 
and their pattern frequencies 
e.g., 
if (particular) similar LD resources fR 
g and 
Predicate Pattern-wise Resource Frequency of a LD resource (e.g., 
pf (ri )) is bigger than all Subject and Object Pattern-wise Resource 
Frequencies, then we select Basic Graph 
?o 
k 
?s 
weight computed by highest pattern frequencies of LD resources fR 
g 
Md-Mizanur Rahoman, Ryutaro Ichise j 14
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Development of Framework 
Md-Mizanur Rahoman, Ryutaro Ichise j 15
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Keyword Graph Generator 
Merge all Basic Graphs in their all possible merging options by 
following Progressive Joining Approach 
e.g., merging 1st and 2nd Basic Graphs at all possible options 
k1 
?s ?o 
k 
?p 
?s 2 
1st Basic Graph 
k 
1 
2nd Basic Graph k 
?s1 2 
, and ?s 
k 
?o 
1 
1 
k 
2 
?p 
2 
1 
?o 
k 
?s 
1 
1 k 
2 
?p 
2 
1 
Progressive Joining Approach - if query words with order 
fk1; k2; k3; :::; kmg, then 
join Basic Graph of k1 and Basic Graph of k2 and
nd a 
Intermediate-version Keyword Graph, then 
progressively join next Basic Graph for remaining query words and 
update Intermediate-version Keyword Graph, until there is query word 
Progressive Joining Approach maintain contextual information 
attachment 
Md-Mizanur Rahoman, Ryutaro Ichise j 16
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Progressive Joining Approach - an Example 
Intermediate-version Keyword Graph k 
?p 
?s 
1 
1 ?o 
2 
k2 
1 
?p 
and Next query word corresponding Basic Graph k 
?s 3 
all possible contextualy-feasible Keyword Graph 
Intermediate Next BG Joining between Increase of KG 
Version KG last joined BG 
and next BG 
k 
?p 
?s 
1 
1 ?o 
2 
k2 
?p 
1 k 
?s 3 
k 
k 
2 
?s 3 
1 
?s 
k 
?o 
2 
1 
k 
3 
?p 
3 
2 
?o 
k 
?s 
2 
2 
k 
3 
?p 
3 
1 
k 
k 
2 
?s 3 
1 
?s 
k 
?o 
2 
1 
k 
3 
?p 
3 
2 
?o 
k 
?s 
2 
2 
k 
3 
?p 
3 
1 
k1 
k1 
k1 
Md-Mizanur Rahoman, Ryutaro Ichise j 17
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Development of Framework 
Md-Mizanur Rahoman, Ryutaro Ichise j 18
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Ranker 
Rank Keyword Graphs for 
Weight - minimum weight of constituent Basic Graphs 
Depth level - how many edges a Keyword Graph holds 
Consider lower depth level Keyword Graphs with higher ranked than 
higher depth level Keyword Graphs 
Md-Mizanur Rahoman, Ryutaro Ichise j 19

Weitere ähnliche Inhalte

Was ist angesagt?

Matching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesMatching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesIJwest
 
ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.Tatiana Tarasova
 
Classification of CNN.com Articles using a TF*IDF Metric
Classification of CNN.com Articles using a TF*IDF MetricClassification of CNN.com Articles using a TF*IDF Metric
Classification of CNN.com Articles using a TF*IDF MetricMarie Vans
 
Automated building of taxonomies for search engines
Automated building of taxonomies for search enginesAutomated building of taxonomies for search engines
Automated building of taxonomies for search enginesBoris Galitsky
 
Python and R for quantitative finance
Python and R for quantitative financePython and R for quantitative finance
Python and R for quantitative financeLuca Sbardella
 
Elements of Text Mining Part - I
Elements of Text Mining Part - IElements of Text Mining Part - I
Elements of Text Mining Part - IJaganadh Gopinadhan
 
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey GusevImage Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey GusevDatabricks
 
Text Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter DataText Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter DataYanchang Zhao
 
Topic-based Federator Query Engine - Presented at ICWI Budapest 2018
Topic-based Federator Query Engine - Presented at ICWI Budapest 2018Topic-based Federator Query Engine - Presented at ICWI Budapest 2018
Topic-based Federator Query Engine - Presented at ICWI Budapest 2018Ciro Sorrentino
 
Text analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlText analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlBen Healey
 
Hybrid geo textual index structure
Hybrid geo textual index structureHybrid geo textual index structure
Hybrid geo textual index structurecseij
 

Was ist angesagt? (20)

Matching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesMatching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sources
 
master_thesis_greciano_v2
master_thesis_greciano_v2master_thesis_greciano_v2
master_thesis_greciano_v2
 
Working with text data
Working with text dataWorking with text data
Working with text data
 
ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.
 
Classification of CNN.com Articles using a TF*IDF Metric
Classification of CNN.com Articles using a TF*IDF MetricClassification of CNN.com Articles using a TF*IDF Metric
Classification of CNN.com Articles using a TF*IDF Metric
 
Automated building of taxonomies for search engines
Automated building of taxonomies for search enginesAutomated building of taxonomies for search engines
Automated building of taxonomies for search engines
 
SAC 2019 ester giallonardo
SAC 2019 ester giallonardoSAC 2019 ester giallonardo
SAC 2019 ester giallonardo
 
Python and R for quantitative finance
Python and R for quantitative financePython and R for quantitative finance
Python and R for quantitative finance
 
Elements of Text Mining Part - I
Elements of Text Mining Part - IElements of Text Mining Part - I
Elements of Text Mining Part - I
 
Ir 03
Ir   03Ir   03
Ir 03
 
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey GusevImage Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
 
semanticweb
semanticwebsemanticweb
semanticweb
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
 
07 04-06
07 04-0607 04-06
07 04-06
 
Text Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter DataText Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter Data
 
Text Mining with R
Text Mining with RText Mining with R
Text Mining with R
 
Topic-based Federator Query Engine - Presented at ICWI Budapest 2018
Topic-based Federator Query Engine - Presented at ICWI Budapest 2018Topic-based Federator Query Engine - Presented at ICWI Budapest 2018
Topic-based Federator Query Engine - Presented at ICWI Budapest 2018
 
Text analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlText analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco Control
 
Ir 02
Ir   02Ir   02
Ir 02
 
Hybrid geo textual index structure
Hybrid geo textual index structureHybrid geo textual index structure
Hybrid geo textual index structure
 

Andere mochten auch

From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...
From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...
From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...Tim Wade
 
"Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw...
"Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw..."Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw...
"Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw...Tim Wade
 
Prueba 1 en PWPAPP CRIC
Prueba 1 en PWPAPP CRICPrueba 1 en PWPAPP CRIC
Prueba 1 en PWPAPP CRICGentedigital
 
세오갤러리
세오갤러리세오갤러리
세오갤러리redcoky
 
Client Case Studies
Client Case StudiesClient Case Studies
Client Case Studiessjpitts00
 
The formula for effective facebook ads
The formula for effective facebook adsThe formula for effective facebook ads
The formula for effective facebook adsguest6aff87b
 
2012 Tim Wade slides - Leading Change? Yes We Can!
2012 Tim Wade slides - Leading Change? Yes We Can!2012 Tim Wade slides - Leading Change? Yes We Can!
2012 Tim Wade slides - Leading Change? Yes We Can!Tim Wade
 
How to increase Productivity - Tim Wade
How to increase Productivity - Tim WadeHow to increase Productivity - Tim Wade
How to increase Productivity - Tim WadeTim Wade
 

Andere mochten auch (18)

TLDRet: A Temporal Semantic Facilitated Linked Data Retrieval Framework
TLDRet: A Temporal Semantic Facilitated Linked Data Retrieval FrameworkTLDRet: A Temporal Semantic Facilitated Linked Data Retrieval Framework
TLDRet: A Temporal Semantic Facilitated Linked Data Retrieval Framework
 
Inclusion of Temporal Semantics over Keyword-based Linked Data Retrieval
Inclusion of Temporal Semantics over Keyword-based Linked Data RetrievalInclusion of Temporal Semantics over Keyword-based Linked Data Retrieval
Inclusion of Temporal Semantics over Keyword-based Linked Data Retrieval
 
BoTLRet: A Template-based Linked Data Information Retrieval
 BoTLRet: A Template-based Linked Data Information Retrieval BoTLRet: A Template-based Linked Data Information Retrieval
BoTLRet: A Template-based Linked Data Information Retrieval
 
TESTING PT 1
TESTING PT 1TESTING PT 1
TESTING PT 1
 
From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...
From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...
From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...
 
Savannah, ga 05
Savannah, ga 05Savannah, ga 05
Savannah, ga 05
 
"Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw...
"Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw..."Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw...
"Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw...
 
Prueba 1 en PWPAPP CRIC
Prueba 1 en PWPAPP CRICPrueba 1 en PWPAPP CRIC
Prueba 1 en PWPAPP CRIC
 
세오갤러리
세오갤러리세오갤러리
세오갤러리
 
An automated template selection framework for keyword query over linked data
An automated template selection framework for keyword query over linked dataAn automated template selection framework for keyword query over linked data
An automated template selection framework for keyword query over linked data
 
Client Case Studies
Client Case StudiesClient Case Studies
Client Case Studies
 
The formula for effective facebook ads
The formula for effective facebook adsThe formula for effective facebook ads
The formula for effective facebook ads
 
Tribus urbanas
Tribus urbanasTribus urbanas
Tribus urbanas
 
A Sacrifice
A SacrificeA Sacrifice
A Sacrifice
 
LiCord: Language Independent Content Word Finder
LiCord: Language Independent Content Word FinderLiCord: Language Independent Content Word Finder
LiCord: Language Independent Content Word Finder
 
2012 Tim Wade slides - Leading Change? Yes We Can!
2012 Tim Wade slides - Leading Change? Yes We Can!2012 Tim Wade slides - Leading Change? Yes We Can!
2012 Tim Wade slides - Leading Change? Yes We Can!
 
Initiation
InitiationInitiation
Initiation
 
How to increase Productivity - Tim Wade
How to increase Productivity - Tim WadeHow to increase Productivity - Tim Wade
How to increase Productivity - Tim Wade
 

Ähnlich wie inteSearch: An Intelligent Linked Data Information Access Framework

Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Nikolaos Konstantinou
 
Recommender Systems in the Linked Data era
Recommender Systems in the Linked Data eraRecommender Systems in the Linked Data era
Recommender Systems in the Linked Data eraRoku
 
Federated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialFederated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialMuhammad Saleem
 
From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...
From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...
From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...Databricks
 
From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...
From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...
From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...Spark Summit
 
Efficient source selection for sparql endpoint federation
Efficient source selection for sparql endpoint federationEfficient source selection for sparql endpoint federation
Efficient source selection for sparql endpoint federationMuhammad Saleem
 
Big data analysis using spark r published
Big data analysis using spark r publishedBig data analysis using spark r published
Big data analysis using spark r publishedDipendra Kusi
 
Enabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and REnabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and RDatabricks
 
Open Analytics Environment
Open Analytics EnvironmentOpen Analytics Environment
Open Analytics EnvironmentIan Foster
 
Introduction to data analysis using R
Introduction to data analysis using RIntroduction to data analysis using R
Introduction to data analysis using RVictoria López
 
Personalised Search for the Social Semantic Web
Personalised Search for the Social Semantic WebPersonalised Search for the Social Semantic Web
Personalised Search for the Social Semantic WebOana Tifrea-Marciuska
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod GmodJun Zhao
 
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talkDistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talkGezim Sejdiu
 
RSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF StreamsRSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF Streamskeski
 
SparkR: Enabling Interactive Data Science at Scale
SparkR: Enabling Interactive Data Science at ScaleSparkR: Enabling Interactive Data Science at Scale
SparkR: Enabling Interactive Data Science at Scalejeykottalam
 
Machine Learning with SparkR
Machine Learning with SparkRMachine Learning with SparkR
Machine Learning with SparkROlgun Aydın
 
Big Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabBig Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabImpetus Technologies
 
Atlanta MLconf Machine Learning Conference 09-23-2016
Atlanta MLconf Machine Learning Conference 09-23-2016Atlanta MLconf Machine Learning Conference 09-23-2016
Atlanta MLconf Machine Learning Conference 09-23-2016Chris Fregly
 

Ähnlich wie inteSearch: An Intelligent Linked Data Information Access Framework (20)

Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...
 
Recommender Systems in the Linked Data era
Recommender Systems in the Linked Data eraRecommender Systems in the Linked Data era
Recommender Systems in the Linked Data era
 
Federated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialFederated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 Tutorial
 
From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...
From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...
From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...
 
From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...
From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...
From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...
 
Efficient source selection for sparql endpoint federation
Efficient source selection for sparql endpoint federationEfficient source selection for sparql endpoint federation
Efficient source selection for sparql endpoint federation
 
Big data analysis using spark r published
Big data analysis using spark r publishedBig data analysis using spark r published
Big data analysis using spark r published
 
morph-LDP Demo
morph-LDP Demomorph-LDP Demo
morph-LDP Demo
 
Enabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and REnabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and R
 
Open Analytics Environment
Open Analytics EnvironmentOpen Analytics Environment
Open Analytics Environment
 
Introduction to data analysis using R
Introduction to data analysis using RIntroduction to data analysis using R
Introduction to data analysis using R
 
Personalised Search for the Social Semantic Web
Personalised Search for the Social Semantic WebPersonalised Search for the Social Semantic Web
Personalised Search for the Social Semantic Web
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talkDistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
 
RSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF StreamsRSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF Streams
 
SparkR: Enabling Interactive Data Science at Scale
SparkR: Enabling Interactive Data Science at ScaleSparkR: Enabling Interactive Data Science at Scale
SparkR: Enabling Interactive Data Science at Scale
 
Machine Learning with SparkR
Machine Learning with SparkRMachine Learning with SparkR
Machine Learning with SparkR
 
Big Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabBig Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLab
 
LD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and toolsLD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and tools
 
Atlanta MLconf Machine Learning Conference 09-23-2016
Atlanta MLconf Machine Learning Conference 09-23-2016Atlanta MLconf Machine Learning Conference 09-23-2016
Atlanta MLconf Machine Learning Conference 09-23-2016
 

Kürzlich hochgeladen

welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the weldingMuhammadUzairLiaqat
 
Internet of things -Arshdeep Bahga .pptx
Internet of things -Arshdeep Bahga .pptxInternet of things -Arshdeep Bahga .pptx
Internet of things -Arshdeep Bahga .pptxVelmuruganTECE
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
Crystal Structure analysis and detailed information pptx
Crystal Structure analysis and detailed information pptxCrystal Structure analysis and detailed information pptx
Crystal Structure analysis and detailed information pptxachiever3003
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
Virtual memory management in Operating System
Virtual memory management in Operating SystemVirtual memory management in Operating System
Virtual memory management in Operating SystemRashmi Bhat
 
Mine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptxMine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptxRomil Mishra
 
National Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdfNational Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdfRajuKanojiya4
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Input Output Management in Operating System
Input Output Management in Operating SystemInput Output Management in Operating System
Input Output Management in Operating SystemRashmi Bhat
 
Risk Management in Engineering Construction Project
Risk Management in Engineering Construction ProjectRisk Management in Engineering Construction Project
Risk Management in Engineering Construction ProjectErbil Polytechnic University
 
DM Pillar Training Manual.ppt will be useful in deploying TPM in project
DM Pillar Training Manual.ppt will be useful in deploying TPM in projectDM Pillar Training Manual.ppt will be useful in deploying TPM in project
DM Pillar Training Manual.ppt will be useful in deploying TPM in projectssuserb6619e
 
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...Erbil Polytechnic University
 
Ch10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfCh10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfChristianCDAM
 
11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdf11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdfHafizMudaserAhmad
 
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catcherssdickerson1
 

Kürzlich hochgeladen (20)

welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the welding
 
Internet of things -Arshdeep Bahga .pptx
Internet of things -Arshdeep Bahga .pptxInternet of things -Arshdeep Bahga .pptx
Internet of things -Arshdeep Bahga .pptx
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
Crystal Structure analysis and detailed information pptx
Crystal Structure analysis and detailed information pptxCrystal Structure analysis and detailed information pptx
Crystal Structure analysis and detailed information pptx
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
Virtual memory management in Operating System
Virtual memory management in Operating SystemVirtual memory management in Operating System
Virtual memory management in Operating System
 
Mine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptxMine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptx
 
National Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdfNational Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdf
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Input Output Management in Operating System
Input Output Management in Operating SystemInput Output Management in Operating System
Input Output Management in Operating System
 
Risk Management in Engineering Construction Project
Risk Management in Engineering Construction ProjectRisk Management in Engineering Construction Project
Risk Management in Engineering Construction Project
 
Designing pile caps according to ACI 318-19.pptx
Designing pile caps according to ACI 318-19.pptxDesigning pile caps according to ACI 318-19.pptx
Designing pile caps according to ACI 318-19.pptx
 
DM Pillar Training Manual.ppt will be useful in deploying TPM in project
DM Pillar Training Manual.ppt will be useful in deploying TPM in projectDM Pillar Training Manual.ppt will be useful in deploying TPM in project
DM Pillar Training Manual.ppt will be useful in deploying TPM in project
 
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
 
Ch10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfCh10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdf
 
11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdf11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdf
 
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 

inteSearch: An Intelligent Linked Data Information Access Framework

  • 1. inteSearch: An Intelligent Linked Data Information Access Framework Md-Mizanur Rahoman, Ryutaro Ichise November 11, 2014
  • 2. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Outline Introduction Background of Linked Data Information Access Problem and Probable Solution Proposed Retrieval Framework: inteSearch Pre-processing of Linked Data Framework Details Experiment Conclusion Md-Mizanur Rahoman, Ryutaro Ichise j 2
  • 3. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Linked Data (LD) are structured data represent knowledge with tuples like << Subject, Predicate, Object >> which called as RDF triples can be represented by graph can use SQL-like expressive query store, as openly available, 2122 datasets, 61 billion RDF triples (as of Apr. 2014) label type Property type type :birthPlace :supervisor :spouse Birth Place Supervisor Spouse label label range domain domain range domainrange :Country :Person Country Person label label type Class type Schema/Ontology :amnd :barl :clra :dnld label label Amanda type :grmn :uk :grce Germany United Kingdom Greece Donald :spouse :supervisor :spouse :birthPlace :birthPlace :birthPlace :birthPlace label label label type Berlusconi Cleyra label label Instances Md-Mizanur Rahoman, Ryutaro Ichise j 3
  • 4. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Information Access over LD It require sub-graph
  • 5. nding over LD graph impose sub-stantial execution cost, if graph size get bigger know-how of (dataset speci
  • 6. c) vocabulary, schema, LD query (i.e., linked data semantics) demand domain-level expertise expect automated tool to understand linked data semantics label type Property type type :birthPlace :supervisor :spouse Birth Place Supervisor Spouse label label range domain domain range domainrange :Country :Person Country Person label label type Class type Schema/Ontology :amnd :barl :clra :dnld label label Amanda type :grmn :uk :grce Germany United Kingdom Greece Donald :spouse :supervisor :spouse :birthPlace :birthPlace :birthPlace :birthPlace label label label type Berlusconi Cleyra label label Instances :spouse :dnld :birthPlacelabel :grce Donald label Greece Md-Mizanur Rahoman, Ryutaro Ichise j 4
  • 7. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Contemporary LD Information Access Systems Language-Tool-Based-Systems (PowerAqua'06, TBSL'12, FREyA'11, SemSek'12, CASIA'13 etc.) use language tools (e.g., parser, POS tagger etc.) to predict possible sub-graphs (over LD graph) convert sub-graphs to
  • 8. nd SPARQL query Pivot-Point-Based-Systems (Treo'11, NLP-Reduce'07 etc.) pick a query word (i.e., pivot point), then try to pick other query word w.r.t. the pivot point and predict a possible sub-graph (over LD graph) convert sub-graph to
  • 9. nd SPARQL query Md-Mizanur Rahoman, Ryutaro Ichise j 5
  • 10. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Language-Tool-Based-Systems Problem generate many improper parsed trees - dierent parser gives dierent parsed trees, with dierent parsing tags. tag for improper semantics (e.g., miss tagging of query words, such as whether query word spouse should be tagged for Object or Predicate) generate empty result or improper result - choosing incorrect sub-graph Md-Mizanur Rahoman, Ryutaro Ichise j 6
  • 11. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Pivot-Point-Based-Systems Problem depend heavily upon picking correct pivot point - most of the cases, systems pick NE (named entities) related pivot points
  • 12. rst, then other pivot points impose huge cost, if pivot point need to change - one pivot point can have multiple LD resources miss contextual information attachment e.g., random choosing of pivot points could generate very dierent result Md-Mizanur Rahoman, Ryutaro Ichise j 7
  • 13. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Problem Statement Probable Solution Problem Statement To LD information access, how can we
  • 14. nd the required sub-graph (over LD graph) within minimum execution cost that will not generate empty result will not miss contextual information of query Solution To
  • 15. nd correct sub-graph - check maximum possible sub-graph generation possibility To achieve minimum execute cost - prepare pre-processed LD statistics which insight sub-graph generation possibility To not lose contextual information of query - adapt a sub-graph joining technique called Progressive Joining Approach (Rahoman Ichise'14) Md-Mizanur Rahoman, Ryutaro Ichise j 8
  • 16. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion inteSearch - Overview Pre-processed data statistics store LD resources in a way so that they can be picked easily store pattern of LD resources so that they can give insight about possible sub-graph Development of framework generate single query word based graph (called as, Basic Graph) merge all Basic Graphs to predict all possible sub-graphs (i.e., called as Keyword Graphs) rank all possible Keyword Graphs using pre-processed data statistics generate SPARQL query for the best ranked Keyword Graphs Md-Mizanur Rahoman, Ryutaro Ichise j 9
  • 17. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Pre-processed data statistics Label Extractor - extract and store label of LD resource lv (r ) = fo j 9 r ; p; o 2 RDF triples of dataset ^ p 2 rrp rrp is resource representing Predicates e.g., label, title etc.g Pattern-wise Resource Frequency Generator - compute and store LD resource pattern frequency sf (r ) = j f r ; p; o j 9 r ; p; o 2 RDF triples of datasetg j pf (r ) = j f s; r ; o j 9 s; r ; o 2 RDF triples of datasetg j of (r ) = j f s; p; r j 9 s; p; r 2 RDF triples of datasetg j Md-Mizanur Rahoman, Ryutaro Ichise j 10
  • 18. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Example of Pre-processed Data Statistics Exemplary LD graph Supervisor Spouse label type Property type type :birthPlace :supervisor :spouse Birth Place label label range domain domain range domainrange :Country :Person Country Person label label type Class type Schema/Ontology :amnd :barl :clra :dnld label label Amanda type :grmn :uk :grce Germany United Kingdom Greece Donald :spouse :supervisor :spouse :birthPlace :birthPlace :birthPlace :birthPlace label label label type Berlusconi Cleyra label label Instances Country label :Country type Class Pre-processed data statistics r lv (r ) sf(r) pf (r ) of (r ) :Country Country 2 ... ... :... ... ... ... ... Md-Mizanur Rahoman, Ryutaro Ichise j 11
  • 19. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Development of Framework Basic Graph Generator - generate the Basic Graphs Keyword Graph Generator - merge all Basic Graphs to predict the Keyword Graphs Ranker - rank all possible Keyword Graphs using pre-processed data statistics SPARQL Query Generator - generate SPARQL query for the best ranked Keyword Graphs Md-Mizanur Rahoman, Ryutaro Ichise j 12
  • 20. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Development of Framework Md-Mizanur Rahoman, Ryutaro Ichise j 13
  • 21. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Basic Graph Generator Choose one of the three Basic Graphs for each query word ?o ?p k ?s , or k k , or ?o ?p ?s decided by (particular) similar LD resources (toward the query word) and their pattern frequencies e.g., if (particular) similar LD resources fR g and Predicate Pattern-wise Resource Frequency of a LD resource (e.g., pf (ri )) is bigger than all Subject and Object Pattern-wise Resource Frequencies, then we select Basic Graph ?o k ?s weight computed by highest pattern frequencies of LD resources fR g Md-Mizanur Rahoman, Ryutaro Ichise j 14
  • 22. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Development of Framework Md-Mizanur Rahoman, Ryutaro Ichise j 15
  • 23. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Keyword Graph Generator Merge all Basic Graphs in their all possible merging options by following Progressive Joining Approach e.g., merging 1st and 2nd Basic Graphs at all possible options k1 ?s ?o k ?p ?s 2 1st Basic Graph k 1 2nd Basic Graph k ?s1 2 , and ?s k ?o 1 1 k 2 ?p 2 1 ?o k ?s 1 1 k 2 ?p 2 1 Progressive Joining Approach - if query words with order fk1; k2; k3; :::; kmg, then join Basic Graph of k1 and Basic Graph of k2 and
  • 24. nd a Intermediate-version Keyword Graph, then progressively join next Basic Graph for remaining query words and update Intermediate-version Keyword Graph, until there is query word Progressive Joining Approach maintain contextual information attachment Md-Mizanur Rahoman, Ryutaro Ichise j 16
  • 25. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Progressive Joining Approach - an Example Intermediate-version Keyword Graph k ?p ?s 1 1 ?o 2 k2 1 ?p and Next query word corresponding Basic Graph k ?s 3 all possible contextualy-feasible Keyword Graph Intermediate Next BG Joining between Increase of KG Version KG last joined BG and next BG k ?p ?s 1 1 ?o 2 k2 ?p 1 k ?s 3 k k 2 ?s 3 1 ?s k ?o 2 1 k 3 ?p 3 2 ?o k ?s 2 2 k 3 ?p 3 1 k k 2 ?s 3 1 ?s k ?o 2 1 k 3 ?p 3 2 ?o k ?s 2 2 k 3 ?p 3 1 k1 k1 k1 Md-Mizanur Rahoman, Ryutaro Ichise j 17
  • 26. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Development of Framework Md-Mizanur Rahoman, Ryutaro Ichise j 18
  • 27. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Ranker Rank Keyword Graphs for Weight - minimum weight of constituent Basic Graphs Depth level - how many edges a Keyword Graph holds Consider lower depth level Keyword Graphs with higher ranked than higher depth level Keyword Graphs Md-Mizanur Rahoman, Ryutaro Ichise j 19
  • 28. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Development of Framework Md-Mizanur Rahoman, Ryutaro Ichise j 20
  • 29. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion SPARQL Query Generator Construct SPARQL query for higher ranked Keyword Graphs, until get the
  • 30. rst non-empty result directly converted by putting Variables in SELECT clause merging keyword corresponding resources in UNION clause Md-Mizanur Rahoman, Ryutaro Ichise j 21
  • 31. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Experimental Setup Question setup Questions: Question Answering over Linked Data test question set 3(QALD-3) consist natural language questions Dataset Total Qs QALD-3 DBpedia 99 99 Keywords: constructed manually w.r.t. word order of question words Evaluation metrics Recall, Precision F1-Measure Evaluated for detail performance analysis, execution complexity measure, comparison with other systems Md-Mizanur Rahoman, Ryutaro Ichise j 22
  • 32. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Detail performance analysis Analyzed for number of keywords each question hold No of Qs Recall (Avg) Precision (Avg) F1 Measure (Avg) One Keyword Group 1 1.00 1.00 1.00 Two Keyword Group 45 0.90 0.96 0.92 Three Keyword Group 13 0.77 0.77 0.77 Four Keyword Group 8 0.75 0.75 0.75 Five Keyword Group 3 1.000 1.000 1.000 0.87 0.90 0.88 Observation according to One/Two/Three Keyword Group questions, selection of Basic Graph works well according to more-than-one Keyword Group questions, merging-based Keyword Graph construction and ranking works well pre-processed data statistics helps in ecient sub-graph
  • 33. nding over linked data graph Md-Mizanur Rahoman, Ryutaro Ichise j 23
  • 34. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Execution time wise performance analysis Environment Machine: Intel R CoreTMi7-4770K central processing unit (CPU) 3.50 GHz based system with 16 GB memory. Triple Store: Network-connected Virtuoso (version 06.01.3127) One Two Three Four Five Keyword Keyword Keyword Keyword Keyword Group Group Group Group Group 710 (ms) 2441 (ms) 2774 (ms) 3585 (ms) 3720 (ms) Observation execution cost linearly increase over number of keywords pre-processed data statistics supports in faster execution Md-Mizanur Rahoman, Ryutaro Ichise j 24
  • 35. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Performance Comparison Compared for QALD-3 challenge participant systems # of Questions Processed Right Partially Recall Precision F1-Measure squall2sparql 99 99 80 13 0.88 0.93 0.90 CASIA 99 52 29 8 0.36 0.35 0.36 Scalewelis 99 70 32 1 0.33 0.33 0.33 inteSearch 99 70 60 1 0.87 0.90 0.88 Observation: pre-processed data statistics helps in ecient sub-graph
  • 36. nding over linked data graph Md-Mizanur Rahoman, Ryutaro Ichise j 25
  • 37. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Conclusion IA over LD require
  • 38. nding proper sub-graph over LD graph We contributed devising LD IA framework that does not generate empty result maintain contextual information attachment retrieve rich information with low execution cost Single query word based Basic Graph can be extended for multiple query words, that can increase further eciency Md-Mizanur Rahoman, Ryutaro Ichise j 26
  • 39. Questions? Md-Mizanur Rahoman, mizan@nii.ac.jp Ryutaro Ichise, ichise@nii.ac.jp Md-Mizanur Rahoman, Ryutaro Ichise j 27