SlideShare ist ein Scribd-Unternehmen logo
1 von 18
SAFE
Policy Aware SPARQL Query Federation
Over RDF Data Cubes
Muhammad Saleem
AKSW, University of Leipzig, Germany
09/02/2015
Paper Presented in SWAT4LS-2014, Berlin, Germany
Enabling networked knowledge
2
Linked2Safety
1. Secured, semantically-interconnecting electronic health
records and clinical trials system
2. Advancing clinical research using Semantic Web
technologies
3. Inter-connect clinical data from CHUV (Switzerland),
ZEINCRO (Greece), CING (Cyprus)
Enabling networked knowledge
3
Patients’ anonymity Data Ownership & Privacy
 Anonymised Clinical Data cubes
 Insensitive clinical parameters without
personal information
 Access-Control Based Query Federation
Safety First – Linked2Safety Ethical & Legal Aspects
Enabling networked knowledge
4
The Problem
return number of patients that have been administered the drug Insulin and exhibit
BMI > 25 and Hypertension and Diabetes as adverse events
Switzerland Cyprus Greece
Enabling networked knowledge
5
The Problem
Access Policy Framework
Oya
Clinical Researcher
Expertise – Diabetes
Requested Data
Input Input
Denies AccessGrants Access
Query execution plan is affected
Enabling networked knowledge
6
SAFE - Secure SPARQL Query Federation
CHUV
(Data Cubes)
ZEINCRO
(Data Cubes)
CING
(Data Cubes)
RDF Data Cubes
Index
RDF Data Cubes
RDF Data Cubes
Access
Policy Model
Source
Selection
Access
Policy Filter
Query re-
Writer
SPARQL Query
Results
Enabling networked knowledge
7
SPARQL
Query + User
Info
SAFE
Source
Selection
Access Policy
Filtering
Query Re-
writing
SELECT ?diabetes ?bmi ?hypertension ?cases
WHERE {
?observation a qb:Observation .
?observation l2s-dim:Diabetes ?diabetes.
?observation l2s-dim:BMI ?bmi.
?observation l2s-dim:Hypertension ?hypertension.
?observation sdmx-measure:Cases ?cases.
}
Oya
Clinical Researcher
Expertise – Diabetes
Enabling networked knowledge
9
SPARQL
Query + User
Info
SAFE – Source Selection
Source
Selection
Access Policy
Filtering
Query Re-
writing
Triples Patterns
?observation l2s-dim:Diabetes ?diabetes.
?observation l2s-dim:BMI ?bmi.
?observation l2s-dim:Hypertension ?hypertension.
?observation sdmx-measure:Cases ?cases.
Capable Sources
{S1, S2, S3, }
{S1, S2, S3}
{S1, S2, S3}
{S1, S2, S3, S4}
S1={ }
S4={ Smoking, Gender, Cases }
INDEX
Diabetes, BMI, Hypertension, Cases
S2={ }Diabetes, BMI, Hypertension, Cases
Diabetes,
S3={ }Diabetes, BMI, Hypertension, HIV, Cases
S4
Join Awareness
Enabling networked knowledge
10
Access Policy Framework
SPARQL
Query + User
Info
SAFE – Access Policy
Triple Pattern-
based Source
Selection
Access Policy
Filtering
Query Re-
writing
Oya
Clinical Researcher
Expertise – Diabetes
Requested Data
S1 S2 S3
Input Input
Denies AccessGrants Access
S1
S2
S3
Enabling networked knowledge
11
AP1 type Access_Policy
AP1 applies_to {S1, S2}
AP1 grants_access Read
AP1 assigned_to Oya
SPARQL
Query + User
Info
SAFE – Access Policy
Source
Selection
Access Policy
Filtering
Query Re-
writing
Oya type User
Oya haslocation Galway
Oya hasPurpose Perform p-value analysis
Oya hasRole Clinical Researcher
Oya hasDomain Diabetes
ASK WHERE {
?accessPolicy a AccessPolicy.
?accessPolicy appliesToNamedGraph S1.
?accessPolicy :rantsAccess acl:Read_l2s,
?accessPolicy hasUser Oya.
}
• SPARQL Query
• Example Access Policy
grantsAccess
Enabling networked knowledge
12
SPARQL
Query + User
Info
 Graph Information will be added to the
query triples
 SELECT …. WHERE { GRAPH <S1> { …. } }
 SELECT …. WHERE { GRAPH <S2> { …. } }
 Sub queries sent to relevant sources
 S1
 S2
 Integration of results obtained from each
sources
SAFE – Query Rewriting
Source
Selection
Access Policy
Filtering
Query Re-
writing
Diabetes BMI Hypertension Cases
0 0 0 40
1 0 1 50
0 1 1 120
1 1 1 90
S1
S2
Enabling networked knowledge
13
Evaluation - DataSets
Dataset # triples # obs # sub # pred # obj # size # index
size
# index generation
time
Internal Dataset
CHUV 0.8 M 96 K 96 K 36 88 31 MB - -
CING 0.1 M 17 K 17 K 21 51 5 MB - -
ZEINCRO 0.4 M 49 K 49 K 24 59 15 MB - -
Total 1.3 M 162 K 162 K 81 198 51 MB 8 KB 10 sec
External Dataset
World Bank 77 M 10 M 10 M 58 40 K 19 GB - -
IMF 18 M 1.8 M 1.8 M 30 3151 3.51 GB - -
Eurostat 0.3 M 38 K 44 K 31 5717 205 MB - -
Trans. Int. 43 K 3939 4286 64 5290 9.2 MB - -
Total 95 M 12 M 2 M 183 54 K 23 GB 12 KB 571 sec
Enabling networked knowledge
14
Evaluation - Berlin SPARQL Benchmark
Characteristics Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12
# of Triple Patterns 9 7 9 16 7 8 11 10 7 7 3 7
# of Sources 3 4 4 3 4 3 3 4 3 3 3 3
# of Results 41 50 348 41 62 1983 5 10 1701 19656 570 41
Filters 
> 9 Patterns       
Negation 
LIMIT Modifier    
Order By Modifier   
DISTINCT Modifier          
REGEX Operator 
UNION Operator 
Enabling networked knowledge
15
Evaluation – Source Selection
Systems Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12 Avg
SAFE 8 10 13 16 15 13 15 16 7 7 9 7 11
FedX 9 13 16 24 20 14 16 19 15 17 9 16 16
Systems Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12 Avg
SAFE 0 0 0 0 0 0 0 0 0 0 0 0 0
FedX 36 28 40 64 48 40 44 40 21 21 9 21 35
• Sum of triple-pattern-wise sources selected for each query
• Number of SPARQL ASK requests used for source selection
Enabling networked knowledge
16
Evaluation - Source Selection Time
• Source Selection Time
Enabling networked knowledge
17
Evaluation - Query Execution Time
• Query Execution Time
Query times-out for FedX
Enabling networked knowledge
18
 Source Selection
 SPARQL SERVICE
 Using SPARQL ASK queries
 Using a catalog/index
 SAFE - Hybrid (catalog/index + ASK)
 Lightweight Cache
– RDF Cube Data Structure
– AccessPolicy
 Join Aware
 Excludes ineligible sources before actual query join
 Provenance – via RDF Named Graphs
 Self-contained Data Cubes
 Creator
 Location
 Date
 Access rights
SAFE – Highlights
Enabling networked knowledge
19
SAFE - Team
• Yasar Khan
• Muhammad Saleem
• Aftab Iqbal
• Muntazir Mehdi
• Aidan Hogan
• Panagiotis Hasapis
• Axel-Cyrille Ngonga Ngomo
• Stefan Decker
• Ratnesh Sahay
Thank You
http://linked2safety.hcls.deri.org:8080/SAFE-Demo/

Weitere ähnliche Inhalte

Was ist angesagt?

Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Alasdair Gray
 
Debunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative FactsDebunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative FactsNeo4j
 
2010 06 ipaw_prv
2010 06 ipaw_prv2010 06 ipaw_prv
2010 06 ipaw_prvJun Zhao
 
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)Ig Bittencourt
 
Using Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's TablesUsing Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's Tables祺傑 林
 
2009 Dils Flyweb
2009 Dils Flyweb2009 Dils Flyweb
2009 Dils FlywebJun Zhao
 
The Role of Metadata in Reproducible Computational Research
The Role of Metadata in Reproducible Computational ResearchThe Role of Metadata in Reproducible Computational Research
The Role of Metadata in Reproducible Computational ResearchJeremy Leipzig
 
Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02eswcsummerschool
 
Annotations as Linked Data with Fedora4 and Triannon
Annotations as Linked Data with Fedora4 and TriannonAnnotations as Linked Data with Fedora4 and Triannon
Annotations as Linked Data with Fedora4 and TriannonRobert Sanderson
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on AndroidEUCLID project
 
The HCLS Community Profile: Describing Datasets, Versions, and Distributions
The HCLS Community Profile: Describing Datasets, Versions, and DistributionsThe HCLS Community Profile: Describing Datasets, Versions, and Distributions
The HCLS Community Profile: Describing Datasets, Versions, and DistributionsAlasdair Gray
 
2010 03 Lodoxf Openflydata
2010 03 Lodoxf Openflydata2010 03 Lodoxf Openflydata
2010 03 Lodoxf OpenflydataJun Zhao
 
A Comparison Between Python APIs For RDF Processing
A Comparison Between Python APIs For RDF ProcessingA Comparison Between Python APIs For RDF Processing
A Comparison Between Python APIs For RDF Processinglucianb
 
20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologiesMelanie Courtot
 
Bio ontologies and semantic technologies
Bio ontologies and semantic technologiesBio ontologies and semantic technologies
Bio ontologies and semantic technologiesProf. Wim Van Criekinge
 

Was ist angesagt? (19)

RDF data model
RDF data modelRDF data model
RDF data model
 
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
 
Sparql
SparqlSparql
Sparql
 
Eswc2018 wimu slides
Eswc2018 wimu slidesEswc2018 wimu slides
Eswc2018 wimu slides
 
Debunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative FactsDebunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative Facts
 
2010 06 ipaw_prv
2010 06 ipaw_prv2010 06 ipaw_prv
2010 06 ipaw_prv
 
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
 
GraphDB
GraphDBGraphDB
GraphDB
 
Using Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's TablesUsing Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's Tables
 
2009 Dils Flyweb
2009 Dils Flyweb2009 Dils Flyweb
2009 Dils Flyweb
 
The Role of Metadata in Reproducible Computational Research
The Role of Metadata in Reproducible Computational ResearchThe Role of Metadata in Reproducible Computational Research
The Role of Metadata in Reproducible Computational Research
 
Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02
 
Annotations as Linked Data with Fedora4 and Triannon
Annotations as Linked Data with Fedora4 and TriannonAnnotations as Linked Data with Fedora4 and Triannon
Annotations as Linked Data with Fedora4 and Triannon
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on Android
 
The HCLS Community Profile: Describing Datasets, Versions, and Distributions
The HCLS Community Profile: Describing Datasets, Versions, and DistributionsThe HCLS Community Profile: Describing Datasets, Versions, and Distributions
The HCLS Community Profile: Describing Datasets, Versions, and Distributions
 
2010 03 Lodoxf Openflydata
2010 03 Lodoxf Openflydata2010 03 Lodoxf Openflydata
2010 03 Lodoxf Openflydata
 
A Comparison Between Python APIs For RDF Processing
A Comparison Between Python APIs For RDF ProcessingA Comparison Between Python APIs For RDF Processing
A Comparison Between Python APIs For RDF Processing
 
20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies
 
Bio ontologies and semantic technologies
Bio ontologies and semantic technologiesBio ontologies and semantic technologies
Bio ontologies and semantic technologies
 

Ähnlich wie SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes

SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
SAFE: Policy Aware SPARQL Query Federation Over RDF Data CubesSAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
SAFE: Policy Aware SPARQL Query Federation Over RDF Data CubesRatnesh Sahay
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...David Peyruc
 
BDE Technical Webinar 1 : Requirements elicitation
BDE Technical Webinar 1 : Requirements elicitationBDE Technical Webinar 1 : Requirements elicitation
BDE Technical Webinar 1 : Requirements elicitationBigData_Europe
 
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...Kathleen Jagodnik
 
Data Mapping in Clinical Trials
Data Mapping in Clinical TrialsData Mapping in Clinical Trials
Data Mapping in Clinical TrialsClinosolIndia
 
Neo4j for Discovering Drugs and Biomarkers
Neo4j for Discovering Drugs and BiomarkersNeo4j for Discovering Drugs and Biomarkers
Neo4j for Discovering Drugs and BiomarkersNeo4j
 
Next Gen Clinical Data Sciences
Next Gen Clinical Data SciencesNext Gen Clinical Data Sciences
Next Gen Clinical Data SciencesSaama
 
Rich feeds for rescue, palms cyberinfrastructure integration stories
Rich feeds for rescue, palms cyberinfrastructure   integration storiesRich feeds for rescue, palms cyberinfrastructure   integration stories
Rich feeds for rescue, palms cyberinfrastructure integration storiesbdemchak
 
Overview of policies for security and data sharing
Overview of policies for security and data sharingOverview of policies for security and data sharing
Overview of policies for security and data sharingbdemchak
 
Rich feeds for rescue an integration story
Rich feeds for rescue   an integration storyRich feeds for rescue   an integration story
Rich feeds for rescue an integration storybdemchak
 
Cloud e-Genome: NGS Workflows on the Cloud Using e-Science Central
Cloud e-Genome: NGS Workflows on the Cloud Using e-Science CentralCloud e-Genome: NGS Workflows on the Cloud Using e-Science Central
Cloud e-Genome: NGS Workflows on the Cloud Using e-Science CentralPaolo Missier
 
2016 Standardization of Laboratory Test Coding - PHI Conference
2016 Standardization of Laboratory Test Coding - PHI Conference2016 Standardization of Laboratory Test Coding - PHI Conference
2016 Standardization of Laboratory Test Coding - PHI ConferenceMegan Sawchuk
 
CDISC & Risk Based Monitoring to Compress Clinical Trial Duration
CDISC & Risk Based Monitoring to Compress Clinical Trial DurationCDISC & Risk Based Monitoring to Compress Clinical Trial Duration
CDISC & Risk Based Monitoring to Compress Clinical Trial DurationClinical Data Inc .
 
Meaningful (meta)data at scale: removing barriers to precision medicine research
Meaningful (meta)data at scale: removing barriers to precision medicine researchMeaningful (meta)data at scale: removing barriers to precision medicine research
Meaningful (meta)data at scale: removing barriers to precision medicine researchNolan Nichols
 
Evaluating Cloud vs On-Premises for NGS Clinical Workflows
Evaluating Cloud vs On-Premises for NGS Clinical WorkflowsEvaluating Cloud vs On-Premises for NGS Clinical Workflows
Evaluating Cloud vs On-Premises for NGS Clinical WorkflowsGolden Helix
 
Dennis Kehoe - ECO 15: Digital connectivity in healthcare
Dennis Kehoe - ECO 15: Digital connectivity in healthcareDennis Kehoe - ECO 15: Digital connectivity in healthcare
Dennis Kehoe - ECO 15: Digital connectivity in healthcareInnovation Agency
 
Automating Clinical Trials in Pharmacology Units
Automating Clinical Trials in Pharmacology UnitsAutomating Clinical Trials in Pharmacology Units
Automating Clinical Trials in Pharmacology UnitsSGS
 

Ähnlich wie SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes (20)

SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
SAFE: Policy Aware SPARQL Query Federation Over RDF Data CubesSAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
 
Burton - Security, Privacy and Trust
Burton - Security, Privacy and TrustBurton - Security, Privacy and Trust
Burton - Security, Privacy and Trust
 
BDE Technical Webinar 1 : Requirements elicitation
BDE Technical Webinar 1 : Requirements elicitationBDE Technical Webinar 1 : Requirements elicitation
BDE Technical Webinar 1 : Requirements elicitation
 
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
 
Data Mapping in Clinical Trials
Data Mapping in Clinical TrialsData Mapping in Clinical Trials
Data Mapping in Clinical Trials
 
Neo4j for Discovering Drugs and Biomarkers
Neo4j for Discovering Drugs and BiomarkersNeo4j for Discovering Drugs and Biomarkers
Neo4j for Discovering Drugs and Biomarkers
 
Meaningful use stage 3 - Nalashaa capabilities
Meaningful use stage 3 - Nalashaa capabilitiesMeaningful use stage 3 - Nalashaa capabilities
Meaningful use stage 3 - Nalashaa capabilities
 
Next Gen Clinical Data Sciences
Next Gen Clinical Data SciencesNext Gen Clinical Data Sciences
Next Gen Clinical Data Sciences
 
Rich feeds for rescue, palms cyberinfrastructure integration stories
Rich feeds for rescue, palms cyberinfrastructure   integration storiesRich feeds for rescue, palms cyberinfrastructure   integration stories
Rich feeds for rescue, palms cyberinfrastructure integration stories
 
Overview of policies for security and data sharing
Overview of policies for security and data sharingOverview of policies for security and data sharing
Overview of policies for security and data sharing
 
Rich feeds for rescue an integration story
Rich feeds for rescue   an integration storyRich feeds for rescue   an integration story
Rich feeds for rescue an integration story
 
Cloud e-Genome: NGS Workflows on the Cloud Using e-Science Central
Cloud e-Genome: NGS Workflows on the Cloud Using e-Science CentralCloud e-Genome: NGS Workflows on the Cloud Using e-Science Central
Cloud e-Genome: NGS Workflows on the Cloud Using e-Science Central
 
2016 Standardization of Laboratory Test Coding - PHI Conference
2016 Standardization of Laboratory Test Coding - PHI Conference2016 Standardization of Laboratory Test Coding - PHI Conference
2016 Standardization of Laboratory Test Coding - PHI Conference
 
Ncicbiit
NcicbiitNcicbiit
Ncicbiit
 
CDISC & Risk Based Monitoring to Compress Clinical Trial Duration
CDISC & Risk Based Monitoring to Compress Clinical Trial DurationCDISC & Risk Based Monitoring to Compress Clinical Trial Duration
CDISC & Risk Based Monitoring to Compress Clinical Trial Duration
 
Meaningful (meta)data at scale: removing barriers to precision medicine research
Meaningful (meta)data at scale: removing barriers to precision medicine researchMeaningful (meta)data at scale: removing barriers to precision medicine research
Meaningful (meta)data at scale: removing barriers to precision medicine research
 
Evaluating Cloud vs On-Premises for NGS Clinical Workflows
Evaluating Cloud vs On-Premises for NGS Clinical WorkflowsEvaluating Cloud vs On-Premises for NGS Clinical Workflows
Evaluating Cloud vs On-Premises for NGS Clinical Workflows
 
Dennis Kehoe - ECO 15: Digital connectivity in healthcare
Dennis Kehoe - ECO 15: Digital connectivity in healthcareDennis Kehoe - ECO 15: Digital connectivity in healthcare
Dennis Kehoe - ECO 15: Digital connectivity in healthcare
 
Automating Clinical Trials in Pharmacology Units
Automating Clinical Trials in Pharmacology UnitsAutomating Clinical Trials in Pharmacology Units
Automating Clinical Trials in Pharmacology Units
 

Mehr von Muhammad Saleem

QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...Muhammad Saleem
 
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...Muhammad Saleem
 
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint FederationCostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint FederationMuhammad Saleem
 
SQCFramework: SPARQL Query containment Benchmark Generation Framework
SQCFramework: SPARQL Query containment  Benchmark Generation Framework SQCFramework: SPARQL Query containment  Benchmark Generation Framework
SQCFramework: SPARQL Query containment Benchmark Generation Framework Muhammad Saleem
 
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...Muhammad Saleem
 
Fine-grained Evaluation of SPARQL Endpoint Federation Systems
Fine-grained Evaluation of SPARQL Endpoint Federation SystemsFine-grained Evaluation of SPARQL Endpoint Federation Systems
Fine-grained Evaluation of SPARQL Endpoint Federation SystemsMuhammad Saleem
 
SPARQL Querying Benchmarks ISWC2016
SPARQL Querying Benchmarks ISWC2016SPARQL Querying Benchmarks ISWC2016
SPARQL Querying Benchmarks ISWC2016Muhammad Saleem
 
LSQ: The Linked SPARQL Queries Dataset
LSQ: The Linked SPARQL Queries DatasetLSQ: The Linked SPARQL Queries Dataset
LSQ: The Linked SPARQL Queries DatasetMuhammad Saleem
 
FEASIBLE-Benchmark-Framework-ISWC2015
FEASIBLE-Benchmark-Framework-ISWC2015FEASIBLE-Benchmark-Framework-ISWC2015
FEASIBLE-Benchmark-Framework-ISWC2015Muhammad Saleem
 
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of DataDAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of DataMuhammad Saleem
 
Fostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked DataFostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked DataMuhammad Saleem
 
Linked Cancer Genome Atlas Database
Linked Cancer Genome Atlas DatabaseLinked Cancer Genome Atlas Database
Linked Cancer Genome Atlas DatabaseMuhammad Saleem
 

Mehr von Muhammad Saleem (14)

QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
 
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
 
LargeRDFBench
LargeRDFBenchLargeRDFBench
LargeRDFBench
 
Extended LargeRDFBench
Extended LargeRDFBenchExtended LargeRDFBench
Extended LargeRDFBench
 
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint FederationCostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
 
SQCFramework: SPARQL Query containment Benchmark Generation Framework
SQCFramework: SPARQL Query containment  Benchmark Generation Framework SQCFramework: SPARQL Query containment  Benchmark Generation Framework
SQCFramework: SPARQL Query containment Benchmark Generation Framework
 
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
 
Fine-grained Evaluation of SPARQL Endpoint Federation Systems
Fine-grained Evaluation of SPARQL Endpoint Federation SystemsFine-grained Evaluation of SPARQL Endpoint Federation Systems
Fine-grained Evaluation of SPARQL Endpoint Federation Systems
 
SPARQL Querying Benchmarks ISWC2016
SPARQL Querying Benchmarks ISWC2016SPARQL Querying Benchmarks ISWC2016
SPARQL Querying Benchmarks ISWC2016
 
LSQ: The Linked SPARQL Queries Dataset
LSQ: The Linked SPARQL Queries DatasetLSQ: The Linked SPARQL Queries Dataset
LSQ: The Linked SPARQL Queries Dataset
 
FEASIBLE-Benchmark-Framework-ISWC2015
FEASIBLE-Benchmark-Framework-ISWC2015FEASIBLE-Benchmark-Framework-ISWC2015
FEASIBLE-Benchmark-Framework-ISWC2015
 
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of DataDAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
 
Fostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked DataFostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked Data
 
Linked Cancer Genome Atlas Database
Linked Cancer Genome Atlas DatabaseLinked Cancer Genome Atlas Database
Linked Cancer Genome Atlas Database
 

Kürzlich hochgeladen

Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowgargpaaro
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样wsppdmt
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schscnajjemba
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...gajnagarg
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxParas Gupta
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxchadhar227
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 

Kürzlich hochgeladen (20)

Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
 
Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 

SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes

  • 1. SAFE Policy Aware SPARQL Query Federation Over RDF Data Cubes Muhammad Saleem AKSW, University of Leipzig, Germany 09/02/2015 Paper Presented in SWAT4LS-2014, Berlin, Germany
  • 2. Enabling networked knowledge 2 Linked2Safety 1. Secured, semantically-interconnecting electronic health records and clinical trials system 2. Advancing clinical research using Semantic Web technologies 3. Inter-connect clinical data from CHUV (Switzerland), ZEINCRO (Greece), CING (Cyprus)
  • 3. Enabling networked knowledge 3 Patients’ anonymity Data Ownership & Privacy  Anonymised Clinical Data cubes  Insensitive clinical parameters without personal information  Access-Control Based Query Federation Safety First – Linked2Safety Ethical & Legal Aspects
  • 4. Enabling networked knowledge 4 The Problem return number of patients that have been administered the drug Insulin and exhibit BMI > 25 and Hypertension and Diabetes as adverse events Switzerland Cyprus Greece
  • 5. Enabling networked knowledge 5 The Problem Access Policy Framework Oya Clinical Researcher Expertise – Diabetes Requested Data Input Input Denies AccessGrants Access Query execution plan is affected
  • 6. Enabling networked knowledge 6 SAFE - Secure SPARQL Query Federation CHUV (Data Cubes) ZEINCRO (Data Cubes) CING (Data Cubes) RDF Data Cubes Index RDF Data Cubes RDF Data Cubes Access Policy Model Source Selection Access Policy Filter Query re- Writer SPARQL Query Results
  • 7. Enabling networked knowledge 7 SPARQL Query + User Info SAFE Source Selection Access Policy Filtering Query Re- writing SELECT ?diabetes ?bmi ?hypertension ?cases WHERE { ?observation a qb:Observation . ?observation l2s-dim:Diabetes ?diabetes. ?observation l2s-dim:BMI ?bmi. ?observation l2s-dim:Hypertension ?hypertension. ?observation sdmx-measure:Cases ?cases. } Oya Clinical Researcher Expertise – Diabetes
  • 8. Enabling networked knowledge 9 SPARQL Query + User Info SAFE – Source Selection Source Selection Access Policy Filtering Query Re- writing Triples Patterns ?observation l2s-dim:Diabetes ?diabetes. ?observation l2s-dim:BMI ?bmi. ?observation l2s-dim:Hypertension ?hypertension. ?observation sdmx-measure:Cases ?cases. Capable Sources {S1, S2, S3, } {S1, S2, S3} {S1, S2, S3} {S1, S2, S3, S4} S1={ } S4={ Smoking, Gender, Cases } INDEX Diabetes, BMI, Hypertension, Cases S2={ }Diabetes, BMI, Hypertension, Cases Diabetes, S3={ }Diabetes, BMI, Hypertension, HIV, Cases S4 Join Awareness
  • 9. Enabling networked knowledge 10 Access Policy Framework SPARQL Query + User Info SAFE – Access Policy Triple Pattern- based Source Selection Access Policy Filtering Query Re- writing Oya Clinical Researcher Expertise – Diabetes Requested Data S1 S2 S3 Input Input Denies AccessGrants Access S1 S2 S3
  • 10. Enabling networked knowledge 11 AP1 type Access_Policy AP1 applies_to {S1, S2} AP1 grants_access Read AP1 assigned_to Oya SPARQL Query + User Info SAFE – Access Policy Source Selection Access Policy Filtering Query Re- writing Oya type User Oya haslocation Galway Oya hasPurpose Perform p-value analysis Oya hasRole Clinical Researcher Oya hasDomain Diabetes ASK WHERE { ?accessPolicy a AccessPolicy. ?accessPolicy appliesToNamedGraph S1. ?accessPolicy :rantsAccess acl:Read_l2s, ?accessPolicy hasUser Oya. } • SPARQL Query • Example Access Policy grantsAccess
  • 11. Enabling networked knowledge 12 SPARQL Query + User Info  Graph Information will be added to the query triples  SELECT …. WHERE { GRAPH <S1> { …. } }  SELECT …. WHERE { GRAPH <S2> { …. } }  Sub queries sent to relevant sources  S1  S2  Integration of results obtained from each sources SAFE – Query Rewriting Source Selection Access Policy Filtering Query Re- writing Diabetes BMI Hypertension Cases 0 0 0 40 1 0 1 50 0 1 1 120 1 1 1 90 S1 S2
  • 12. Enabling networked knowledge 13 Evaluation - DataSets Dataset # triples # obs # sub # pred # obj # size # index size # index generation time Internal Dataset CHUV 0.8 M 96 K 96 K 36 88 31 MB - - CING 0.1 M 17 K 17 K 21 51 5 MB - - ZEINCRO 0.4 M 49 K 49 K 24 59 15 MB - - Total 1.3 M 162 K 162 K 81 198 51 MB 8 KB 10 sec External Dataset World Bank 77 M 10 M 10 M 58 40 K 19 GB - - IMF 18 M 1.8 M 1.8 M 30 3151 3.51 GB - - Eurostat 0.3 M 38 K 44 K 31 5717 205 MB - - Trans. Int. 43 K 3939 4286 64 5290 9.2 MB - - Total 95 M 12 M 2 M 183 54 K 23 GB 12 KB 571 sec
  • 13. Enabling networked knowledge 14 Evaluation - Berlin SPARQL Benchmark Characteristics Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12 # of Triple Patterns 9 7 9 16 7 8 11 10 7 7 3 7 # of Sources 3 4 4 3 4 3 3 4 3 3 3 3 # of Results 41 50 348 41 62 1983 5 10 1701 19656 570 41 Filters  > 9 Patterns        Negation  LIMIT Modifier     Order By Modifier    DISTINCT Modifier           REGEX Operator  UNION Operator 
  • 14. Enabling networked knowledge 15 Evaluation – Source Selection Systems Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12 Avg SAFE 8 10 13 16 15 13 15 16 7 7 9 7 11 FedX 9 13 16 24 20 14 16 19 15 17 9 16 16 Systems Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12 Avg SAFE 0 0 0 0 0 0 0 0 0 0 0 0 0 FedX 36 28 40 64 48 40 44 40 21 21 9 21 35 • Sum of triple-pattern-wise sources selected for each query • Number of SPARQL ASK requests used for source selection
  • 15. Enabling networked knowledge 16 Evaluation - Source Selection Time • Source Selection Time
  • 16. Enabling networked knowledge 17 Evaluation - Query Execution Time • Query Execution Time Query times-out for FedX
  • 17. Enabling networked knowledge 18  Source Selection  SPARQL SERVICE  Using SPARQL ASK queries  Using a catalog/index  SAFE - Hybrid (catalog/index + ASK)  Lightweight Cache – RDF Cube Data Structure – AccessPolicy  Join Aware  Excludes ineligible sources before actual query join  Provenance – via RDF Named Graphs  Self-contained Data Cubes  Creator  Location  Date  Access rights SAFE – Highlights
  • 18. Enabling networked knowledge 19 SAFE - Team • Yasar Khan • Muhammad Saleem • Aftab Iqbal • Muntazir Mehdi • Aidan Hogan • Panagiotis Hasapis • Axel-Cyrille Ngonga Ngomo • Stefan Decker • Ratnesh Sahay Thank You http://linked2safety.hcls.deri.org:8080/SAFE-Demo/