SlideShare ist ein Scribd-Unternehmen logo
1 von 91
Downloaden Sie, um offline zu lesen
Dependencies
Making Ontology Based Data Access Work in Practice
Mariano Rodriguez-Muro and Diego Calvanese
{rodriguez,calvanese}@inf.unibz.it
KRDB Research Centre
Free University of Bozen Bolzano
May 11, 2011
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 1 / 33
The context
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 2 / 33
DL Ontologies
Description Logics:
• Formalisms for knowledge representation.
• Decidable fragments of FOL
• Base of OWL
• World is described by means of Concepts and Roles
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 3 / 33
DL Ontologies
Description Logics:
• Formalisms for knowledge representation.
• Decidable fragments of FOL
• Base of OWL
• World is described by means of Concepts and Roles
Ontologies
• Intentional knowledge: TBox T .
• Extensional knowledge: ABox A.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 3 / 33
OBDA with DL-Lite
A family of light-weight ontology languages
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 4 / 33
OBDA with DL-Lite
A family of light-weight ontology languages
• DL-LiteF concepts
B := A | ∃R
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 4 / 33
OBDA with DL-Lite
A family of light-weight ontology languages
• DL-LiteF concepts
B := A | ∃R
• DL-LiteF roles
R := P | P−
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 4 / 33
OBDA with DL-Lite
A family of light-weight ontology languages
• DL-LiteF concepts
B := A | ∃R
• DL-LiteF roles
R := P | P−
• DL-LiteF TBoxes
B B | B ¬B | (funct R)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 4 / 33
OBDA with DL-Lite
A family of light-weight ontology languages
• DL-LiteF concepts
B := A | ∃R
• DL-LiteF roles
R := P | P−
• DL-LiteF TBoxes
B B | B ¬B | (funct R)
• DL-LiteF ABoxes
A(a) | R(a, b)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 4 / 33
Query Answering
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 5 / 33
Query Answering
TBox:
Man Person, Woman Person, Person ∃hasFather,
∃hasFather−
Person
ABox:
Man(mariano)
Queries:
q(x) ← Person(x), hasFather(x, y), Person(y)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 5 / 33
Query Answering
TBox:
Man Person, Woman Person, Person ∃hasFather,
∃hasFather−
Person
ABox:
Man(mariano)
Queries:
q(x) ← Person(x), hasFather(x, y), Person(y)
Problem: Compute the certain answers of Q, denoted cert(Q, O).
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 5 / 33
Query Answering
TBox:
Man Person, Woman Person, Person ∃hasFather,
∃hasFather−
Person
ABox:
Man(mariano)
Queries:
q(x) ← Person(x), hasFather(x, y), Person(y)
Problem: Compute the certain answers of Q, denoted cert(Q, O).
The promise
We can do this as efficiently as answering DB queries, also in the virtual
setting.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 5 / 33
Query Answering with PerfectRef (2005)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 6 / 33
Query Answering with PerfectRef (2005)
Query:
q(x) ← Person(x), hasFather(x, y), Person(y)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 6 / 33
Query Answering with PerfectRef (2005)
Query:
q(x) ← Person(x), hasFather(x, y), Person(y)
Reformulation:
q(x) ← Person(x), hasFather(x, y), Person(y)
q(x) ← Person(x), hasFather(x, y), hasFather(z, y)
q(x) ← Person(x), hasFather(x, y)
q(x) ← Person(x), Person(x)
q(x) ← Person(x)
q(x) ← Person(x), hasFather(x, y), Man(y)
q(x) ← Person(x), hasFather(x, y), Woman(y)
q(x) ← hasFather(x, m), hasFather(x, y), Person(y)
q(x) ← hasFather(x, m), hasFather(x, y), hasFather(z, y)
q(x) ← hasFather(x, m), hasFather(x, y)
q(x) ← hasFather(x, m), Person(x)
q(x) ← hasFather(x, m), hasFather(x, t)
q(x) ← hasFather(x, m)
q(x) ← hasFather(x, m), hasFather(x, y), Man(y)Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 6 / 33
Alternatives
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
Alternatives
• Improved version of PerfectRef (2007-2011)
• RQR (Urbina et, al. 2007)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
Alternatives
• Improved version of PerfectRef (2007-2011)
• RQR (Urbina et, al. 2007)
Too many unions, cannot execute!.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
Alternatives
• Improved version of PerfectRef (2007-2011)
• RQR (Urbina et, al. 2007)
Too many unions, cannot execute!.
• PRESTO (Rosati et al., 2010)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
Alternatives
• Improved version of PerfectRef (2007-2011)
• RQR (Urbina et, al. 2007)
Too many unions, cannot execute!.
• PRESTO (Rosati et al., 2010)
Better, eventually it breaks.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
Alternatives
• Improved version of PerfectRef (2007-2011)
• RQR (Urbina et, al. 2007)
Too many unions, cannot execute!.
• PRESTO (Rosati et al., 2010)
Better, eventually it breaks.
• Combined Approach (Kontchakov et. al., 2010)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
Alternatives
• Improved version of PerfectRef (2007-2011)
• RQR (Urbina et, al. 2007)
Too many unions, cannot execute!.
• PRESTO (Rosati et al., 2010)
Better, eventually it breaks.
• Combined Approach (Kontchakov et. al., 2010)
Fast. But too much data and too much time.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
What can we do?
?
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 8 / 33
Query Answering
It is not only about existential constants
Query:
q(x, y) ← Person(x), hasFather(x, y), Person(y)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 9 / 33
Query Answering
It is not only about existential constants
Query:
q(x, y) ← Person(x), hasFather(x, y), Person(y)
Reformulation:
q(x, y) ← Person(x), hasFather(x, y), Person(y)
q(x, y) ← Person(x), hasFather(x, y), hasFather(z, y)
q(x, y) ← Person(x), hasFather(x, y), Man(y)
q(x, y) ← Person(x), hasFather(x, y), Woman(y)
q(x, y) ← hasFather(x, m), hasFather(x, y), Person(y)
q(x, y) ← hasFather(x, m), hasFather(x, y), hasFather(z, y)
q(x, y) ← hasFather(x, m), hasFather(x, y), Man(y)
q(x, y) ← hasFather(x, m), hasFather(x, y), Woman(y)
q(x, y) ← Man(x), hasFather(x, y), Person(y)
q(x, y) ← Man(x), hasFather(x, y), hasFather(z, y)
q(x, y) ← Man(x), hasFather(x, y), Man(y)
q(x, y) ← Man(x), hasFather(x, y), Woman(y)
q(x, y) ← Woman(x), hasFather(x, y), Person(y)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 9 / 33
The full picture: Ontology Based Data
Access
SourceUser Source
User
Queries
Ontology
Mappings
Source
To deal with OBDA we need to consider:
• If in the backend we have RDBMSs, we cannot go beyond their
capabilities.
• All systems are composed by T , D = R, I , M.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 10 / 33
First Observation
Is my data complete?
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
First Observation
Is my data complete?
Completeness of A
The TBox sais: Manager Employee
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
First Observation
Is my data complete?
Completeness of A
The TBox sais: Manager Employee
In the ABox: all Managers are already employees.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
First Observation
Is my data complete?
Completeness of A
The TBox sais: Manager Employee
In the ABox: all Managers are already employees.
In any realistic scenario:
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
First Observation
Is my data complete?
Completeness of A
The TBox sais: Manager Employee
In the ABox: all Managers are already employees.
In any realistic scenario:
• We don’t use arbitrary sources;
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
First Observation
Is my data complete?
Completeness of A
The TBox sais: Manager Employee
In the ABox: all Managers are already employees.
In any realistic scenario:
• We don’t use arbitrary sources;
• Intersection of semantics is reflected in completeness (e.g., no need to
chase, expand or rewrite)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
First Observation
Is my data complete?
Completeness of A
The TBox sais: Manager Employee
In the ABox: all Managers are already employees.
In any realistic scenario:
• We don’t use arbitrary sources;
• Intersection of semantics is reflected in completeness (e.g., no need to
chase, expand or rewrite)
• This happens a lot!
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
First Observation
Is my data complete?
Completeness of A
The TBox sais: Manager Employee
In the ABox: all Managers are already employees.
In any realistic scenario:
• We don’t use arbitrary sources;
• Intersection of semantics is reflected in completeness (e.g., no need to
chase, expand or rewrite)
• This happens a lot!
Keyword
Redundancy
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
Second Observation
There are no ABoxes
THERE ARE NO ABOXES!
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 12 / 33
Second Observation
There are no ABoxes
THERE ARE NO ABOXES!
Any Ontology based query answering systems today:
• Uses relational DBs to store the ABox data;
• In such D, both, R and I can be manipulated;
• Implementors may choose any M for their system;
Opportunity
To complete an ABox we can do more than expansion.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 12 / 33
How to approach the problem
Two level approach
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 13 / 33
How to approach the problem
Two level approach
How to approach OBDA in practice?
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 13 / 33
How to approach the problem
Two level approach
How to approach OBDA in practice?
• Efficient ways to deal with redundancy due to completeness.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 13 / 33
How to approach the problem
Two level approach
How to approach OBDA in practice?
• Efficient ways to deal with redundancy due to completeness.
• Efficient ways to complete (virtual) ABoxes.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 13 / 33
Contributions
Dealing with redundancy
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 14 / 33
Characterizing completeness
ABox Dependencies
Definition
An assertion B A B that restricts valid ABoxes.
Syntax B2 A B2
Semantics: A |= Manager A Employee if Manager(x)∈ A implies
Employee(x)∈ A.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 15 / 33
Characterizing completeness
ABox Dependencies
Definition
An assertion B A B that restricts valid ABoxes.
Syntax B2 A B2
Semantics: A |= Manager A Employee if Manager(x)∈ A implies
Employee(x)∈ A.
ABox dependencies are fundamentally different than TBox assertions.
Think open world
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 15 / 33
Where to deal with redundancy?
Given a TBox T , an ABox A, a set of dependencies Σ and a query Q,
what do we do?
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 16 / 33
Where to deal with redundancy?
Given a TBox T , an ABox A, a set of dependencies Σ and a query Q,
what do we do?
Available Options:
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 16 / 33
Where to deal with redundancy?
Given a TBox T , an ABox A, a set of dependencies Σ and a query Q,
what do we do?
Available Options:
• Optimize the query reformulation algorithm to deal with Σ.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 16 / 33
Where to deal with redundancy?
Given a TBox T , an ABox A, a set of dependencies Σ and a query Q,
what do we do?
Available Options:
• Optimize the query reformulation algorithm to deal with Σ.
• Optimize the TBox T with respect to Σ.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 16 / 33
When is an assertion redundant?
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 17 / 33
When is an assertion redundant?
Direct Redundancy: Case 1
Let T be implied the following
hierarchy:
∃hasFather
Person
Human
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 17 / 33
When is an assertion redundant?
Direct Redundancy: Case 1
Let T be implied the following
hierarchy:
∃hasFather
Person
Human
Redundant if Σ is:
∃hasFather
Person
Human
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 17 / 33
When is an assertion redundant?
Direct Redundancy: Case 1
Let T be implied the following
hierarchy:
∃hasFather
Person
Human
Redundant if Σ is:
∃hasFather
Person
Human
Σ sais hasFather(mariano, ramon) ∈ A → Human(mariano) ∈ A.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 17 / 33
When is an assertion redundant?
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 18 / 33
When is an assertion redundant?
Direct Redundancy: Case 2
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 18 / 33
When is an assertion redundant?
Direct Redundancy: Case 2
Let T be the following TBox:
Person
∃hasFather−
∃hasFather
Man
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 18 / 33
When is an assertion redundant?
Direct Redundancy: Case 2
Let T be the following TBox:
Person
∃hasFather−
∃hasFather
Man
Redundant if Σ is:
Person
∃hasFather−
∃hasFather
Man
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 18 / 33
When is an assertion redundant?
Direct Redundancy: Case 2
Let T be the following TBox:
Person
∃hasFather−
∃hasFather
Man
Redundant if Σ is:
Person
∃hasFather−
∃hasFather
Man
Σ sais Man(ramon) ∈ A → ∃a | hasFather(ramon, a ) ∧ Person(a ) ∈ A.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 18 / 33
When is an assertion redundant?
Indirect Redundancy
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 19 / 33
When is an assertion redundant?
Indirect Redundancy
Let T be the following TBox:
Animal
Man Human
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 19 / 33
When is an assertion redundant?
Indirect Redundancy
Let T be the following TBox:
Animal
Man Human
Redundant if Σ is:
Animal
Man Human
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 19 / 33
When is an assertion redundant?
Indirect Redundancy
Let T be the following TBox:
Animal
Man Human
Redundant if Σ is:
Animal
Man Human
Σ sais Man(mariano) ∈ A then Animal(mariano) ∈ A.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 19 / 33
Formalization: Redundancy
Given a TBox T and a set of dependencies Σ over T , the optimized version
of T w.r.t. Σ, denoted optim(T , Σ), is the set of inclusion assertions
{α ∈ sat(T ) | α is not redundant in sat(T ) w.r.t. sat(Σ)}
We can compute optim(T , Σ) in linear time.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 20 / 33
Contributions
Completing ABoxes
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 21 / 33
General considerations
OBDA systems have no ABoxes, instead virtual ABoxes V = D, M with
D = R, I .
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 22 / 33
General considerations
OBDA systems have no ABoxes, instead virtual ABoxes V = D, M with
D = R, I .
If we that V |= A A B, we check make sure that mappings for B include
all the data coming from the mappings of A.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 22 / 33
General considerations
OBDA systems have no ABoxes, instead virtual ABoxes V = D, M with
D = R, I .
If we that V |= A A B, we check make sure that mappings for B include
all the data coming from the mappings of A.
Trade-off:
• Degree of completeness (# of dependencies),
• Cost of the procedure
• Performance of Query answering.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 22 / 33
General considerations
OBDA systems have no ABoxes, instead virtual ABoxes V = D, M with
D = R, I .
If we that V |= A A B, we check make sure that mappings for B include
all the data coming from the mappings of A.
Trade-off:
• Degree of completeness (# of dependencies),
• Cost of the procedure
• Performance of Query answering.
We can complete virtual ABoxes up to B ∃R without the need for new
data.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 22 / 33
Semantic Index for OBDA
General Idea
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 23 / 33
Semantic Index for OBDA
General Idea
• To encode the semantics of T in numeric indexes and ranges for
concept names and roles.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 23 / 33
Semantic Index for OBDA
General Idea
• To encode the semantics of T in numeric indexes and ranges for
concept names and roles.
• Store the ABox in the database using those indexes and ranges.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 23 / 33
Semantic Index for OBDA
General Idea
• To encode the semantics of T in numeric indexes and ranges for
concept names and roles.
• Store the ABox in the database using those indexes and ranges.
• Make mappings for the system that take the ranges into account.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 23 / 33
Semantic Index for OBDA
General Idea
• To encode the semantics of T in numeric indexes and ranges for
concept names and roles.
• Store the ABox in the database using those indexes and ranges.
• Make mappings for the system that take the ranges into account.
We can do this by using the implied hierarchy of T to generate the index
and ranges!
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 23 / 33
Semantic Index Example
T = {B A, C A, C D}
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 24 / 33
Semantic Index Example
T = {B A, C A, C D}
A
B C
D
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 24 / 33
Semantic Index Example
T = {B A, C A, C D}
1
A
B
2
C
3
4
D
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 24 / 33
Semantic Index Example
T = {B A, C A, C D}
1
A
B
2
C
3
4
D
We create a table TC with constant and idx columns. To insert the data
we use the indexes. e.g., B(mariano) ∈ A then we put (mariano, 2) ∈ TC
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 24 / 33
Semantic Index Example
T = {B A, C A, C D}
1, {(1, 3)}
A
B
2, {(2, 2)}
C
3, {(3, 3)}
4, {(3, 4)}
D
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 24 / 33
Semantic Index Example
T = {B A, C A, C D}
1, {(1, 3)}
A
B
2, {(2, 2)}
C
3, {(3, 3)}
4, {(3, 4)}
D
We create the mappings using the ranges, e.g., SELECT constant
FROM TC WHERE IDX ≥ 1 AND IDX ≤ 3; A(constant)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 24 / 33
Experimentation I
The Resource Index features:
• Search over 22 document collections
• Semantics given by the hierarchies of 200 ontologies (SNOMED, GO)
Implementation in a nutshell:
(i) Understand documents with natural language processing and
annotate
Cervical Cancer( doc224 )
(ii) Expand the ABox
(iii) Pose queries that retrieve documents as
q(x) ← A1(x) ∧ · · · ∧ An(x)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 25 / 33
Experimentation II
The challenge:
• ≈ 3 million concepts and ≈ 2.5 million is-a assertions
• Split second responses
• 150 GB of data
• Expansion data: 1.5 TB
The experimentation data:
• Clinical Trials.gov (CT)
• 181 million assertion (≈ 14 GB of data, ≈ 140 GB when expanded.)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 26 / 33
Results
The query:
q(x) ← DNA Repair Gene(x) ∧ Antigen Gene(x) ∧ Cancer Gene(x)
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 27 / 33
Results
The query:
q(x) ← DNA Repair Gene(x) ∧ Antigen Gene(x) ∧ Cancer Gene(x)
Results:
• Traditional reformulation: Union of 467874 SQL SPJ queries;
• Semantic Index: 1 SQL; execution 3.582s (0.082s if warm); Time
to compute semantic index: 1 min; Size of data: +≈ 4 GB.
• ABox expansion: 1 SQL; executing 3s (0.6s if warm); Expansion
time ≈ 7 days; Size of data +≈ 126 GB.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 27 / 33
The Query
The query:
q(x) ← DNA Repair Gene(x) ∧ Antigen Gene(x) ∧ Cancer Gene(x)
SELECT DISTINCT r0.element_id as element_id
FROM
RESOURCE_INDEX.CT_ANN r0 JOIN RESOURCE_INDEX.CT_ANN r1
ON r0.element_id = r1.element_id
JOIN RESOURCE_INDEX.CT_ANN r2
ON r1.element_id = r2.element_id
WHERE
((r0.idx >= 1783559 AND r0.idx <= 1783657)) AND
((r1.idx >= 1782996 AND r1.idx <= 1783029)) AND
((r2.idx >= 1783115 AND r2.idx <= 1783253));
Standard SQL query efficient in ANY DBMS.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 28 / 33
Conclusions
Contributions
• We indicated that efficient OBDA requires to take into account more
than only T , A and Q.
• Provided means to deal with redundancy at the level of the TBox.
• We showed that expansion is not necessary that we can complete
ABoxes.
• We presented to efficient ways to complete ABoxes, one for the
general OBDA setting and one for the virtual setting.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 29 / 33
Conclusions
Contributions
• We indicated that efficient OBDA requires to take into account more
than only T , A and Q.
• Provided means to deal with redundancy at the level of the TBox.
• We showed that expansion is not necessary that we can complete
ABoxes.
• We presented to efficient ways to complete ABoxes, one for the
general OBDA setting and one for the virtual setting.
Future work
• Exploring more expressive languages.
• Exploring the RDFS/SPARQL setting.
• Handling updates of T and A.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 29 / 33
Extra examples
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 30 / 33
First Observation (cont.)
Mappings will introduce dependencies over ABoxes
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 31 / 33
First Observation (cont.)
Mappings will introduce dependencies over ABoxes
Let R be a DB schema with the relation schema employee with attributes
id, dept, and salary. Let M be the following mappings:
SELECT id,dept FROM employee ;q(id, dept) ← Employee(id) ∧
WORKS-FOR(id, dept)
SELECT id,dept FROM employee
WHERE salary > 1000
;q(id, dept) ← Manager(id)∧
MANAGES(id, dept)
Then for any instance I, if Manager(John) ∈ A we have that
Employee(John).
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 31 / 33
First Observation (cont.)
Mappings will introduce dependencies over ABoxes
Let R be a DB schema with the relation schema employee with attributes
id, dept, and salary. Let M be the following mappings:
SELECT id,dept FROM employee ;q(id, dept) ← Employee(id) ∧
WORKS-FOR(id, dept)
SELECT id,dept FROM employee
WHERE salary > 1000
;q(id, dept) ← Manager(id)∧
MANAGES(id, dept)
Then for any instance I, if Manager(John) ∈ A we have that
Employee(John).
This is an indicator of completeness of all ABoxes A for M and R, e.g., A
is complete w.r.t. Manager A Employee.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 31 / 33
Formalization: Chains
Let T be a TBox, B, C basic concepts, and Σ a set of dependencies over
T . A T -chain from B to C in T (resp., a Σ-chain from B to C in Σ) is a
sequence of concept inclusion assertions (Bi Bi )n
i=0 in T (resp., a
sequence of inclusion dependencies (Bi A Bi )n
i=0 in Σ), for some n ≥ 0,
such that:
1 B0 = B, Bn = C, and
2 for 1 ≤ i ≤ n, we have that Bi−1 and Bi are basic concepts s.t., either
(i) Bi−1 = Bi , or
(ii) Bi−1 = ∃R and Bi = ∃R−
, for some basic role R.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 32 / 33
Formalization: Redundancy
Let T be a TBox, B, C basic concepts, and Σ a set of dependencies. The
concept inclusion assertion B C is directly redundant in T w.r.t. Σ if
(i) Σ |= B A C and
(ii) for every T -chain (Bi Bi )n
i=0 with Bn = B in T , there is a Σ-chain
(Bi A Bi )n
i=0.
Then, B C is redundant in T w.r.t. Σ if
(a) it is directly redundant, or
(b) there exists B = B s.t.
(i) T |= B C,
(ii) B C is not redundant in T w.r.t. Σ, and
(iii) B B is directly redundant in T w.r.t. Σ.
Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 33 / 33

Weitere ähnliche Inhalte

Andere mochten auch

Mmarjori da oliverდდ
Mmarjori da oliverდდMmarjori da oliverდდ
Mmarjori da oliverდდninolip
 
Photo album
Photo albumPhoto album
Photo albumninolip
 
Sustainable Water Use and Infrastructure
Sustainable Water Use and InfrastructureSustainable Water Use and Infrastructure
Sustainable Water Use and InfrastructureKEvens
 
Fin de curso. CEIP Isaac Peral. Xuño 2014
Fin de curso. CEIP Isaac Peral. Xuño 2014Fin de curso. CEIP Isaac Peral. Xuño 2014
Fin de curso. CEIP Isaac Peral. Xuño 2014isaacperalferrol
 
SWT Lecture Session 7 - Advanced uses of RDFS
SWT Lecture Session 7 - Advanced uses of RDFSSWT Lecture Session 7 - Advanced uses of RDFS
SWT Lecture Session 7 - Advanced uses of RDFSMariano Rodriguez-Muro
 
Alumnado curso 2015 - 2016 - CEIP Isaac Peral
Alumnado curso 2015 - 2016 - CEIP Isaac PeralAlumnado curso 2015 - 2016 - CEIP Isaac Peral
Alumnado curso 2015 - 2016 - CEIP Isaac Peralisaacperalferrol
 

Andere mochten auch (15)

A nosa ludoteca
A nosa ludotecaA nosa ludoteca
A nosa ludoteca
 
Frases anton cortizas
Frases anton cortizasFrases anton cortizas
Frases anton cortizas
 
So i got an Arduino now what
So i got an Arduino now whatSo i got an Arduino now what
So i got an Arduino now what
 
Postais nadal 2013
Postais nadal 2013Postais nadal 2013
Postais nadal 2013
 
Mmarjori da oliverდდ
Mmarjori da oliverდდMmarjori da oliverდდ
Mmarjori da oliverდდ
 
Postales nadal 2012
Postales nadal 2012Postales nadal 2012
Postales nadal 2012
 
Photo album
Photo albumPhoto album
Photo album
 
Worms
WormsWorms
Worms
 
Sustainable Water Use and Infrastructure
Sustainable Water Use and InfrastructureSustainable Water Use and Infrastructure
Sustainable Water Use and Infrastructure
 
Cynefin, Kanban and Crash Test Dummies
Cynefin, Kanban and Crash Test DummiesCynefin, Kanban and Crash Test Dummies
Cynefin, Kanban and Crash Test Dummies
 
Fin de curso. CEIP Isaac Peral. Xuño 2014
Fin de curso. CEIP Isaac Peral. Xuño 2014Fin de curso. CEIP Isaac Peral. Xuño 2014
Fin de curso. CEIP Isaac Peral. Xuño 2014
 
SWT Lecture Session 11 - R2RML part 2
SWT Lecture Session 11 - R2RML part 2SWT Lecture Session 11 - R2RML part 2
SWT Lecture Session 11 - R2RML part 2
 
SWT Lecture Session 7 - Advanced uses of RDFS
SWT Lecture Session 7 - Advanced uses of RDFSSWT Lecture Session 7 - Advanced uses of RDFS
SWT Lecture Session 7 - Advanced uses of RDFS
 
SWT Lecture Session 3 - SPARQL
SWT Lecture Session 3 - SPARQLSWT Lecture Session 3 - SPARQL
SWT Lecture Session 3 - SPARQL
 
Alumnado curso 2015 - 2016 - CEIP Isaac Peral
Alumnado curso 2015 - 2016 - CEIP Isaac PeralAlumnado curso 2015 - 2016 - CEIP Isaac Peral
Alumnado curso 2015 - 2016 - CEIP Isaac Peral
 

Ähnlich wie AMW'11 dependencies-sem index-t-mappings

A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...Marko Rodriguez
 
Towards Linked Ontologies and Data on the Semantic Web
Towards Linked Ontologies and Data on the Semantic WebTowards Linked Ontologies and Data on the Semantic Web
Towards Linked Ontologies and Data on the Semantic WebJie Bao
 
Linking Open Data with Drupal
Linking Open Data with DrupalLinking Open Data with Drupal
Linking Open Data with Drupalemmanuel_jamin
 
Exploring the Web of Data for Earth and Environmental Sciences
Exploring the Web of Data for Earth and Environmental SciencesExploring the Web of Data for Earth and Environmental Sciences
Exploring the Web of Data for Earth and Environmental SciencesXiaogang (Marshall) Ma
 
Franz 2017 sols cbs seminar the limits of synthesis for integrative biology
Franz 2017 sols cbs seminar the limits of synthesis for integrative biologyFranz 2017 sols cbs seminar the limits of synthesis for integrative biology
Franz 2017 sols cbs seminar the limits of synthesis for integrative biologytaxonbytes
 
Ecdl2007ecologicallyinfluenced Ss
Ecdl2007ecologicallyinfluenced SsEcdl2007ecologicallyinfluenced Ss
Ecdl2007ecologicallyinfluenced SsR. John Robertson
 
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...Seth Grimes
 
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...Seth Grimes
 
Interpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open ContextInterpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open ContextEric Kansa
 
Corpora, Blogs and Linguistic Variation (Paderborn)
Corpora, Blogs and Linguistic Variation (Paderborn)Corpora, Blogs and Linguistic Variation (Paderborn)
Corpora, Blogs and Linguistic Variation (Paderborn)Cornelius Puschmann
 
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...Franz et al ice 2016 addressing the name meaning drift challenge in open ende...
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...taxonbytes
 
121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1manujam
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...Armin Haller
 
Towards Open Methods: Using Scientific Workflows in Linguistics
Towards Open Methods: Using Scientific Workflows in LinguisticsTowards Open Methods: Using Scientific Workflows in Linguistics
Towards Open Methods: Using Scientific Workflows in LinguisticsRichard Littauer
 
Linked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
Linked Open Data Alignment and Enrichment Using Bootstrapping Based TechniquesLinked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
Linked Open Data Alignment and Enrichment Using Bootstrapping Based TechniquesPrateek Jain
 
Teaching & Learning with Technology TLT 2016
Teaching & Learning with Technology TLT 2016Teaching & Learning with Technology TLT 2016
Teaching & Learning with Technology TLT 2016Roy Clariana
 
Sociotechnical Architecture
Sociotechnical ArchitectureSociotechnical Architecture
Sociotechnical ArchitectureFlorian Stefan
 

Ähnlich wie AMW'11 dependencies-sem index-t-mappings (20)

Some Information Retrieval Models and Our Experiments for TREC KBA
Some Information Retrieval Models and Our Experiments for TREC KBASome Information Retrieval Models and Our Experiments for TREC KBA
Some Information Retrieval Models and Our Experiments for TREC KBA
 
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
 
Towards Linked Ontologies and Data on the Semantic Web
Towards Linked Ontologies and Data on the Semantic WebTowards Linked Ontologies and Data on the Semantic Web
Towards Linked Ontologies and Data on the Semantic Web
 
On Unified Stream Reasoning
On Unified Stream ReasoningOn Unified Stream Reasoning
On Unified Stream Reasoning
 
Linking Open Data with Drupal
Linking Open Data with DrupalLinking Open Data with Drupal
Linking Open Data with Drupal
 
Exploring the Web of Data for Earth and Environmental Sciences
Exploring the Web of Data for Earth and Environmental SciencesExploring the Web of Data for Earth and Environmental Sciences
Exploring the Web of Data for Earth and Environmental Sciences
 
Franz 2017 sols cbs seminar the limits of synthesis for integrative biology
Franz 2017 sols cbs seminar the limits of synthesis for integrative biologyFranz 2017 sols cbs seminar the limits of synthesis for integrative biology
Franz 2017 sols cbs seminar the limits of synthesis for integrative biology
 
Ecdl2007ecologicallyinfluenced Ss
Ecdl2007ecologicallyinfluenced SsEcdl2007ecologicallyinfluenced Ss
Ecdl2007ecologicallyinfluenced Ss
 
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
 
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
 
Interpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open ContextInterpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open Context
 
Corpora, Blogs and Linguistic Variation (Paderborn)
Corpora, Blogs and Linguistic Variation (Paderborn)Corpora, Blogs and Linguistic Variation (Paderborn)
Corpora, Blogs and Linguistic Variation (Paderborn)
 
Recommandation sociale : filtrage collaboratif et par le contenu
Recommandation sociale : filtrage collaboratif et par le contenuRecommandation sociale : filtrage collaboratif et par le contenu
Recommandation sociale : filtrage collaboratif et par le contenu
 
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...Franz et al ice 2016 addressing the name meaning drift challenge in open ende...
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...
 
121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
 
Towards Open Methods: Using Scientific Workflows in Linguistics
Towards Open Methods: Using Scientific Workflows in LinguisticsTowards Open Methods: Using Scientific Workflows in Linguistics
Towards Open Methods: Using Scientific Workflows in Linguistics
 
Linked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
Linked Open Data Alignment and Enrichment Using Bootstrapping Based TechniquesLinked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
Linked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
 
Teaching & Learning with Technology TLT 2016
Teaching & Learning with Technology TLT 2016Teaching & Learning with Technology TLT 2016
Teaching & Learning with Technology TLT 2016
 
Sociotechnical Architecture
Sociotechnical ArchitectureSociotechnical Architecture
Sociotechnical Architecture
 

Mehr von Mariano Rodriguez-Muro

SWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mappingSWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mappingMariano Rodriguez-Muro
 
SWT Lecture Session 8 - Inference in jena
SWT Lecture Session 8 - Inference in jenaSWT Lecture Session 8 - Inference in jena
SWT Lecture Session 8 - Inference in jenaMariano Rodriguez-Muro
 
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfsSWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfsMariano Rodriguez-Muro
 
SWT Lecture Session 4 - SW architectures and SPARQL
SWT Lecture Session 4 - SW architectures and SPARQLSWT Lecture Session 4 - SW architectures and SPARQL
SWT Lecture Session 4 - SW architectures and SPARQLMariano Rodriguez-Muro
 
Stanford'12 Intro to Ontology Based Data Access for RDBMS through query rewri...
Stanford'12 Intro to Ontology Based Data Access for RDBMS through query rewri...Stanford'12 Intro to Ontology Based Data Access for RDBMS through query rewri...
Stanford'12 Intro to Ontology Based Data Access for RDBMS through query rewri...Mariano Rodriguez-Muro
 

Mehr von Mariano Rodriguez-Muro (20)

SWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDFSWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDF
 
SWT Lab 3
SWT Lab 3SWT Lab 3
SWT Lab 3
 
SWT Lab 5
SWT Lab 5SWT Lab 5
SWT Lab 5
 
SWT Lab 2
SWT Lab 2SWT Lab 2
SWT Lab 2
 
SWT Lab 1
SWT Lab 1SWT Lab 1
SWT Lab 1
 
SWT Lecture Session 10 R2RML Part 1
SWT Lecture Session 10 R2RML Part 1SWT Lecture Session 10 R2RML Part 1
SWT Lecture Session 10 R2RML Part 1
 
SWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mappingSWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mapping
 
SWT Lecture Session 8 - Rules
SWT Lecture Session 8 - RulesSWT Lecture Session 8 - Rules
SWT Lecture Session 8 - Rules
 
SWT Lecture Session 8 - Inference in jena
SWT Lecture Session 8 - Inference in jenaSWT Lecture Session 8 - Inference in jena
SWT Lecture Session 8 - Inference in jena
 
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfsSWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
 
SWT Lecture Session 5 - RDFS
SWT Lecture Session 5 - RDFSSWT Lecture Session 5 - RDFS
SWT Lecture Session 5 - RDFS
 
SWT Lecture Session 4 - SW architectures and SPARQL
SWT Lecture Session 4 - SW architectures and SPARQLSWT Lecture Session 4 - SW architectures and SPARQL
SWT Lecture Session 4 - SW architectures and SPARQL
 
SWT Lecture Session 4 - Sesame
SWT Lecture Session 4 - SesameSWT Lecture Session 4 - Sesame
SWT Lecture Session 4 - Sesame
 
7 advanced uses of rdfs
7 advanced uses of rdfs7 advanced uses of rdfs
7 advanced uses of rdfs
 
5 rdfs
5 rdfs5 rdfs
5 rdfs
 
4 sw architectures and sparql
4 sw architectures and sparql4 sw architectures and sparql
4 sw architectures and sparql
 
4 sesame
4 sesame4 sesame
4 sesame
 
SWT Lecture Session 1 - Introduction
SWT Lecture Session 1 - IntroductionSWT Lecture Session 1 - Introduction
SWT Lecture Session 1 - Introduction
 
ontop: A tutorial
ontop: A tutorialontop: A tutorial
ontop: A tutorial
 
Stanford'12 Intro to Ontology Based Data Access for RDBMS through query rewri...
Stanford'12 Intro to Ontology Based Data Access for RDBMS through query rewri...Stanford'12 Intro to Ontology Based Data Access for RDBMS through query rewri...
Stanford'12 Intro to Ontology Based Data Access for RDBMS through query rewri...
 

Kürzlich hochgeladen

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 

Kürzlich hochgeladen (20)

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

AMW'11 dependencies-sem index-t-mappings

  • 1. Dependencies Making Ontology Based Data Access Work in Practice Mariano Rodriguez-Muro and Diego Calvanese {rodriguez,calvanese}@inf.unibz.it KRDB Research Centre Free University of Bozen Bolzano May 11, 2011 Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 1 / 33
  • 2. The context Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 2 / 33
  • 3. DL Ontologies Description Logics: • Formalisms for knowledge representation. • Decidable fragments of FOL • Base of OWL • World is described by means of Concepts and Roles Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 3 / 33
  • 4. DL Ontologies Description Logics: • Formalisms for knowledge representation. • Decidable fragments of FOL • Base of OWL • World is described by means of Concepts and Roles Ontologies • Intentional knowledge: TBox T . • Extensional knowledge: ABox A. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 3 / 33
  • 5. OBDA with DL-Lite A family of light-weight ontology languages Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 4 / 33
  • 6. OBDA with DL-Lite A family of light-weight ontology languages • DL-LiteF concepts B := A | ∃R Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 4 / 33
  • 7. OBDA with DL-Lite A family of light-weight ontology languages • DL-LiteF concepts B := A | ∃R • DL-LiteF roles R := P | P− Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 4 / 33
  • 8. OBDA with DL-Lite A family of light-weight ontology languages • DL-LiteF concepts B := A | ∃R • DL-LiteF roles R := P | P− • DL-LiteF TBoxes B B | B ¬B | (funct R) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 4 / 33
  • 9. OBDA with DL-Lite A family of light-weight ontology languages • DL-LiteF concepts B := A | ∃R • DL-LiteF roles R := P | P− • DL-LiteF TBoxes B B | B ¬B | (funct R) • DL-LiteF ABoxes A(a) | R(a, b) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 4 / 33
  • 10. Query Answering Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 5 / 33
  • 11. Query Answering TBox: Man Person, Woman Person, Person ∃hasFather, ∃hasFather− Person ABox: Man(mariano) Queries: q(x) ← Person(x), hasFather(x, y), Person(y) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 5 / 33
  • 12. Query Answering TBox: Man Person, Woman Person, Person ∃hasFather, ∃hasFather− Person ABox: Man(mariano) Queries: q(x) ← Person(x), hasFather(x, y), Person(y) Problem: Compute the certain answers of Q, denoted cert(Q, O). Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 5 / 33
  • 13. Query Answering TBox: Man Person, Woman Person, Person ∃hasFather, ∃hasFather− Person ABox: Man(mariano) Queries: q(x) ← Person(x), hasFather(x, y), Person(y) Problem: Compute the certain answers of Q, denoted cert(Q, O). The promise We can do this as efficiently as answering DB queries, also in the virtual setting. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 5 / 33
  • 14. Query Answering with PerfectRef (2005) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 6 / 33
  • 15. Query Answering with PerfectRef (2005) Query: q(x) ← Person(x), hasFather(x, y), Person(y) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 6 / 33
  • 16. Query Answering with PerfectRef (2005) Query: q(x) ← Person(x), hasFather(x, y), Person(y) Reformulation: q(x) ← Person(x), hasFather(x, y), Person(y) q(x) ← Person(x), hasFather(x, y), hasFather(z, y) q(x) ← Person(x), hasFather(x, y) q(x) ← Person(x), Person(x) q(x) ← Person(x) q(x) ← Person(x), hasFather(x, y), Man(y) q(x) ← Person(x), hasFather(x, y), Woman(y) q(x) ← hasFather(x, m), hasFather(x, y), Person(y) q(x) ← hasFather(x, m), hasFather(x, y), hasFather(z, y) q(x) ← hasFather(x, m), hasFather(x, y) q(x) ← hasFather(x, m), Person(x) q(x) ← hasFather(x, m), hasFather(x, t) q(x) ← hasFather(x, m) q(x) ← hasFather(x, m), hasFather(x, y), Man(y)Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 6 / 33
  • 17. Alternatives Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
  • 18. Alternatives • Improved version of PerfectRef (2007-2011) • RQR (Urbina et, al. 2007) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
  • 19. Alternatives • Improved version of PerfectRef (2007-2011) • RQR (Urbina et, al. 2007) Too many unions, cannot execute!. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
  • 20. Alternatives • Improved version of PerfectRef (2007-2011) • RQR (Urbina et, al. 2007) Too many unions, cannot execute!. • PRESTO (Rosati et al., 2010) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
  • 21. Alternatives • Improved version of PerfectRef (2007-2011) • RQR (Urbina et, al. 2007) Too many unions, cannot execute!. • PRESTO (Rosati et al., 2010) Better, eventually it breaks. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
  • 22. Alternatives • Improved version of PerfectRef (2007-2011) • RQR (Urbina et, al. 2007) Too many unions, cannot execute!. • PRESTO (Rosati et al., 2010) Better, eventually it breaks. • Combined Approach (Kontchakov et. al., 2010) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
  • 23. Alternatives • Improved version of PerfectRef (2007-2011) • RQR (Urbina et, al. 2007) Too many unions, cannot execute!. • PRESTO (Rosati et al., 2010) Better, eventually it breaks. • Combined Approach (Kontchakov et. al., 2010) Fast. But too much data and too much time. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 7 / 33
  • 24. What can we do? ? Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 8 / 33
  • 25. Query Answering It is not only about existential constants Query: q(x, y) ← Person(x), hasFather(x, y), Person(y) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 9 / 33
  • 26. Query Answering It is not only about existential constants Query: q(x, y) ← Person(x), hasFather(x, y), Person(y) Reformulation: q(x, y) ← Person(x), hasFather(x, y), Person(y) q(x, y) ← Person(x), hasFather(x, y), hasFather(z, y) q(x, y) ← Person(x), hasFather(x, y), Man(y) q(x, y) ← Person(x), hasFather(x, y), Woman(y) q(x, y) ← hasFather(x, m), hasFather(x, y), Person(y) q(x, y) ← hasFather(x, m), hasFather(x, y), hasFather(z, y) q(x, y) ← hasFather(x, m), hasFather(x, y), Man(y) q(x, y) ← hasFather(x, m), hasFather(x, y), Woman(y) q(x, y) ← Man(x), hasFather(x, y), Person(y) q(x, y) ← Man(x), hasFather(x, y), hasFather(z, y) q(x, y) ← Man(x), hasFather(x, y), Man(y) q(x, y) ← Man(x), hasFather(x, y), Woman(y) q(x, y) ← Woman(x), hasFather(x, y), Person(y) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 9 / 33
  • 27. The full picture: Ontology Based Data Access SourceUser Source User Queries Ontology Mappings Source To deal with OBDA we need to consider: • If in the backend we have RDBMSs, we cannot go beyond their capabilities. • All systems are composed by T , D = R, I , M. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 10 / 33
  • 28. First Observation Is my data complete? Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
  • 29. First Observation Is my data complete? Completeness of A The TBox sais: Manager Employee Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
  • 30. First Observation Is my data complete? Completeness of A The TBox sais: Manager Employee In the ABox: all Managers are already employees. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
  • 31. First Observation Is my data complete? Completeness of A The TBox sais: Manager Employee In the ABox: all Managers are already employees. In any realistic scenario: Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
  • 32. First Observation Is my data complete? Completeness of A The TBox sais: Manager Employee In the ABox: all Managers are already employees. In any realistic scenario: • We don’t use arbitrary sources; Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
  • 33. First Observation Is my data complete? Completeness of A The TBox sais: Manager Employee In the ABox: all Managers are already employees. In any realistic scenario: • We don’t use arbitrary sources; • Intersection of semantics is reflected in completeness (e.g., no need to chase, expand or rewrite) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
  • 34. First Observation Is my data complete? Completeness of A The TBox sais: Manager Employee In the ABox: all Managers are already employees. In any realistic scenario: • We don’t use arbitrary sources; • Intersection of semantics is reflected in completeness (e.g., no need to chase, expand or rewrite) • This happens a lot! Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
  • 35. First Observation Is my data complete? Completeness of A The TBox sais: Manager Employee In the ABox: all Managers are already employees. In any realistic scenario: • We don’t use arbitrary sources; • Intersection of semantics is reflected in completeness (e.g., no need to chase, expand or rewrite) • This happens a lot! Keyword Redundancy Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 11 / 33
  • 36. Second Observation There are no ABoxes THERE ARE NO ABOXES! Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 12 / 33
  • 37. Second Observation There are no ABoxes THERE ARE NO ABOXES! Any Ontology based query answering systems today: • Uses relational DBs to store the ABox data; • In such D, both, R and I can be manipulated; • Implementors may choose any M for their system; Opportunity To complete an ABox we can do more than expansion. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 12 / 33
  • 38. How to approach the problem Two level approach Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 13 / 33
  • 39. How to approach the problem Two level approach How to approach OBDA in practice? Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 13 / 33
  • 40. How to approach the problem Two level approach How to approach OBDA in practice? • Efficient ways to deal with redundancy due to completeness. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 13 / 33
  • 41. How to approach the problem Two level approach How to approach OBDA in practice? • Efficient ways to deal with redundancy due to completeness. • Efficient ways to complete (virtual) ABoxes. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 13 / 33
  • 42. Contributions Dealing with redundancy Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 14 / 33
  • 43. Characterizing completeness ABox Dependencies Definition An assertion B A B that restricts valid ABoxes. Syntax B2 A B2 Semantics: A |= Manager A Employee if Manager(x)∈ A implies Employee(x)∈ A. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 15 / 33
  • 44. Characterizing completeness ABox Dependencies Definition An assertion B A B that restricts valid ABoxes. Syntax B2 A B2 Semantics: A |= Manager A Employee if Manager(x)∈ A implies Employee(x)∈ A. ABox dependencies are fundamentally different than TBox assertions. Think open world Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 15 / 33
  • 45. Where to deal with redundancy? Given a TBox T , an ABox A, a set of dependencies Σ and a query Q, what do we do? Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 16 / 33
  • 46. Where to deal with redundancy? Given a TBox T , an ABox A, a set of dependencies Σ and a query Q, what do we do? Available Options: Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 16 / 33
  • 47. Where to deal with redundancy? Given a TBox T , an ABox A, a set of dependencies Σ and a query Q, what do we do? Available Options: • Optimize the query reformulation algorithm to deal with Σ. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 16 / 33
  • 48. Where to deal with redundancy? Given a TBox T , an ABox A, a set of dependencies Σ and a query Q, what do we do? Available Options: • Optimize the query reformulation algorithm to deal with Σ. • Optimize the TBox T with respect to Σ. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 16 / 33
  • 49. When is an assertion redundant? Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 17 / 33
  • 50. When is an assertion redundant? Direct Redundancy: Case 1 Let T be implied the following hierarchy: ∃hasFather Person Human Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 17 / 33
  • 51. When is an assertion redundant? Direct Redundancy: Case 1 Let T be implied the following hierarchy: ∃hasFather Person Human Redundant if Σ is: ∃hasFather Person Human Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 17 / 33
  • 52. When is an assertion redundant? Direct Redundancy: Case 1 Let T be implied the following hierarchy: ∃hasFather Person Human Redundant if Σ is: ∃hasFather Person Human Σ sais hasFather(mariano, ramon) ∈ A → Human(mariano) ∈ A. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 17 / 33
  • 53. When is an assertion redundant? Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 18 / 33
  • 54. When is an assertion redundant? Direct Redundancy: Case 2 Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 18 / 33
  • 55. When is an assertion redundant? Direct Redundancy: Case 2 Let T be the following TBox: Person ∃hasFather− ∃hasFather Man Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 18 / 33
  • 56. When is an assertion redundant? Direct Redundancy: Case 2 Let T be the following TBox: Person ∃hasFather− ∃hasFather Man Redundant if Σ is: Person ∃hasFather− ∃hasFather Man Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 18 / 33
  • 57. When is an assertion redundant? Direct Redundancy: Case 2 Let T be the following TBox: Person ∃hasFather− ∃hasFather Man Redundant if Σ is: Person ∃hasFather− ∃hasFather Man Σ sais Man(ramon) ∈ A → ∃a | hasFather(ramon, a ) ∧ Person(a ) ∈ A. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 18 / 33
  • 58. When is an assertion redundant? Indirect Redundancy Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 19 / 33
  • 59. When is an assertion redundant? Indirect Redundancy Let T be the following TBox: Animal Man Human Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 19 / 33
  • 60. When is an assertion redundant? Indirect Redundancy Let T be the following TBox: Animal Man Human Redundant if Σ is: Animal Man Human Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 19 / 33
  • 61. When is an assertion redundant? Indirect Redundancy Let T be the following TBox: Animal Man Human Redundant if Σ is: Animal Man Human Σ sais Man(mariano) ∈ A then Animal(mariano) ∈ A. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 19 / 33
  • 62. Formalization: Redundancy Given a TBox T and a set of dependencies Σ over T , the optimized version of T w.r.t. Σ, denoted optim(T , Σ), is the set of inclusion assertions {α ∈ sat(T ) | α is not redundant in sat(T ) w.r.t. sat(Σ)} We can compute optim(T , Σ) in linear time. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 20 / 33
  • 63. Contributions Completing ABoxes Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 21 / 33
  • 64. General considerations OBDA systems have no ABoxes, instead virtual ABoxes V = D, M with D = R, I . Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 22 / 33
  • 65. General considerations OBDA systems have no ABoxes, instead virtual ABoxes V = D, M with D = R, I . If we that V |= A A B, we check make sure that mappings for B include all the data coming from the mappings of A. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 22 / 33
  • 66. General considerations OBDA systems have no ABoxes, instead virtual ABoxes V = D, M with D = R, I . If we that V |= A A B, we check make sure that mappings for B include all the data coming from the mappings of A. Trade-off: • Degree of completeness (# of dependencies), • Cost of the procedure • Performance of Query answering. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 22 / 33
  • 67. General considerations OBDA systems have no ABoxes, instead virtual ABoxes V = D, M with D = R, I . If we that V |= A A B, we check make sure that mappings for B include all the data coming from the mappings of A. Trade-off: • Degree of completeness (# of dependencies), • Cost of the procedure • Performance of Query answering. We can complete virtual ABoxes up to B ∃R without the need for new data. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 22 / 33
  • 68. Semantic Index for OBDA General Idea Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 23 / 33
  • 69. Semantic Index for OBDA General Idea • To encode the semantics of T in numeric indexes and ranges for concept names and roles. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 23 / 33
  • 70. Semantic Index for OBDA General Idea • To encode the semantics of T in numeric indexes and ranges for concept names and roles. • Store the ABox in the database using those indexes and ranges. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 23 / 33
  • 71. Semantic Index for OBDA General Idea • To encode the semantics of T in numeric indexes and ranges for concept names and roles. • Store the ABox in the database using those indexes and ranges. • Make mappings for the system that take the ranges into account. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 23 / 33
  • 72. Semantic Index for OBDA General Idea • To encode the semantics of T in numeric indexes and ranges for concept names and roles. • Store the ABox in the database using those indexes and ranges. • Make mappings for the system that take the ranges into account. We can do this by using the implied hierarchy of T to generate the index and ranges! Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 23 / 33
  • 73. Semantic Index Example T = {B A, C A, C D} Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 24 / 33
  • 74. Semantic Index Example T = {B A, C A, C D} A B C D Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 24 / 33
  • 75. Semantic Index Example T = {B A, C A, C D} 1 A B 2 C 3 4 D Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 24 / 33
  • 76. Semantic Index Example T = {B A, C A, C D} 1 A B 2 C 3 4 D We create a table TC with constant and idx columns. To insert the data we use the indexes. e.g., B(mariano) ∈ A then we put (mariano, 2) ∈ TC Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 24 / 33
  • 77. Semantic Index Example T = {B A, C A, C D} 1, {(1, 3)} A B 2, {(2, 2)} C 3, {(3, 3)} 4, {(3, 4)} D Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 24 / 33
  • 78. Semantic Index Example T = {B A, C A, C D} 1, {(1, 3)} A B 2, {(2, 2)} C 3, {(3, 3)} 4, {(3, 4)} D We create the mappings using the ranges, e.g., SELECT constant FROM TC WHERE IDX ≥ 1 AND IDX ≤ 3; A(constant) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 24 / 33
  • 79. Experimentation I The Resource Index features: • Search over 22 document collections • Semantics given by the hierarchies of 200 ontologies (SNOMED, GO) Implementation in a nutshell: (i) Understand documents with natural language processing and annotate Cervical Cancer( doc224 ) (ii) Expand the ABox (iii) Pose queries that retrieve documents as q(x) ← A1(x) ∧ · · · ∧ An(x) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 25 / 33
  • 80. Experimentation II The challenge: • ≈ 3 million concepts and ≈ 2.5 million is-a assertions • Split second responses • 150 GB of data • Expansion data: 1.5 TB The experimentation data: • Clinical Trials.gov (CT) • 181 million assertion (≈ 14 GB of data, ≈ 140 GB when expanded.) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 26 / 33
  • 81. Results The query: q(x) ← DNA Repair Gene(x) ∧ Antigen Gene(x) ∧ Cancer Gene(x) Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 27 / 33
  • 82. Results The query: q(x) ← DNA Repair Gene(x) ∧ Antigen Gene(x) ∧ Cancer Gene(x) Results: • Traditional reformulation: Union of 467874 SQL SPJ queries; • Semantic Index: 1 SQL; execution 3.582s (0.082s if warm); Time to compute semantic index: 1 min; Size of data: +≈ 4 GB. • ABox expansion: 1 SQL; executing 3s (0.6s if warm); Expansion time ≈ 7 days; Size of data +≈ 126 GB. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 27 / 33
  • 83. The Query The query: q(x) ← DNA Repair Gene(x) ∧ Antigen Gene(x) ∧ Cancer Gene(x) SELECT DISTINCT r0.element_id as element_id FROM RESOURCE_INDEX.CT_ANN r0 JOIN RESOURCE_INDEX.CT_ANN r1 ON r0.element_id = r1.element_id JOIN RESOURCE_INDEX.CT_ANN r2 ON r1.element_id = r2.element_id WHERE ((r0.idx >= 1783559 AND r0.idx <= 1783657)) AND ((r1.idx >= 1782996 AND r1.idx <= 1783029)) AND ((r2.idx >= 1783115 AND r2.idx <= 1783253)); Standard SQL query efficient in ANY DBMS. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 28 / 33
  • 84. Conclusions Contributions • We indicated that efficient OBDA requires to take into account more than only T , A and Q. • Provided means to deal with redundancy at the level of the TBox. • We showed that expansion is not necessary that we can complete ABoxes. • We presented to efficient ways to complete ABoxes, one for the general OBDA setting and one for the virtual setting. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 29 / 33
  • 85. Conclusions Contributions • We indicated that efficient OBDA requires to take into account more than only T , A and Q. • Provided means to deal with redundancy at the level of the TBox. • We showed that expansion is not necessary that we can complete ABoxes. • We presented to efficient ways to complete ABoxes, one for the general OBDA setting and one for the virtual setting. Future work • Exploring more expressive languages. • Exploring the RDFS/SPARQL setting. • Handling updates of T and A. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 29 / 33
  • 86. Extra examples Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 30 / 33
  • 87. First Observation (cont.) Mappings will introduce dependencies over ABoxes Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 31 / 33
  • 88. First Observation (cont.) Mappings will introduce dependencies over ABoxes Let R be a DB schema with the relation schema employee with attributes id, dept, and salary. Let M be the following mappings: SELECT id,dept FROM employee ;q(id, dept) ← Employee(id) ∧ WORKS-FOR(id, dept) SELECT id,dept FROM employee WHERE salary > 1000 ;q(id, dept) ← Manager(id)∧ MANAGES(id, dept) Then for any instance I, if Manager(John) ∈ A we have that Employee(John). Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 31 / 33
  • 89. First Observation (cont.) Mappings will introduce dependencies over ABoxes Let R be a DB schema with the relation schema employee with attributes id, dept, and salary. Let M be the following mappings: SELECT id,dept FROM employee ;q(id, dept) ← Employee(id) ∧ WORKS-FOR(id, dept) SELECT id,dept FROM employee WHERE salary > 1000 ;q(id, dept) ← Manager(id)∧ MANAGES(id, dept) Then for any instance I, if Manager(John) ∈ A we have that Employee(John). This is an indicator of completeness of all ABoxes A for M and R, e.g., A is complete w.r.t. Manager A Employee. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 31 / 33
  • 90. Formalization: Chains Let T be a TBox, B, C basic concepts, and Σ a set of dependencies over T . A T -chain from B to C in T (resp., a Σ-chain from B to C in Σ) is a sequence of concept inclusion assertions (Bi Bi )n i=0 in T (resp., a sequence of inclusion dependencies (Bi A Bi )n i=0 in Σ), for some n ≥ 0, such that: 1 B0 = B, Bn = C, and 2 for 1 ≤ i ≤ n, we have that Bi−1 and Bi are basic concepts s.t., either (i) Bi−1 = Bi , or (ii) Bi−1 = ∃R and Bi = ∃R− , for some basic role R. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 32 / 33
  • 91. Formalization: Redundancy Let T be a TBox, B, C basic concepts, and Σ a set of dependencies. The concept inclusion assertion B C is directly redundant in T w.r.t. Σ if (i) Σ |= B A C and (ii) for every T -chain (Bi Bi )n i=0 with Bn = B in T , there is a Σ-chain (Bi A Bi )n i=0. Then, B C is redundant in T w.r.t. Σ if (a) it is directly redundant, or (b) there exists B = B s.t. (i) T |= B C, (ii) B C is not redundant in T w.r.t. Σ, and (iii) B B is directly redundant in T w.r.t. Σ. Rodriguez-Muro and Calvanese (UNIBZ) Dependencies and OBDA May 11, 2011 33 / 33