SlideShare ist ein Scribd-Unternehmen logo
1 von 19
A short intro to PDQ: Proof-driven 
Querying 
Michael Benedikt 
with Julien Leblay, Efi Tsamoura, and Michael Vanden Boom
Background 
DBOnto: Semantics for a better world 
Exploit semantics of data: 
within a single source, among distributed sources, across data models 
• Enable new applications 
• Deliver better performance for current data-intensive tasks 
• Diminish effort in integrating complex data sources
Background 
Dimensions of Semantic Data 
Completeness 
of Sources/Source Access Model 
Target 
Implementation 
Data model 
for queries and constraints
Background 
Dimensions of Semantic Data 
Completeness 
of Sources/Source Access Model 
Target 
Implementation 
Data model 
for queries and constraints
Background 
Semantic Data Technology 
Completeness of Sources/ 
Source Access Model 
Target 
Implementation 
Data model 
for queries and constraints 
Semantic Web 
• RDF data model, description logic constraints 
• Inherently incomplete sources 
• Certain answer semantics 
• Wide range of target implementations
Background 
Semantic Data Technology 
Target 
Implementation 
Data model 
for queries and constraints 
Completeness of Sources/ 
Source Access Model 
Query Optimization 
with Constraints 
• Relational data model and constraints 
• Complete information 
• Access via lookup indices in sources 
• Compile to plan language of DBMS
Background 
Semantic Data Technology 
Target 
Implementation 
Data model 
for queries and constraints 
Completeness of Sources/ 
Source Access Model 
Query Optimization 
with Constraints via Reformulation 
• Relational data model and constraints 
• Complete sources 
• Compile to query language (e.g. SQL)
Background 
Semantic Data Technology 
Target 
Implementation 
Data model 
for queries and constraints 
Completeness of Sources/ 
Source Access Model 
Query Rewriting 
with Exact Views 
• Relational sources and constraints 
• Base data may not be accessible 
• Can still look for exact answers to queries 
• Compile to query language (e.g. SQL)
Background 
Semantic Data Technology 
Target 
Implementation 
Data model 
for queries and constraints 
Completeness of Sources/ 
Source Access Model 
Federated Querying Over Web-based 
Sources 
• Model sources and constraints relationally 
• Complete information on subset of sources 
• Distributed sources with mix of access regimes 
• Compile to middleware plan
Background 
Long-term PDQ vision 
Completeness 
of Sources/Source Access Model 
Target 
Implementation 
Data model 
for queries and constraints 
PDQ
Functionality 
PDQ: what it is today 
System for answering queries Q in the presence of semantic relationships and 
access restrictions on sources 
Targets: 
•Relational data model and constraints 
•Sufficient accessible information assumption: there is sufficient accessible 
data to obtain the exact answers to the query Q 
•Compilation into a “static plan” (reformulation, physical plan, middleware plan) 
Unified framework for: 
•Query Optimization/Reformulation with Constraints 
•Querying with Materialized Views 
•Federated Querying with Complete Information
Functionality 
PDQ: what it is 
Metadata including 
•D description of access to sources 
•integrity constraints C 
PDQ planner 
Cost information 
(e.g. cost function on plans) 
Query Q 
Pbest: plan using access model described by D with minimal cost 
giving the exact answer to Q for databases satisfying constraints C 
PDQ runtime Executes plans on top of 
Web-based or local datasources
Under the hood 
PDQ: how it works (sort of) 
Key observation: Under the sufficient accessible information assumption 
on Q, C, D there is always a “static plan” (e.g. relational algebra query) PQ 
that can be run to answer Q 
We can find such a PQ by looking for a “proof that there is sufficient 
information to answer Q”. 
• First main component: procedures to turn “proofs of answerability” into plans 
• Proof-to-plan procedure works for extremely rich class of integrity constraints 
• Adaptable to different target implementations (SQL query, physical plan, distributed plan…) 
• These “proof-to-plan” procedures are coupled with a reasoning system 
for finding the proofs of answerability. 
• Plug-in architecture: Chase procedure, Tableau-based FO theorem-prover, …
Under the hood 
PDQ: how it works in a bit more detail 
Metadata including 
•D description of access to sources 
•integrity constraints C Query Q 
PDQ planner 
Reasoning 
system for 
finding “proofs of 
answerability” 
Proof-to-Plan 
conversion 
Cost information 
(e.g. cost function on plans)
Under the hood 
PDQ: how it works, still more 
We can find a static plan PQ getting the exact answer to Q by looking for a 
“proof that Q is answerable” and then applying a proof-to-plan procedure. 
Last component – search strategy: we can find a good PQ by searching 
for a proof that 
1.witnesses that Q is answerable 
2.generates a low-cost plan 
Search is directed by proof goal and cost
Under the hood 
PDQ architecture
Status 
PDQ today and tomorrow 
• Theoretical basis given in PODS 2014 paper 
• Demonstration implemented over web services in VLDB 2014 
• Implementation generates SQL reformulation over relational sources (run on top 
of Postgres) 
Moving forward: 
•Pilot project beginning Oct 2014 to explore “native implementation” of PDQ on top 
of the plan language of the LogicBlox DBMS 
•Large EPSRC-funded project 2015-2020 to explore diverse uses of PDQ
Status 
PDQ today and tomorrow 
Completeness 
of Sources/Source Access Model 
Target 
Implementation 
Data model 
for queries and constraints 
PDQ 
2014 
PDQ 
2020
Next Steps 
PDQ: Next Steps 
• More info at http://cs.ox.ac.uk/pdq 
• See the demo!

Weitere ähnliche Inhalte

Andere mochten auch

ArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - PosterArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - PosterDBOnto
 
Welcome by Ian Horrocks
Welcome by Ian HorrocksWelcome by Ian Horrocks
Welcome by Ian HorrocksDBOnto
 
Optique - poster
Optique - posterOptique - poster
Optique - posterDBOnto
 
Diadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDiadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDBOnto
 
RDFox Poster
RDFox PosterRDFox Poster
RDFox PosterDBOnto
 
PAGOdA paper
PAGOdA paperPAGOdA paper
PAGOdA paperDBOnto
 
PAGOdA Presentation
PAGOdA PresentationPAGOdA Presentation
PAGOdA PresentationDBOnto
 
SemFacet paper
SemFacet paperSemFacet paper
SemFacet paperDBOnto
 
Optique presentation
Optique presentationOptique presentation
Optique presentationDBOnto
 
SemFacet Poster
SemFacet PosterSemFacet Poster
SemFacet PosterDBOnto
 
PAGOdA poster
PAGOdA posterPAGOdA poster
PAGOdA posterDBOnto
 
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...DBOnto
 
PDQ Poster
PDQ PosterPDQ Poster
PDQ PosterDBOnto
 
Aggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperAggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperDBOnto
 
Overview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentationOverview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentationDBOnto
 
ROSeAnn Presentation
ROSeAnn PresentationROSeAnn Presentation
ROSeAnn PresentationDBOnto
 
Semantic Faceted Search with SemFacet presentation
Semantic Faceted Search with SemFacet presentationSemantic Faceted Search with SemFacet presentation
Semantic Faceted Search with SemFacet presentationDBOnto
 
Sem facet paper
Sem facet paperSem facet paper
Sem facet paperDBOnto
 
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DBOnto
 
Parallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox PresentationParallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox PresentationDBOnto
 

Andere mochten auch (20)

ArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - PosterArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
 
Welcome by Ian Horrocks
Welcome by Ian HorrocksWelcome by Ian Horrocks
Welcome by Ian Horrocks
 
Optique - poster
Optique - posterOptique - poster
Optique - poster
 
Diadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDiadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meeting
 
RDFox Poster
RDFox PosterRDFox Poster
RDFox Poster
 
PAGOdA paper
PAGOdA paperPAGOdA paper
PAGOdA paper
 
PAGOdA Presentation
PAGOdA PresentationPAGOdA Presentation
PAGOdA Presentation
 
SemFacet paper
SemFacet paperSemFacet paper
SemFacet paper
 
Optique presentation
Optique presentationOptique presentation
Optique presentation
 
SemFacet Poster
SemFacet PosterSemFacet Poster
SemFacet Poster
 
PAGOdA poster
PAGOdA posterPAGOdA poster
PAGOdA poster
 
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
 
PDQ Poster
PDQ PosterPDQ Poster
PDQ Poster
 
Aggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperAggregating Semantic Annotators Paper
Aggregating Semantic Annotators Paper
 
Overview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentationOverview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentation
 
ROSeAnn Presentation
ROSeAnn PresentationROSeAnn Presentation
ROSeAnn Presentation
 
Semantic Faceted Search with SemFacet presentation
Semantic Faceted Search with SemFacet presentationSemantic Faceted Search with SemFacet presentation
Semantic Faceted Search with SemFacet presentation
 
Sem facet paper
Sem facet paperSem facet paper
Sem facet paper
 
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
 
Parallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox PresentationParallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox Presentation
 

Ähnlich wie PDQ: Proof-driven Querying presentation

Designing real-time recommendations engine using graph databases.pptx
Designing real-time recommendations engine using graph databases.pptxDesigning real-time recommendations engine using graph databases.pptx
Designing real-time recommendations engine using graph databases.pptxGopi Krishna
 
LONG_Dong_CV
LONG_Dong_CVLONG_Dong_CV
LONG_Dong_CVdong long
 
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic ProfilesA Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic ProfilesBesnik Fetahu
 
Core Geospatial Ontologies
Core Geospatial OntologiesCore Geospatial Ontologies
Core Geospatial OntologiesStephane Fellah
 
Team Data Science Process Presentation (TDSP), Aug 29, 2017
Team Data Science Process Presentation (TDSP), Aug 29, 2017Team Data Science Process Presentation (TDSP), Aug 29, 2017
Team Data Science Process Presentation (TDSP), Aug 29, 2017Debraj GuhaThakurta
 
Lei Liu Resume
Lei Liu ResumeLei Liu Resume
Lei Liu ResumeLei Liu
 
Government GraphSummit: And Then There Were 15 Standards
Government GraphSummit: And Then There Were 15 StandardsGovernment GraphSummit: And Then There Were 15 Standards
Government GraphSummit: And Then There Were 15 StandardsNeo4j
 
Effective Semantic Web Service Composition Framework Based on QoS
Effective Semantic Web Service Composition Framework Based on QoSEffective Semantic Web Service Composition Framework Based on QoS
Effective Semantic Web Service Composition Framework Based on QoSsethuraman R
 
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL EndpointsA BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL EndpointsEnrico Daga
 
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph Technology
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph TechnologyOracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph Technology
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph TechnologyInfiniteGraph
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Vivien Bonazzi
 
Presentation: Project Preliminary
Presentation: Project PreliminaryPresentation: Project Preliminary
Presentation: Project PreliminaryMrugen Deshmukh
 
NWEA Summer 2014 UPDATES - Webinar
NWEA Summer 2014 UPDATES - WebinarNWEA Summer 2014 UPDATES - Webinar
NWEA Summer 2014 UPDATES - Webinarlissaweier
 
Wei Fang's resume
Wei Fang's resumeWei Fang's resume
Wei Fang's resumeWei Fang
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSEd Dodds
 
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...Joaquin Delgado PhD.
 
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
 RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning... RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...S. Diana Hu
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentPeter Haase
 
Cochrane Collaboration - Register of Studies Consultation
Cochrane Collaboration - Register of Studies ConsultationCochrane Collaboration - Register of Studies Consultation
Cochrane Collaboration - Register of Studies ConsultationCochrane.Collaboration
 
Utilizing additional information in factorization methods (research overview,...
Utilizing additional information in factorization methods (research overview,...Utilizing additional information in factorization methods (research overview,...
Utilizing additional information in factorization methods (research overview,...Balázs Hidasi
 

Ähnlich wie PDQ: Proof-driven Querying presentation (20)

Designing real-time recommendations engine using graph databases.pptx
Designing real-time recommendations engine using graph databases.pptxDesigning real-time recommendations engine using graph databases.pptx
Designing real-time recommendations engine using graph databases.pptx
 
LONG_Dong_CV
LONG_Dong_CVLONG_Dong_CV
LONG_Dong_CV
 
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic ProfilesA Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
 
Core Geospatial Ontologies
Core Geospatial OntologiesCore Geospatial Ontologies
Core Geospatial Ontologies
 
Team Data Science Process Presentation (TDSP), Aug 29, 2017
Team Data Science Process Presentation (TDSP), Aug 29, 2017Team Data Science Process Presentation (TDSP), Aug 29, 2017
Team Data Science Process Presentation (TDSP), Aug 29, 2017
 
Lei Liu Resume
Lei Liu ResumeLei Liu Resume
Lei Liu Resume
 
Government GraphSummit: And Then There Were 15 Standards
Government GraphSummit: And Then There Were 15 StandardsGovernment GraphSummit: And Then There Were 15 Standards
Government GraphSummit: And Then There Were 15 Standards
 
Effective Semantic Web Service Composition Framework Based on QoS
Effective Semantic Web Service Composition Framework Based on QoSEffective Semantic Web Service Composition Framework Based on QoS
Effective Semantic Web Service Composition Framework Based on QoS
 
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL EndpointsA BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
 
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph Technology
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph TechnologyOracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph Technology
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph Technology
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2
 
Presentation: Project Preliminary
Presentation: Project PreliminaryPresentation: Project Preliminary
Presentation: Project Preliminary
 
NWEA Summer 2014 UPDATES - Webinar
NWEA Summer 2014 UPDATES - WebinarNWEA Summer 2014 UPDATES - Webinar
NWEA Summer 2014 UPDATES - Webinar
 
Wei Fang's resume
Wei Fang's resumeWei Fang's resume
Wei Fang's resume
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
 
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
 
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
 RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning... RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 
Cochrane Collaboration - Register of Studies Consultation
Cochrane Collaboration - Register of Studies ConsultationCochrane Collaboration - Register of Studies Consultation
Cochrane Collaboration - Register of Studies Consultation
 
Utilizing additional information in factorization methods (research overview,...
Utilizing additional information in factorization methods (research overview,...Utilizing additional information in factorization methods (research overview,...
Utilizing additional information in factorization methods (research overview,...
 

Kürzlich hochgeladen

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 

Kürzlich hochgeladen (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

PDQ: Proof-driven Querying presentation

  • 1. A short intro to PDQ: Proof-driven Querying Michael Benedikt with Julien Leblay, Efi Tsamoura, and Michael Vanden Boom
  • 2. Background DBOnto: Semantics for a better world Exploit semantics of data: within a single source, among distributed sources, across data models • Enable new applications • Deliver better performance for current data-intensive tasks • Diminish effort in integrating complex data sources
  • 3. Background Dimensions of Semantic Data Completeness of Sources/Source Access Model Target Implementation Data model for queries and constraints
  • 4. Background Dimensions of Semantic Data Completeness of Sources/Source Access Model Target Implementation Data model for queries and constraints
  • 5. Background Semantic Data Technology Completeness of Sources/ Source Access Model Target Implementation Data model for queries and constraints Semantic Web • RDF data model, description logic constraints • Inherently incomplete sources • Certain answer semantics • Wide range of target implementations
  • 6. Background Semantic Data Technology Target Implementation Data model for queries and constraints Completeness of Sources/ Source Access Model Query Optimization with Constraints • Relational data model and constraints • Complete information • Access via lookup indices in sources • Compile to plan language of DBMS
  • 7. Background Semantic Data Technology Target Implementation Data model for queries and constraints Completeness of Sources/ Source Access Model Query Optimization with Constraints via Reformulation • Relational data model and constraints • Complete sources • Compile to query language (e.g. SQL)
  • 8. Background Semantic Data Technology Target Implementation Data model for queries and constraints Completeness of Sources/ Source Access Model Query Rewriting with Exact Views • Relational sources and constraints • Base data may not be accessible • Can still look for exact answers to queries • Compile to query language (e.g. SQL)
  • 9. Background Semantic Data Technology Target Implementation Data model for queries and constraints Completeness of Sources/ Source Access Model Federated Querying Over Web-based Sources • Model sources and constraints relationally • Complete information on subset of sources • Distributed sources with mix of access regimes • Compile to middleware plan
  • 10. Background Long-term PDQ vision Completeness of Sources/Source Access Model Target Implementation Data model for queries and constraints PDQ
  • 11. Functionality PDQ: what it is today System for answering queries Q in the presence of semantic relationships and access restrictions on sources Targets: •Relational data model and constraints •Sufficient accessible information assumption: there is sufficient accessible data to obtain the exact answers to the query Q •Compilation into a “static plan” (reformulation, physical plan, middleware plan) Unified framework for: •Query Optimization/Reformulation with Constraints •Querying with Materialized Views •Federated Querying with Complete Information
  • 12. Functionality PDQ: what it is Metadata including •D description of access to sources •integrity constraints C PDQ planner Cost information (e.g. cost function on plans) Query Q Pbest: plan using access model described by D with minimal cost giving the exact answer to Q for databases satisfying constraints C PDQ runtime Executes plans on top of Web-based or local datasources
  • 13. Under the hood PDQ: how it works (sort of) Key observation: Under the sufficient accessible information assumption on Q, C, D there is always a “static plan” (e.g. relational algebra query) PQ that can be run to answer Q We can find such a PQ by looking for a “proof that there is sufficient information to answer Q”. • First main component: procedures to turn “proofs of answerability” into plans • Proof-to-plan procedure works for extremely rich class of integrity constraints • Adaptable to different target implementations (SQL query, physical plan, distributed plan…) • These “proof-to-plan” procedures are coupled with a reasoning system for finding the proofs of answerability. • Plug-in architecture: Chase procedure, Tableau-based FO theorem-prover, …
  • 14. Under the hood PDQ: how it works in a bit more detail Metadata including •D description of access to sources •integrity constraints C Query Q PDQ planner Reasoning system for finding “proofs of answerability” Proof-to-Plan conversion Cost information (e.g. cost function on plans)
  • 15. Under the hood PDQ: how it works, still more We can find a static plan PQ getting the exact answer to Q by looking for a “proof that Q is answerable” and then applying a proof-to-plan procedure. Last component – search strategy: we can find a good PQ by searching for a proof that 1.witnesses that Q is answerable 2.generates a low-cost plan Search is directed by proof goal and cost
  • 16. Under the hood PDQ architecture
  • 17. Status PDQ today and tomorrow • Theoretical basis given in PODS 2014 paper • Demonstration implemented over web services in VLDB 2014 • Implementation generates SQL reformulation over relational sources (run on top of Postgres) Moving forward: •Pilot project beginning Oct 2014 to explore “native implementation” of PDQ on top of the plan language of the LogicBlox DBMS •Large EPSRC-funded project 2015-2020 to explore diverse uses of PDQ
  • 18. Status PDQ today and tomorrow Completeness of Sources/Source Access Model Target Implementation Data model for queries and constraints PDQ 2014 PDQ 2020
  • 19. Next Steps PDQ: Next Steps • More info at http://cs.ox.ac.uk/pdq • See the demo!