SlideShare ist ein Scribd-Unternehmen logo
1 von 32
GBLENDER: Towards Blending Visual Query
 Formulation and Query Processing in Graph
                 Databases
Changjiu Jin et al. at SIGMOD 2010
Presented by: Abolfazl Asudeh

CSE 6339 – Spring 2013
Outline
       Motivation
       Goals and Contributions
       Preliminaries
       Indices
       Query Processing




    2                             4/12/2013
Motivation
       Formulating a graph(query)  “programming" skill




    3                                       4/12/2013
Motivation
       Graph matching  Subgraph Isomorphism  NP-
        Complete




    4                                      4/12/2013
Outline
       Motivation
       Goals and Contributions
       Preliminaries
       Indices
       Query Processing




    5                             4/12/2013
Goals and Contributions
       1. Produce a visual interface
           to formulate a query by clicking-and-dragging items




    6                                                4/12/2013
Goals and Contributions
       Improve System Response Time
       They blend Visual Query Construction and Query
        Processing
       Use the latency of Query production to process
        current part of query.
           Start query processing before the user hits the RUN
            button
       They assume user doesn’t make mistake during the
        query formulation (doesn’t UNDO)



    7                                                4/12/2013
Challenges
       How to mix query construction and evaluation with
        MINIMAL DISK ACCESS
       How to Index the data
       How to make the pre-fetch processing transparent
        from the user




    8                                        4/12/2013
Overview: Indexing
       action-aware frequent index (A2F)
           Use Preprocessing
       action-aware infrequent index (A2I)
           If the final query is infrequent, probe A2I




    9                                                     4/12/2013
Outline
    Motivation
    Goals and Contributions
    Preliminaries
    Indices
    Query Processing




    10                         4/12/2013
PRELIMINARIES
    Graph DB: A set of Graphs (V,E)



        Graph Fragment: a small sub-graph existing in
         graph databases or query graphs




    11                                        4/12/2013
Example: Fragment samples in a chemical
compound database




12                            4/12/2013
PRELIMINARIES: Frequent Fragment
    A fragment       is frequent if its support is not less than
      ∣ ∣
        ∣ ∣: the number of graphs in the data base
    e.g. if =0.1 and ∣ ∣=10000




    13                                            4/12/2013
PRELIMINARIES: Infrequent Fragment
    A fragment is frequent if its support is less than ∣
     ∣
    e.g. if =0.1 and ∣ ∣=10000




    14                                      4/12/2013
Discriminative Infrequent Fragment
    If all sub-graphs of a fragment are frequent but itself
     is infrequent




                             √
    15                                        4/12/2013
Outline
    Motivation
    Goals and Contributions
    Preliminaries
    Indices
    Query Processing




    16                         4/12/2013
Indexing
    Because of the visual interface structure, the query
     size is grown by one in each step.
    The indexing has to (given a list of graphs that
     satisfy the fragment ′ in Step ) to support efficient
     strategy for identifyingthe graphs that match the
     fragment ′′ (generated at Step + 1)




    17                                       4/12/2013
A2F index
    Being able to fit the matches in the memory ,
     Frequent indices are divide to Memory-Resident and
     Disk-Resident
    Smaller frequent fragments are processed more
     frequently in various visual queries
    Smaller fragments have more matches
    If |g|< (threshold) it is saved in memory (MF-index)
     otherwise it is saved in the disk (DF-index)




    18                                     4/12/2013
MF index structure - example




19                             4/12/2013
MF index structure - example




20                             4/12/2013
MF index structure - example




21                             4/12/2013
MF index structure - example




22                             4/12/2013
DF-Index




23         4/12/2013
DF-Index




24         4/12/2013
A2I index
    Just Index the discriminative infrequent graphs
    For other infrequent graphs use sub-graph
     isomorphism test over its discriminative infrequent




    25                                      4/12/2013
Outline
    Motivation
    Goals and Contributions
    Preliminaries
    Indices
    Query Processing




    26                         4/12/2013
GBlender Algorithm




27                   4/12/2013
example




28        4/12/2013
example




29        4/12/2013
example




30        4/12/2013
example




31        4/12/2013
Thank you




32          4/12/2013

Weitere ähnliche Inhalte

Ähnlich wie GBLENDER: Towards blending visual query formulation and query processing in graph databases

DotNetToscana: NoSQL Revolution - RavenDB
DotNetToscana: NoSQL Revolution - RavenDBDotNetToscana: NoSQL Revolution - RavenDB
DotNetToscana: NoSQL Revolution - RavenDBNicola Baldi
 
Information Retrieval AICTE FDP at GCT Coimbatore
Information Retrieval AICTE FDP at GCT CoimbatoreInformation Retrieval AICTE FDP at GCT Coimbatore
Information Retrieval AICTE FDP at GCT Coimbatoreveningstonk
 
New seven management tools
New seven management toolsNew seven management tools
New seven management toolsJavith Saleem
 
httphps.orgdocumentspregnancy_fact_sheet.pdfhttpswww.docx
httphps.orgdocumentspregnancy_fact_sheet.pdfhttpswww.docxhttphps.orgdocumentspregnancy_fact_sheet.pdfhttpswww.docx
httphps.orgdocumentspregnancy_fact_sheet.pdfhttpswww.docxpooleavelina
 
Enriching SMW based Virtual Research Environments with external data, Jan Nov...
Enriching SMW based Virtual Research Environments with external data, Jan Nov...Enriching SMW based Virtual Research Environments with external data, Jan Nov...
Enriching SMW based Virtual Research Environments with external data, Jan Nov...KDZ - Zentrum für Verwaltungsforschung
 
TDWI Roundtable: The HANA EDW
TDWI Roundtable: The HANA EDWTDWI Roundtable: The HANA EDW
TDWI Roundtable: The HANA EDWukc4
 
The two faces of sql parameter sniffing
The two faces of sql parameter sniffingThe two faces of sql parameter sniffing
The two faces of sql parameter sniffingIvo Andreev
 
Topic 12: NoSQL in Action
Topic 12: NoSQL in ActionTopic 12: NoSQL in Action
Topic 12: NoSQL in ActionZubair Nabi
 
Ads applications of ads
Ads  applications of adsAds  applications of ads
Ads applications of adsTech_MX
 
2 data warehouse life cycle golfarelli
2 data warehouse life cycle golfarelli2 data warehouse life cycle golfarelli
2 data warehouse life cycle golfarellitruongthuthuy47
 
Aris 9 See the Future Today
Aris 9 See the Future TodayAris 9 See the Future Today
Aris 9 See the Future TodaySoftware AG
 
Quack Chat | Partitioning - Black Magic or Silver Bullet
Quack Chat | Partitioning - Black Magic or Silver BulletQuack Chat | Partitioning - Black Magic or Silver Bullet
Quack Chat | Partitioning - Black Magic or Silver BulletIDERA Software
 
A new approach for converging LVC simulation architectures
A new approach for converging LVC simulation architecturesA new approach for converging LVC simulation architectures
A new approach for converging LVC simulation architecturesJosé Ramón Martínez Salio
 
Drupal for Project Managers, Part 3: Launching
Drupal for Project Managers, Part 3: LaunchingDrupal for Project Managers, Part 3: Launching
Drupal for Project Managers, Part 3: LaunchingAcquia
 
Babok2 chapter9 daxko
Babok2 chapter9 daxko Babok2 chapter9 daxko
Babok2 chapter9 daxko Mudassir Iqbal
 

Ähnlich wie GBLENDER: Towards blending visual query formulation and query processing in graph databases (20)

DotNetToscana: NoSQL Revolution - RavenDB
DotNetToscana: NoSQL Revolution - RavenDBDotNetToscana: NoSQL Revolution - RavenDB
DotNetToscana: NoSQL Revolution - RavenDB
 
RavenDB
RavenDBRavenDB
RavenDB
 
Information Retrieval AICTE FDP at GCT Coimbatore
Information Retrieval AICTE FDP at GCT CoimbatoreInformation Retrieval AICTE FDP at GCT Coimbatore
Information Retrieval AICTE FDP at GCT Coimbatore
 
New seven management tools
New seven management toolsNew seven management tools
New seven management tools
 
httphps.orgdocumentspregnancy_fact_sheet.pdfhttpswww.docx
httphps.orgdocumentspregnancy_fact_sheet.pdfhttpswww.docxhttphps.orgdocumentspregnancy_fact_sheet.pdfhttpswww.docx
httphps.orgdocumentspregnancy_fact_sheet.pdfhttpswww.docx
 
PGi Tableau
PGi TableauPGi Tableau
PGi Tableau
 
Enriching SMW based Virtual Research Environments with external data, Jan Nov...
Enriching SMW based Virtual Research Environments with external data, Jan Nov...Enriching SMW based Virtual Research Environments with external data, Jan Nov...
Enriching SMW based Virtual Research Environments with external data, Jan Nov...
 
Data ware house
Data ware houseData ware house
Data ware house
 
TDWI Roundtable: The HANA EDW
TDWI Roundtable: The HANA EDWTDWI Roundtable: The HANA EDW
TDWI Roundtable: The HANA EDW
 
Lesson02 database system architecture
Lesson02 database system architectureLesson02 database system architecture
Lesson02 database system architecture
 
The two faces of sql parameter sniffing
The two faces of sql parameter sniffingThe two faces of sql parameter sniffing
The two faces of sql parameter sniffing
 
NASA HDF/HDF-EOS Data Access Challenges
NASA HDF/HDF-EOS Data Access ChallengesNASA HDF/HDF-EOS Data Access Challenges
NASA HDF/HDF-EOS Data Access Challenges
 
Topic 12: NoSQL in Action
Topic 12: NoSQL in ActionTopic 12: NoSQL in Action
Topic 12: NoSQL in Action
 
Ads applications of ads
Ads  applications of adsAds  applications of ads
Ads applications of ads
 
2 data warehouse life cycle golfarelli
2 data warehouse life cycle golfarelli2 data warehouse life cycle golfarelli
2 data warehouse life cycle golfarelli
 
Aris 9 See the Future Today
Aris 9 See the Future TodayAris 9 See the Future Today
Aris 9 See the Future Today
 
Quack Chat | Partitioning - Black Magic or Silver Bullet
Quack Chat | Partitioning - Black Magic or Silver BulletQuack Chat | Partitioning - Black Magic or Silver Bullet
Quack Chat | Partitioning - Black Magic or Silver Bullet
 
A new approach for converging LVC simulation architectures
A new approach for converging LVC simulation architecturesA new approach for converging LVC simulation architectures
A new approach for converging LVC simulation architectures
 
Drupal for Project Managers, Part 3: Launching
Drupal for Project Managers, Part 3: LaunchingDrupal for Project Managers, Part 3: Launching
Drupal for Project Managers, Part 3: Launching
 
Babok2 chapter9 daxko
Babok2 chapter9 daxko Babok2 chapter9 daxko
Babok2 chapter9 daxko
 

Kürzlich hochgeladen

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 

Kürzlich hochgeladen (20)

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

GBLENDER: Towards blending visual query formulation and query processing in graph databases

  • 1. GBLENDER: Towards Blending Visual Query Formulation and Query Processing in Graph Databases Changjiu Jin et al. at SIGMOD 2010 Presented by: Abolfazl Asudeh CSE 6339 – Spring 2013
  • 2. Outline  Motivation  Goals and Contributions  Preliminaries  Indices  Query Processing 2 4/12/2013
  • 3. Motivation  Formulating a graph(query)  “programming" skill 3 4/12/2013
  • 4. Motivation  Graph matching  Subgraph Isomorphism  NP- Complete 4 4/12/2013
  • 5. Outline  Motivation  Goals and Contributions  Preliminaries  Indices  Query Processing 5 4/12/2013
  • 6. Goals and Contributions  1. Produce a visual interface  to formulate a query by clicking-and-dragging items 6 4/12/2013
  • 7. Goals and Contributions  Improve System Response Time  They blend Visual Query Construction and Query Processing  Use the latency of Query production to process current part of query.  Start query processing before the user hits the RUN button  They assume user doesn’t make mistake during the query formulation (doesn’t UNDO) 7 4/12/2013
  • 8. Challenges  How to mix query construction and evaluation with MINIMAL DISK ACCESS  How to Index the data  How to make the pre-fetch processing transparent from the user 8 4/12/2013
  • 9. Overview: Indexing  action-aware frequent index (A2F)  Use Preprocessing  action-aware infrequent index (A2I)  If the final query is infrequent, probe A2I 9 4/12/2013
  • 10. Outline  Motivation  Goals and Contributions  Preliminaries  Indices  Query Processing 10 4/12/2013
  • 11. PRELIMINARIES  Graph DB: A set of Graphs (V,E)  Graph Fragment: a small sub-graph existing in graph databases or query graphs 11 4/12/2013
  • 12. Example: Fragment samples in a chemical compound database 12 4/12/2013
  • 13. PRELIMINARIES: Frequent Fragment  A fragment is frequent if its support is not less than ∣ ∣  ∣ ∣: the number of graphs in the data base  e.g. if =0.1 and ∣ ∣=10000 13 4/12/2013
  • 14. PRELIMINARIES: Infrequent Fragment  A fragment is frequent if its support is less than ∣ ∣  e.g. if =0.1 and ∣ ∣=10000 14 4/12/2013
  • 15. Discriminative Infrequent Fragment  If all sub-graphs of a fragment are frequent but itself is infrequent √ 15 4/12/2013
  • 16. Outline  Motivation  Goals and Contributions  Preliminaries  Indices  Query Processing 16 4/12/2013
  • 17. Indexing  Because of the visual interface structure, the query size is grown by one in each step.  The indexing has to (given a list of graphs that satisfy the fragment ′ in Step ) to support efficient strategy for identifyingthe graphs that match the fragment ′′ (generated at Step + 1) 17 4/12/2013
  • 18. A2F index  Being able to fit the matches in the memory , Frequent indices are divide to Memory-Resident and Disk-Resident  Smaller frequent fragments are processed more frequently in various visual queries  Smaller fragments have more matches  If |g|< (threshold) it is saved in memory (MF-index) otherwise it is saved in the disk (DF-index) 18 4/12/2013
  • 19. MF index structure - example 19 4/12/2013
  • 20. MF index structure - example 20 4/12/2013
  • 21. MF index structure - example 21 4/12/2013
  • 22. MF index structure - example 22 4/12/2013
  • 23. DF-Index 23 4/12/2013
  • 24. DF-Index 24 4/12/2013
  • 25. A2I index  Just Index the discriminative infrequent graphs  For other infrequent graphs use sub-graph isomorphism test over its discriminative infrequent 25 4/12/2013
  • 26. Outline  Motivation  Goals and Contributions  Preliminaries  Indices  Query Processing 26 4/12/2013
  • 28. example 28 4/12/2013
  • 29. example 29 4/12/2013
  • 30. example 30 4/12/2013
  • 31. example 31 4/12/2013
  • 32. Thank you 32 4/12/2013