SlideShare a Scribd company logo
1 of 7
1
RDF Needs Better Database
 Database science is all one, laws of physics
apply to all
 RDF is relational in spirit, with some extra
complications
 To realize RDF’s potential, it must meet the best
in databasing, no separation of communities and
no reinventing wheels
© 2013 OpenLink Software, All rights reserved.
2
Virtuoso 7 Brings the Best of an
Analytics DB to RDF
 Column store, 6 to 14 bytes a quad, depending
on data, 3x more compact than before
 Vectored execution Get the best of the metal,
take advantage of locality 3x the thread speed
over previous
 Intra-query parallelization everywhere
 Scale out with flexible deployment, more parallel
© 2013 OpenLink Software, All rights reserved.
3
State Of Play
 Star Schema Benchmark
 Single Server, 30G
 Virtuoso SQL: 8.9s
 MonetDB SQL: 18.9s
 Virtuoso SPARQL: 45.4s
 2 Node Cluster, 300G
 Virtuoso SQL: 43.4s
© 2013 OpenLink Software, All rights reserved.
4
State Of Play (continued)
 Run with RDF in Virtuoso and run circles around
many RDBMS
 Use task-specific schema for performance on the
level of the best in SQL analytics
 Gap between SPARQL (schema-less) and SQL
(schema-first) closing
 now 5x, going to 3x
 parity is reachable via self-tuning physical data layout
© 2013 OpenLink Software, All rights reserved.
5
Big data is not only queries
 Many things (e.g., graph analytics) cannot be
expressed as queries
 Data preparation and ETL pipeline
 Map reduce and
 Bulk Synchronous Processing (BSP) frameworks
 Virtuoso can do all that with parallel SQL
procedures and Java hosting
© 2013 OpenLink Software, All rights reserved.
6
Future convergence
 RDF is, in the end, about relations
 The URI, the Superkey, is the ultimate silo
breaker
 Get Schema-less flexibility, self-describing data,
and schema first performance, through adaptive
physical layout
 Open RDF data for use by the SQL tool chain
© 2013 OpenLink Software, All rights reserved.
6
Future convergence
 RDF is, in the end, about relations
 The URI, the Superkey, is the ultimate silo
breaker
 Get Schema-less flexibility, self-describing data,
and schema first performance, through adaptive
physical layout
 Open RDF data for use by the SQL tool chain
© 2013 OpenLink Software, All rights reserved.

More Related Content

What's hot

Using SparkR to Scale Data Science Applications in Production. Lessons from t...
Using SparkR to Scale Data Science Applications in Production. Lessons from t...Using SparkR to Scale Data Science Applications in Production. Lessons from t...
Using SparkR to Scale Data Science Applications in Production. Lessons from t...
Spark Summit
 
Semantics, rdf and drupal
Semantics, rdf and drupalSemantics, rdf and drupal
Semantics, rdf and drupal
Gokul Nk
 
Big data Processing with Apache Spark & Scala
Big data Processing with Apache Spark & ScalaBig data Processing with Apache Spark & Scala
Big data Processing with Apache Spark & Scala
Edureka!
 
PostgreSQL Extension APIs are Changing the Face of Relational Databases | PGC...
PostgreSQL Extension APIs are Changing the Face of Relational Databases | PGC...PostgreSQL Extension APIs are Changing the Face of Relational Databases | PGC...
PostgreSQL Extension APIs are Changing the Face of Relational Databases | PGC...
Teresa Giacomini
 

What's hot (20)

Spark intro iwomm_2017-07-19
Spark intro iwomm_2017-07-19Spark intro iwomm_2017-07-19
Spark intro iwomm_2017-07-19
 
Introduction to SparkR
Introduction to SparkRIntroduction to SparkR
Introduction to SparkR
 
Using SparkR to Scale Data Science Applications in Production. Lessons from t...
Using SparkR to Scale Data Science Applications in Production. Lessons from t...Using SparkR to Scale Data Science Applications in Production. Lessons from t...
Using SparkR to Scale Data Science Applications in Production. Lessons from t...
 
Apache Spark 101
Apache Spark 101Apache Spark 101
Apache Spark 101
 
Semantics, rdf and drupal
Semantics, rdf and drupalSemantics, rdf and drupal
Semantics, rdf and drupal
 
Integrating Drupal with a Triple Store
Integrating Drupal with a Triple StoreIntegrating Drupal with a Triple Store
Integrating Drupal with a Triple Store
 
Drupal 7 and RDF
Drupal 7 and RDFDrupal 7 and RDF
Drupal 7 and RDF
 
Performance of Spark vs MapReduce
Performance of Spark vs MapReducePerformance of Spark vs MapReduce
Performance of Spark vs MapReduce
 
Sparkler - Spark Crawler
Sparkler - Spark Crawler Sparkler - Spark Crawler
Sparkler - Spark Crawler
 
Big data Processing with Apache Spark & Scala
Big data Processing with Apache Spark & ScalaBig data Processing with Apache Spark & Scala
Big data Processing with Apache Spark & Scala
 
Sparkler Presentation for Spark Summit East 2017
Sparkler Presentation for Spark Summit East 2017Sparkler Presentation for Spark Summit East 2017
Sparkler Presentation for Spark Summit East 2017
 
PostgreSQL Extension APIs are Changing the Face of Relational Databases | PGC...
PostgreSQL Extension APIs are Changing the Face of Relational Databases | PGC...PostgreSQL Extension APIs are Changing the Face of Relational Databases | PGC...
PostgreSQL Extension APIs are Changing the Face of Relational Databases | PGC...
 
Sponsored Talk @ PGConf APAC 2018 - Migrating Oracle to EDB Postgres Approach...
Sponsored Talk @ PGConf APAC 2018 - Migrating Oracle to EDB Postgres Approach...Sponsored Talk @ PGConf APAC 2018 - Migrating Oracle to EDB Postgres Approach...
Sponsored Talk @ PGConf APAC 2018 - Migrating Oracle to EDB Postgres Approach...
 
Big Data Processing with Spark and Scala
Big Data Processing with Spark and Scala Big Data Processing with Spark and Scala
Big Data Processing with Spark and Scala
 
Spark for big data analytics
Spark for big data analyticsSpark for big data analytics
Spark for big data analytics
 
What No One Tells You About Writing a Streaming App: Spark Summit East talk b...
What No One Tells You About Writing a Streaming App: Spark Summit East talk b...What No One Tells You About Writing a Streaming App: Spark Summit East talk b...
What No One Tells You About Writing a Streaming App: Spark Summit East talk b...
 
Intro to Apache Spark
Intro to Apache SparkIntro to Apache Spark
Intro to Apache Spark
 
Apache spark
Apache sparkApache spark
Apache spark
 
RISELab:Enabling Intelligent Real-Time Decisions
RISELab:Enabling Intelligent Real-Time DecisionsRISELab:Enabling Intelligent Real-Time Decisions
RISELab:Enabling Intelligent Real-Time Decisions
 
Introduction to Apache Spark and MLlib
Introduction to Apache Spark and MLlibIntroduction to Apache Spark and MLlib
Introduction to Apache Spark and MLlib
 

Viewers also liked

Valentines Day Website Analysis
Valentines Day Website AnalysisValentines Day Website Analysis
Valentines Day Website Analysis
guesta66e6b
 

Viewers also liked (16)

Refuge Powerpoint
Refuge  PowerpointRefuge  Powerpoint
Refuge Powerpoint
 
Present continuous2
Present continuous2Present continuous2
Present continuous2
 
Present perfect for since
Present perfect for sincePresent perfect for since
Present perfect for since
 
Valentines Day Website Analysis
Valentines Day Website AnalysisValentines Day Website Analysis
Valentines Day Website Analysis
 
1 presentperfectindefinitepast
1 presentperfectindefinitepast1 presentperfectindefinitepast
1 presentperfectindefinitepast
 
1 presentperfectindefinitepast
1 presentperfectindefinitepast1 presentperfectindefinitepast
1 presentperfectindefinitepast
 
Second conditional
Second conditionalSecond conditional
Second conditional
 
Possesive 's
Possesive 'sPossesive 's
Possesive 's
 
GeoKnow: Making the Web an Exploratory Place for Spatial Data
GeoKnow: Making the Web an Exploratory Place for Spatial DataGeoKnow: Making the Web an Exploratory Place for Spatial Data
GeoKnow: Making the Web an Exploratory Place for Spatial Data
 
Virtuoso -- The Prometheus of RDF
Virtuoso -- The Prometheus of RDFVirtuoso -- The Prometheus of RDF
Virtuoso -- The Prometheus of RDF
 
Present continuous1
Present continuous1Present continuous1
Present continuous1
 
Possesive pronouns
Possesive pronounsPossesive pronouns
Possesive pronouns
 
First conditional
First conditionalFirst conditional
First conditional
 
Phaniendra_document_ece
Phaniendra_document_ecePhaniendra_document_ece
Phaniendra_document_ece
 
Guia 300 advanced
Guia 300 advancedGuia 300 advanced
Guia 300 advanced
 
Guia 305
Guia 305Guia 305
Guia 305
 

Similar to ESWC 2013 Panel - Semantic Technologies for Big Data Analytics: Opportunities and Challenges

Oracle NoSQL Database release 3.0 overview
Oracle NoSQL Database release 3.0 overviewOracle NoSQL Database release 3.0 overview
Oracle NoSQL Database release 3.0 overview
Paulo Fagundes
 
Comparative Study That Aims Rdf Processing For The Java Platform
Comparative Study That Aims Rdf Processing For The Java PlatformComparative Study That Aims Rdf Processing For The Java Platform
Comparative Study That Aims Rdf Processing For The Java Platform
Computer Science
 
Con8862 no sql, json and time series data
Con8862   no sql, json and time series dataCon8862   no sql, json and time series data
Con8862 no sql, json and time series data
Anuj Sahni
 

Similar to ESWC 2013 Panel - Semantic Technologies for Big Data Analytics: Opportunities and Challenges (20)

Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
 Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
 
8th TUC Meeting - Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
8th TUC Meeting -  Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...8th TUC Meeting -  Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
8th TUC Meeting - Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
 
GraphTech Ecosystem - part 1: Graph Databases
GraphTech Ecosystem - part 1: Graph DatabasesGraphTech Ecosystem - part 1: Graph Databases
GraphTech Ecosystem - part 1: Graph Databases
 
Hadoop and rdbms with sqoop
Hadoop and rdbms with sqoop Hadoop and rdbms with sqoop
Hadoop and rdbms with sqoop
 
Oracle NoSQL Database release 3.0 overview
Oracle NoSQL Database release 3.0 overviewOracle NoSQL Database release 3.0 overview
Oracle NoSQL Database release 3.0 overview
 
Solution Use Case Demo: The Power of Relationships in Your Big Data
Solution Use Case Demo: The Power of Relationships in Your Big DataSolution Use Case Demo: The Power of Relationships in Your Big Data
Solution Use Case Demo: The Power of Relationships in Your Big Data
 
Apache Spark 101 - Demi Ben-Ari - Panorays
Apache Spark 101 - Demi Ben-Ari - PanoraysApache Spark 101 - Demi Ben-Ari - Panorays
Apache Spark 101 - Demi Ben-Ari - Panorays
 
Comparative Study That Aims Rdf Processing For The Java Platform
Comparative Study That Aims Rdf Processing For The Java PlatformComparative Study That Aims Rdf Processing For The Java Platform
Comparative Study That Aims Rdf Processing For The Java Platform
 
Spark Workshop
Spark WorkshopSpark Workshop
Spark Workshop
 
Con8862 no sql, json and time series data
Con8862   no sql, json and time series dataCon8862   no sql, json and time series data
Con8862 no sql, json and time series data
 
OUG Scotland 2014 - NoSQL and MySQL - The best of both worlds
OUG Scotland 2014 - NoSQL and MySQL - The best of both worldsOUG Scotland 2014 - NoSQL and MySQL - The best of both worlds
OUG Scotland 2014 - NoSQL and MySQL - The best of both worlds
 
Oracle RAC 12c Release 2 - Overview
Oracle RAC 12c Release 2 - OverviewOracle RAC 12c Release 2 - Overview
Oracle RAC 12c Release 2 - Overview
 
Webcast Q&A- Big Data Architectures Beyond Hadoop
Webcast Q&A- Big Data Architectures Beyond HadoopWebcast Q&A- Big Data Architectures Beyond Hadoop
Webcast Q&A- Big Data Architectures Beyond Hadoop
 
Rollin onj Rubyv3
Rollin onj Rubyv3Rollin onj Rubyv3
Rollin onj Rubyv3
 
Introduction to Property Graph Features (AskTOM Office Hours part 1)
Introduction to Property Graph Features (AskTOM Office Hours part 1) Introduction to Property Graph Features (AskTOM Office Hours part 1)
Introduction to Property Graph Features (AskTOM Office Hours part 1)
 
No sql database
No sql databaseNo sql database
No sql database
 
RDBMS vs Hadoop vs Spark
RDBMS vs Hadoop vs SparkRDBMS vs Hadoop vs Spark
RDBMS vs Hadoop vs Spark
 
AWS Finland September Meetup - Using Amazon Neptune to build Fashion Knowledg...
AWS Finland September Meetup - Using Amazon Neptune to build Fashion Knowledg...AWS Finland September Meetup - Using Amazon Neptune to build Fashion Knowledg...
AWS Finland September Meetup - Using Amazon Neptune to build Fashion Knowledg...
 
Extending the R API for Spark with sparklyr and Microsoft R Server with Ali Z...
Extending the R API for Spark with sparklyr and Microsoft R Server with Ali Z...Extending the R API for Spark with sparklyr and Microsoft R Server with Ali Z...
Extending the R API for Spark with sparklyr and Microsoft R Server with Ali Z...
 
APACHE SPARK.pptx
APACHE SPARK.pptxAPACHE SPARK.pptx
APACHE SPARK.pptx
 

Recently uploaded

Recently uploaded (20)

Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
 
A Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyA Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System Strategy
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptxBT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdf
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
 
Using IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & IrelandUsing IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & Ireland
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101
 
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
 
Your enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4jYour enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4j
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 

ESWC 2013 Panel - Semantic Technologies for Big Data Analytics: Opportunities and Challenges

  • 1. 1 RDF Needs Better Database  Database science is all one, laws of physics apply to all  RDF is relational in spirit, with some extra complications  To realize RDF’s potential, it must meet the best in databasing, no separation of communities and no reinventing wheels © 2013 OpenLink Software, All rights reserved.
  • 2. 2 Virtuoso 7 Brings the Best of an Analytics DB to RDF  Column store, 6 to 14 bytes a quad, depending on data, 3x more compact than before  Vectored execution Get the best of the metal, take advantage of locality 3x the thread speed over previous  Intra-query parallelization everywhere  Scale out with flexible deployment, more parallel © 2013 OpenLink Software, All rights reserved.
  • 3. 3 State Of Play  Star Schema Benchmark  Single Server, 30G  Virtuoso SQL: 8.9s  MonetDB SQL: 18.9s  Virtuoso SPARQL: 45.4s  2 Node Cluster, 300G  Virtuoso SQL: 43.4s © 2013 OpenLink Software, All rights reserved.
  • 4. 4 State Of Play (continued)  Run with RDF in Virtuoso and run circles around many RDBMS  Use task-specific schema for performance on the level of the best in SQL analytics  Gap between SPARQL (schema-less) and SQL (schema-first) closing  now 5x, going to 3x  parity is reachable via self-tuning physical data layout © 2013 OpenLink Software, All rights reserved.
  • 5. 5 Big data is not only queries  Many things (e.g., graph analytics) cannot be expressed as queries  Data preparation and ETL pipeline  Map reduce and  Bulk Synchronous Processing (BSP) frameworks  Virtuoso can do all that with parallel SQL procedures and Java hosting © 2013 OpenLink Software, All rights reserved.
  • 6. 6 Future convergence  RDF is, in the end, about relations  The URI, the Superkey, is the ultimate silo breaker  Get Schema-less flexibility, self-describing data, and schema first performance, through adaptive physical layout  Open RDF data for use by the SQL tool chain © 2013 OpenLink Software, All rights reserved.
  • 7. 6 Future convergence  RDF is, in the end, about relations  The URI, the Superkey, is the ultimate silo breaker  Get Schema-less flexibility, self-describing data, and schema first performance, through adaptive physical layout  Open RDF data for use by the SQL tool chain © 2013 OpenLink Software, All rights reserved.