SlideShare a Scribd company logo
1 of 34
Download to read offline
stardog.com




    Stardog 1.1
 An Easier, Smarter,
Faster RDF Database
   Michael Grove, Clark & Parsia LLC
        mike@clarkparsia.com
 @mikegrovesoft, @stardog_db, @candp


                  1
stardog.com




        About C&P

• We build semantic technology tools
  for enterprise solutions

• Proud bootstrappers since 2005
• Offices in DC and Cambridge, MA
• Government & enterprise customers
                  2
stardog.com




  What is Stardog?

• a pure Java RDF database
• full-service, feature rich
• focus on query performance
• standards compliant
• scalable (up first, out next)

                  3
stardog.com




           History
• Development started summer 2010
• Stardog 0.5 alpha - 2 May 2011
• Stardog 1.0 final - 19 June 2012
 • Total of 32 releases, ~500 tickets,
   100s of email on the mailing list

• Stardog 1.0.7 presently
• Stardog 1.1 real soon now...
                  4
stardog.com




Easier.


   5
stardog.com




     What is easy?
• What’s “easy” in an RDF database?
 • Configuration
 • Maintenance
 • User Experience
 • i.e., rationally predictable
• Easier for whom? Not a simple
  question.

                  6
stardog.com




     Configuration
• Convention, not configuration
• “Quick Start” is shortest page in the
  docs

• 4 steps to querying
• Predictable, sane defaults throughout
• Adapted to Java, Unix, Semtech cultures
 • Culture is key to convention
• Very good (!) documentation
                    7
stardog.com



      Maintenance
• Nothing is easier than doing nothing
 • RDF & OWL are ideally schema
    flexible

 • Job scheduler: search, indexes, etc.
 • Data migration tools since < 1.0
 • Multi-tenancy, online & offline DBs
 • Just add data...Automatic data
    quality*

• NoSQL == Anti-jobs program for DBAs
                   8
stardog.com




     Except that...
• Every DB has to be admin’d &
  maintained

• Matter of degree, not kind
• Stardog Enterprise Server Management
 • audit logging
 • JMX monitoring
 • web console
 • online backups (coming soon!)
                  9
stardog.com




  User Experience
• Client-server & Embeddable
• Jena, Sesame, SNARL, HTTP
• SPARQL query simplifications
• ACID transactions
• Idiomatic Java & Unix interfaces
 • Great CLI & shell…
 • Windows has gotten much better! :>
• Rich security model
                 10
stardog.com




Smarter.


   11
stardog.com




  Okay...that’s BS.
• “Smarter” is market speak
• But Stardog 1.1 has rich feature set
 • Reasoning, including UDR
 • Integrity Constraint Validation (ICV)
 • Semantic Search
 • Security
 • Spring
 • Linked Data Platform
                  12
stardog.com




        Reasoning
• OWL 2 DL, QL, EL, and RL
• Query-time, no materialization
• Only pay for what you eat
• Embarrassingly parallel in part
• Pellet 3 embedded for OWL 2 DL
  schema reasoning only

• Very flexible re: NGs & schemas
                 13
stardog.com




  User-defined Rules
• New in 1.1!
• Using SWRL syntax
 • Including all SWRL builtins
   • Which are also available to SPARQL
• Recently added new individual builtin
 • Create new individuals in your rules
 • Beware of non-termination!
• Executed at query time like everything else
                       14
stardog.com



             ICV?
• Integrity Constraint Validation
• Automated data quality
• Closed world semantics
• Transactional
• High-level & declarative
• ICs can be OWL, SWRL, or SPARQL
                15
stardog.com




             Example...
Only employees who are US citizens can
work on projects that receive funding from a
US government agency.

Class:
  Project and
      (receivesFundsFrom some USGovAgency)
SubClassOf:
  inverse(worksOn) only
      (Employee and nationality value "US")

 More examples: http://stardog.com/docs/
                       16
stardog.com




  Semantic Search
• Uses Waldo, our deep adaptation of
  Lucene

• Text index from RDF literals
• Search for resources or literals
• Integrated with SPARQL query
  evaluation

• Auto-managed search indexes
                  17
stardog.com




          Security

• Rich security model
• Based on standard RBAC model
• Applies at database-level
• Will extend to Named Graphs in 1.x
• Easy CLI admin tools (& Java API)
                 18
stardog.com




             Spring
• Love it or not, Spring isn’t going away
• Support Batch, Data Import, etc.
• Open Source: http://github.com/
  clark-parsia/spring-stardog

• Developed by an early adopter who
  needed it; supported/maintained by
  C&P

                   19
stardog.com




      Linked Data
• Stardog fills a hole in our Linked
  Data Platform

• HTML5, pure JS, client side web
  framework (based on backbone.js)

• Linked Data publishing suite
• Stardog Linked Data
  Catalog...Enterprise Linked Data
  management app

                  20
stardog.com




Faster.


   21
stardog.com




          Finally...
• Now we can talk about something
  that’s objective, context-free, and
  measurable

• Yes!
• But no…#include <std_disclaim.h>
 • Your data & your queries are the
   only things that really matter

                  22
stardog.com




       That said...
• Two de facto benchmarks for
  SPARQL:

 • BSBM, OLTP-style, query mixes
   per hour (QMpH · 25)

 • SP2B, OLAP-style (torture test), set
   of queries within a timeout, T, at a
   data size D

                 23
stardog.com




                 SP2B
• Stardog completes SP2B at 5M,
  10M, and 25M (except q5a)

• No other RDF database completes >
  5M. (As of the most recent report.
  Things change.)

• Considerable performance
  differential

• Pushing this out to 100M+ in 1.x
                  24
stardog.com




                BSBM
• A throughput test, primarily. Not
  necessarily simple queries

• On modest machine, 255 clients, 10M
  triples, we sustain 7m queries per hour
  (277k QMpH)

• At 100M, 255 clients, sustain 3m
  queries per hour (125k QMpH)

• Among the top 2 or 3 RDF DBs for BSBM
  performance

• We will tackle BSBM BI next...
                    25
stardog.com



      Data Loading
• Two indexing modes
• Triples only indexing
 • Faster loading, slower NG query
 • Up to 250,000 triples per second
• Quads indexing
 • Slower loading, faster NG query
 • Up to 150,000 triples per second
• More improvements coming in the future
 • Customized RDF parser
 • Will look at user-defined index subsets
                    26
stardog.com




 What’s new in 1.1

• Aforementioned user defined rules
• But most notably, SPARQL 1.1
 • Our most requested feature in a
   survey

• Oh, we also made it faster

                 27
stardog.com




         SPARQL 1.1
• Latest revision of the SPARQL query
  language
• Put off implementing until spec finalized
 • It’s still in flux, but we decided to go for it
• Adds useful new features to SPARQL
 • Aggregates, grouping, sub-query,
    negation

  • Oh, and the entailment regimes
                       28
stardog.com




        SPARQL 1.1
• Rewrite of query planner & engine for 1.0.5
 • Changes needed to support SPARQL 1.1
 • Tested by users for the past 3 releases
• With great power comes great responsibility...
 • New features are not without cost
 • Query planning & optimization more crucial
    than ever

  • Majority of development time
                      29
stardog.com




             Roadmap
1. Transitivity &              6. “Stardocs”: doc/blob
   equality                       storage & NLP
                                  analytics
2. GeoSPARQL
                               7. Graph Traversals,
3. Web Console                    Algorithms & query
                                  langs
4. Statement identifiers
                               8. Statistical inference
5. Stored procedures &            & machine learning
   database triggers
                               9. Stardog 2.0:
                                  Distributed Cluster
                                  Super Cloud Thingie!
                          30
stardog.com




         Summary
 Easier.
Smarter.
 Faster.
Pick all three!




                  31
stardog.com




Thanks!



   32
stardog.com




Licensing




    33
stardog.com




           Feature Rich
•   Support for RDFS, OWL2 profiles (EL, RL, QL) & OWL2 DL
    via schema only queries

•   Semantic Search

•   ICV

•   Transactions

•   Rich security model

•   Support for major APIs

    •   Jena & Sesame, and our own SNARL

    •   SPARQL HTTP protocol, Graph Store protocol

    •   Also includes a CLI & Shell environment


                              34

More Related Content

What's hot

Java Persistence API (JPA) - A Brief Overview
Java Persistence API (JPA) - A Brief OverviewJava Persistence API (JPA) - A Brief Overview
Java Persistence API (JPA) - A Brief OverviewCraig Dickson
 
Postgrtesql as a NoSQL Document Store - The JSON/JSONB data type
Postgrtesql as a NoSQL Document Store - The JSON/JSONB data typePostgrtesql as a NoSQL Document Store - The JSON/JSONB data type
Postgrtesql as a NoSQL Document Store - The JSON/JSONB data typeJumping Bean
 
Hibernate ORM: Tips, Tricks, and Performance Techniques
Hibernate ORM: Tips, Tricks, and Performance TechniquesHibernate ORM: Tips, Tricks, and Performance Techniques
Hibernate ORM: Tips, Tricks, and Performance TechniquesBrett Meyer
 
Polyglot Persistence - Two Great Tastes That Taste Great Together
Polyglot Persistence - Two Great Tastes That Taste Great TogetherPolyglot Persistence - Two Great Tastes That Taste Great Together
Polyglot Persistence - Two Great Tastes That Taste Great TogetherJohn Wood
 
Introducing Infinispan
Introducing InfinispanIntroducing Infinispan
Introducing InfinispanPT.JUG
 
Infinspan: In-memory data grid meets NoSQL
Infinspan: In-memory data grid meets NoSQLInfinspan: In-memory data grid meets NoSQL
Infinspan: In-memory data grid meets NoSQLManik Surtani
 
Not Just ORM: Powerful Hibernate ORM Features and Capabilities
Not Just ORM: Powerful Hibernate ORM Features and CapabilitiesNot Just ORM: Powerful Hibernate ORM Features and Capabilities
Not Just ORM: Powerful Hibernate ORM Features and CapabilitiesBrett Meyer
 
Open Source SQL Databases
Open Source SQL DatabasesOpen Source SQL Databases
Open Source SQL DatabasesEmanuel Calvo
 
Why we love ArangoDB. The hunt for the right NosQL Database
Why we love ArangoDB. The hunt for the right NosQL DatabaseWhy we love ArangoDB. The hunt for the right NosQL Database
Why we love ArangoDB. The hunt for the right NosQL DatabaseAndreas Jung
 
Which postgres is_right_for_me_20130517
Which postgres is_right_for_me_20130517Which postgres is_right_for_me_20130517
Which postgres is_right_for_me_20130517EDB
 
Why Is My Solr Slow?: Presented by Mike Drob, Cloudera
Why Is My Solr Slow?: Presented by Mike Drob, ClouderaWhy Is My Solr Slow?: Presented by Mike Drob, Cloudera
Why Is My Solr Slow?: Presented by Mike Drob, ClouderaLucidworks
 
D conf 2016 - Using D for Primary Storage
D conf 2016 - Using D for Primary StorageD conf 2016 - Using D for Primary Storage
D conf 2016 - Using D for Primary StorageLiran Zvibel
 
Talent42 2014 Sam Wholley -
Talent42 2014 Sam Wholley - Talent42 2014 Sam Wholley -
Talent42 2014 Sam Wholley - Talent42
 
The Economies of Scaling Software
The Economies of Scaling SoftwareThe Economies of Scaling Software
The Economies of Scaling SoftwareAbdelmonaim Remani
 
Cassandra Summit 2015 - Building a multi-tenant API PaaS with DataStax Enterp...
Cassandra Summit 2015 - Building a multi-tenant API PaaS with DataStax Enterp...Cassandra Summit 2015 - Building a multi-tenant API PaaS with DataStax Enterp...
Cassandra Summit 2015 - Building a multi-tenant API PaaS with DataStax Enterp...Restlet
 
DConf2015 - Using D for Development of Large Scale Primary Storage
DConf2015 - Using D for Development  of Large Scale Primary StorageDConf2015 - Using D for Development  of Large Scale Primary Storage
DConf2015 - Using D for Development of Large Scale Primary StorageLiran Zvibel
 
Autocomplete Multi-Language Search Using Ngram and EDismax Phrase Queries: Pr...
Autocomplete Multi-Language Search Using Ngram and EDismax Phrase Queries: Pr...Autocomplete Multi-Language Search Using Ngram and EDismax Phrase Queries: Pr...
Autocomplete Multi-Language Search Using Ngram and EDismax Phrase Queries: Pr...Lucidworks
 
Using JPA applications in the era of NoSQL: Introducing Hibernate OGM
Using JPA applications in the era of NoSQL: Introducing Hibernate OGMUsing JPA applications in the era of NoSQL: Introducing Hibernate OGM
Using JPA applications in the era of NoSQL: Introducing Hibernate OGMPT.JUG
 
Json and Jsonpath in Postgres 12
Json and Jsonpath in Postgres 12Json and Jsonpath in Postgres 12
Json and Jsonpath in Postgres 12EDB
 
Scala at Treasure Data
Scala at Treasure DataScala at Treasure Data
Scala at Treasure DataTaro L. Saito
 

What's hot (20)

Java Persistence API (JPA) - A Brief Overview
Java Persistence API (JPA) - A Brief OverviewJava Persistence API (JPA) - A Brief Overview
Java Persistence API (JPA) - A Brief Overview
 
Postgrtesql as a NoSQL Document Store - The JSON/JSONB data type
Postgrtesql as a NoSQL Document Store - The JSON/JSONB data typePostgrtesql as a NoSQL Document Store - The JSON/JSONB data type
Postgrtesql as a NoSQL Document Store - The JSON/JSONB data type
 
Hibernate ORM: Tips, Tricks, and Performance Techniques
Hibernate ORM: Tips, Tricks, and Performance TechniquesHibernate ORM: Tips, Tricks, and Performance Techniques
Hibernate ORM: Tips, Tricks, and Performance Techniques
 
Polyglot Persistence - Two Great Tastes That Taste Great Together
Polyglot Persistence - Two Great Tastes That Taste Great TogetherPolyglot Persistence - Two Great Tastes That Taste Great Together
Polyglot Persistence - Two Great Tastes That Taste Great Together
 
Introducing Infinispan
Introducing InfinispanIntroducing Infinispan
Introducing Infinispan
 
Infinspan: In-memory data grid meets NoSQL
Infinspan: In-memory data grid meets NoSQLInfinspan: In-memory data grid meets NoSQL
Infinspan: In-memory data grid meets NoSQL
 
Not Just ORM: Powerful Hibernate ORM Features and Capabilities
Not Just ORM: Powerful Hibernate ORM Features and CapabilitiesNot Just ORM: Powerful Hibernate ORM Features and Capabilities
Not Just ORM: Powerful Hibernate ORM Features and Capabilities
 
Open Source SQL Databases
Open Source SQL DatabasesOpen Source SQL Databases
Open Source SQL Databases
 
Why we love ArangoDB. The hunt for the right NosQL Database
Why we love ArangoDB. The hunt for the right NosQL DatabaseWhy we love ArangoDB. The hunt for the right NosQL Database
Why we love ArangoDB. The hunt for the right NosQL Database
 
Which postgres is_right_for_me_20130517
Which postgres is_right_for_me_20130517Which postgres is_right_for_me_20130517
Which postgres is_right_for_me_20130517
 
Why Is My Solr Slow?: Presented by Mike Drob, Cloudera
Why Is My Solr Slow?: Presented by Mike Drob, ClouderaWhy Is My Solr Slow?: Presented by Mike Drob, Cloudera
Why Is My Solr Slow?: Presented by Mike Drob, Cloudera
 
D conf 2016 - Using D for Primary Storage
D conf 2016 - Using D for Primary StorageD conf 2016 - Using D for Primary Storage
D conf 2016 - Using D for Primary Storage
 
Talent42 2014 Sam Wholley -
Talent42 2014 Sam Wholley - Talent42 2014 Sam Wholley -
Talent42 2014 Sam Wholley -
 
The Economies of Scaling Software
The Economies of Scaling SoftwareThe Economies of Scaling Software
The Economies of Scaling Software
 
Cassandra Summit 2015 - Building a multi-tenant API PaaS with DataStax Enterp...
Cassandra Summit 2015 - Building a multi-tenant API PaaS with DataStax Enterp...Cassandra Summit 2015 - Building a multi-tenant API PaaS with DataStax Enterp...
Cassandra Summit 2015 - Building a multi-tenant API PaaS with DataStax Enterp...
 
DConf2015 - Using D for Development of Large Scale Primary Storage
DConf2015 - Using D for Development  of Large Scale Primary StorageDConf2015 - Using D for Development  of Large Scale Primary Storage
DConf2015 - Using D for Development of Large Scale Primary Storage
 
Autocomplete Multi-Language Search Using Ngram and EDismax Phrase Queries: Pr...
Autocomplete Multi-Language Search Using Ngram and EDismax Phrase Queries: Pr...Autocomplete Multi-Language Search Using Ngram and EDismax Phrase Queries: Pr...
Autocomplete Multi-Language Search Using Ngram and EDismax Phrase Queries: Pr...
 
Using JPA applications in the era of NoSQL: Introducing Hibernate OGM
Using JPA applications in the era of NoSQL: Introducing Hibernate OGMUsing JPA applications in the era of NoSQL: Introducing Hibernate OGM
Using JPA applications in the era of NoSQL: Introducing Hibernate OGM
 
Json and Jsonpath in Postgres 12
Json and Jsonpath in Postgres 12Json and Jsonpath in Postgres 12
Json and Jsonpath in Postgres 12
 
Scala at Treasure Data
Scala at Treasure DataScala at Treasure Data
Scala at Treasure Data
 

Similar to Stardog 1.1: Easier, Smarter, Faster RDF Database

Transitioning from Java to Scala for Spark - March 13, 2019
Transitioning from Java to Scala for Spark - March 13, 2019Transitioning from Java to Scala for Spark - March 13, 2019
Transitioning from Java to Scala for Spark - March 13, 2019Gravy Analytics
 
Devnexus 2018
Devnexus 2018Devnexus 2018
Devnexus 2018Roy Russo
 
Dev nexus 2017
Dev nexus 2017Dev nexus 2017
Dev nexus 2017Roy Russo
 
Just the Job: Employing Solr for Recruitment Search -Charlie Hull
Just the Job: Employing Solr for Recruitment Search -Charlie Hull Just the Job: Employing Solr for Recruitment Search -Charlie Hull
Just the Job: Employing Solr for Recruitment Search -Charlie Hull lucenerevolution
 
Introduction to Apache Geode (Cork, Ireland)
Introduction to Apache Geode (Cork, Ireland)Introduction to Apache Geode (Cork, Ireland)
Introduction to Apache Geode (Cork, Ireland)Anthony Baker
 
Performance and Abstractions
Performance and AbstractionsPerformance and Abstractions
Performance and AbstractionsMetosin Oy
 
Azure Cosmos DB: Features, Practical Use and Optimization "
Azure Cosmos DB: Features, Practical Use and Optimization "Azure Cosmos DB: Features, Practical Use and Optimization "
Azure Cosmos DB: Features, Practical Use and Optimization "GlobalLogic Ukraine
 
EDB's Migration Portal - Migrate from Oracle to Postgres
EDB's Migration Portal - Migrate from Oracle to PostgresEDB's Migration Portal - Migrate from Oracle to Postgres
EDB's Migration Portal - Migrate from Oracle to PostgresEDB
 
SSJS, NoSQL, GAE and AppengineJS
SSJS, NoSQL, GAE and AppengineJSSSJS, NoSQL, GAE and AppengineJS
SSJS, NoSQL, GAE and AppengineJSEugene Lazutkin
 
Apache Geode Meetup, Cork, Ireland at CIT
Apache Geode Meetup, Cork, Ireland at CITApache Geode Meetup, Cork, Ireland at CIT
Apache Geode Meetup, Cork, Ireland at CITApache Geode
 
Cassandra Day SV 2014: Spark, Shark, and Apache Cassandra
Cassandra Day SV 2014: Spark, Shark, and Apache CassandraCassandra Day SV 2014: Spark, Shark, and Apache Cassandra
Cassandra Day SV 2014: Spark, Shark, and Apache CassandraDataStax Academy
 
From a student to an apache committer practice of apache io tdb
From a student to an apache committer  practice of apache io tdbFrom a student to an apache committer  practice of apache io tdb
From a student to an apache committer practice of apache io tdbjixuan1989
 
Meetup#2: Building responsive Symbology & Suggest WebService
Meetup#2: Building responsive Symbology & Suggest WebServiceMeetup#2: Building responsive Symbology & Suggest WebService
Meetup#2: Building responsive Symbology & Suggest WebServiceMinsk MongoDB User Group
 
Scala and Spark are Ideal for Big Data
Scala and Spark are Ideal for Big DataScala and Spark are Ideal for Big Data
Scala and Spark are Ideal for Big DataJohn Nestor
 
SQL Now! How Optiq brings the best of SQL to NoSQL data.
SQL Now! How Optiq brings the best of SQL to NoSQL data.SQL Now! How Optiq brings the best of SQL to NoSQL data.
SQL Now! How Optiq brings the best of SQL to NoSQL data.Julian Hyde
 

Similar to Stardog 1.1: Easier, Smarter, Faster RDF Database (20)

Transitioning from Java to Scala for Spark - March 13, 2019
Transitioning from Java to Scala for Spark - March 13, 2019Transitioning from Java to Scala for Spark - March 13, 2019
Transitioning from Java to Scala for Spark - March 13, 2019
 
Devnexus 2018
Devnexus 2018Devnexus 2018
Devnexus 2018
 
Dev nexus 2017
Dev nexus 2017Dev nexus 2017
Dev nexus 2017
 
Pig on Spark
Pig on SparkPig on Spark
Pig on Spark
 
Oracle OpenWo2014 review part 03 three_paa_s_database
Oracle OpenWo2014 review part 03 three_paa_s_databaseOracle OpenWo2014 review part 03 three_paa_s_database
Oracle OpenWo2014 review part 03 three_paa_s_database
 
Just the Job: Employing Solr for Recruitment Search -Charlie Hull
Just the Job: Employing Solr for Recruitment Search -Charlie Hull Just the Job: Employing Solr for Recruitment Search -Charlie Hull
Just the Job: Employing Solr for Recruitment Search -Charlie Hull
 
Introduction to Apache Geode (Cork, Ireland)
Introduction to Apache Geode (Cork, Ireland)Introduction to Apache Geode (Cork, Ireland)
Introduction to Apache Geode (Cork, Ireland)
 
Performance and Abstractions
Performance and AbstractionsPerformance and Abstractions
Performance and Abstractions
 
Azure Cosmos DB: Features, Practical Use and Optimization "
Azure Cosmos DB: Features, Practical Use and Optimization "Azure Cosmos DB: Features, Practical Use and Optimization "
Azure Cosmos DB: Features, Practical Use and Optimization "
 
EDB's Migration Portal - Migrate from Oracle to Postgres
EDB's Migration Portal - Migrate from Oracle to PostgresEDB's Migration Portal - Migrate from Oracle to Postgres
EDB's Migration Portal - Migrate from Oracle to Postgres
 
SSJS, NoSQL, GAE and AppengineJS
SSJS, NoSQL, GAE and AppengineJSSSJS, NoSQL, GAE and AppengineJS
SSJS, NoSQL, GAE and AppengineJS
 
Einführung in RavenDB
Einführung in RavenDBEinführung in RavenDB
Einführung in RavenDB
 
Apache Geode Meetup, Cork, Ireland at CIT
Apache Geode Meetup, Cork, Ireland at CITApache Geode Meetup, Cork, Ireland at CIT
Apache Geode Meetup, Cork, Ireland at CIT
 
Cassandra Day SV 2014: Spark, Shark, and Apache Cassandra
Cassandra Day SV 2014: Spark, Shark, and Apache CassandraCassandra Day SV 2014: Spark, Shark, and Apache Cassandra
Cassandra Day SV 2014: Spark, Shark, and Apache Cassandra
 
Solr @ eBay Kleinanzeigen
Solr @ eBay KleinanzeigenSolr @ eBay Kleinanzeigen
Solr @ eBay Kleinanzeigen
 
From a student to an apache committer practice of apache io tdb
From a student to an apache committer  practice of apache io tdbFrom a student to an apache committer  practice of apache io tdb
From a student to an apache committer practice of apache io tdb
 
Meetup#2: Building responsive Symbology & Suggest WebService
Meetup#2: Building responsive Symbology & Suggest WebServiceMeetup#2: Building responsive Symbology & Suggest WebService
Meetup#2: Building responsive Symbology & Suggest WebService
 
Scala and Spark are Ideal for Big Data
Scala and Spark are Ideal for Big DataScala and Spark are Ideal for Big Data
Scala and Spark are Ideal for Big Data
 
SQL Now! How Optiq brings the best of SQL to NoSQL data.
SQL Now! How Optiq brings the best of SQL to NoSQL data.SQL Now! How Optiq brings the best of SQL to NoSQL data.
SQL Now! How Optiq brings the best of SQL to NoSQL data.
 
Big Data training
Big Data trainingBig Data training
Big Data training
 

More from Clark & Parsia LLC

Validating Linked Data with OWL
Validating Linked Data with OWLValidating Linked Data with OWL
Validating Linked Data with OWLClark & Parsia LLC
 
PelletDb: Scalable Reasoning for Enterprise Semantics
PelletDb: Scalable Reasoning for Enterprise SemanticsPelletDb: Scalable Reasoning for Enterprise Semantics
PelletDb: Scalable Reasoning for Enterprise SemanticsClark & Parsia LLC
 
Automated Planning as a Semantic Technology
Automated Planning as a Semantic TechnologyAutomated Planning as a Semantic Technology
Automated Planning as a Semantic TechnologyClark & Parsia LLC
 
SemTech 2010: Pelorus Platform
SemTech 2010: Pelorus PlatformSemTech 2010: Pelorus Platform
SemTech 2010: Pelorus PlatformClark & Parsia LLC
 

More from Clark & Parsia LLC (8)

Stardog Linked Data Catalog
Stardog Linked Data CatalogStardog Linked Data Catalog
Stardog Linked Data Catalog
 
RR2010 Keynote
RR2010 KeynoteRR2010 Keynote
RR2010 Keynote
 
Validating Linked Data with OWL
Validating Linked Data with OWLValidating Linked Data with OWL
Validating Linked Data with OWL
 
Terp: An OWL-friendly SPARQL
Terp: An OWL-friendly SPARQLTerp: An OWL-friendly SPARQL
Terp: An OWL-friendly SPARQL
 
PelletDb: Scalable Reasoning for Enterprise Semantics
PelletDb: Scalable Reasoning for Enterprise SemanticsPelletDb: Scalable Reasoning for Enterprise Semantics
PelletDb: Scalable Reasoning for Enterprise Semantics
 
Automated Planning as a Semantic Technology
Automated Planning as a Semantic TechnologyAutomated Planning as a Semantic Technology
Automated Planning as a Semantic Technology
 
Empire: JPA for RDF & SPARQL
Empire: JPA for RDF & SPARQLEmpire: JPA for RDF & SPARQL
Empire: JPA for RDF & SPARQL
 
SemTech 2010: Pelorus Platform
SemTech 2010: Pelorus PlatformSemTech 2010: Pelorus Platform
SemTech 2010: Pelorus Platform
 

Recently uploaded

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 

Recently uploaded (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 

Stardog 1.1: Easier, Smarter, Faster RDF Database

  • 1. stardog.com Stardog 1.1 An Easier, Smarter, Faster RDF Database Michael Grove, Clark & Parsia LLC mike@clarkparsia.com @mikegrovesoft, @stardog_db, @candp 1
  • 2. stardog.com About C&P • We build semantic technology tools for enterprise solutions • Proud bootstrappers since 2005 • Offices in DC and Cambridge, MA • Government & enterprise customers 2
  • 3. stardog.com What is Stardog? • a pure Java RDF database • full-service, feature rich • focus on query performance • standards compliant • scalable (up first, out next) 3
  • 4. stardog.com History • Development started summer 2010 • Stardog 0.5 alpha - 2 May 2011 • Stardog 1.0 final - 19 June 2012 • Total of 32 releases, ~500 tickets, 100s of email on the mailing list • Stardog 1.0.7 presently • Stardog 1.1 real soon now... 4
  • 6. stardog.com What is easy? • What’s “easy” in an RDF database? • Configuration • Maintenance • User Experience • i.e., rationally predictable • Easier for whom? Not a simple question. 6
  • 7. stardog.com Configuration • Convention, not configuration • “Quick Start” is shortest page in the docs • 4 steps to querying • Predictable, sane defaults throughout • Adapted to Java, Unix, Semtech cultures • Culture is key to convention • Very good (!) documentation 7
  • 8. stardog.com Maintenance • Nothing is easier than doing nothing • RDF & OWL are ideally schema flexible • Job scheduler: search, indexes, etc. • Data migration tools since < 1.0 • Multi-tenancy, online & offline DBs • Just add data...Automatic data quality* • NoSQL == Anti-jobs program for DBAs 8
  • 9. stardog.com Except that... • Every DB has to be admin’d & maintained • Matter of degree, not kind • Stardog Enterprise Server Management • audit logging • JMX monitoring • web console • online backups (coming soon!) 9
  • 10. stardog.com User Experience • Client-server & Embeddable • Jena, Sesame, SNARL, HTTP • SPARQL query simplifications • ACID transactions • Idiomatic Java & Unix interfaces • Great CLI & shell… • Windows has gotten much better! :> • Rich security model 10
  • 12. stardog.com Okay...that’s BS. • “Smarter” is market speak • But Stardog 1.1 has rich feature set • Reasoning, including UDR • Integrity Constraint Validation (ICV) • Semantic Search • Security • Spring • Linked Data Platform 12
  • 13. stardog.com Reasoning • OWL 2 DL, QL, EL, and RL • Query-time, no materialization • Only pay for what you eat • Embarrassingly parallel in part • Pellet 3 embedded for OWL 2 DL schema reasoning only • Very flexible re: NGs & schemas 13
  • 14. stardog.com User-defined Rules • New in 1.1! • Using SWRL syntax • Including all SWRL builtins • Which are also available to SPARQL • Recently added new individual builtin • Create new individuals in your rules • Beware of non-termination! • Executed at query time like everything else 14
  • 15. stardog.com ICV? • Integrity Constraint Validation • Automated data quality • Closed world semantics • Transactional • High-level & declarative • ICs can be OWL, SWRL, or SPARQL 15
  • 16. stardog.com Example... Only employees who are US citizens can work on projects that receive funding from a US government agency. Class: Project and (receivesFundsFrom some USGovAgency) SubClassOf: inverse(worksOn) only (Employee and nationality value "US") More examples: http://stardog.com/docs/ 16
  • 17. stardog.com Semantic Search • Uses Waldo, our deep adaptation of Lucene • Text index from RDF literals • Search for resources or literals • Integrated with SPARQL query evaluation • Auto-managed search indexes 17
  • 18. stardog.com Security • Rich security model • Based on standard RBAC model • Applies at database-level • Will extend to Named Graphs in 1.x • Easy CLI admin tools (& Java API) 18
  • 19. stardog.com Spring • Love it or not, Spring isn’t going away • Support Batch, Data Import, etc. • Open Source: http://github.com/ clark-parsia/spring-stardog • Developed by an early adopter who needed it; supported/maintained by C&P 19
  • 20. stardog.com Linked Data • Stardog fills a hole in our Linked Data Platform • HTML5, pure JS, client side web framework (based on backbone.js) • Linked Data publishing suite • Stardog Linked Data Catalog...Enterprise Linked Data management app 20
  • 22. stardog.com Finally... • Now we can talk about something that’s objective, context-free, and measurable • Yes! • But no…#include <std_disclaim.h> • Your data & your queries are the only things that really matter 22
  • 23. stardog.com That said... • Two de facto benchmarks for SPARQL: • BSBM, OLTP-style, query mixes per hour (QMpH · 25) • SP2B, OLAP-style (torture test), set of queries within a timeout, T, at a data size D 23
  • 24. stardog.com SP2B • Stardog completes SP2B at 5M, 10M, and 25M (except q5a) • No other RDF database completes > 5M. (As of the most recent report. Things change.) • Considerable performance differential • Pushing this out to 100M+ in 1.x 24
  • 25. stardog.com BSBM • A throughput test, primarily. Not necessarily simple queries • On modest machine, 255 clients, 10M triples, we sustain 7m queries per hour (277k QMpH) • At 100M, 255 clients, sustain 3m queries per hour (125k QMpH) • Among the top 2 or 3 RDF DBs for BSBM performance • We will tackle BSBM BI next... 25
  • 26. stardog.com Data Loading • Two indexing modes • Triples only indexing • Faster loading, slower NG query • Up to 250,000 triples per second • Quads indexing • Slower loading, faster NG query • Up to 150,000 triples per second • More improvements coming in the future • Customized RDF parser • Will look at user-defined index subsets 26
  • 27. stardog.com What’s new in 1.1 • Aforementioned user defined rules • But most notably, SPARQL 1.1 • Our most requested feature in a survey • Oh, we also made it faster 27
  • 28. stardog.com SPARQL 1.1 • Latest revision of the SPARQL query language • Put off implementing until spec finalized • It’s still in flux, but we decided to go for it • Adds useful new features to SPARQL • Aggregates, grouping, sub-query, negation • Oh, and the entailment regimes 28
  • 29. stardog.com SPARQL 1.1 • Rewrite of query planner & engine for 1.0.5 • Changes needed to support SPARQL 1.1 • Tested by users for the past 3 releases • With great power comes great responsibility... • New features are not without cost • Query planning & optimization more crucial than ever • Majority of development time 29
  • 30. stardog.com Roadmap 1. Transitivity & 6. “Stardocs”: doc/blob equality storage & NLP analytics 2. GeoSPARQL 7. Graph Traversals, 3. Web Console Algorithms & query langs 4. Statement identifiers 8. Statistical inference 5. Stored procedures & & machine learning database triggers 9. Stardog 2.0: Distributed Cluster Super Cloud Thingie! 30
  • 31. stardog.com Summary Easier. Smarter. Faster. Pick all three! 31
  • 34. stardog.com Feature Rich • Support for RDFS, OWL2 profiles (EL, RL, QL) & OWL2 DL via schema only queries • Semantic Search • ICV • Transactions • Rich security model • Support for major APIs • Jena & Sesame, and our own SNARL • SPARQL HTTP protocol, Graph Store protocol • Also includes a CLI & Shell environment 34