Here is the presentation from Warren Davidson, Director of Business Development, and Darren Wood, InfiniteGraph chief architect. The October 21, 2010 webinar hosted by DBTA, with InfiniteGraph and Riptano, covered new data technologies and how the NOSQL ("Not Only SQL") approach is beneficial in addressing some of the more complex application, scalability and performance requirements in handling vast amounts of data, and in performing advanced analytics on those data volumes with greater ease and speed.
5. NoSQL Landscape
Key Value
Stores
Key Value
Stores
BigTable
Clones
BigTable
Clones
Document databasesDocument databases
Complexity
Voldemort – LinkedIn
Dynamo - Amazon
Cassandra – Facebook
HBase –
Apache/Hadoop
Hypertable
CouchDB – Apache
MongoDB
Neo4j
HypergraphDB
AllegroGraph
Sones
Performance
Graph Databases
Social Network Analysis
Intelligence Community
Graph Databases
6. Graph Databases
• A graph database is used to trace relationships among entities, most
commonly people, to any depth. Its characteristics are:
– Very simple, fixed schema
– Very complex data relationships
– Used to support complex associations among like entities.
6
Node
Edge
John
Jones
Jane
Jones-
Smith
Nancy
Jones
Paul
Jones
Doris
Smith
Jim
Smith
Jeff
Smith
Meta-Model Instance Example (simplified)
Attribute(s)
Jeff
Smith
8. Graphs are everywhere
Enterprise and government 2.0, bio-engineering, gene
sequencing, drug development…..
LinkedIn, Facebook….
Social network analytics, social CRM….
Network analysis, complex BoM, predictive and
real-time ISR, fraud detection and response….
9. Graph Databases – What’s so
Different ?
Darren Wood
Chief Architect, InfiniteGraph
Key-value pair stores have a simple interface – Put, Get and Delete
Voldemort is a distributed key-value storage system implemented as a fault tolerant hash table
Dynamo – a distributed storage system, highly available key-value store, fault tolerant
BigTable – fast and extremely large scale, distributed Google File System, MapReduce – distributed parallel processing
Cassandra – structured key-value store, columnfamily based data model, eventually consistent, distributed systems technology from Dynamo,data model from Google's BigTable
Hbase –
HyperTable –
CouchDB – is a document oriented database that can be queried and indexed in a MapReduce fashion using JavaScript
MongoDB – document oriented, more complex schema model than just key/value pairs, C++, uses MapReduce for processing
Neo4j –
HyperGraphDB –
AllegrGraph
Sones -