SlideShare a Scribd company logo
1 of 34
Download to read offline
+
NoSQL – Part 2
CAP Theorem & Column Oriented
Mohammad Sadegh Salehi
Dr.Baraani
Winter2015 Sheikh Bahaie
University
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
2
Winter 2015
Agenda
—Review NoSQL
—Dynamo and BigTable
—NoSQL Classification
—Key-value Stores
—Column Oriented
—Casandra
—Why Casandra
—Question
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
3
Winter 2015
What is NoSQL
review
 Stands for Not Only SQL
 Class of non-relational data storage systems
 Usually do not require a fixed table schema nor do
they use the concept of joins
 All NoSQL offerings relax one or more of the ACID
properties (will talk about the CAP theorem)
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
4
Winter 2015
Dynamo and BigTable
 Three major papers were the seeds of the NoSQL
movement
• BigTable (Google)
• Dynamo (Amazon)
—Gossip protocol (discovery and error detection)
—Distributed key-value data store
—Eventual consistency
• CAP Theorem (discuss in a sec ..)
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
5
Winter 2015
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
7
Winter 2015
What kinds of NoSQL
Review
 NoSQL solutions fall into two major areas:
• Key/Value or ‘the big hash table’.
—Amazon S3 (Dynamo)
—Voldemort
—Scalaris
• Schema-less which comes in multiple flavors, column-
based, document-based or graph-based.
—Cassandra (column-based)
—CouchDB (document-based)
—Neo4J (graph-based)
—HBase (column-based)
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
8
Winter 2015
Key-Value Stores
 Extremely simple interface
• Data model: (key, value) pairs
• Operations:
—Insert(key,value),
—Fetch(key),
—Update(key),
—Delete(key).
 Implementation: efficiency, scalability, fault-
tolerance
• Records distributed to nodes based on key
• Replication
• Single-record transactions,“eventual consistency”
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
9
Winter 2015
Key-Value Data Stores
 Storing Session Information
 User Profiles, Preferences: Almost every user has
a unique userID as well as preferences such as
language, color, timezone, which products the
user has access to , and so on.
Suitable Use Cases
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
10
Winter 2015
Key-Value Data Stores
 As we want the shopping carts to be available
all the time, across browsers, machines, and
sessions, all the shopping information can be put
into value where the key is the userID
Shopping Cart Data
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
11
Winter 2015
Key-Value Data Stores
 Relationships among data
 Multi-operation Transactions
 Query by Data
 Operations by Sets
Not to Use
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
12
Winter 2015
Column-oriented
 Store data in column order
 Allow key-value pairs to be stored (and retrieved
on key) in a massively parallel system,
• Data model: families of attributes defined in a schema,
new attributes can be added,
• Storing principle: big hashed distributed tables,
• Properties: partitioning (horizontally and/or vertically),
high availability etc. completely transparent to
application,
Intro
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
13
Winter 2015
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
14
Winter 2015
Cassandra
 Apache Cassandra™ is a free
 Distributed…
 High performance…
 Extremely scalable…
 Fault tolerant (i.e. no single point of failure)…
 Post-relational database solution.
 Cassandra can serve as both real-time datastore and as a
read-intensive database.
 Compiles to: C++, Java, PHP, Ruby, Erlang, Perl, ...
Thrift
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
15
Winter 2015
Cassandra
Infographic
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
16
Winter 2015
Cassandra
 Originally developed at Facebook
 Follows the BigTable data model: column-oriented
 Uses the Dynamo Eventual Consistency model
 Written in Java
 Open-sourced and exists within the Apache family
 Uses Apache Thrift as it’s API
 Some of its myriad users:
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
17
Winter 2015
Cassandra
 keyspace: Usually the name of the application; e.g.,
'Twitter', 'Wordpress‘.
 column family: structure containing an unlimited
number of rows
• Simple
• Super (nested Column Families)
 column: a tuple with name, value and time stamp
• Each Column has
— Name
— Value
— Timestamp
 key: name of record
 super column: contains more columns
Data Model
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
18
Winter 2015
Cassandra – Data Model
keyspace
settings
column family
settings
column
name value timestamp
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
19
Winter 2015
Cassandra
Column Family & Super Column Family
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
20
Winter 2015
Cassandra
 Cassandra was designed with the understanding that
system/hardware failures can and do occur
 Peer-to-peer, distributed system
 All nodes the same
 Data partitioned among all nodes
in the cluster
 Custom data replication to ensure
fault tolerance
 Read/Write-anywhere design
Architecture Overview
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
21
Winter 2015
Cassandra
 Each node communicates with each other through the
Gossip protocol, which exchanges information across
the cluster every second,
 A commit log is used on each node to capture write
activity. Data durability is assured,
 Data also written to an in-memory
structure (memtable) and then to
disk once the memory structure is
full (an SStable).
Architecture Overview
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
22
Winter 2015
Why Cassandra?
 Gigabyte to Petabyte scalability
 Linear performance gains through adding nodes
 No single point of failure
 Easy replication / data distribution
 Multi-data center and Cloud capable
 No need for separate caching layer
 Tunable data consistency
 Flexible schema design
 Data Compression
 CQL language (like SQL)
 Support for key languages and platforms
 No need for special hardware or software
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
23
Winter 2015
Why Cassandra?
 Capable of comfortably scaling to petabytes
 New nodes = Linear performance increases
 Add new nodes online
Big Data Scalability
1
2
Double Throughput
Capabilities
1
2
3
4
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
24
Winter 2015
Why Cassandra?
 All nodes the same
 Customized replication affords tunable data redundancy
 Read/write from any node
 Can replicate data among different physical data center
racks
No Single Point of Failure
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
25
Winter 2015
Why Cassandra?
 Peer-to-peer architecture removes need for special
caching layer and the programming that goes with it
 The database cluster uses the memory from all
participating nodes to cache the data assigned to each
node
 No irregularities between a memory cache and database
are encountered
No Need for Caching Software
Database Server
Memcached Servers
Application Servers
Writes
Reads
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
26
Winter 2015
Why Cassandra?
 Uses Google’s Snappy data compression algorithm
 Compresses data on a per column family level
 Internal tests at DataStax show up to 80%+ compression
of raw data
 No performance penalty (and some increases in overall
performance due to less physical I/O)!
Data Compression
Portfolio Keyspace
Customer Column Family
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
27
Winter 2015
Why Cassandra?
 Very similar to RDBMS SQL syntax
 Create objects via DDL (e.g. CREATE…)
 Core DML commands supported: INSERT, UPDATE,
DELETE
 Query data with SELECT
CQL Language
Portfolio Keyspace
1
2
3
4
5
6
SELECT *
FROM USERS
WHERE STATE = ‘TX’;
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
28
Winter 2015
Comparison with MySQL
 MySQL > 50 GB Data
Writes Average : ~300 ms
Reads Average : ~350 ms
 Stats provided by Authors using facebook data.
 Cassandra > 50 GB Data
Writes Average : 0.12 ms
Reads Average : 15 ms
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
29
Winter 2015
Cassandra Tools
........DesktopnoSqlCassandra-sadeghnoSqlCassandra-sadegh.mp4
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
30
Winter 2015
Where to get Cassandra?
 Go to www.datastax.com
 DataStax makes free smart start installers available for
Cassandra that include:
• The most up-to-date Cassandra version that is production quality
• A version of DataStax OpsCenter, which is a visual, browser-
based management tool for managing and monitoring
Cassandra
• Drivers and connectors for popular development languages
• Same database and application
• Automatic configuration assistance for ensuring optimal
performance and setup for either stand-alone or cluster
implementations
• Getting Started Guide
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
31
Winter 2015
Where Can I Learn More?
www.datastax.com
 Free Online Documentation
 User/Customer Cas Studies
 Technical White Papers
 Software downloads
 Technical Articles
 User Forums
 Videos
 Tutorials
 FAQ’s
 Blogs
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
32
Winter 2015
Resources
Sites
 Cassandra
• http://cassandra.apache.org
 NoSQL News websites
• http://nosql.mypopescu.com
• http://www.nosqldatabases.com
 “a practical guide to noSQL”, Posted by Denise Miura on
March 17, 2011 at
• http://blogs.marklogic.com/2011/03/17/a-practical-
guide-to-nosql/
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
33
Winter 2015
Resources
Books
 “Cassandra The Definition Guide”, O'Reilly Media, nov2013
 “Cassandra Essential Toturial”, DataStax 2014
 “Professional NoSQL”, Wrox, 2011
 “NoSQL Distilled”, Martin Fowler, 2013
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
34
Winter 2015
Questions
+
Mohammad Sadegh Salehi
3adegh.ce@gmail.com
Thank You

More Related Content

What's hot

An Introduction to AWS
An Introduction to AWSAn Introduction to AWS
An Introduction to AWSIan Massingham
 
Terraform modules restructured
Terraform modules restructuredTerraform modules restructured
Terraform modules restructuredAmi Mahloof
 
Apache Spark overview
Apache Spark overviewApache Spark overview
Apache Spark overviewDataArt
 
Apache Spark Introduction
Apache Spark IntroductionApache Spark Introduction
Apache Spark Introductionsudhakara st
 
Machine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerMachine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerAmazon Web Services
 
Terraform modules and best-practices - September 2018
Terraform modules and best-practices - September 2018Terraform modules and best-practices - September 2018
Terraform modules and best-practices - September 2018Anton Babenko
 
Pyspark Tutorial | Introduction to Apache Spark with Python | PySpark Trainin...
Pyspark Tutorial | Introduction to Apache Spark with Python | PySpark Trainin...Pyspark Tutorial | Introduction to Apache Spark with Python | PySpark Trainin...
Pyspark Tutorial | Introduction to Apache Spark with Python | PySpark Trainin...Edureka!
 
Getting Started with Apache Spark on Kubernetes
Getting Started with Apache Spark on KubernetesGetting Started with Apache Spark on Kubernetes
Getting Started with Apache Spark on KubernetesDatabricks
 
What is NoSQL and CAP Theorem
What is NoSQL and CAP TheoremWhat is NoSQL and CAP Theorem
What is NoSQL and CAP TheoremRahul Jain
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Araf Karsh Hamid
 

What's hot (20)

An Introduction to AWS
An Introduction to AWSAn Introduction to AWS
An Introduction to AWS
 
Terraform modules restructured
Terraform modules restructuredTerraform modules restructured
Terraform modules restructured
 
Amazon SQS overview
Amazon SQS overviewAmazon SQS overview
Amazon SQS overview
 
Amazon Redshift
Amazon Redshift Amazon Redshift
Amazon Redshift
 
Introduction to helm
Introduction to helmIntroduction to helm
Introduction to helm
 
Apache Spark overview
Apache Spark overviewApache Spark overview
Apache Spark overview
 
Spark
SparkSpark
Spark
 
Apache Spark Introduction
Apache Spark IntroductionApache Spark Introduction
Apache Spark Introduction
 
Spark
SparkSpark
Spark
 
Spark
SparkSpark
Spark
 
Final terraform
Final terraformFinal terraform
Final terraform
 
Machine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerMachine Learning & Amazon SageMaker
Machine Learning & Amazon SageMaker
 
Terraform modules and best-practices - September 2018
Terraform modules and best-practices - September 2018Terraform modules and best-practices - September 2018
Terraform modules and best-practices - September 2018
 
Pyspark Tutorial | Introduction to Apache Spark with Python | PySpark Trainin...
Pyspark Tutorial | Introduction to Apache Spark with Python | PySpark Trainin...Pyspark Tutorial | Introduction to Apache Spark with Python | PySpark Trainin...
Pyspark Tutorial | Introduction to Apache Spark with Python | PySpark Trainin...
 
Introduction to Amazon DynamoDB
Introduction to Amazon DynamoDBIntroduction to Amazon DynamoDB
Introduction to Amazon DynamoDB
 
Dynamodb ppt
Dynamodb pptDynamodb ppt
Dynamodb ppt
 
Getting Started with Apache Spark on Kubernetes
Getting Started with Apache Spark on KubernetesGetting Started with Apache Spark on Kubernetes
Getting Started with Apache Spark on Kubernetes
 
Aurora Deep Dive | AWS Floor28
Aurora Deep Dive | AWS Floor28Aurora Deep Dive | AWS Floor28
Aurora Deep Dive | AWS Floor28
 
What is NoSQL and CAP Theorem
What is NoSQL and CAP TheoremWhat is NoSQL and CAP Theorem
What is NoSQL and CAP Theorem
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics
 

Viewers also liked

Data Modeling with Cassandra Column Families
Data Modeling with Cassandra Column FamiliesData Modeling with Cassandra Column Families
Data Modeling with Cassandra Column Familiesgdusbabek
 
Usergrid Overview
Usergrid OverviewUsergrid Overview
Usergrid Overviewusergrid
 
Open Source Mobile Backend on Cassandra
Open Source Mobile Backend on CassandraOpen Source Mobile Backend on Cassandra
Open Source Mobile Backend on CassandraEd Anuff
 
Cybersecurity-Serverless-Graph DB
Cybersecurity-Serverless-Graph DBCybersecurity-Serverless-Graph DB
Cybersecurity-Serverless-Graph DBSukumar Nayak
 
Cassandra 2.0 to 2.1
Cassandra 2.0 to 2.1Cassandra 2.0 to 2.1
Cassandra 2.0 to 2.1Johnny Miller
 
How to find Zero day vulnerabilities
How to find Zero day vulnerabilitiesHow to find Zero day vulnerabilities
How to find Zero day vulnerabilitiesMohammed A. Imran
 
I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...
I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...
I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...Apigee | Google Cloud
 
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...DataStax
 
Apresentação cassandra
Apresentação cassandraApresentação cassandra
Apresentação cassandraRichiely Paiva
 
DataStax: A deep look at the CQL WHERE clause
DataStax: A deep look at the CQL WHERE clauseDataStax: A deep look at the CQL WHERE clause
DataStax: A deep look at the CQL WHERE clauseDataStax Academy
 
NoSQL, Base VS ACID e Teorema CAP
NoSQL, Base VS ACID e Teorema CAPNoSQL, Base VS ACID e Teorema CAP
NoSQL, Base VS ACID e Teorema CAPAricelio Souza
 
NoSQL: onde, como e por quê? Cassandra e MongoDB
NoSQL: onde, como e por quê? Cassandra e MongoDBNoSQL: onde, como e por quê? Cassandra e MongoDB
NoSQL: onde, como e por quê? Cassandra e MongoDBRodrigo Hjort
 
Advanced excel 2010 & 2013 updated Terrabiz
Advanced excel 2010 & 2013 updated TerrabizAdvanced excel 2010 & 2013 updated Terrabiz
Advanced excel 2010 & 2013 updated TerrabizAhmed Yasir Khan
 
Key-Value Stores: a practical overview
Key-Value Stores: a practical overviewKey-Value Stores: a practical overview
Key-Value Stores: a practical overviewMarc Seeger
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache CassandraDataStax Academy
 
Introduction to column oriented databases
Introduction to column oriented databasesIntroduction to column oriented databases
Introduction to column oriented databasesArangoDB Database
 

Viewers also liked (20)

Data Modeling with Cassandra Column Families
Data Modeling with Cassandra Column FamiliesData Modeling with Cassandra Column Families
Data Modeling with Cassandra Column Families
 
Usergrid Overview
Usergrid OverviewUsergrid Overview
Usergrid Overview
 
Open Source Mobile Backend on Cassandra
Open Source Mobile Backend on CassandraOpen Source Mobile Backend on Cassandra
Open Source Mobile Backend on Cassandra
 
Docker and CloudStack
Docker and CloudStackDocker and CloudStack
Docker and CloudStack
 
CQL3 in depth
CQL3 in depthCQL3 in depth
CQL3 in depth
 
Cybersecurity-Serverless-Graph DB
Cybersecurity-Serverless-Graph DBCybersecurity-Serverless-Graph DB
Cybersecurity-Serverless-Graph DB
 
Cassandra 2.0 to 2.1
Cassandra 2.0 to 2.1Cassandra 2.0 to 2.1
Cassandra 2.0 to 2.1
 
How to find Zero day vulnerabilities
How to find Zero day vulnerabilitiesHow to find Zero day vulnerabilities
How to find Zero day vulnerabilities
 
I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...
I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...
I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...
 
Taming NoSQL with Spring Data
Taming NoSQL with Spring DataTaming NoSQL with Spring Data
Taming NoSQL with Spring Data
 
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
 
Apresentação cassandra
Apresentação cassandraApresentação cassandra
Apresentação cassandra
 
DataStax: A deep look at the CQL WHERE clause
DataStax: A deep look at the CQL WHERE clauseDataStax: A deep look at the CQL WHERE clause
DataStax: A deep look at the CQL WHERE clause
 
NoSQL, Base VS ACID e Teorema CAP
NoSQL, Base VS ACID e Teorema CAPNoSQL, Base VS ACID e Teorema CAP
NoSQL, Base VS ACID e Teorema CAP
 
NoSQL: onde, como e por quê? Cassandra e MongoDB
NoSQL: onde, como e por quê? Cassandra e MongoDBNoSQL: onde, como e por quê? Cassandra e MongoDB
NoSQL: onde, como e por quê? Cassandra e MongoDB
 
CloudStack Architecture
CloudStack ArchitectureCloudStack Architecture
CloudStack Architecture
 
Advanced excel 2010 & 2013 updated Terrabiz
Advanced excel 2010 & 2013 updated TerrabizAdvanced excel 2010 & 2013 updated Terrabiz
Advanced excel 2010 & 2013 updated Terrabiz
 
Key-Value Stores: a practical overview
Key-Value Stores: a practical overviewKey-Value Stores: a practical overview
Key-Value Stores: a practical overview
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
 
Introduction to column oriented databases
Introduction to column oriented databasesIntroduction to column oriented databases
Introduction to column oriented databases
 

Similar to NoSQL Database- cassandra column Base DB

NewSQL - Deliverance from BASE and back to SQL and ACID
NewSQL - Deliverance from BASE and back to SQL and ACIDNewSQL - Deliverance from BASE and back to SQL and ACID
NewSQL - Deliverance from BASE and back to SQL and ACIDTony Rogerson
 
GCP Data Engineer cheatsheet
GCP Data Engineer cheatsheetGCP Data Engineer cheatsheet
GCP Data Engineer cheatsheetGuang Xu
 
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftBest Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftSnapLogic
 
London Redshift Meetup - July 2017
London Redshift Meetup - July 2017London Redshift Meetup - July 2017
London Redshift Meetup - July 2017Pratim Das
 
Dynamic DDL: Adding structure to streaming IoT data on the fly
Dynamic DDL: Adding structure to streaming IoT data on the flyDynamic DDL: Adding structure to streaming IoT data on the fly
Dynamic DDL: Adding structure to streaming IoT data on the flyDataWorks Summit
 
Svccg nosql 2011_v4
Svccg nosql 2011_v4Svccg nosql 2011_v4
Svccg nosql 2011_v4Sid Anand
 
Data stores: beyond relational databases
Data stores: beyond relational databasesData stores: beyond relational databases
Data stores: beyond relational databasesJavier García Magna
 
Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...
Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...
Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...Databricks
 
Lambda architecture @ Indix
Lambda architecture @ IndixLambda architecture @ Indix
Lambda architecture @ IndixRajesh Muppalla
 
Cloud Lambda Architecture Patterns
Cloud Lambda Architecture PatternsCloud Lambda Architecture Patterns
Cloud Lambda Architecture PatternsAsis Mohanty
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSAmazon Web Services
 
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...Amazon Web Services
 
Azure Stream Analytics
Azure Stream AnalyticsAzure Stream Analytics
Azure Stream AnalyticsMarco Parenzan
 
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)Trivadis
 
Spark and Couchbase: Augmenting the Operational Database with Spark
Spark and Couchbase: Augmenting the Operational Database with SparkSpark and Couchbase: Augmenting the Operational Database with Spark
Spark and Couchbase: Augmenting the Operational Database with SparkSpark Summit
 
AWS Redshift Introduction - Big Data Analytics
AWS Redshift Introduction - Big Data AnalyticsAWS Redshift Introduction - Big Data Analytics
AWS Redshift Introduction - Big Data AnalyticsKeeyong Han
 
Jump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with DatabricksJump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with DatabricksAnyscale
 
Dynamodb Presentation
Dynamodb PresentationDynamodb Presentation
Dynamodb Presentationadvaitdeo
 

Similar to NoSQL Database- cassandra column Base DB (20)

NewSQL - Deliverance from BASE and back to SQL and ACID
NewSQL - Deliverance from BASE and back to SQL and ACIDNewSQL - Deliverance from BASE and back to SQL and ACID
NewSQL - Deliverance from BASE and back to SQL and ACID
 
Gcp data engineer
Gcp data engineerGcp data engineer
Gcp data engineer
 
GCP Data Engineer cheatsheet
GCP Data Engineer cheatsheetGCP Data Engineer cheatsheet
GCP Data Engineer cheatsheet
 
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftBest Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
 
London Redshift Meetup - July 2017
London Redshift Meetup - July 2017London Redshift Meetup - July 2017
London Redshift Meetup - July 2017
 
MySQL Cluster
MySQL ClusterMySQL Cluster
MySQL Cluster
 
Dynamic DDL: Adding structure to streaming IoT data on the fly
Dynamic DDL: Adding structure to streaming IoT data on the flyDynamic DDL: Adding structure to streaming IoT data on the fly
Dynamic DDL: Adding structure to streaming IoT data on the fly
 
Svccg nosql 2011_v4
Svccg nosql 2011_v4Svccg nosql 2011_v4
Svccg nosql 2011_v4
 
Data stores: beyond relational databases
Data stores: beyond relational databasesData stores: beyond relational databases
Data stores: beyond relational databases
 
Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...
Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...
Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...
 
Lambda architecture @ Indix
Lambda architecture @ IndixLambda architecture @ Indix
Lambda architecture @ Indix
 
Cloud Lambda Architecture Patterns
Cloud Lambda Architecture PatternsCloud Lambda Architecture Patterns
Cloud Lambda Architecture Patterns
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWS
 
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
 
Azure Stream Analytics
Azure Stream AnalyticsAzure Stream Analytics
Azure Stream Analytics
 
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
 
Spark and Couchbase: Augmenting the Operational Database with Spark
Spark and Couchbase: Augmenting the Operational Database with SparkSpark and Couchbase: Augmenting the Operational Database with Spark
Spark and Couchbase: Augmenting the Operational Database with Spark
 
AWS Redshift Introduction - Big Data Analytics
AWS Redshift Introduction - Big Data AnalyticsAWS Redshift Introduction - Big Data Analytics
AWS Redshift Introduction - Big Data Analytics
 
Jump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with DatabricksJump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with Databricks
 
Dynamodb Presentation
Dynamodb PresentationDynamodb Presentation
Dynamodb Presentation
 

More from sadegh salehi

Cloud intrusion detection System
Cloud intrusion detection SystemCloud intrusion detection System
Cloud intrusion detection Systemsadegh salehi
 
Interactive and Multimodal Pedagogy Using IWB
Interactive and Multimodal Pedagogy Using IWB  Interactive and Multimodal Pedagogy Using IWB
Interactive and Multimodal Pedagogy Using IWB sadegh salehi
 
Ontology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهOntology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهsadegh salehi
 
Prototype design pattern - الگوی طراحی Prototype
Prototype design pattern - الگوی طراحی PrototypePrototype design pattern - الگوی طراحی Prototype
Prototype design pattern - الگوی طراحی Prototypesadegh salehi
 
Backup and recovery in oracle
Backup and recovery in oracleBackup and recovery in oracle
Backup and recovery in oraclesadegh salehi
 
مکتب کلبیان
مکتب کلبیانمکتب کلبیان
مکتب کلبیانsadegh salehi
 

More from sadegh salehi (9)

Cloud intrusion detection System
Cloud intrusion detection SystemCloud intrusion detection System
Cloud intrusion detection System
 
Fault prediction
Fault predictionFault prediction
Fault prediction
 
Integration test
Integration testIntegration test
Integration test
 
Interactive and Multimodal Pedagogy Using IWB
Interactive and Multimodal Pedagogy Using IWB  Interactive and Multimodal Pedagogy Using IWB
Interactive and Multimodal Pedagogy Using IWB
 
Ontology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهOntology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغه
 
Prototype design pattern - الگوی طراحی Prototype
Prototype design pattern - الگوی طراحی PrototypePrototype design pattern - الگوی طراحی Prototype
Prototype design pattern - الگوی طراحی Prototype
 
Backup and recovery in oracle
Backup and recovery in oracleBackup and recovery in oracle
Backup and recovery in oracle
 
Jame isfahan mosque
Jame isfahan mosqueJame isfahan mosque
Jame isfahan mosque
 
مکتب کلبیان
مکتب کلبیانمکتب کلبیان
مکتب کلبیان
 

Recently uploaded

US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionUS Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionMebane Rash
 
Vishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsVishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsSachinPawar510423
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...asadnawaz62
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfAsst.prof M.Gokilavani
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...121011101441
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
computer application and construction management
computer application and construction managementcomputer application and construction management
computer application and construction managementMariconPadriquez1
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfROCENODodongVILLACER
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgsaravananr517913
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm Systemirfanmechengr
 
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncWhy does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncssuser2ae721
 

Recently uploaded (20)

Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionUS Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of Action
 
Vishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsVishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documents
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.ppt
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
computer application and construction management
computer application and construction managementcomputer application and construction management
computer application and construction management
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdf
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm System
 
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncWhy does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
 

NoSQL Database- cassandra column Base DB

  • 1. + NoSQL – Part 2 CAP Theorem & Column Oriented Mohammad Sadegh Salehi Dr.Baraani Winter2015 Sheikh Bahaie University
  • 2. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 2 Winter 2015 Agenda —Review NoSQL —Dynamo and BigTable —NoSQL Classification —Key-value Stores —Column Oriented —Casandra —Why Casandra —Question
  • 3. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 3 Winter 2015 What is NoSQL review  Stands for Not Only SQL  Class of non-relational data storage systems  Usually do not require a fixed table schema nor do they use the concept of joins  All NoSQL offerings relax one or more of the ACID properties (will talk about the CAP theorem)
  • 4. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 4 Winter 2015 Dynamo and BigTable  Three major papers were the seeds of the NoSQL movement • BigTable (Google) • Dynamo (Amazon) —Gossip protocol (discovery and error detection) —Distributed key-value data store —Eventual consistency • CAP Theorem (discuss in a sec ..)
  • 5. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 5 Winter 2015
  • 6. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 7 Winter 2015 What kinds of NoSQL Review  NoSQL solutions fall into two major areas: • Key/Value or ‘the big hash table’. —Amazon S3 (Dynamo) —Voldemort —Scalaris • Schema-less which comes in multiple flavors, column- based, document-based or graph-based. —Cassandra (column-based) —CouchDB (document-based) —Neo4J (graph-based) —HBase (column-based)
  • 7. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 8 Winter 2015 Key-Value Stores  Extremely simple interface • Data model: (key, value) pairs • Operations: —Insert(key,value), —Fetch(key), —Update(key), —Delete(key).  Implementation: efficiency, scalability, fault- tolerance • Records distributed to nodes based on key • Replication • Single-record transactions,“eventual consistency”
  • 8. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 9 Winter 2015 Key-Value Data Stores  Storing Session Information  User Profiles, Preferences: Almost every user has a unique userID as well as preferences such as language, color, timezone, which products the user has access to , and so on. Suitable Use Cases
  • 9. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 10 Winter 2015 Key-Value Data Stores  As we want the shopping carts to be available all the time, across browsers, machines, and sessions, all the shopping information can be put into value where the key is the userID Shopping Cart Data
  • 10. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 11 Winter 2015 Key-Value Data Stores  Relationships among data  Multi-operation Transactions  Query by Data  Operations by Sets Not to Use
  • 11. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 12 Winter 2015 Column-oriented  Store data in column order  Allow key-value pairs to be stored (and retrieved on key) in a massively parallel system, • Data model: families of attributes defined in a schema, new attributes can be added, • Storing principle: big hashed distributed tables, • Properties: partitioning (horizontally and/or vertically), high availability etc. completely transparent to application, Intro
  • 12. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 13 Winter 2015
  • 13. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 14 Winter 2015 Cassandra  Apache Cassandra™ is a free  Distributed…  High performance…  Extremely scalable…  Fault tolerant (i.e. no single point of failure)…  Post-relational database solution.  Cassandra can serve as both real-time datastore and as a read-intensive database.  Compiles to: C++, Java, PHP, Ruby, Erlang, Perl, ... Thrift
  • 14. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 15 Winter 2015 Cassandra Infographic
  • 15. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 16 Winter 2015 Cassandra  Originally developed at Facebook  Follows the BigTable data model: column-oriented  Uses the Dynamo Eventual Consistency model  Written in Java  Open-sourced and exists within the Apache family  Uses Apache Thrift as it’s API  Some of its myriad users:
  • 16. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 17 Winter 2015 Cassandra  keyspace: Usually the name of the application; e.g., 'Twitter', 'Wordpress‘.  column family: structure containing an unlimited number of rows • Simple • Super (nested Column Families)  column: a tuple with name, value and time stamp • Each Column has — Name — Value — Timestamp  key: name of record  super column: contains more columns Data Model
  • 17. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 18 Winter 2015 Cassandra – Data Model keyspace settings column family settings column name value timestamp
  • 18. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 19 Winter 2015 Cassandra Column Family & Super Column Family
  • 19. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 20 Winter 2015 Cassandra  Cassandra was designed with the understanding that system/hardware failures can and do occur  Peer-to-peer, distributed system  All nodes the same  Data partitioned among all nodes in the cluster  Custom data replication to ensure fault tolerance  Read/Write-anywhere design Architecture Overview
  • 20. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 21 Winter 2015 Cassandra  Each node communicates with each other through the Gossip protocol, which exchanges information across the cluster every second,  A commit log is used on each node to capture write activity. Data durability is assured,  Data also written to an in-memory structure (memtable) and then to disk once the memory structure is full (an SStable). Architecture Overview
  • 21. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 22 Winter 2015 Why Cassandra?  Gigabyte to Petabyte scalability  Linear performance gains through adding nodes  No single point of failure  Easy replication / data distribution  Multi-data center and Cloud capable  No need for separate caching layer  Tunable data consistency  Flexible schema design  Data Compression  CQL language (like SQL)  Support for key languages and platforms  No need for special hardware or software
  • 22. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 23 Winter 2015 Why Cassandra?  Capable of comfortably scaling to petabytes  New nodes = Linear performance increases  Add new nodes online Big Data Scalability 1 2 Double Throughput Capabilities 1 2 3 4
  • 23. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 24 Winter 2015 Why Cassandra?  All nodes the same  Customized replication affords tunable data redundancy  Read/write from any node  Can replicate data among different physical data center racks No Single Point of Failure
  • 24. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 25 Winter 2015 Why Cassandra?  Peer-to-peer architecture removes need for special caching layer and the programming that goes with it  The database cluster uses the memory from all participating nodes to cache the data assigned to each node  No irregularities between a memory cache and database are encountered No Need for Caching Software Database Server Memcached Servers Application Servers Writes Reads
  • 25. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 26 Winter 2015 Why Cassandra?  Uses Google’s Snappy data compression algorithm  Compresses data on a per column family level  Internal tests at DataStax show up to 80%+ compression of raw data  No performance penalty (and some increases in overall performance due to less physical I/O)! Data Compression Portfolio Keyspace Customer Column Family
  • 26. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 27 Winter 2015 Why Cassandra?  Very similar to RDBMS SQL syntax  Create objects via DDL (e.g. CREATE…)  Core DML commands supported: INSERT, UPDATE, DELETE  Query data with SELECT CQL Language Portfolio Keyspace 1 2 3 4 5 6 SELECT * FROM USERS WHERE STATE = ‘TX’;
  • 27. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 28 Winter 2015 Comparison with MySQL  MySQL > 50 GB Data Writes Average : ~300 ms Reads Average : ~350 ms  Stats provided by Authors using facebook data.  Cassandra > 50 GB Data Writes Average : 0.12 ms Reads Average : 15 ms
  • 28. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 29 Winter 2015 Cassandra Tools ........DesktopnoSqlCassandra-sadeghnoSqlCassandra-sadegh.mp4
  • 29. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 30 Winter 2015 Where to get Cassandra?  Go to www.datastax.com  DataStax makes free smart start installers available for Cassandra that include: • The most up-to-date Cassandra version that is production quality • A version of DataStax OpsCenter, which is a visual, browser- based management tool for managing and monitoring Cassandra • Drivers and connectors for popular development languages • Same database and application • Automatic configuration assistance for ensuring optimal performance and setup for either stand-alone or cluster implementations • Getting Started Guide
  • 30. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 31 Winter 2015 Where Can I Learn More? www.datastax.com  Free Online Documentation  User/Customer Cas Studies  Technical White Papers  Software downloads  Technical Articles  User Forums  Videos  Tutorials  FAQ’s  Blogs
  • 31. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 32 Winter 2015 Resources Sites  Cassandra • http://cassandra.apache.org  NoSQL News websites • http://nosql.mypopescu.com • http://www.nosqldatabases.com  “a practical guide to noSQL”, Posted by Denise Miura on March 17, 2011 at • http://blogs.marklogic.com/2011/03/17/a-practical- guide-to-nosql/
  • 32. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 33 Winter 2015 Resources Books  “Cassandra The Definition Guide”, O'Reilly Media, nov2013  “Cassandra Essential Toturial”, DataStax 2014  “Professional NoSQL”, Wrox, 2011  “NoSQL Distilled”, Martin Fowler, 2013
  • 33. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 34 Winter 2015 Questions