SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Peer-to-
peer
distributed
database
Handle
large
volumes of
data
Write and
read
throughpu
High
Avaiability
No special nodes
No cluster
masters
No slaves
No
elected
leaders
Data Partitioning
Shared nothing
architecture
Each node is
responsible for
part of the data
The act of
distributing data
across nodes is
referred to as
data
partitioning.
Consistent Hashing
Two main problems when trying to
distribute data efficiently
Determining a node on
which a specific piece of
data should reside on
Minimizing data
movement when adding
or removing nodes
Consistent Hashing
Map Cassandra row
keys to physical nodes
On average only k/n
keys need to be
remapped where k is
the number of keys and
n is the number of nodes
Data Replication
Storing copies of
data on multiple
nodes is referred
to as replication
Replication of
data ensures fault
tolerance and
reliability
Eventual Consistency
Ensure that data
is synchronized
across replicas
Theoretically
guarantees that,
all nodes/replicas
will eventually
return the last
updated value
Example : DNS
Tunable Consistency
Users can determine the
consistency level by tuning it
via read and write operations
Consistency Level
Configure the number
of replicas in a cluster
that must
acknowledge a read
or write operation
Before considering
the operation
successful
Gossip Protocol
Nodes discover
information about other
nodes by exchanging
state information about
themselves and other
nodes they know about
Not exchange information
with every other node
Merkle Tree
A hash tree which provides an efficient
way to find differences in data blocks
Leaves contain
hashes of
individual data
blocks
Parent nodes
contain hashes of
their respective
children
Enables efficient
way of finding
differences
between nodes.
SSTable
A Sorted String
Table (SSTable)
ordered immutable
key value map
Efficient way of
storing large sorted
data segments in a
file
Write Back Cache
Write operation is only
directed to the cache and
completion is immediately
confirmed
Memtable
Write back cache residing in
memory which has not been
flushed to disk yet.
Cassandra Keyspace
Similar to a schema
in the RDBMS
When defining a keyspace,
Specify a
replication strategy
Replication factor
i.e. the number of
nodes that the data
must be replicate too
Column Family
Analogous to the
concept of a
table in an
RDBMS
Think of a column
family as a map
of sorted map
A row in the map
provides access
to a set of
columns which is
represented by a
sorted map
Row Key
Also known as
the partition
key
Has a number
of columns
associated with
Is responsible
for determining
data
distribution
across a cluster.
Cassandra Cluster/Ring
Cluster Bootstrapping
All nodes
participating in a
cluster have the
same name as
Cluster
Seed nodes
Are used during
start up to help
discover all
participating nodes
Have no special
purpose other than
helping bootstrap
the cluster using
the gossip protocol
When a node starts
up it looks to its
seed list to obtain
information about
the other nodes in
the cluster
Cassandra Ring
Visualized as a ring
because it uses a
consistent hashing
algorithm to distribute
data
At start up each node is assigned a token
range
Determines its
position in the cluster
Range of data stored
by the node
Each node receives a
proportionate range
of the token ranges
to ensure that data is
spread evenly across
the ring
Cassandra Ring
Each node is
responsible for a
certain set of data
Which is determined by
the partitioner
Partitioner is a hash
function for computing
the resultant token for a
particular row key
This token is then used
to determine the node
which will store the first
replica
Partitioner
Murmur3Partitioner
(default)
RandomPartitioner ByteOrderedPartitioner.
Replication
According to the chosen
replication strategy
Simple Strategy
Places the
subsequent replicas
on the next node in a
clockwise manner
Network Topology
Strategy
Works well when
Cassandra is
deployed across data
centres
Placement
First replica for the
data is determined by
the partitioner
The placement of the
subsequent replicas is
determined by the
replication strategy.
Cassandra Write Path
Cassandra
is masterless
Clients can interface with
a Cassandra node using
Thrift
protocol
CQL
Cassandra Write Path
Cassandra Read Path
Similar to a write
operation
Client can connect
with any node in the
cluster
The chosen node is
called the
coordinator and is
responsible for
returning the
requested data
A row key must be
supplied for every
read operation
The coordinator
uses the row key to
determine the first
replica
The replication
strategy in
conjunction with the
replication factor is
used to determine
all other applicable
replicas
HTTPS://DZONE.COM/ARTICLES/INTRODUCTION-
APACHE-CASSANDRAS

Weitere ähnliche Inhalte

Was ist angesagt?

Simple Load Rebalancing For Distributed Hash Tables In Cloud
Simple Load Rebalancing For Distributed Hash Tables In CloudSimple Load Rebalancing For Distributed Hash Tables In Cloud
Simple Load Rebalancing For Distributed Hash Tables In CloudIOSR Journals
 
Apache cassandra an introduction
Apache cassandra  an introductionApache cassandra  an introduction
Apache cassandra an introductionShehaaz Saif
 
JPN1412 Transmission-Efficient Clustering Method for Wireless Sensor Networ...
JPN1412   Transmission-Efficient Clustering Method for Wireless Sensor Networ...JPN1412   Transmission-Efficient Clustering Method for Wireless Sensor Networ...
JPN1412 Transmission-Efficient Clustering Method for Wireless Sensor Networ...chennaijp
 
Cassandra advanced part-ll
Cassandra advanced part-llCassandra advanced part-ll
Cassandra advanced part-llachudhivi
 
Spanner - Google distributed database
Spanner - Google distributed databaseSpanner - Google distributed database
Spanner - Google distributed databaseAbhra Basak
 
Google Spanner - Synchronously-Replicated, Globally-Distributed, Multi-Versio...
Google Spanner - Synchronously-Replicated, Globally-Distributed, Multi-Versio...Google Spanner - Synchronously-Replicated, Globally-Distributed, Multi-Versio...
Google Spanner - Synchronously-Replicated, Globally-Distributed, Multi-Versio...Maciek Jozwiak
 
Redundant Actor Based Multi-Hole Healing System for Mobile Sensor Networks
Redundant Actor Based Multi-Hole Healing System for Mobile Sensor NetworksRedundant Actor Based Multi-Hole Healing System for Mobile Sensor Networks
Redundant Actor Based Multi-Hole Healing System for Mobile Sensor NetworksEditor IJCATR
 
Using SweetSpotSimilarity for Solr Fulltext Indexing
Using SweetSpotSimilarity for Solr Fulltext IndexingUsing SweetSpotSimilarity for Solr Fulltext Indexing
Using SweetSpotSimilarity for Solr Fulltext IndexingJay Luker
 

Was ist angesagt? (12)

Simple Load Rebalancing For Distributed Hash Tables In Cloud
Simple Load Rebalancing For Distributed Hash Tables In CloudSimple Load Rebalancing For Distributed Hash Tables In Cloud
Simple Load Rebalancing For Distributed Hash Tables In Cloud
 
Apache cassandra an introduction
Apache cassandra  an introductionApache cassandra  an introduction
Apache cassandra an introduction
 
JPN1412 Transmission-Efficient Clustering Method for Wireless Sensor Networ...
JPN1412   Transmission-Efficient Clustering Method for Wireless Sensor Networ...JPN1412   Transmission-Efficient Clustering Method for Wireless Sensor Networ...
JPN1412 Transmission-Efficient Clustering Method for Wireless Sensor Networ...
 
GaianDB
GaianDBGaianDB
GaianDB
 
Cassandra advanced part-ll
Cassandra advanced part-llCassandra advanced part-ll
Cassandra advanced part-ll
 
Ch21
Ch21Ch21
Ch21
 
Spanner - Google distributed database
Spanner - Google distributed databaseSpanner - Google distributed database
Spanner - Google distributed database
 
Google Spanner - Synchronously-Replicated, Globally-Distributed, Multi-Versio...
Google Spanner - Synchronously-Replicated, Globally-Distributed, Multi-Versio...Google Spanner - Synchronously-Replicated, Globally-Distributed, Multi-Versio...
Google Spanner - Synchronously-Replicated, Globally-Distributed, Multi-Versio...
 
Cassandra20141009
Cassandra20141009Cassandra20141009
Cassandra20141009
 
Redundant Actor Based Multi-Hole Healing System for Mobile Sensor Networks
Redundant Actor Based Multi-Hole Healing System for Mobile Sensor NetworksRedundant Actor Based Multi-Hole Healing System for Mobile Sensor Networks
Redundant Actor Based Multi-Hole Healing System for Mobile Sensor Networks
 
9517cnc01
9517cnc019517cnc01
9517cnc01
 
Using SweetSpotSimilarity for Solr Fulltext Indexing
Using SweetSpotSimilarity for Solr Fulltext IndexingUsing SweetSpotSimilarity for Solr Fulltext Indexing
Using SweetSpotSimilarity for Solr Fulltext Indexing
 

Ähnlich wie Cassandra v1.0

Dynamo cassandra
Dynamo cassandraDynamo cassandra
Dynamo cassandraWu Liang
 
Cassandra & Python - Springfield MO User Group
Cassandra & Python - Springfield MO User GroupCassandra & Python - Springfield MO User Group
Cassandra & Python - Springfield MO User GroupAdam Hutson
 
Cassandra internals
Cassandra internalsCassandra internals
Cassandra internalsnarsiman
 
Understanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache CassandraUnderstanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache CassandraDataStax
 
5266732.ppt
5266732.ppt5266732.ppt
5266732.ppthothyfa
 
20. Parallel Databases in DBMS
20. Parallel Databases in DBMS20. Parallel Databases in DBMS
20. Parallel Databases in DBMSkoolkampus
 
Cassandra advanced-I
Cassandra advanced-ICassandra advanced-I
Cassandra advanced-Iachudhivi
 
A Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in ParallelA Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in ParallelJenny Liu
 
Cassandra consistency
Cassandra consistencyCassandra consistency
Cassandra consistencyzqhxuyuan
 
Routing in Wireless Sensor Network
Routing in Wireless Sensor NetworkRouting in Wireless Sensor Network
Routing in Wireless Sensor NetworkAarthi Raghavendra
 
Cassandra for Sysadmins
Cassandra for SysadminsCassandra for Sysadmins
Cassandra for SysadminsNathan Milford
 
ADBS_parallel Databases in Advanced DBMS
ADBS_parallel Databases in Advanced DBMSADBS_parallel Databases in Advanced DBMS
ADBS_parallel Databases in Advanced DBMSchandugoswami
 
Tapestry
TapestryTapestry
TapestrySutha31
 
Java Abs Scalable Wireless Ad Hoc Network Simulation Using
Java Abs   Scalable Wireless Ad Hoc Network Simulation UsingJava Abs   Scalable Wireless Ad Hoc Network Simulation Using
Java Abs Scalable Wireless Ad Hoc Network Simulation Usingncct
 
Cassandra - Research Paper Overview
Cassandra - Research Paper OverviewCassandra - Research Paper Overview
Cassandra - Research Paper Overviewsameiralk
 

Ähnlich wie Cassandra v1.0 (20)

Dynamo cassandra
Dynamo cassandraDynamo cassandra
Dynamo cassandra
 
Cassandra
CassandraCassandra
Cassandra
 
Cassandra & Python - Springfield MO User Group
Cassandra & Python - Springfield MO User GroupCassandra & Python - Springfield MO User Group
Cassandra & Python - Springfield MO User Group
 
Cassandra no sql ecosystem
Cassandra no sql ecosystemCassandra no sql ecosystem
Cassandra no sql ecosystem
 
Cassandra internals
Cassandra internalsCassandra internals
Cassandra internals
 
Understanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache CassandraUnderstanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache Cassandra
 
5266732.ppt
5266732.ppt5266732.ppt
5266732.ppt
 
20. Parallel Databases in DBMS
20. Parallel Databases in DBMS20. Parallel Databases in DBMS
20. Parallel Databases in DBMS
 
Cassandra advanced-I
Cassandra advanced-ICassandra advanced-I
Cassandra advanced-I
 
A Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in ParallelA Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in Parallel
 
Cassandra architecture
Cassandra architectureCassandra architecture
Cassandra architecture
 
C04511822
C04511822C04511822
C04511822
 
Cassandra consistency
Cassandra consistencyCassandra consistency
Cassandra consistency
 
ROUTING STRATEGIES IN DTN
ROUTING STRATEGIES IN DTNROUTING STRATEGIES IN DTN
ROUTING STRATEGIES IN DTN
 
Routing in Wireless Sensor Network
Routing in Wireless Sensor NetworkRouting in Wireless Sensor Network
Routing in Wireless Sensor Network
 
Cassandra for Sysadmins
Cassandra for SysadminsCassandra for Sysadmins
Cassandra for Sysadmins
 
ADBS_parallel Databases in Advanced DBMS
ADBS_parallel Databases in Advanced DBMSADBS_parallel Databases in Advanced DBMS
ADBS_parallel Databases in Advanced DBMS
 
Tapestry
TapestryTapestry
Tapestry
 
Java Abs Scalable Wireless Ad Hoc Network Simulation Using
Java Abs   Scalable Wireless Ad Hoc Network Simulation UsingJava Abs   Scalable Wireless Ad Hoc Network Simulation Using
Java Abs Scalable Wireless Ad Hoc Network Simulation Using
 
Cassandra - Research Paper Overview
Cassandra - Research Paper OverviewCassandra - Research Paper Overview
Cassandra - Research Paper Overview
 

Mehr von RastinKenarsari

Mehr von RastinKenarsari (7)

4 Management Styles to Strive
4 Management Styles to Strive4 Management Styles to Strive
4 Management Styles to Strive
 
4 Management Styles to Avoid
4 Management Styles to Avoid4 Management Styles to Avoid
4 Management Styles to Avoid
 
Openshift argo cd_v1_2
Openshift argo cd_v1_2Openshift argo cd_v1_2
Openshift argo cd_v1_2
 
All About Database v1.1
All About Database  v1.1All About Database  v1.1
All About Database v1.1
 
Tpm all you need v1.2
Tpm all you need v1.2Tpm all you need v1.2
Tpm all you need v1.2
 
Aws architect v1.0
Aws architect v1.0Aws architect v1.0
Aws architect v1.0
 
Amazon products v1-0
Amazon products v1-0Amazon products v1-0
Amazon products v1-0
 

Kürzlich hochgeladen

Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfproinshot.com
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionOnePlan Solutions
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdfPearlKirahMaeRagusta1
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfryanfarris8
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...software pro Development
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesVictorSzoltysek
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...kalichargn70th171
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 

Kürzlich hochgeladen (20)

Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 

Cassandra v1.0

  • 1.
  • 3. Data Partitioning Shared nothing architecture Each node is responsible for part of the data The act of distributing data across nodes is referred to as data partitioning.
  • 4. Consistent Hashing Two main problems when trying to distribute data efficiently Determining a node on which a specific piece of data should reside on Minimizing data movement when adding or removing nodes
  • 5. Consistent Hashing Map Cassandra row keys to physical nodes On average only k/n keys need to be remapped where k is the number of keys and n is the number of nodes
  • 6. Data Replication Storing copies of data on multiple nodes is referred to as replication Replication of data ensures fault tolerance and reliability
  • 7. Eventual Consistency Ensure that data is synchronized across replicas Theoretically guarantees that, all nodes/replicas will eventually return the last updated value Example : DNS
  • 8. Tunable Consistency Users can determine the consistency level by tuning it via read and write operations
  • 9. Consistency Level Configure the number of replicas in a cluster that must acknowledge a read or write operation Before considering the operation successful
  • 10. Gossip Protocol Nodes discover information about other nodes by exchanging state information about themselves and other nodes they know about Not exchange information with every other node
  • 11. Merkle Tree A hash tree which provides an efficient way to find differences in data blocks Leaves contain hashes of individual data blocks Parent nodes contain hashes of their respective children Enables efficient way of finding differences between nodes.
  • 12. SSTable A Sorted String Table (SSTable) ordered immutable key value map Efficient way of storing large sorted data segments in a file
  • 13. Write Back Cache Write operation is only directed to the cache and completion is immediately confirmed Memtable Write back cache residing in memory which has not been flushed to disk yet.
  • 14. Cassandra Keyspace Similar to a schema in the RDBMS When defining a keyspace, Specify a replication strategy Replication factor i.e. the number of nodes that the data must be replicate too
  • 15. Column Family Analogous to the concept of a table in an RDBMS Think of a column family as a map of sorted map A row in the map provides access to a set of columns which is represented by a sorted map
  • 16. Row Key Also known as the partition key Has a number of columns associated with Is responsible for determining data distribution across a cluster.
  • 18. Cluster Bootstrapping All nodes participating in a cluster have the same name as Cluster Seed nodes Are used during start up to help discover all participating nodes Have no special purpose other than helping bootstrap the cluster using the gossip protocol When a node starts up it looks to its seed list to obtain information about the other nodes in the cluster
  • 19. Cassandra Ring Visualized as a ring because it uses a consistent hashing algorithm to distribute data At start up each node is assigned a token range Determines its position in the cluster Range of data stored by the node Each node receives a proportionate range of the token ranges to ensure that data is spread evenly across the ring
  • 20. Cassandra Ring Each node is responsible for a certain set of data Which is determined by the partitioner Partitioner is a hash function for computing the resultant token for a particular row key This token is then used to determine the node which will store the first replica Partitioner Murmur3Partitioner (default) RandomPartitioner ByteOrderedPartitioner.
  • 21. Replication According to the chosen replication strategy Simple Strategy Places the subsequent replicas on the next node in a clockwise manner Network Topology Strategy Works well when Cassandra is deployed across data centres Placement First replica for the data is determined by the partitioner The placement of the subsequent replicas is determined by the replication strategy.
  • 22. Cassandra Write Path Cassandra is masterless Clients can interface with a Cassandra node using Thrift protocol CQL
  • 24. Cassandra Read Path Similar to a write operation Client can connect with any node in the cluster The chosen node is called the coordinator and is responsible for returning the requested data A row key must be supplied for every read operation The coordinator uses the row key to determine the first replica The replication strategy in conjunction with the replication factor is used to determine all other applicable replicas