SlideShare ist ein Scribd-Unternehmen logo
1 von 43
Downloaden Sie, um offline zu lesen
My First 100 days with a Cassandra
Cluster
Presented by :
Gustavo René Antúnez DBA Team Lead
Carlos Rolo Cassandra MVP
September, 2015
2
Welcome to Cassandra Summit 2015
• 18	
  Years	
  of	
  Data	
  infrastructure	
  
management	
  consulting
• 200+	
  Top	
  brands
• 6000+	
  databases	
  under	
  
management
• Over	
  400	
  DBA’s,	
  in	
  35	
  countries	
  
• Top	
  5%	
  of	
  DBA	
  work	
  force,	
  9	
  
Oracle	
  ACE’s,	
  2	
  Microsoft	
  
MVP’s,	
  1	
  Cassandra	
  MVP	
  
• Oracle,	
  Microsoft,	
  MySQL,	
  
Datastax	
  partners,	
  Netezza,	
  
Hadoop	
  and	
  MongoDB	
  plus	
  
UNIX	
  Sysadmin	
  and	
  Oracle	
  apps
About Pythian
Where does René come from
– Oracle	
  DBA	
  
• Started	
  with	
  Version	
  9.2	
  in	
  2004	
  
– Speaker	
  at	
  Oracle	
  Open	
  World,	
  
Developers	
  Day	
  and	
  Collaborate	
  	
  
– APress	
  Q1	
  2016:	
  “Prac%cal	
  Data	
  
Refresh”	
  
– Movie	
  Fanatic	
  &	
  Music	
  Lover	
  
– Bringing	
  the	
  best	
  from	
  México	
  
(Mexihtli)	
  to	
  the	
  rest	
  of	
  the	
  world	
  
and	
  in	
  the	
  process	
  photographing	
  it	
  :)	
  
– rene-­‐ace.com	
  
– @rene_ace
4
Where does Carlos come
5
• Cassandra	
  Consultant	
  	
  
• First	
  contact	
  was	
  0.8	
  	
  
• Cassandra	
  MVP	
  &	
  DataStax	
  
Certified	
  Architect	
  	
  
• Lisbon	
  Cassandra	
  Meetup	
  	
  
• Passion	
  for	
  distributed	
  systems	
  	
  
• Loves	
  a	
  good	
  challenge	
  	
  
• Waterpolo	
  is	
  my	
  sport	
  	
  
• @cjrolo
How did you get to be a DBA
6
6th Happiest Job of 2015!
7
http://www.forbes.com/sites/susanadams/2014/03/20/the-happiest-and-unhappiest-jobs-in-2014/
Work-life
balance
Relationship with
boss and co-workers
Daily tasks
Job resources
Field will grow by
15% between
2012 and 2022
DBA can be the
key driver of
success
Happiest Job of 2034?
Oxford University: THE FUTURE OF EMPLOYMENT: HOW SUSCEPTIBLE ARE JOBS TO COMPUTERISATION?
• 47	
  percent	
  of	
  American	
  jobs	
  are	
  at	
  high	
  risk	
  of	
  being	
  taken	
  by	
  computers	
  
within	
  the	
  next	
  two	
  decades.	
  
– 1st	
  Wave	
  	
  
• Computers	
  will	
  start	
  replacing	
  people	
  in	
  especially	
  vulnerable	
  
fields	
  like	
  transportation/logistics,	
  production	
  labor,	
  and	
  
administrative	
  support.	
  
– 2nd	
  Wave	
  
• Dependent	
  upon	
  the	
  development	
  of	
  good	
  artificial	
  intelligence.	
  
This	
  could	
  next	
  put	
  jobs	
  in	
  management,	
  science	
  and	
  
engineering,	
  and	
  the	
  arts	
  at	
  risk.
8
What is Cassandra ?

• NoSQL	
  database,	
  developed	
  in	
  JavaOne	
  	
  
• Fully	
  distributed	
  DB	
  
• Meaning	
  that	
  there	
  is	
  no	
  master	
  DB,	
  
unlike	
  Oracle	
  or	
  MySQL.	
  
• Linearly	
  scalable	
  
• Based	
  on	
  2	
  core	
  technologies,	
  Google’s	
  Big	
  
Table	
  and	
  Amazon’s	
  Dynamo	
  
• 2	
  versions	
  of	
  Cassandra	
  
• Community	
  Edition.-­‐	
  This	
  is	
  distributed	
  
under	
  the	
  Apache™	
  License	
  
• Enterprise	
  Edition	
  .-­‐	
  This	
  is	
  distributed	
  by	
  
Datastax
9
≠
CAP	
  Theorem
• In	
  a	
  distributed	
  system	
  you	
  can	
  only	
  have	
  two	
  
out	
  of	
  the	
  following	
  three	
  guarantees	
  across	
  a	
  
write/read	
  pair:	
  
• Consistency.-­‐	
  A	
  read	
  is	
  guaranteed	
  to	
  
return	
  the	
  most	
  recent	
  write	
  for	
  a	
  given	
  
client.	
  
• Availability.-­‐A	
  non-­‐failing	
  node	
  will	
  return	
  
a	
  reasonable	
  response	
  within	
  a	
  
reasonable	
  amount	
  of	
  time	
  (no	
  error	
  or	
  
timeout).	
  
• Partition	
  Tolerance.-­‐The	
  system	
  will	
  
continue	
  to	
  function	
  when	
  network	
  
partitions	
  occur.
10
N1 N2
X X
N1 N2
N1 N2
What is Cassandra ?

What is Cassandra ?

• Cassandra	
  is	
  a	
  BASE	
  (Basically	
  
Available,	
  Soft	
  state,	
  Eventually	
  
consistent)	
  type	
  system
11
• Not	
  an	
  ACID	
  (Atomicity,	
  Consistency,	
  
Isolation,	
  Durability)	
  type	
  system	
  
It Can be as easy as …
• Start	
  your	
  machine	
  and	
  install	
  the	
  following:	
  
• ntp	
  (Packages	
  are	
  normally	
  ntp,	
  ntpdata	
  and	
  ntp-­‐
doc)	
  
• wget	
  (Unless	
  you	
  have	
  your	
  packages	
  copied	
  over	
  via	
  
other	
  means)	
  
• vim	
  (Or	
  your	
  favorite	
  text	
  editor)	
  
• Yum	
  Package	
  Management	
  	
  
• Root	
  or	
  sudo	
  access	
  to	
  the	
  install	
  machine	
  
• Latest	
  version	
  of	
  Oracle	
  Java	
  SE	
  Runtime	
  
Environment	
  (JRE)	
  8	
  (recommended)	
  or	
  OpenJDK	
  7.	
  
• Python	
  2.6+	
  (needed	
  if	
  installing	
  OpsCenter)
12
It Can be as easy as …
13
• Install	
  Cassandra.	
  
~$ sudo yum install dsc21-2.1.5-1 cassandra2.1.5-1
• Install	
  optional	
  utilities.	
  
~$ sudo yum install cassandra21-tools-2.1.5-1
• Start	
  Cassandra	
  service	
  
~$ sudo service cassandra stop
~$ sudo rm -rf /var/lib/cassandra/data/system/*
• In	
  the	
  cassandra-­‐rackdc.properties	
  file	
  
#	
  indicate	
  the	
  rack	
  and	
  dc	
  for	
  this	
  node	
  
dc=Pythian	
  
rack=RAC1	
  
~$ sudo service cassandra start
Where is everything in Cassandra?
14
Directories Description
/var/lib/cassandra Data	
  directories
/var/log/	
  cassandra Log	
  directory
/var/run/	
  cassandra Runtime	
  files
/usr/share/	
  cassandra Environment	
  settings
/usr/share/	
  cassandra/
lib
JAR	
  files
/usr/bin Optional	
  utilities,	
  such	
  as	
  sstablelevelreset,	
  
sstablerepairedset,	
  and	
  sstablesplit
/usr/bin Binary	
  files
/usr/sbin
/etc/cassandra Configuration	
  files
/etc/init.d Service	
  startup	
  script
/etc/security/	
  limits.d Cassandra	
  user	
  limits
/etc/default
/usr/share/	
  doc/
cassandra/examples
Sample	
  cassandra.yaml	
  files	
  for	
  stress	
  
testing
I come from this world…
12c	
  Version	
  
Architecture…
15
I come from this world…
Oracle…
16
101010
Online Redo
Log10100
Data Files Control Files
Segment
Database
Tablespace
Extent
Oracle data
block
Schema Data file
OS block
Logical
Datafile
Physical
Datafile
I come from this world…
17
RAC	
  -­‐	
  For	
  Node	
  Point	
  of	
  Failure
RAC Cluster
Node3Node2
ASM Disks
Node1
Public Network
Storage Network
ASM Network
CSS Network
ASM ASM ASM
DBB DBBDBB
Global	
  Data	
  Services	
  	
  
– Service Failover / Load Balancing
I come from this world…
18
Dataguard	
  -­‐	
  For	
  Failover
Primary
Standby
Far	
  Sync	
  
Instance
SYNC
ASYNC
Zero	
  data	
  loss	
  failover
Cassandra Architecture
Cassandra	
  Cluster
19
N1
Node
N2
Node
Rack	
  1
Datacenter	
  México
N3
Node
N4
Node
Rack	
  2
Datacenter	
  Portugal
One Ring to Rule them All
20
• The	
  total	
  amount	
  of	
  data	
  
managed	
  by	
  the	
  cluster	
  is	
  
represented	
  as	
  a	
  ring	
  
• Each	
  node	
  is	
  assigned	
  a	
  part	
  of	
  
the	
  database	
  to	
  hold	
  based	
  on	
  
each	
  table’s	
  primary	
  key.	
  
• To	
  guarantee	
  both	
  availability	
  
and	
  durability	
  multiple	
  nodes	
  will	
  
be	
  assigned	
  to	
  the	
  same	
  data.	
  
• There	
  is	
  no	
  master	
  node	
  all	
  
nodes	
  can	
  perform	
  all	
  operations
1
4
3
2
A-F,T-Z,M-S
G-L,A-F,T-Z
M-S,G-L,A-F
T-Z,M-S,G-L
Gossip
21
• Peer-­‐to-­‐peer	
  communication	
  
protocol	
  in	
  which	
  nodes	
  periodically	
  
exchange	
  state	
  information	
  	
  
• Runs	
  every	
  second	
  and	
  exchanges	
  
state	
  messages	
  with	
  up	
  to	
  three	
  
other	
  nodes	
  in	
  the	
  cluster	
  	
  
• Failure	
  detection	
  	
  
• It	
  determines	
  locally	
  from	
  
gossip	
  state	
  and	
  history	
  if	
  
another	
  node	
  in	
  the	
  system	
  is	
  
down	
  or	
  has	
  come	
  back	
  up.
Consistent Hashing
22
• A	
  hash	
  consists	
  of	
  one	
  or	
  more	
  
arithmetic	
  operations	
  on	
  a	
  piece	
  of	
  
data	
  	
  
• Common	
  way	
  of	
  load	
  balancing	
  across	
  
several	
  nodes	
  
• Hash	
  function	
  must	
  have	
  a	
  upper	
  and	
  
lower	
  bound	
  so	
  objects	
  can	
  be	
  
mapped	
  in	
  a	
  circle	
  
• Common	
  Hash	
  algorithms	
  
– Simple	
  checksums	
  
– Message	
  Digest	
  (MD5)	
  
– Secure	
  Hash	
  Algorithm	
  (SHA-­‐1/2)	
  
– MurmurHash
Partitioners
23
• Determines	
  how	
  data	
  is	
  
distributed	
  across	
  the	
  nodes	
  
in	
  the	
  cluster	
  	
  
• Function	
  for	
  deriving	
  a	
  token	
  
representing	
  a	
  row	
  from	
  its	
  
partition	
  key	
  
Cassandra	
  Offers:	
  
– Murmur3Partition	
  
– RandomPartitioner	
  
– ByteOrderedPartitioner
Virtual Nodes
24
• Solution	
  for	
  avoiding	
  calculating	
  
node	
  tokens	
  and	
  thinking	
  about	
  
the	
  cluster	
  size	
  before	
  hand	
  
• Each	
  node	
  has	
  multiple	
  virtual	
  
nodes	
  
• Each	
  node	
  virtual	
  node	
  own	
  a	
  
much	
  smaller	
  subset	
  of	
  data	
  
Coordinators
25
• Acts	
  as	
  a	
  proxy	
  between	
  the	
  
client	
  application	
  and	
  the	
  
nodes	
  that	
  own	
  the	
  data	
  
being	
  requestedAny	
  client	
  
request	
  can	
  be	
  sent	
  to	
  any	
  
node.
Snitch
26
• Is	
  responsible	
  for	
  keeping	
  all	
  
of	
  the	
  nodes	
  up	
  to	
  date	
  on	
  
what	
  node	
  has	
  what	
  data,	
  
what	
  nodes	
  are	
  currently	
  
down,	
  what	
  nodes	
  are	
  
bootstrapping,	
  etc.	
  	
  
• It	
  Interprets	
  the	
  topology
The	
  most	
  popular	
  are:	
  
– Gossiping	
  property	
  file	
  
snitch	
  
– EC2	
  Snitch	
  
– EC2	
  Multi-­‐region	
  snitch	
  
– Dynamic	
  Snitch
27
Logical database container
28
Data	
  is	
  Stored	
  in	
  Keyspaces
A CASSANDRA TABLE OR COLUMN FAMILY
29
Coordinator
Snitch
Commitlog	
  Writer
Mem	
  table	
  writer
Mem	
  Table	
  Flush	
  (Sstable	
  
writer)
Reader
Mem	
  tables
Bloom	
  Filters
Cassandra	
  Node
CommitLog
10100
SSTables
A CASSANDRA TABLE OR COLUMN FAMILY
30
• Consists	
  of	
  one	
  or	
  more	
  SStables	
  and	
  
0	
  or	
  more	
  MEMtables	
  
• SStable	
  stands	
  for	
  Sorted	
  String	
  Table.	
  	
  
• E.G.	
  all	
  of	
  the	
  Columns	
  in	
  the	
  
SStable	
  are	
  sorted	
  in	
  order	
  by	
  
key.	
  
• Each	
  SStable	
  consists	
  of	
  the	
  data	
  
table,	
  bloom	
  filter,	
  index	
  and	
  some	
  
other	
  minor	
  files.	
  
• SStables	
  are	
  immutable.	
  Once	
  written	
  
they	
  are	
  never	
  altered	
  only	
  read	
  and	
  
eventually	
  deleted
videogames-events-data-jb-1.db
videogames-events-filters-jb-1.db
videogames-events-index-jb-1.db
videogames-events-data-jb-2.db
videogames-events-filters-jb-2.db
videogames-events-index-jb-2.db
videogames-events-data-jb-3.db
videogames-events-filters-jb-3.db
videogames-events-index-jb-3.db
videogames-events-data-jb-4.db
videogames-events-filters-jb-4.db
videogames-events-index-jb-4.db
SStables	
  on	
  disk	
  
/var/lib/cassandra
REPLICATION FACTOR (RF) AND CONSISTENCY
31
• Replication	
  Factor	
  is	
  the	
  
number	
  of	
  copies	
  of	
  
columns	
  stored	
  in	
  the	
  ring	
  
• Replication	
  factor	
  should	
  
not	
  exceed	
  the	
  number	
  of	
  
nodes	
  in	
  the	
  cluster
– RF=1	
  is	
  one	
  copy	
  this	
  means	
  that	
  
the	
  data	
  for	
  each	
  column	
  is	
  stored	
  
only	
  once	
  in	
  the	
  ring.	
  
– RF=3	
  (default)	
  means	
  every	
  
column	
  stored	
  in	
  the	
  database	
  is	
  
stored	
  three	
  times.	
  
– Quorum	
  .-­‐	
  The	
  read	
  and	
  write	
  
must	
  be	
  acked/returned	
  from	
  a	
  
quorum	
  of	
  nodes.
REPLICATION FACTOR (RF) AND CONSISTENCY
32
• Consistency	
  
– When	
  write	
  or	
  read	
  is	
  
performed	
  the	
  application	
  can	
  
choose	
  to	
  wait	
  for	
  n	
  copies	
  of	
  
the	
  data	
  to	
  be	
  written	
  or	
  read	
  
this	
  is	
  referred	
  to	
  as	
  consistency	
  
of	
  n.	
  
– There	
  is	
  a	
  special	
  consistency	
  
value	
  called	
  quorum	
  which	
  
means	
  a	
  response	
  from	
  RF/2+1	
  
nodes	
  is	
  required.
HOW TO MAKE SURE WE DON’T LOOSE DATA
33
• Three	
  anti-­‐entropy	
  mechanisms	
  in	
  Cassandra	
  
1)	
  Hinted	
  handoff	
  
2)	
  Read	
  repair	
  	
  
3)	
  Repair
A.K.A.	
  Anti-­‐Entropy
WRITE PATH
34
COMPACTIONS
35
• SStables	
  are	
  immutable.	
  
• Deletes	
  and	
  updates	
  are	
  just	
  new	
  
writes	
  	
  
• SStables	
  are	
  merged	
  together	
  by	
  
partitioned	
  key.Old	
  obsolete	
  data	
  is	
  
discarded.	
  
• Lots	
  of	
  SStables	
  become	
  a	
  few.	
  
• Compaction	
  can	
  require	
  a	
  lot	
  of	
  
disk	
  space.	
  DO	
  NOT	
  LET	
  your	
  disks	
  
get	
  more	
  than	
  50%	
  full.	
  	
  
CQL - Cassandra Query Language
36
CQL	
  is	
  not	
  SQL
• Default	
  and	
  primary	
  interface	
  into	
  the	
  Cassandra	
  Database	
  (since	
  2.0)	
  
• Cassandra	
  does	
  not	
  support	
  joins	
  or	
  subqueries	
  
• Only	
  way	
  to	
  create	
  users	
  and	
  user	
  based	
  permissions	
  
• Very	
  similar:	
  
cqlsh> CREATE KEYSPACE sandbox WITH REPLICATION = { 'class' :
'NetworkTopologyStrategy', DC1 : 1};
cqlsh> USE sandbox;
cqlsh:sandbox>CREATE TABLE data (id uuid, data text, PRIMARY KEY (id));
cqlsh:sandbox> INSERT INTO data (id, data) values
(c37d661d-7e61-49ea-96a5-68c34e83db3a, 'testing');
cqlsh:sandbox> SELECT * FROM data;
37
38
Feature/Function	
   DSE/Cassandra Oracle	
  RDBMS	
  
Core architecture “Masterless”; peer-to-peer with
all nodes being the same
Traditional standalone
High availability Continuous availability with built
in redundancy and hardware
rack awareness in both single
and multiple data centers
Oracle Dataguard (for failover)
and Oracle RAC (Node SPOF)
GoldenGate
Data model Google Bigtable Relational/tabular
Data consistency model Tunable consistency (CAP
theorem consistency per
operation
Traditional ACID
Storage model Targeted directories with
separation
Tablespaces
Logical database
container
Keyspace Database
Backup/recovery Online, point-in-time restore Online, point-in-time restore
Enterprise management/
monitoring
DataStax OpsCenter Oracle Enterprise Manager
LESSONS LEARNED
39
• Understand	
  the	
  Data	
  Model	
  Differences	
  
• Hardware	
  Setup	
  does	
  Matter	
  
• Grep	
  the	
  logs	
  for	
  errors	
  and	
  warnings	
  
• Make	
  sure	
  each	
  node	
  is	
  created	
  properly	
  
• Know	
  your	
  tools	
  
• nodetool	
  utility	
  
• Cassandra	
  bulk	
  loader	
  (sstableloader)	
  
• jconsole/JavaVisualVM	
  
• Cassandra-­‐Stress	
  
• OpsCenter
40
FIT-ACER
• F – Focus (SLOW DOWN! Are you ready?)
• I – Identify server/DB name, time, authorization
• T – Type the command (do not hit enter yet)
• A – Assess the command (SPEND TIME HERE!)
• C – Check the server / database name again
• E – Execute the command
• R – Review and document the results
41
42
rene-ace.com
43
To contact us
sales@pythian.com
1-877-PYTHIAN
To follow us
http://www.pythian.com/blog
http://www.facebook.com/pages/The-Pythian-Group/163902527671
@pythian
http://www.linkedin.com/company/pythian
Thank you – Q&A

Weitere ähnliche Inhalte

Was ist angesagt?

Cassandra background-and-architecture
Cassandra background-and-architectureCassandra background-and-architecture
Cassandra background-and-architectureMarkus Klems
 
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016DataStax
 
Cassandra Summit 2014: Apache Cassandra Best Practices at Ebay
Cassandra Summit 2014: Apache Cassandra Best Practices at EbayCassandra Summit 2014: Apache Cassandra Best Practices at Ebay
Cassandra Summit 2014: Apache Cassandra Best Practices at EbayDataStax Academy
 
Intro to cassandra
Intro to cassandraIntro to cassandra
Intro to cassandraAaron Ploetz
 
The Cassandra Distributed Database
The Cassandra Distributed DatabaseThe Cassandra Distributed Database
The Cassandra Distributed DatabaseEric Evans
 
Cassandra and Spark
Cassandra and SparkCassandra and Spark
Cassandra and Sparknickmbailey
 
Understanding Cassandra internals to solve real-world problems
Understanding Cassandra internals to solve real-world problemsUnderstanding Cassandra internals to solve real-world problems
Understanding Cassandra internals to solve real-world problemsAcunu
 
C* Summit 2013: Cassandra at eBay Scale by Feng Qu and Anurag Jambhekar
C* Summit 2013: Cassandra at eBay Scale by Feng Qu and Anurag JambhekarC* Summit 2013: Cassandra at eBay Scale by Feng Qu and Anurag Jambhekar
C* Summit 2013: Cassandra at eBay Scale by Feng Qu and Anurag JambhekarDataStax Academy
 
M6d cassandrapresentation
M6d cassandrapresentationM6d cassandrapresentation
M6d cassandrapresentationEdward Capriolo
 
Building a Multi-Region Cluster at Target (Aaron Ploetz, Target) | Cassandra ...
Building a Multi-Region Cluster at Target (Aaron Ploetz, Target) | Cassandra ...Building a Multi-Region Cluster at Target (Aaron Ploetz, Target) | Cassandra ...
Building a Multi-Region Cluster at Target (Aaron Ploetz, Target) | Cassandra ...DataStax
 
Micro-batching: High-performance Writes (Adam Zegelin, Instaclustr) | Cassand...
Micro-batching: High-performance Writes (Adam Zegelin, Instaclustr) | Cassand...Micro-batching: High-performance Writes (Adam Zegelin, Instaclustr) | Cassand...
Micro-batching: High-performance Writes (Adam Zegelin, Instaclustr) | Cassand...DataStax
 
Signal Digital: The Skinny on Wide Rows
Signal Digital: The Skinny on Wide RowsSignal Digital: The Skinny on Wide Rows
Signal Digital: The Skinny on Wide RowsDataStax Academy
 
Instaclustr webinar 2017 feb 08 japan
Instaclustr webinar 2017 feb 08   japanInstaclustr webinar 2017 feb 08   japan
Instaclustr webinar 2017 feb 08 japanHiromitsu Komatsu
 
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...DataStax Academy
 
Presentation of Apache Cassandra
Presentation of Apache Cassandra Presentation of Apache Cassandra
Presentation of Apache Cassandra Nikiforos Botis
 
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...DataStax
 
Webinar: Getting Started with Apache Cassandra
Webinar: Getting Started with Apache CassandraWebinar: Getting Started with Apache Cassandra
Webinar: Getting Started with Apache CassandraDataStax
 
Apache Cassandra @Geneva JUG 2013.02.26
Apache Cassandra @Geneva JUG 2013.02.26Apache Cassandra @Geneva JUG 2013.02.26
Apache Cassandra @Geneva JUG 2013.02.26Benoit Perroud
 
Migration Best Practices: From RDBMS to Cassandra without a Hitch
Migration Best Practices: From RDBMS to Cassandra without a HitchMigration Best Practices: From RDBMS to Cassandra without a Hitch
Migration Best Practices: From RDBMS to Cassandra without a HitchDataStax Academy
 

Was ist angesagt? (20)

Cassandra background-and-architecture
Cassandra background-and-architectureCassandra background-and-architecture
Cassandra background-and-architecture
 
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
 
Cassandra Summit 2014: Apache Cassandra Best Practices at Ebay
Cassandra Summit 2014: Apache Cassandra Best Practices at EbayCassandra Summit 2014: Apache Cassandra Best Practices at Ebay
Cassandra Summit 2014: Apache Cassandra Best Practices at Ebay
 
Intro to cassandra
Intro to cassandraIntro to cassandra
Intro to cassandra
 
The Cassandra Distributed Database
The Cassandra Distributed DatabaseThe Cassandra Distributed Database
The Cassandra Distributed Database
 
Cassandra and Spark
Cassandra and SparkCassandra and Spark
Cassandra and Spark
 
Understanding Cassandra internals to solve real-world problems
Understanding Cassandra internals to solve real-world problemsUnderstanding Cassandra internals to solve real-world problems
Understanding Cassandra internals to solve real-world problems
 
C* Summit 2013: Cassandra at eBay Scale by Feng Qu and Anurag Jambhekar
C* Summit 2013: Cassandra at eBay Scale by Feng Qu and Anurag JambhekarC* Summit 2013: Cassandra at eBay Scale by Feng Qu and Anurag Jambhekar
C* Summit 2013: Cassandra at eBay Scale by Feng Qu and Anurag Jambhekar
 
M6d cassandrapresentation
M6d cassandrapresentationM6d cassandrapresentation
M6d cassandrapresentation
 
Building a Multi-Region Cluster at Target (Aaron Ploetz, Target) | Cassandra ...
Building a Multi-Region Cluster at Target (Aaron Ploetz, Target) | Cassandra ...Building a Multi-Region Cluster at Target (Aaron Ploetz, Target) | Cassandra ...
Building a Multi-Region Cluster at Target (Aaron Ploetz, Target) | Cassandra ...
 
Micro-batching: High-performance Writes (Adam Zegelin, Instaclustr) | Cassand...
Micro-batching: High-performance Writes (Adam Zegelin, Instaclustr) | Cassand...Micro-batching: High-performance Writes (Adam Zegelin, Instaclustr) | Cassand...
Micro-batching: High-performance Writes (Adam Zegelin, Instaclustr) | Cassand...
 
Signal Digital: The Skinny on Wide Rows
Signal Digital: The Skinny on Wide RowsSignal Digital: The Skinny on Wide Rows
Signal Digital: The Skinny on Wide Rows
 
Cassandra
CassandraCassandra
Cassandra
 
Instaclustr webinar 2017 feb 08 japan
Instaclustr webinar 2017 feb 08   japanInstaclustr webinar 2017 feb 08   japan
Instaclustr webinar 2017 feb 08 japan
 
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
 
Presentation of Apache Cassandra
Presentation of Apache Cassandra Presentation of Apache Cassandra
Presentation of Apache Cassandra
 
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...
 
Webinar: Getting Started with Apache Cassandra
Webinar: Getting Started with Apache CassandraWebinar: Getting Started with Apache Cassandra
Webinar: Getting Started with Apache Cassandra
 
Apache Cassandra @Geneva JUG 2013.02.26
Apache Cassandra @Geneva JUG 2013.02.26Apache Cassandra @Geneva JUG 2013.02.26
Apache Cassandra @Geneva JUG 2013.02.26
 
Migration Best Practices: From RDBMS to Cassandra without a Hitch
Migration Best Practices: From RDBMS to Cassandra without a HitchMigration Best Practices: From RDBMS to Cassandra without a Hitch
Migration Best Practices: From RDBMS to Cassandra without a Hitch
 

Andere mochten auch

Cassandra and security
Cassandra and securityCassandra and security
Cassandra and securityBen Bromhead
 
Cassandra SF 2015 - Repeatable, Scalable, Reliable, Observable Cassandra
Cassandra SF 2015 - Repeatable, Scalable, Reliable, Observable CassandraCassandra SF 2015 - Repeatable, Scalable, Reliable, Observable Cassandra
Cassandra SF 2015 - Repeatable, Scalable, Reliable, Observable Cassandraaaronmorton
 
Hardening cassandra for compliance or paranoia
Hardening cassandra for compliance or paranoiaHardening cassandra for compliance or paranoia
Hardening cassandra for compliance or paranoiazznate
 
Cassandra Performance Benchmark
Cassandra Performance BenchmarkCassandra Performance Benchmark
Cassandra Performance BenchmarkBigstep
 
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...DataStax Academy
 
Securing Cassandra The Right Way
Securing Cassandra The Right WaySecuring Cassandra The Right Way
Securing Cassandra The Right WayDataStax Academy
 
Inside Hulu's Data platform (BigDataCamp LA 2013)
Inside Hulu's Data platform (BigDataCamp LA 2013)Inside Hulu's Data platform (BigDataCamp LA 2013)
Inside Hulu's Data platform (BigDataCamp LA 2013)Prasan Samtani
 
Global Netflix - HPTS Workshop - Scaling Cassandra benchmark to over 1M write...
Global Netflix - HPTS Workshop - Scaling Cassandra benchmark to over 1M write...Global Netflix - HPTS Workshop - Scaling Cassandra benchmark to over 1M write...
Global Netflix - HPTS Workshop - Scaling Cassandra benchmark to over 1M write...Adrian Cockcroft
 
Cassandra Summit 2015: Real World DTCS For Operators
Cassandra Summit 2015: Real World DTCS For OperatorsCassandra Summit 2015: Real World DTCS For Operators
Cassandra Summit 2015: Real World DTCS For OperatorsJeff Jirsa
 
Cassandra Summit 2015 - A Change of Seasons
Cassandra Summit 2015 - A Change of SeasonsCassandra Summit 2015 - A Change of Seasons
Cassandra Summit 2015 - A Change of SeasonsEiti Kimura
 
AddThis: Scaling Cassandra up and down into containers with ZFS
AddThis: Scaling Cassandra up and down into containers with ZFSAddThis: Scaling Cassandra up and down into containers with ZFS
AddThis: Scaling Cassandra up and down into containers with ZFSDataStax Academy
 
Ficstar Software: Cassandra Installation to Optimization
Ficstar Software: Cassandra Installation to OptimizationFicstar Software: Cassandra Installation to Optimization
Ficstar Software: Cassandra Installation to OptimizationDataStax Academy
 
Your First 90 Days in Sales Management
Your First 90 Days in Sales ManagementYour First 90 Days in Sales Management
Your First 90 Days in Sales Managementdaleu
 
Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...
Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...
Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...Helena Edelson
 
The First 90 days
The First 90 daysThe First 90 days
The First 90 daysGMR Group
 

Andere mochten auch (15)

Cassandra and security
Cassandra and securityCassandra and security
Cassandra and security
 
Cassandra SF 2015 - Repeatable, Scalable, Reliable, Observable Cassandra
Cassandra SF 2015 - Repeatable, Scalable, Reliable, Observable CassandraCassandra SF 2015 - Repeatable, Scalable, Reliable, Observable Cassandra
Cassandra SF 2015 - Repeatable, Scalable, Reliable, Observable Cassandra
 
Hardening cassandra for compliance or paranoia
Hardening cassandra for compliance or paranoiaHardening cassandra for compliance or paranoia
Hardening cassandra for compliance or paranoia
 
Cassandra Performance Benchmark
Cassandra Performance BenchmarkCassandra Performance Benchmark
Cassandra Performance Benchmark
 
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
 
Securing Cassandra The Right Way
Securing Cassandra The Right WaySecuring Cassandra The Right Way
Securing Cassandra The Right Way
 
Inside Hulu's Data platform (BigDataCamp LA 2013)
Inside Hulu's Data platform (BigDataCamp LA 2013)Inside Hulu's Data platform (BigDataCamp LA 2013)
Inside Hulu's Data platform (BigDataCamp LA 2013)
 
Global Netflix - HPTS Workshop - Scaling Cassandra benchmark to over 1M write...
Global Netflix - HPTS Workshop - Scaling Cassandra benchmark to over 1M write...Global Netflix - HPTS Workshop - Scaling Cassandra benchmark to over 1M write...
Global Netflix - HPTS Workshop - Scaling Cassandra benchmark to over 1M write...
 
Cassandra Summit 2015: Real World DTCS For Operators
Cassandra Summit 2015: Real World DTCS For OperatorsCassandra Summit 2015: Real World DTCS For Operators
Cassandra Summit 2015: Real World DTCS For Operators
 
Cassandra Summit 2015 - A Change of Seasons
Cassandra Summit 2015 - A Change of SeasonsCassandra Summit 2015 - A Change of Seasons
Cassandra Summit 2015 - A Change of Seasons
 
AddThis: Scaling Cassandra up and down into containers with ZFS
AddThis: Scaling Cassandra up and down into containers with ZFSAddThis: Scaling Cassandra up and down into containers with ZFS
AddThis: Scaling Cassandra up and down into containers with ZFS
 
Ficstar Software: Cassandra Installation to Optimization
Ficstar Software: Cassandra Installation to OptimizationFicstar Software: Cassandra Installation to Optimization
Ficstar Software: Cassandra Installation to Optimization
 
Your First 90 Days in Sales Management
Your First 90 Days in Sales ManagementYour First 90 Days in Sales Management
Your First 90 Days in Sales Management
 
Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...
Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...
Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...
 
The First 90 days
The First 90 daysThe First 90 days
The First 90 days
 

Ähnlich wie Pythian: My First 100 days with a Cassandra Cluster

cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
RAC - The Savior of DBA
RAC - The Savior of DBARAC - The Savior of DBA
RAC - The Savior of DBANikhil Kumar
 
MySQL Cluster Scaling to a Billion Queries
MySQL Cluster Scaling to a Billion QueriesMySQL Cluster Scaling to a Billion Queries
MySQL Cluster Scaling to a Billion QueriesBernd Ocklin
 
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...DataStax Academy
 
Leveraging Cassandra for real-time multi-datacenter public cloud analytics
Leveraging Cassandra for real-time multi-datacenter public cloud analyticsLeveraging Cassandra for real-time multi-datacenter public cloud analytics
Leveraging Cassandra for real-time multi-datacenter public cloud analyticsJulien Anguenot
 
Cassandra - A Basic Introduction Guide
Cassandra - A Basic Introduction GuideCassandra - A Basic Introduction Guide
Cassandra - A Basic Introduction GuideMohammed Fazuluddin
 
Streaming Analytics with Spark, Kafka, Cassandra and Akka
Streaming Analytics with Spark, Kafka, Cassandra and AkkaStreaming Analytics with Spark, Kafka, Cassandra and Akka
Streaming Analytics with Spark, Kafka, Cassandra and AkkaHelena Edelson
 
Apache Cassandra in the Real World
Apache Cassandra in the Real WorldApache Cassandra in the Real World
Apache Cassandra in the Real WorldJeremy Hanna
 
Aruman Cassandra database
Aruman Cassandra databaseAruman Cassandra database
Aruman Cassandra databaseUmesh Dande
 
Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...
Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...
Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...DataStax Academy
 
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena EdelsonStreaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena EdelsonSpark Summit
 
SMACK Stack 1.1
SMACK Stack 1.1SMACK Stack 1.1
SMACK Stack 1.1Joe Stein
 
An Introduction to Cassandra - Oracle User Group
An Introduction to Cassandra - Oracle User GroupAn Introduction to Cassandra - Oracle User Group
An Introduction to Cassandra - Oracle User GroupCarlos Juzarte Rolo
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservicesBigstep
 
Fully fault tolerant real time data pipeline with docker and mesos
Fully fault tolerant real time data pipeline with docker and mesos Fully fault tolerant real time data pipeline with docker and mesos
Fully fault tolerant real time data pipeline with docker and mesos Rahul Kumar
 
Cassandra
CassandraCassandra
Cassandraexsuns
 

Ähnlich wie Pythian: My First 100 days with a Cassandra Cluster (20)

BigData Developers MeetUp
BigData Developers MeetUpBigData Developers MeetUp
BigData Developers MeetUp
 
Apache cassandra
Apache cassandraApache cassandra
Apache cassandra
 
Devops kc
Devops kcDevops kc
Devops kc
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
RAC - The Savior of DBA
RAC - The Savior of DBARAC - The Savior of DBA
RAC - The Savior of DBA
 
MySQL Cluster Scaling to a Billion Queries
MySQL Cluster Scaling to a Billion QueriesMySQL Cluster Scaling to a Billion Queries
MySQL Cluster Scaling to a Billion Queries
 
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
 
Leveraging Cassandra for real-time multi-datacenter public cloud analytics
Leveraging Cassandra for real-time multi-datacenter public cloud analyticsLeveraging Cassandra for real-time multi-datacenter public cloud analytics
Leveraging Cassandra for real-time multi-datacenter public cloud analytics
 
Cassandra - A Basic Introduction Guide
Cassandra - A Basic Introduction GuideCassandra - A Basic Introduction Guide
Cassandra - A Basic Introduction Guide
 
Streaming Analytics with Spark, Kafka, Cassandra and Akka
Streaming Analytics with Spark, Kafka, Cassandra and AkkaStreaming Analytics with Spark, Kafka, Cassandra and Akka
Streaming Analytics with Spark, Kafka, Cassandra and Akka
 
Apache Cassandra in the Real World
Apache Cassandra in the Real WorldApache Cassandra in the Real World
Apache Cassandra in the Real World
 
Aruman Cassandra database
Aruman Cassandra databaseAruman Cassandra database
Aruman Cassandra database
 
DataStax TechDay - Munich 2014
DataStax TechDay - Munich 2014DataStax TechDay - Munich 2014
DataStax TechDay - Munich 2014
 
Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...
Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...
Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...
 
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena EdelsonStreaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
 
SMACK Stack 1.1
SMACK Stack 1.1SMACK Stack 1.1
SMACK Stack 1.1
 
An Introduction to Cassandra - Oracle User Group
An Introduction to Cassandra - Oracle User GroupAn Introduction to Cassandra - Oracle User Group
An Introduction to Cassandra - Oracle User Group
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservices
 
Fully fault tolerant real time data pipeline with docker and mesos
Fully fault tolerant real time data pipeline with docker and mesos Fully fault tolerant real time data pipeline with docker and mesos
Fully fault tolerant real time data pipeline with docker and mesos
 
Cassandra
CassandraCassandra
Cassandra
 

Mehr von DataStax Academy

Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftForrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftDataStax Academy
 
Introduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseIntroduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseDataStax Academy
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraIntroduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraDataStax Academy
 
Cassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsCassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsDataStax Academy
 
Cassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingCassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingDataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackCassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackDataStax Academy
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache CassandraDataStax Academy
 
Production Ready Cassandra
Production Ready CassandraProduction Ready Cassandra
Production Ready CassandraDataStax Academy
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonCassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonDataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2DataStax Academy
 
Standing Up Your First Cluster
Standing Up Your First ClusterStanding Up Your First Cluster
Standing Up Your First ClusterDataStax Academy
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with DseDataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraIntroduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraDataStax Academy
 
Enabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseEnabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseDataStax Academy
 
Advanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraAdvanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraDataStax Academy
 

Mehr von DataStax Academy (20)

Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftForrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
 
Introduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseIntroduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph Database
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraIntroduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
 
Cassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsCassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart Labs
 
Cassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingCassandra 3.0 Data Modeling
Cassandra 3.0 Data Modeling
 
Cassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackCassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stack
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
 
Coursera Cassandra Driver
Coursera Cassandra DriverCoursera Cassandra Driver
Coursera Cassandra Driver
 
Production Ready Cassandra
Production Ready CassandraProduction Ready Cassandra
Production Ready Cassandra
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonCassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2
 
Standing Up Your First Cluster
Standing Up Your First ClusterStanding Up Your First Cluster
Standing Up Your First Cluster
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with Dse
 
Introduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraIntroduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache Cassandra
 
Cassandra Core Concepts
Cassandra Core ConceptsCassandra Core Concepts
Cassandra Core Concepts
 
Enabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseEnabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax Enterprise
 
Bad Habits Die Hard
Bad Habits Die Hard Bad Habits Die Hard
Bad Habits Die Hard
 
Advanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraAdvanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache Cassandra
 
Advanced Cassandra
Advanced CassandraAdvanced Cassandra
Advanced Cassandra
 

Kürzlich hochgeladen

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 

Kürzlich hochgeladen (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Pythian: My First 100 days with a Cassandra Cluster

  • 1. My First 100 days with a Cassandra Cluster Presented by : Gustavo René Antúnez DBA Team Lead Carlos Rolo Cassandra MVP September, 2015
  • 3. • 18  Years  of  Data  infrastructure   management  consulting • 200+  Top  brands • 6000+  databases  under   management • Over  400  DBA’s,  in  35  countries   • Top  5%  of  DBA  work  force,  9   Oracle  ACE’s,  2  Microsoft   MVP’s,  1  Cassandra  MVP   • Oracle,  Microsoft,  MySQL,   Datastax  partners,  Netezza,   Hadoop  and  MongoDB  plus   UNIX  Sysadmin  and  Oracle  apps About Pythian
  • 4. Where does René come from – Oracle  DBA   • Started  with  Version  9.2  in  2004   – Speaker  at  Oracle  Open  World,   Developers  Day  and  Collaborate     – APress  Q1  2016:  “Prac%cal  Data   Refresh”   – Movie  Fanatic  &  Music  Lover   – Bringing  the  best  from  México   (Mexihtli)  to  the  rest  of  the  world   and  in  the  process  photographing  it  :)   – rene-­‐ace.com   – @rene_ace 4
  • 5. Where does Carlos come 5 • Cassandra  Consultant     • First  contact  was  0.8     • Cassandra  MVP  &  DataStax   Certified  Architect     • Lisbon  Cassandra  Meetup     • Passion  for  distributed  systems     • Loves  a  good  challenge     • Waterpolo  is  my  sport     • @cjrolo
  • 6. How did you get to be a DBA 6
  • 7. 6th Happiest Job of 2015! 7 http://www.forbes.com/sites/susanadams/2014/03/20/the-happiest-and-unhappiest-jobs-in-2014/ Work-life balance Relationship with boss and co-workers Daily tasks Job resources Field will grow by 15% between 2012 and 2022 DBA can be the key driver of success
  • 8. Happiest Job of 2034? Oxford University: THE FUTURE OF EMPLOYMENT: HOW SUSCEPTIBLE ARE JOBS TO COMPUTERISATION? • 47  percent  of  American  jobs  are  at  high  risk  of  being  taken  by  computers   within  the  next  two  decades.   – 1st  Wave     • Computers  will  start  replacing  people  in  especially  vulnerable   fields  like  transportation/logistics,  production  labor,  and   administrative  support.   – 2nd  Wave   • Dependent  upon  the  development  of  good  artificial  intelligence.   This  could  next  put  jobs  in  management,  science  and   engineering,  and  the  arts  at  risk. 8
  • 9. What is Cassandra ?
 • NoSQL  database,  developed  in  JavaOne     • Fully  distributed  DB   • Meaning  that  there  is  no  master  DB,   unlike  Oracle  or  MySQL.   • Linearly  scalable   • Based  on  2  core  technologies,  Google’s  Big   Table  and  Amazon’s  Dynamo   • 2  versions  of  Cassandra   • Community  Edition.-­‐  This  is  distributed   under  the  Apache™  License   • Enterprise  Edition  .-­‐  This  is  distributed  by   Datastax 9 ≠
  • 10. CAP  Theorem • In  a  distributed  system  you  can  only  have  two   out  of  the  following  three  guarantees  across  a   write/read  pair:   • Consistency.-­‐  A  read  is  guaranteed  to   return  the  most  recent  write  for  a  given   client.   • Availability.-­‐A  non-­‐failing  node  will  return   a  reasonable  response  within  a   reasonable  amount  of  time  (no  error  or   timeout).   • Partition  Tolerance.-­‐The  system  will   continue  to  function  when  network   partitions  occur. 10 N1 N2 X X N1 N2 N1 N2 What is Cassandra ?

  • 11. What is Cassandra ?
 • Cassandra  is  a  BASE  (Basically   Available,  Soft  state,  Eventually   consistent)  type  system 11 • Not  an  ACID  (Atomicity,  Consistency,   Isolation,  Durability)  type  system  
  • 12. It Can be as easy as … • Start  your  machine  and  install  the  following:   • ntp  (Packages  are  normally  ntp,  ntpdata  and  ntp-­‐ doc)   • wget  (Unless  you  have  your  packages  copied  over  via   other  means)   • vim  (Or  your  favorite  text  editor)   • Yum  Package  Management     • Root  or  sudo  access  to  the  install  machine   • Latest  version  of  Oracle  Java  SE  Runtime   Environment  (JRE)  8  (recommended)  or  OpenJDK  7.   • Python  2.6+  (needed  if  installing  OpsCenter) 12
  • 13. It Can be as easy as … 13 • Install  Cassandra.   ~$ sudo yum install dsc21-2.1.5-1 cassandra2.1.5-1 • Install  optional  utilities.   ~$ sudo yum install cassandra21-tools-2.1.5-1 • Start  Cassandra  service   ~$ sudo service cassandra stop ~$ sudo rm -rf /var/lib/cassandra/data/system/* • In  the  cassandra-­‐rackdc.properties  file   #  indicate  the  rack  and  dc  for  this  node   dc=Pythian   rack=RAC1   ~$ sudo service cassandra start
  • 14. Where is everything in Cassandra? 14 Directories Description /var/lib/cassandra Data  directories /var/log/  cassandra Log  directory /var/run/  cassandra Runtime  files /usr/share/  cassandra Environment  settings /usr/share/  cassandra/ lib JAR  files /usr/bin Optional  utilities,  such  as  sstablelevelreset,   sstablerepairedset,  and  sstablesplit /usr/bin Binary  files /usr/sbin /etc/cassandra Configuration  files /etc/init.d Service  startup  script /etc/security/  limits.d Cassandra  user  limits /etc/default /usr/share/  doc/ cassandra/examples Sample  cassandra.yaml  files  for  stress   testing
  • 15. I come from this world… 12c  Version   Architecture… 15
  • 16. I come from this world… Oracle… 16 101010 Online Redo Log10100 Data Files Control Files Segment Database Tablespace Extent Oracle data block Schema Data file OS block Logical Datafile Physical Datafile
  • 17. I come from this world… 17 RAC  -­‐  For  Node  Point  of  Failure RAC Cluster Node3Node2 ASM Disks Node1 Public Network Storage Network ASM Network CSS Network ASM ASM ASM DBB DBBDBB Global  Data  Services     – Service Failover / Load Balancing
  • 18. I come from this world… 18 Dataguard  -­‐  For  Failover Primary Standby Far  Sync   Instance SYNC ASYNC Zero  data  loss  failover
  • 19. Cassandra Architecture Cassandra  Cluster 19 N1 Node N2 Node Rack  1 Datacenter  México N3 Node N4 Node Rack  2 Datacenter  Portugal
  • 20. One Ring to Rule them All 20 • The  total  amount  of  data   managed  by  the  cluster  is   represented  as  a  ring   • Each  node  is  assigned  a  part  of   the  database  to  hold  based  on   each  table’s  primary  key.   • To  guarantee  both  availability   and  durability  multiple  nodes  will   be  assigned  to  the  same  data.   • There  is  no  master  node  all   nodes  can  perform  all  operations 1 4 3 2 A-F,T-Z,M-S G-L,A-F,T-Z M-S,G-L,A-F T-Z,M-S,G-L
  • 21. Gossip 21 • Peer-­‐to-­‐peer  communication   protocol  in  which  nodes  periodically   exchange  state  information     • Runs  every  second  and  exchanges   state  messages  with  up  to  three   other  nodes  in  the  cluster     • Failure  detection     • It  determines  locally  from   gossip  state  and  history  if   another  node  in  the  system  is   down  or  has  come  back  up.
  • 22. Consistent Hashing 22 • A  hash  consists  of  one  or  more   arithmetic  operations  on  a  piece  of   data     • Common  way  of  load  balancing  across   several  nodes   • Hash  function  must  have  a  upper  and   lower  bound  so  objects  can  be   mapped  in  a  circle   • Common  Hash  algorithms   – Simple  checksums   – Message  Digest  (MD5)   – Secure  Hash  Algorithm  (SHA-­‐1/2)   – MurmurHash
  • 23. Partitioners 23 • Determines  how  data  is   distributed  across  the  nodes   in  the  cluster     • Function  for  deriving  a  token   representing  a  row  from  its   partition  key   Cassandra  Offers:   – Murmur3Partition   – RandomPartitioner   – ByteOrderedPartitioner
  • 24. Virtual Nodes 24 • Solution  for  avoiding  calculating   node  tokens  and  thinking  about   the  cluster  size  before  hand   • Each  node  has  multiple  virtual   nodes   • Each  node  virtual  node  own  a   much  smaller  subset  of  data  
  • 25. Coordinators 25 • Acts  as  a  proxy  between  the   client  application  and  the   nodes  that  own  the  data   being  requestedAny  client   request  can  be  sent  to  any   node.
  • 26. Snitch 26 • Is  responsible  for  keeping  all   of  the  nodes  up  to  date  on   what  node  has  what  data,   what  nodes  are  currently   down,  what  nodes  are   bootstrapping,  etc.     • It  Interprets  the  topology The  most  popular  are:   – Gossiping  property  file   snitch   – EC2  Snitch   – EC2  Multi-­‐region  snitch   – Dynamic  Snitch
  • 27. 27
  • 28. Logical database container 28 Data  is  Stored  in  Keyspaces
  • 29. A CASSANDRA TABLE OR COLUMN FAMILY 29 Coordinator Snitch Commitlog  Writer Mem  table  writer Mem  Table  Flush  (Sstable   writer) Reader Mem  tables Bloom  Filters Cassandra  Node CommitLog 10100 SSTables
  • 30. A CASSANDRA TABLE OR COLUMN FAMILY 30 • Consists  of  one  or  more  SStables  and   0  or  more  MEMtables   • SStable  stands  for  Sorted  String  Table.     • E.G.  all  of  the  Columns  in  the   SStable  are  sorted  in  order  by   key.   • Each  SStable  consists  of  the  data   table,  bloom  filter,  index  and  some   other  minor  files.   • SStables  are  immutable.  Once  written   they  are  never  altered  only  read  and   eventually  deleted videogames-events-data-jb-1.db videogames-events-filters-jb-1.db videogames-events-index-jb-1.db videogames-events-data-jb-2.db videogames-events-filters-jb-2.db videogames-events-index-jb-2.db videogames-events-data-jb-3.db videogames-events-filters-jb-3.db videogames-events-index-jb-3.db videogames-events-data-jb-4.db videogames-events-filters-jb-4.db videogames-events-index-jb-4.db SStables  on  disk   /var/lib/cassandra
  • 31. REPLICATION FACTOR (RF) AND CONSISTENCY 31 • Replication  Factor  is  the   number  of  copies  of   columns  stored  in  the  ring   • Replication  factor  should   not  exceed  the  number  of   nodes  in  the  cluster – RF=1  is  one  copy  this  means  that   the  data  for  each  column  is  stored   only  once  in  the  ring.   – RF=3  (default)  means  every   column  stored  in  the  database  is   stored  three  times.   – Quorum  .-­‐  The  read  and  write   must  be  acked/returned  from  a   quorum  of  nodes.
  • 32. REPLICATION FACTOR (RF) AND CONSISTENCY 32 • Consistency   – When  write  or  read  is   performed  the  application  can   choose  to  wait  for  n  copies  of   the  data  to  be  written  or  read   this  is  referred  to  as  consistency   of  n.   – There  is  a  special  consistency   value  called  quorum  which   means  a  response  from  RF/2+1   nodes  is  required.
  • 33. HOW TO MAKE SURE WE DON’T LOOSE DATA 33 • Three  anti-­‐entropy  mechanisms  in  Cassandra   1)  Hinted  handoff   2)  Read  repair     3)  Repair A.K.A.  Anti-­‐Entropy
  • 35. COMPACTIONS 35 • SStables  are  immutable.   • Deletes  and  updates  are  just  new   writes     • SStables  are  merged  together  by   partitioned  key.Old  obsolete  data  is   discarded.   • Lots  of  SStables  become  a  few.   • Compaction  can  require  a  lot  of   disk  space.  DO  NOT  LET  your  disks   get  more  than  50%  full.    
  • 36. CQL - Cassandra Query Language 36 CQL  is  not  SQL • Default  and  primary  interface  into  the  Cassandra  Database  (since  2.0)   • Cassandra  does  not  support  joins  or  subqueries   • Only  way  to  create  users  and  user  based  permissions   • Very  similar:   cqlsh> CREATE KEYSPACE sandbox WITH REPLICATION = { 'class' : 'NetworkTopologyStrategy', DC1 : 1}; cqlsh> USE sandbox; cqlsh:sandbox>CREATE TABLE data (id uuid, data text, PRIMARY KEY (id)); cqlsh:sandbox> INSERT INTO data (id, data) values (c37d661d-7e61-49ea-96a5-68c34e83db3a, 'testing'); cqlsh:sandbox> SELECT * FROM data;
  • 37. 37
  • 38. 38 Feature/Function   DSE/Cassandra Oracle  RDBMS   Core architecture “Masterless”; peer-to-peer with all nodes being the same Traditional standalone High availability Continuous availability with built in redundancy and hardware rack awareness in both single and multiple data centers Oracle Dataguard (for failover) and Oracle RAC (Node SPOF) GoldenGate Data model Google Bigtable Relational/tabular Data consistency model Tunable consistency (CAP theorem consistency per operation Traditional ACID Storage model Targeted directories with separation Tablespaces Logical database container Keyspace Database Backup/recovery Online, point-in-time restore Online, point-in-time restore Enterprise management/ monitoring DataStax OpsCenter Oracle Enterprise Manager
  • 39. LESSONS LEARNED 39 • Understand  the  Data  Model  Differences   • Hardware  Setup  does  Matter   • Grep  the  logs  for  errors  and  warnings   • Make  sure  each  node  is  created  properly   • Know  your  tools   • nodetool  utility   • Cassandra  bulk  loader  (sstableloader)   • jconsole/JavaVisualVM   • Cassandra-­‐Stress   • OpsCenter
  • 40. 40
  • 41. FIT-ACER • F – Focus (SLOW DOWN! Are you ready?) • I – Identify server/DB name, time, authorization • T – Type the command (do not hit enter yet) • A – Assess the command (SPEND TIME HERE!) • C – Check the server / database name again • E – Execute the command • R – Review and document the results 41
  • 43. 43 To contact us sales@pythian.com 1-877-PYTHIAN To follow us http://www.pythian.com/blog http://www.facebook.com/pages/The-Pythian-Group/163902527671 @pythian http://www.linkedin.com/company/pythian Thank you – Q&A