SlideShare ist ein Scribd-Unternehmen logo
1 von 28
Downloaden Sie, um offline zu lesen
Eddie	
  Garcia,	
  
VP	
  of	
  InfoSec	
  and	
  Services	
  
Gazzang,	
  Inc.	
  
I/O	
  Performance	
  tuning	
  for	
  Cassandra	
  
running	
  on	
  AWS	
  with	
  Gazzang	
  
Today’s	
  Agenda	
  
•  Tips	
  and	
  Tricks	
  to	
  achieve	
  high	
  performance	
  when	
  running	
  
Cassandra	
  on	
  AWS	
  
•  ConfiguraBon	
  tuning	
  for	
  Cassandra	
  
•  Tools	
  to	
  benchmark	
  raw	
  file	
  system	
  I/O	
  
•  AWS	
  available	
  AMIs	
  to	
  boost	
  performance	
  
•  Stress	
  tesBng	
  on	
  AWS	
  i2	
  HVM	
  instances	
  
•  Configuring	
  AWS	
  EC2	
  instances	
  with	
  SSDs	
  and	
  EBS	
  storage	
  
with	
  PIOPS	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 2
Performance	
  tuning	
  
• Tuning	
  at	
  every	
  layer	
  
– Tune	
  the	
  AWS	
  layer	
  
– Tune	
  the	
  Cassandra	
  layer	
  
– Tune	
  the	
  file	
  system	
  /	
  security	
  layer	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 3
 
Tune	
  the	
  AWS	
  layer	
  
	
  	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 4
Tune	
  the	
  AWS	
  layer	
  
•  i2	
  HVM	
  instances	
  will	
  provide	
  beNer	
  I/O	
  over	
  other	
  instance	
  
types	
  
•  i2	
  instances	
  will	
  support	
  SSD	
  TRIM	
  for	
  beNer	
  SDD	
  health	
  and	
  
performance	
  over	
  Bme	
  
•  Use	
  Amazon	
  Linux	
  distribuBon	
  AMI	
  or	
  kernel	
  version	
  3.8	
  and	
  
greater	
  for	
  higher	
  I/O	
  performance	
  
•  Use	
  Amazon	
  Linux	
  distribuBon	
  AMI	
  for	
  built-­‐in	
  SR-­‐IOV	
  (single	
  
root	
  I/O	
  virtualizaBon)	
  drivers	
  to	
  enable	
  higher	
  performance	
  
AWS	
  Enhanced	
  Networking	
  when	
  running	
  in	
  a	
  VPC	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 5
Amazon	
  Linux	
  AMI	
  Instance	
  Types	
  and	
  Sizes	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 6
http://aws.amazon.com/amazon-linux-ami/
Amazon	
  Linux	
  AMI	
  Instance	
  Types	
  and	
  Cost	
  
on-­‐demand	
  in	
  US	
  East	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 7
http://aws.amazon.com/ec2/pricing/
 
Tune	
  the	
  Cassandra	
  layer	
  
	
  	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 8
Tune	
  the	
  Cassandra	
  layer	
  
•  Follow	
  DataStax	
  published	
  Cassandra	
  best	
  pracBces	
  
hNp://www.datastax.com/documentaBon/cassandra/2.0/cassandra/install/installRecommendSe]ngs.html	
  
•  Data	
  directory	
  should	
  go	
  on	
  the	
  mounted	
  ephemeral	
  instance	
  
storage,	
  avoid	
  EBS	
  storage	
  for	
  maximum	
  I/O	
  performance	
  
•  IMPORTANT:	
  You	
  must	
  have	
  a	
  backup	
  strategy	
  when	
  using	
  
ephemeral,	
  for	
  example	
  using	
  S3	
  for	
  backups	
  
•  RAID-­‐0	
  (stripe)	
  of	
  SSDs	
  is	
  supported	
  but	
  Cassandra	
  also	
  does	
  a	
  
great	
  job	
  of	
  using	
  all	
  mounted	
  drives	
  without	
  RAID	
  
•  Scale	
  by	
  adding	
  smaller	
  instances	
  vs.	
  increasing	
  instance	
  size	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 9
Tune	
  the	
  Cassandra	
  layer	
  
•  Cassandra	
  writes	
  immutable	
  sstable	
  files	
  to	
  disk.	
  	
  It	
  then	
  
compacts	
  mulBple	
  sstables	
  into	
  1	
  larger	
  sstable	
  with	
  some	
  
cleanup	
  occurring	
  along	
  the	
  way	
  which	
  also	
  helps	
  TRIM	
  	
  
•  More	
  OS	
  memory	
  the	
  beNer,	
  on	
  read	
  the	
  sstables	
  are	
  cached	
  
as	
  normal	
  memory	
  mapped	
  file	
  loaded	
  into	
  OS	
  memory	
  
•  Increasing	
  the	
  JVM	
  heap	
  size	
  can	
  cause	
  performance	
  issues	
  for	
  
Cassandra	
  during	
  garbage	
  collecBon	
  “Death	
  by	
  Garbage	
  
CollecBon”	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 10
 
Tune	
  the	
  file	
  system	
  /	
  security	
  layer	
  
	
  	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 11
Tune	
  the	
  file	
  system	
  layer	
  
•  Format	
  the	
  file	
  system	
  with	
  ext4	
  vs	
  ext3	
  or	
  xfs	
  if	
  supported	
  by	
  
your	
  chosen	
  Linux	
  distribuBon	
  
•  Use	
  the	
  most	
  current	
  Linux	
  version	
  for	
  your	
  distribuBon,	
  many	
  
performance	
  fixes	
  are	
  supported	
  only	
  in	
  newer	
  kernels	
  
•  Use	
  IOZone	
  or	
  other	
  file	
  system	
  tests	
  before	
  and	
  ager	
  
configuraBons	
  to	
  benchmark	
  raw	
  file	
  I/O	
  before	
  loading	
  your	
  
Cassandra	
  data	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 12
Tune	
  the	
  file	
  security	
  layer	
  
•  Use	
  Block	
  Level	
  encrypBon	
  dedicaBng	
  enBre	
  SSD	
  volume	
  
•  Encrypt	
  the	
  cluster	
  before	
  loading	
  data	
  whenever	
  possible	
  
•  Use	
  systems	
  that	
  support	
  hardware	
  encrypBon	
  acceleraBon	
  
like	
  Intel	
  AES-­‐NI	
  hNp://aws.amazon.com/ec2/instance-­‐types	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 13
 
	
  
	
  
Test	
  and	
  measure	
  
	
  	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 14
Performance	
  TesJng	
  
•  When	
  tesBng	
  performance	
  reduce	
  the	
  number	
  of	
  variables	
  
that	
  can	
  affect	
  the	
  test	
  
–  Stopping	
  and	
  stopping	
  a	
  server	
  can	
  switch	
  your	
  instance	
  to	
  a	
  different	
  
host	
  with	
  different	
  performance	
  
–  Time	
  of	
  day	
  when	
  you	
  run	
  tests	
  can	
  affect	
  the	
  performance	
  
–  Eliminate	
  cached	
  in	
  memory	
  data	
  from	
  prior	
  tests	
  which	
  may	
  
contaminate	
  your	
  results	
  
–  Avoid	
  tesBng	
  on	
  systems	
  with	
  unknown	
  state	
  and	
  size	
  of	
  data	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 15
Cassandra	
  Test	
  Environment	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 16
Cassandra	
  
Stress	
  Client	
  
Cassandra	
  
Node	
  1	
  
Cassandra	
  
Node	
  2	
  
Cassandra	
  
Node	
  3	
  
Cassandra	
  
Node	
  4	
  
Cassandra	
  
Node	
  5	
  
Cassandra	
  
Node	
  6	
  
EBS	
  Clear	
  text	
  
EBS	
  4K	
  PIOPS	
  
SSD	
  Clear	
  text	
  
SSD	
  Encrypted	
  
IOZone Tests
Cassandra
Stress Tests
S3	
  
Backups	
  
Test	
  Environment	
  SpecificaJons	
  
Instance:	
  i2.2xlarge 	
   	
  	
  
AZ:	
  us-­‐east-­‐1a	
  
AMI	
  InformaBon:	
  amzn-­‐ami-­‐hvm-­‐2013.09.2.x86_64-­‐ebs	
  (ami-­‐e9a18d80)	
  
Linux	
  DistribuBon:	
  Amazon	
  Linux	
  AMI	
  release	
  2013.09	
  
Kernel	
  Version:	
  3.4.73-­‐64.112.amzn1.x86_64	
  
Drive	
  Layout:	
  
	
  	
  	
  	
  Filesystem	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Size	
  	
  Used	
  Avail	
  Use%	
  Mounted	
  on	
  
	
  	
  	
  	
  /dev/xvda1	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  7.9G	
  	
  1.8G	
  	
  6.1G	
  	
  23%	
  /	
   	
  (EBS	
  backed	
  for	
  tests,	
  ephemeral	
  is	
  beNer)	
  
	
  	
  	
  	
  tmpfs	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  30G	
  	
  	
  	
  	
  0	
  	
  	
  30G	
  	
  	
  0%	
  /dev/shm	
  
	
  	
  	
  	
  /dev/xvdb	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  734G	
  	
  197M	
  	
  697G	
  	
  	
  1%	
  /mount/ssd1	
   	
  (Cleartext	
  test	
  SSD)	
  
	
  	
  	
  	
  /dev/mapper/encrypted	
  734G	
  	
  	
  36G	
  	
  662G	
  	
  	
  6%	
  /encrypted	
   	
  (Encrypted	
  test	
  SSD)	
  
	
  
Cassandra	
  Stress	
  Client	
  –	
  m1.medium	
  
	
  
Cassandra	
  Cluster:	
  6	
  Nodes	
  
DataStax	
  enterprise:	
  dse-­‐libcassandra-­‐3.2.2-­‐1.noarch	
  
Cassandra:	
  version	
  1.2.12.2	
  	
  
Java	
  HotSpot(TM)	
  64-­‐Bit	
  Server	
  VM/1.6.0_45	
  
	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 17
IOZone	
  SSD	
  vs.	
  Non-­‐SSD	
  
IOZone	
  test	
  configuraBon	
  
Bme	
  iozone	
  -­‐ORa	
  -­‐s	
  163840	
  -­‐r	
  16384	
  
	
  	
  	
  	
  	
  Iozone:	
  Performance	
  Test	
  of	
  File	
  I/O	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Version	
  $Revision:	
  3.420	
  $	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Compiled	
  for	
  64	
  bit	
  mode.	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Build:	
  linux-­‐AMD64	
  
	
  
	
  	
  	
  	
  	
  OPS	
  Mode.	
  Output	
  is	
  in	
  operaBons	
  per	
  second.	
  
	
  	
  	
  	
  	
  Excel	
  chart	
  generaBon	
  enabled	
  
	
  	
  	
  	
  	
  Auto	
  Mode	
  
	
  	
  	
  	
  	
  File	
  size	
  set	
  to	
  163840	
  KB	
  
	
  	
  	
  	
  	
  Record	
  Size	
  16384	
  KB	
  
	
  	
  	
  	
  	
  Command	
  line	
  used:	
  iozone	
  -­‐ORa	
  -­‐s	
  163840	
  -­‐r	
  16384	
  
	
  	
  	
  	
  	
  Time	
  ResoluBon	
  =	
  0.000001	
  seconds.	
  
	
  	
  	
  	
  	
  Processor	
  cache	
  size	
  set	
  to	
  1024	
  Kbytes.	
  
	
  	
  	
  	
  	
  Processor	
  cache	
  line	
  size	
  set	
  to	
  32	
  bytes.	
  
	
  	
  	
  	
  	
  File	
  stride	
  size	
  set	
  to	
  17	
  *	
  record	
  size.	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 18
http://www.iozone.org/
Cassandra	
  Test	
  Environment	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 19
Cassandra	
  
Node	
  	
  
EBS	
  Clear	
  text	
  
EBS	
  4K	
  PIOPS	
  
encrypted	
  
SSD	
  
SSD	
  Encrypted	
  
IOZone Tests
real 1m6.360s
user 0m0.084s
sys 0m0.911s
real 0m15.223s
user 0m0.115s
sys 0m1.391s
real 0m9.951s
user 0m0.291s
sys 0m3.595s
Cassandra	
  stress	
  
The	
  cassandra-­‐stress	
  tool	
  
•  A	
  Java-­‐based	
  stress	
  tesBng	
  uBlity	
  for	
  benchmarking	
  and	
  load	
  tesBng	
  
a	
  Cassandra	
  cluster.	
  
•  The	
  binary	
  installaBon	
  of	
  the	
  tool	
  also	
  includes	
  a	
  daemon,	
  which	
  in	
  
larger-­‐scale	
  tesBng	
  can	
  prevent	
  potenBal	
  skews	
  in	
  the	
  test	
  results	
  by	
  
keeping	
  the	
  JVM	
  warm.	
  
•  Modes	
  of	
  operaBon:	
  
–  InserBng:	
  Loads	
  test	
  data.	
  
–  Reading:	
  Reads	
  test	
  data.	
  
–  Indexed	
  range	
  slicing:	
  Works	
  with	
  RandomParBBoner	
  on	
  indexed	
  tables.	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 20
http://www.datastax.com/documentation/cassandra/2.0/cassandra/tools/
toolsCStress_t.html
Current	
  Cassandra	
  stress	
  test	
  configuraJon	
  
•  Cassandra	
  stress	
  test	
  command	
  
–  <cassandra	
  home>/tools/bin/cassandra-­‐stress	
  -­‐l	
  3	
  -­‐o	
  insert	
  -­‐n	
  
100000000	
  -­‐i	
  1	
  -­‐e	
  ONE	
  -­‐c	
  10	
  -­‐d	
  <Cassandra	
  Node	
  IPs>	
  -­‐t	
  150	
  -­‐f	
  
T1.csv	
  &	
  
•  In	
  the	
  stress	
  test,	
  client	
  stress	
  test	
  nodes	
  1	
  –	
  3	
  will	
  target	
  two	
  
separate	
  Cassandra	
  nodes.	
  On	
  client	
  node	
  #4,	
  target	
  all	
  Cassandra	
  
nodes.	
  
–  Client#1	
  —>	
  CAS	
  1,	
  2	
  
–  Client#2	
  —>	
  CAS	
  3,	
  4	
  
–  Client#3	
  —>	
  CAS	
  5,	
  6	
  
–  Client#4	
  —>	
  CAS	
  1,	
  2,	
  3,	
  4,	
  5,	
  6	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 21
Cassandra	
  Test	
  Environment	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 22
Stress	
  	
  
Client	
  1	
  
Cassandra	
  
Node	
  1	
  
Cassandra	
  
Node	
  2	
  
Cassandra	
  
Node	
  3	
  
Cassandra	
  
Node	
  4	
  
Cassandra	
  
Node	
  5	
  
Cassandra	
  
Node	
  6	
  
SSD	
  Clear	
  text	
  
SSD	
  Encrypted	
  
Cassandra
Stress Tests
Stress	
  	
  
Client	
  2	
  
Stress	
  	
  
Client	
  3	
  
Stress	
  	
  
Client	
  4	
  
Benchmark	
  clear	
  text	
  vs	
  encrypted	
  inserts	
  (write)	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 23
Summary	
  
•  Test	
  in	
  your	
  environment	
  with	
  your	
  data,	
  results	
  will	
  vary	
  
greatly	
  on	
  OS,	
  HW	
  and	
  applicaBon	
  configuraBons	
  
–  Baseline	
  before	
  you	
  tune	
  
–  Tune	
  
–  Test	
  ager	
  tuning	
  
–  Measure	
  
–  Rinse	
  and	
  repeat	
  twice	
  
	
  
•  Security	
  and	
  Performance	
  are	
  not	
  mutually	
  exclusive,	
  
encrypBon	
  can	
  coexist	
  with	
  High	
  I/O	
  performance	
  
	
  
•  Do	
  your	
  homework,	
  configure	
  and	
  run	
  tests	
  that	
  map	
  to	
  your	
  
use	
  case	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 24
• Headquartered	
  in	
  AusBn,	
  Texas	
  
• Focus	
  on	
  securing	
  sensiBve	
  data	
  in	
  cloud	
  
and	
  big	
  data	
  environments	
  
• Enable	
  customers	
  to	
  meet	
  compliance	
  	
  
requirements	
  like	
  HIPAA,	
  PCI,	
  FIPS	
  and	
  
FERPA	
  
• SaBsfy	
  internal	
  security	
  mandates	
  
• Protect	
  valuable	
  client	
  informaBon	
  
About	
  Gazzang	
  
Gazzang	
  is	
  focused	
  on	
  data	
  at-­‐rest	
  encrypBon	
  
	
  
Security	
  in	
  the	
  cloud	
  is	
  a	
  layered	
  approach	
  
264/24/14 Gazzang - All rights reserved 2013
Data	
  in	
  process	
  (in	
  applicaJon)	
  
Data	
  at	
  rest	
  (storage)	
  
Data	
  in	
  transit	
  (SSL)	
  
and	
  key	
  management	
  
	
  
274/24/14 Gazzang - All rights reserved 2013
Security	
  in	
  the	
  cloud	
  is	
  a	
  layered	
  approach	
  
Data	
  in	
  process	
  (in	
  applicaJon)	
  
Data	
  at	
  rest	
  (storage)	
  
Data	
  in	
  transit	
  (SSL)	
  
Thank	
  you!	
  
Gazzang,	
  Inc	
  
www.gazzang.com	
  	
  
	
  
Eddie	
  Garcia	
  
VP	
  of	
  InfoSec	
  and	
  Services	
  
eddie.garcia@gazzang.com	
  
4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 28

Weitere ähnliche Inhalte

Was ist angesagt?

Cassandra and Solid State Drives
Cassandra and Solid State DrivesCassandra and Solid State Drives
Cassandra and Solid State Drives
Rick Branson
 
Lessons Learned on Java Tuning for Our Cassandra Clusters (Carlos Monroy, Kne...
Lessons Learned on Java Tuning for Our Cassandra Clusters (Carlos Monroy, Kne...Lessons Learned on Java Tuning for Our Cassandra Clusters (Carlos Monroy, Kne...
Lessons Learned on Java Tuning for Our Cassandra Clusters (Carlos Monroy, Kne...
DataStax
 

Was ist angesagt? (20)

A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...
A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...
A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...
 
Stabilizing Ceph
Stabilizing CephStabilizing Ceph
Stabilizing Ceph
 
AddThis: Scaling Cassandra up and down into containers with ZFS
AddThis: Scaling Cassandra up and down into containers with ZFSAddThis: Scaling Cassandra up and down into containers with ZFS
AddThis: Scaling Cassandra up and down into containers with ZFS
 
Load testing Cassandra applications
Load testing Cassandra applicationsLoad testing Cassandra applications
Load testing Cassandra applications
 
How to size up an Apache Cassandra cluster (Training)
How to size up an Apache Cassandra cluster (Training)How to size up an Apache Cassandra cluster (Training)
How to size up an Apache Cassandra cluster (Training)
 
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
 
Postgres in Amazon RDS
Postgres in Amazon RDSPostgres in Amazon RDS
Postgres in Amazon RDS
 
Cassandra and Solid State Drives
Cassandra and Solid State DrivesCassandra and Solid State Drives
Cassandra and Solid State Drives
 
Ceph Day Melbourne - Troubleshooting Ceph
Ceph Day Melbourne - Troubleshooting Ceph Ceph Day Melbourne - Troubleshooting Ceph
Ceph Day Melbourne - Troubleshooting Ceph
 
Lessons Learned on Java Tuning for Our Cassandra Clusters (Carlos Monroy, Kne...
Lessons Learned on Java Tuning for Our Cassandra Clusters (Carlos Monroy, Kne...Lessons Learned on Java Tuning for Our Cassandra Clusters (Carlos Monroy, Kne...
Lessons Learned on Java Tuning for Our Cassandra Clusters (Carlos Monroy, Kne...
 
Ceph Day Melbourne - Ceph on All-Flash Storage - Breaking Performance Barriers
Ceph Day Melbourne - Ceph on All-Flash Storage - Breaking Performance BarriersCeph Day Melbourne - Ceph on All-Flash Storage - Breaking Performance Barriers
Ceph Day Melbourne - Ceph on All-Flash Storage - Breaking Performance Barriers
 
C* Summit 2013: Practice Makes Perfect: Extreme Cassandra Optimization by Alb...
C* Summit 2013: Practice Makes Perfect: Extreme Cassandra Optimization by Alb...C* Summit 2013: Practice Makes Perfect: Extreme Cassandra Optimization by Alb...
C* Summit 2013: Practice Makes Perfect: Extreme Cassandra Optimization by Alb...
 
Ceph Performance Profiling and Reporting
Ceph Performance Profiling and ReportingCeph Performance Profiling and Reporting
Ceph Performance Profiling and Reporting
 
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSAccelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
 
Ceph on All Flash Storage -- Breaking Performance Barriers
Ceph on All Flash Storage -- Breaking Performance BarriersCeph on All Flash Storage -- Breaking Performance Barriers
Ceph on All Flash Storage -- Breaking Performance Barriers
 
2016-JAN-28 -- High Performance Production Databases on Ceph
2016-JAN-28 -- High Performance Production Databases on Ceph2016-JAN-28 -- High Performance Production Databases on Ceph
2016-JAN-28 -- High Performance Production Databases on Ceph
 
Ceph Day Melbourne - Walk Through a Software Defined Everything PoC
Ceph Day Melbourne - Walk Through a Software Defined Everything PoCCeph Day Melbourne - Walk Through a Software Defined Everything PoC
Ceph Day Melbourne - Walk Through a Software Defined Everything PoC
 
Journey to Stability: Petabyte Ceph Cluster in OpenStack Cloud
Journey to Stability: Petabyte Ceph Cluster in OpenStack CloudJourney to Stability: Petabyte Ceph Cluster in OpenStack Cloud
Journey to Stability: Petabyte Ceph Cluster in OpenStack Cloud
 
Ceph Day San Jose - Object Storage for Big Data
Ceph Day San Jose - Object Storage for Big Data Ceph Day San Jose - Object Storage for Big Data
Ceph Day San Jose - Object Storage for Big Data
 
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph
 

Ähnlich wie TechTalk v2.0 - Performance tuning Cassandra + AWS

Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
Red_Hat_Storage
 
Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration
Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration
Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration
Ceph Community
 

Ähnlich wie TechTalk v2.0 - Performance tuning Cassandra + AWS (20)

Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
 
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
 
AWS Webcast - Achieving consistent high performance with Postgres on Amazon W...
AWS Webcast - Achieving consistent high performance with Postgres on Amazon W...AWS Webcast - Achieving consistent high performance with Postgres on Amazon W...
AWS Webcast - Achieving consistent high performance with Postgres on Amazon W...
 
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
 
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash TechnologyCeph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
 
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
 
Running Business Critical Workloads on AWS
Running Business Critical Workloads on AWS Running Business Critical Workloads on AWS
Running Business Critical Workloads on AWS
 
Enterprise Storage NAS - Dual Controller
Enterprise Storage NAS - Dual ControllerEnterprise Storage NAS - Dual Controller
Enterprise Storage NAS - Dual Controller
 
Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration
Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration
Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration
 
Load Testing Cassandra Applications (Ben Slater, Instaclustr) | C* Summit 2016
Load Testing Cassandra Applications (Ben Slater, Instaclustr) | C* Summit 2016Load Testing Cassandra Applications (Ben Slater, Instaclustr) | C* Summit 2016
Load Testing Cassandra Applications (Ben Slater, Instaclustr) | C* Summit 2016
 
Load Testing Cassandra Applications
Load Testing Cassandra Applications Load Testing Cassandra Applications
Load Testing Cassandra Applications
 
Tuning Linux Windows and Firebird for Heavy Workload
Tuning Linux Windows and Firebird for Heavy WorkloadTuning Linux Windows and Firebird for Heavy Workload
Tuning Linux Windows and Firebird for Heavy Workload
 
Running BSD on AWS
Running BSD on AWSRunning BSD on AWS
Running BSD on AWS
 
Running Microsoft and Oracle Stacks on Elastic Block Store (STG303) | AWS re:...
Running Microsoft and Oracle Stacks on Elastic Block Store (STG303) | AWS re:...Running Microsoft and Oracle Stacks on Elastic Block Store (STG303) | AWS re:...
Running Microsoft and Oracle Stacks on Elastic Block Store (STG303) | AWS re:...
 
Understanding Elastic Block Store Availability and Performance
Understanding Elastic Block Store Availability and PerformanceUnderstanding Elastic Block Store Availability and Performance
Understanding Elastic Block Store Availability and Performance
 
AWS Webcast - Introduction to RDS Low Admin High Perf DBS
AWS Webcast - Introduction to RDS Low Admin High Perf DBSAWS Webcast - Introduction to RDS Low Admin High Perf DBS
AWS Webcast - Introduction to RDS Low Admin High Perf DBS
 
Data Scotland 2019: You can run SQL Server on AWS
Data Scotland 2019: You can run SQL Server on AWSData Scotland 2019: You can run SQL Server on AWS
Data Scotland 2019: You can run SQL Server on AWS
 
AWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon RedshiftAWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon Redshift
 
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
 
Build an High-Performance and High-Durable Block Storage Service Based on Ceph
Build an High-Performance and High-Durable Block Storage Service Based on CephBuild an High-Performance and High-Durable Block Storage Service Based on Ceph
Build an High-Performance and High-Durable Block Storage Service Based on Ceph
 

Mehr von Pythian

Ramp-Tutorial for MYSQL Cluster - Scaling with Continuous Availability
Ramp-Tutorial for MYSQL Cluster - Scaling with Continuous AvailabilityRamp-Tutorial for MYSQL Cluster - Scaling with Continuous Availability
Ramp-Tutorial for MYSQL Cluster - Scaling with Continuous Availability
Pythian
 
Pdb my sql backup london percona live 2012
Pdb my sql backup   london percona live 2012Pdb my sql backup   london percona live 2012
Pdb my sql backup london percona live 2012
Pythian
 

Mehr von Pythian (9)

DB Engineering - From Antiquated to Engineer
DB Engineering - From Antiquated to EngineerDB Engineering - From Antiquated to Engineer
DB Engineering - From Antiquated to Engineer
 
Percona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWSPercona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWS
 
MySQL administration in Amazon RDS
MySQL administration in Amazon RDSMySQL administration in Amazon RDS
MySQL administration in Amazon RDS
 
Maximizing SQL Reviews and Tuning with pt-query-digest
Maximizing SQL Reviews and Tuning with pt-query-digestMaximizing SQL Reviews and Tuning with pt-query-digest
Maximizing SQL Reviews and Tuning with pt-query-digest
 
Online Schema Changes for Maximizing Uptime
 Online Schema Changes for Maximizing Uptime Online Schema Changes for Maximizing Uptime
Online Schema Changes for Maximizing Uptime
 
MYSQL Patterns in Amazon - Make the Cloud Work For You
MYSQL Patterns in Amazon - Make the Cloud Work For YouMYSQL Patterns in Amazon - Make the Cloud Work For You
MYSQL Patterns in Amazon - Make the Cloud Work For You
 
MYSQL Query Anti-Patterns That Can Be Moved to Sphinx
MYSQL Query Anti-Patterns That Can Be Moved to SphinxMYSQL Query Anti-Patterns That Can Be Moved to Sphinx
MYSQL Query Anti-Patterns That Can Be Moved to Sphinx
 
Ramp-Tutorial for MYSQL Cluster - Scaling with Continuous Availability
Ramp-Tutorial for MYSQL Cluster - Scaling with Continuous AvailabilityRamp-Tutorial for MYSQL Cluster - Scaling with Continuous Availability
Ramp-Tutorial for MYSQL Cluster - Scaling with Continuous Availability
 
Pdb my sql backup london percona live 2012
Pdb my sql backup   london percona live 2012Pdb my sql backup   london percona live 2012
Pdb my sql backup london percona live 2012
 

Kürzlich hochgeladen

Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
Kamal Acharya
 
Verification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxVerification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptx
chumtiyababu
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
jaanualu31
 

Kürzlich hochgeladen (20)

Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
 
Verification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxVerification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptx
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEGEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
 

TechTalk v2.0 - Performance tuning Cassandra + AWS

  • 1. Eddie  Garcia,   VP  of  InfoSec  and  Services   Gazzang,  Inc.   I/O  Performance  tuning  for  Cassandra   running  on  AWS  with  Gazzang  
  • 2. Today’s  Agenda   •  Tips  and  Tricks  to  achieve  high  performance  when  running   Cassandra  on  AWS   •  ConfiguraBon  tuning  for  Cassandra   •  Tools  to  benchmark  raw  file  system  I/O   •  AWS  available  AMIs  to  boost  performance   •  Stress  tesBng  on  AWS  i2  HVM  instances   •  Configuring  AWS  EC2  instances  with  SSDs  and  EBS  storage   with  PIOPS   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 2
  • 3. Performance  tuning   • Tuning  at  every  layer   – Tune  the  AWS  layer   – Tune  the  Cassandra  layer   – Tune  the  file  system  /  security  layer   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 3
  • 4.   Tune  the  AWS  layer       4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 4
  • 5. Tune  the  AWS  layer   •  i2  HVM  instances  will  provide  beNer  I/O  over  other  instance   types   •  i2  instances  will  support  SSD  TRIM  for  beNer  SDD  health  and   performance  over  Bme   •  Use  Amazon  Linux  distribuBon  AMI  or  kernel  version  3.8  and   greater  for  higher  I/O  performance   •  Use  Amazon  Linux  distribuBon  AMI  for  built-­‐in  SR-­‐IOV  (single   root  I/O  virtualizaBon)  drivers  to  enable  higher  performance   AWS  Enhanced  Networking  when  running  in  a  VPC   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 5
  • 6. Amazon  Linux  AMI  Instance  Types  and  Sizes   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 6 http://aws.amazon.com/amazon-linux-ami/
  • 7. Amazon  Linux  AMI  Instance  Types  and  Cost   on-­‐demand  in  US  East   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 7 http://aws.amazon.com/ec2/pricing/
  • 8.   Tune  the  Cassandra  layer       4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 8
  • 9. Tune  the  Cassandra  layer   •  Follow  DataStax  published  Cassandra  best  pracBces   hNp://www.datastax.com/documentaBon/cassandra/2.0/cassandra/install/installRecommendSe]ngs.html   •  Data  directory  should  go  on  the  mounted  ephemeral  instance   storage,  avoid  EBS  storage  for  maximum  I/O  performance   •  IMPORTANT:  You  must  have  a  backup  strategy  when  using   ephemeral,  for  example  using  S3  for  backups   •  RAID-­‐0  (stripe)  of  SSDs  is  supported  but  Cassandra  also  does  a   great  job  of  using  all  mounted  drives  without  RAID   •  Scale  by  adding  smaller  instances  vs.  increasing  instance  size   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 9
  • 10. Tune  the  Cassandra  layer   •  Cassandra  writes  immutable  sstable  files  to  disk.    It  then   compacts  mulBple  sstables  into  1  larger  sstable  with  some   cleanup  occurring  along  the  way  which  also  helps  TRIM     •  More  OS  memory  the  beNer,  on  read  the  sstables  are  cached   as  normal  memory  mapped  file  loaded  into  OS  memory   •  Increasing  the  JVM  heap  size  can  cause  performance  issues  for   Cassandra  during  garbage  collecBon  “Death  by  Garbage   CollecBon”   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 10
  • 11.   Tune  the  file  system  /  security  layer       4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 11
  • 12. Tune  the  file  system  layer   •  Format  the  file  system  with  ext4  vs  ext3  or  xfs  if  supported  by   your  chosen  Linux  distribuBon   •  Use  the  most  current  Linux  version  for  your  distribuBon,  many   performance  fixes  are  supported  only  in  newer  kernels   •  Use  IOZone  or  other  file  system  tests  before  and  ager   configuraBons  to  benchmark  raw  file  I/O  before  loading  your   Cassandra  data   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 12
  • 13. Tune  the  file  security  layer   •  Use  Block  Level  encrypBon  dedicaBng  enBre  SSD  volume   •  Encrypt  the  cluster  before  loading  data  whenever  possible   •  Use  systems  that  support  hardware  encrypBon  acceleraBon   like  Intel  AES-­‐NI  hNp://aws.amazon.com/ec2/instance-­‐types   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 13
  • 14.       Test  and  measure       4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 14
  • 15. Performance  TesJng   •  When  tesBng  performance  reduce  the  number  of  variables   that  can  affect  the  test   –  Stopping  and  stopping  a  server  can  switch  your  instance  to  a  different   host  with  different  performance   –  Time  of  day  when  you  run  tests  can  affect  the  performance   –  Eliminate  cached  in  memory  data  from  prior  tests  which  may   contaminate  your  results   –  Avoid  tesBng  on  systems  with  unknown  state  and  size  of  data   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 15
  • 16. Cassandra  Test  Environment   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 16 Cassandra   Stress  Client   Cassandra   Node  1   Cassandra   Node  2   Cassandra   Node  3   Cassandra   Node  4   Cassandra   Node  5   Cassandra   Node  6   EBS  Clear  text   EBS  4K  PIOPS   SSD  Clear  text   SSD  Encrypted   IOZone Tests Cassandra Stress Tests S3   Backups  
  • 17. Test  Environment  SpecificaJons   Instance:  i2.2xlarge       AZ:  us-­‐east-­‐1a   AMI  InformaBon:  amzn-­‐ami-­‐hvm-­‐2013.09.2.x86_64-­‐ebs  (ami-­‐e9a18d80)   Linux  DistribuBon:  Amazon  Linux  AMI  release  2013.09   Kernel  Version:  3.4.73-­‐64.112.amzn1.x86_64   Drive  Layout:          Filesystem                        Size    Used  Avail  Use%  Mounted  on          /dev/xvda1                        7.9G    1.8G    6.1G    23%  /    (EBS  backed  for  tests,  ephemeral  is  beNer)          tmpfs                                    30G          0      30G      0%  /dev/shm          /dev/xvdb                          734G    197M    697G      1%  /mount/ssd1    (Cleartext  test  SSD)          /dev/mapper/encrypted  734G      36G    662G      6%  /encrypted    (Encrypted  test  SSD)     Cassandra  Stress  Client  –  m1.medium     Cassandra  Cluster:  6  Nodes   DataStax  enterprise:  dse-­‐libcassandra-­‐3.2.2-­‐1.noarch   Cassandra:  version  1.2.12.2     Java  HotSpot(TM)  64-­‐Bit  Server  VM/1.6.0_45     4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 17
  • 18. IOZone  SSD  vs.  Non-­‐SSD   IOZone  test  configuraBon   Bme  iozone  -­‐ORa  -­‐s  163840  -­‐r  16384            Iozone:  Performance  Test  of  File  I/O                            Version  $Revision:  3.420  $                      Compiled  for  64  bit  mode.                      Build:  linux-­‐AMD64              OPS  Mode.  Output  is  in  operaBons  per  second.            Excel  chart  generaBon  enabled            Auto  Mode            File  size  set  to  163840  KB            Record  Size  16384  KB            Command  line  used:  iozone  -­‐ORa  -­‐s  163840  -­‐r  16384            Time  ResoluBon  =  0.000001  seconds.            Processor  cache  size  set  to  1024  Kbytes.            Processor  cache  line  size  set  to  32  bytes.            File  stride  size  set  to  17  *  record  size.   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 18 http://www.iozone.org/
  • 19. Cassandra  Test  Environment   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 19 Cassandra   Node     EBS  Clear  text   EBS  4K  PIOPS   encrypted   SSD   SSD  Encrypted   IOZone Tests real 1m6.360s user 0m0.084s sys 0m0.911s real 0m15.223s user 0m0.115s sys 0m1.391s real 0m9.951s user 0m0.291s sys 0m3.595s
  • 20. Cassandra  stress   The  cassandra-­‐stress  tool   •  A  Java-­‐based  stress  tesBng  uBlity  for  benchmarking  and  load  tesBng   a  Cassandra  cluster.   •  The  binary  installaBon  of  the  tool  also  includes  a  daemon,  which  in   larger-­‐scale  tesBng  can  prevent  potenBal  skews  in  the  test  results  by   keeping  the  JVM  warm.   •  Modes  of  operaBon:   –  InserBng:  Loads  test  data.   –  Reading:  Reads  test  data.   –  Indexed  range  slicing:  Works  with  RandomParBBoner  on  indexed  tables.   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 20 http://www.datastax.com/documentation/cassandra/2.0/cassandra/tools/ toolsCStress_t.html
  • 21. Current  Cassandra  stress  test  configuraJon   •  Cassandra  stress  test  command   –  <cassandra  home>/tools/bin/cassandra-­‐stress  -­‐l  3  -­‐o  insert  -­‐n   100000000  -­‐i  1  -­‐e  ONE  -­‐c  10  -­‐d  <Cassandra  Node  IPs>  -­‐t  150  -­‐f   T1.csv  &   •  In  the  stress  test,  client  stress  test  nodes  1  –  3  will  target  two   separate  Cassandra  nodes.  On  client  node  #4,  target  all  Cassandra   nodes.   –  Client#1  —>  CAS  1,  2   –  Client#2  —>  CAS  3,  4   –  Client#3  —>  CAS  5,  6   –  Client#4  —>  CAS  1,  2,  3,  4,  5,  6   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 21
  • 22. Cassandra  Test  Environment   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 22 Stress     Client  1   Cassandra   Node  1   Cassandra   Node  2   Cassandra   Node  3   Cassandra   Node  4   Cassandra   Node  5   Cassandra   Node  6   SSD  Clear  text   SSD  Encrypted   Cassandra Stress Tests Stress     Client  2   Stress     Client  3   Stress     Client  4  
  • 23. Benchmark  clear  text  vs  encrypted  inserts  (write)   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 23
  • 24. Summary   •  Test  in  your  environment  with  your  data,  results  will  vary   greatly  on  OS,  HW  and  applicaBon  configuraBons   –  Baseline  before  you  tune   –  Tune   –  Test  ager  tuning   –  Measure   –  Rinse  and  repeat  twice     •  Security  and  Performance  are  not  mutually  exclusive,   encrypBon  can  coexist  with  High  I/O  performance     •  Do  your  homework,  configure  and  run  tests  that  map  to  your   use  case   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 24
  • 25. • Headquartered  in  AusBn,  Texas   • Focus  on  securing  sensiBve  data  in  cloud   and  big  data  environments   • Enable  customers  to  meet  compliance     requirements  like  HIPAA,  PCI,  FIPS  and   FERPA   • SaBsfy  internal  security  mandates   • Protect  valuable  client  informaBon   About  Gazzang  
  • 26. Gazzang  is  focused  on  data  at-­‐rest  encrypBon     Security  in  the  cloud  is  a  layered  approach   264/24/14 Gazzang - All rights reserved 2013 Data  in  process  (in  applicaJon)   Data  at  rest  (storage)   Data  in  transit  (SSL)  
  • 27. and  key  management     274/24/14 Gazzang - All rights reserved 2013 Security  in  the  cloud  is  a  layered  approach   Data  in  process  (in  applicaJon)   Data  at  rest  (storage)   Data  in  transit  (SSL)  
  • 28. Thank  you!   Gazzang,  Inc   www.gazzang.com       Eddie  Garcia   VP  of  InfoSec  and  Services   eddie.garcia@gazzang.com   4/24/14 © Gazzang, Inc. -- CONFIDENTIAL -- 28