SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
© DataStax, All Rights Reserved.
Apache Cassandra™
Choosing instances for success
1
Erick Ramirez
DataStax Engineering
@flightc
Welcome
• Your app in focus — reads vs writes, CPU vs RAM

• What IOPS? How much is enough?

• Are ephemeral disks evil?

• False economy — cheaper instances can cost you more

• A time to kill — they’re not your pets
© DataStax, All Rights Reserved.3
© DataStax, All Rights Reserved.4
© DataStax, All Rights Reserved.
https://academy.datastax.com
5
© DataStax, All Rights Reserved.
ONE SIZE DOES NOT

FIT ALL
6
Tailor to workload
• intimately understand your app

• reads vs writes

• CPU vs memory

• OLTP vs OLAP

• use case will dictate requirements
© DataStax, All Rights Reserved.
STORAGE OPTIONS
8
© DataStax, All Rights Reserved.
EBS gp2 SSDs
9
• general purpose EBS option

• persistent (durable)

• default volume for EC2 instances

• guaranteed 99% single-digit millisecond latency

• only pay for each GB (IOPS included)

• minimum 10K IOPS for production workloads
3 IOPS/GB (3K IOPS/TB)
Max 10K IOPS/vol
Max 160MB/s throughput/vol
1TB = $122/mo, $1474/yr
© DataStax, All Rights Reserved.
EBS io1 SSDs
10
• fastest available EBS option

• persistent (durable)

• for latency-sensitive OLTP workloads

• guaranteed 99.9%* single-digit millisecond latency

• provisioned IOPS are charged extra

• minimum 10K IOPS for production workloads

* read the fine print
Up to 50 IOPS/GB
Max 20K IOPS/vol
Max 320MB/s throughput/vol
1TB = $141/mo, $1695/yr
1K IOPS = $72/mo, $864/yr
© DataStax, All Rights Reserved.
#spoileralert

EPHEMERAL IS YOUR FRIEND
11
Ephemeral storage
• performance orders of magnitude better than EBS

• already included in instance costs, e.g. m3, c3, i3

• “physically” attached

• not durable across reboots but…
© DataStax, All Rights Reserved.
HELLO, CASSANDRA
13
What is Cassandra
• massively scalable NoSQL database

• fully distributed, no single-point-of-failure

• linear horizontal scaling
© DataStax, All Rights Reserved.
Why Cassandra
15
• all nodes are the same — no SPOF

• real-time, durable writes

• linear scaling on commodity servers

• real-time replication across data centres

• always on — no offline operation

• because you have a scale problem
© DataStax, All Rights Reserved.16
Replication across DCs
© DataStax, All Rights Reserved.
CHEAP INSTANCES

MAY BE COSTING YOU
17
© DataStax, All Rights Reserved.
Real example
18
• deployed on c4.4xlarge

• using EBS io1 with 3K PIOPS

• nodes dropping writes

• high read latencies
16 vCPU, 30GB RAM
Instance $ 5443
EBS io1 1TB $ 1695
PIOPS 3K $ 2592
————————
Annual cost $ 9730
© DataStax, All Rights Reserved.
Recommendation
19
• swap to i3.2xlarge

• 1.9TB NVMe SSDs included

• 3M IOPS, 16GB/s

• 60-70% cheaper than replaced i2.2xlarge
8 vCPU, 61GB RAM
Instance $ 4174
————————
Annual cost $ 4174
© DataStax, All Rights Reserved.
HORSES FOR COURSES
20
© DataStax, All Rights Reserved.
Use case - dev, light prod
21
• m3.large suitable

• entry-level load, testing-the-waters

• minimum 3 C* nodes with RF=3

• use CMS GC with 2GB heap
2 vCPU
7.5GB RAM
1 x 32GB SSD
$ 962/yr
© DataStax, All Rights Reserved.
Use case -

low prod volume
22
• m3.xlarge suitable

• JVM will perform better with the extra RAM

• min 3 C* nodes with RF=3

• use CMS GC with 8GB heap
4 vCPU
15GB RAM
1 x 40GB SSD
$ 1924/yr
© DataStax, All Rights Reserved.
Use case -

moderate prod volume
23
• c3.2xlarge recommended

• more diskspace, extra cores a bonus

• costs 50% more for 2x CPU and 4x diskspace

• min 3 C* nodes with RF=3

• use CMS GC with 8GB heap
8 vCPU
15GB RAM
2 x 80GB SSD
$ 2916/yr
© DataStax, All Rights Reserved.
Use case -

real prod volume
24
• i3.2xlarge recommended

• will handle all kinds of workloads including Analytics,
Graph and Search (Solr)

• min 3 C* nodes with RF=3

• use G1 GC with 24GB heap (32GB for Search nodes)
8 vCPU
61GB RAM
1.9TB NVMe SSD
$ 4174/yr
© DataStax, All Rights Reserved.
https://datastaxacademy.slack.com
25
© DataStax, All Rights Reserved.
Thank you
26

Weitere ähnliche Inhalte

Was ist angesagt?

10 Devops-Friendly Database Must-Haves - Dor Laor, ScyllaDB - DevOpsDays Tel ...
10 Devops-Friendly Database Must-Haves - Dor Laor, ScyllaDB - DevOpsDays Tel ...10 Devops-Friendly Database Must-Haves - Dor Laor, ScyllaDB - DevOpsDays Tel ...
10 Devops-Friendly Database Must-Haves - Dor Laor, ScyllaDB - DevOpsDays Tel ...
DevOpsDays Tel Aviv
 
Scaling Cassandra for Big Data
Scaling Cassandra for Big DataScaling Cassandra for Big Data
Scaling Cassandra for Big Data
DataStax Academy
 
TechTalk v2.0 - Performance tuning Cassandra + AWS
TechTalk v2.0 - Performance tuning Cassandra + AWSTechTalk v2.0 - Performance tuning Cassandra + AWS
TechTalk v2.0 - Performance tuning Cassandra + AWS
Pythian
 

Was ist angesagt? (20)

Cassandra Day Atlanta 2015: Diagnosing Problems in Production
Cassandra Day Atlanta 2015: Diagnosing Problems in ProductionCassandra Day Atlanta 2015: Diagnosing Problems in Production
Cassandra Day Atlanta 2015: Diagnosing Problems in Production
 
Seattle Cassandra Meetup - HasOffers
Seattle Cassandra Meetup - HasOffersSeattle Cassandra Meetup - HasOffers
Seattle Cassandra Meetup - HasOffers
 
Scylla Summit 2018: Rebuilding the Ceph Distributed Storage Solution with Sea...
Scylla Summit 2018: Rebuilding the Ceph Distributed Storage Solution with Sea...Scylla Summit 2018: Rebuilding the Ceph Distributed Storage Solution with Sea...
Scylla Summit 2018: Rebuilding the Ceph Distributed Storage Solution with Sea...
 
DynamoDB at HasOffers
DynamoDB at HasOffers DynamoDB at HasOffers
DynamoDB at HasOffers
 
10 Devops-Friendly Database Must-Haves - Dor Laor, ScyllaDB - DevOpsDays Tel ...
10 Devops-Friendly Database Must-Haves - Dor Laor, ScyllaDB - DevOpsDays Tel ...10 Devops-Friendly Database Must-Haves - Dor Laor, ScyllaDB - DevOpsDays Tel ...
10 Devops-Friendly Database Must-Haves - Dor Laor, ScyllaDB - DevOpsDays Tel ...
 
How to size up an Apache Cassandra cluster (Training)
How to size up an Apache Cassandra cluster (Training)How to size up an Apache Cassandra cluster (Training)
How to size up an Apache Cassandra cluster (Training)
 
How We Made Scylla Maintenance Easier, Safer and Faster
How We Made Scylla Maintenance Easier, Safer and FasterHow We Made Scylla Maintenance Easier, Safer and Faster
How We Made Scylla Maintenance Easier, Safer and Faster
 
Scylla Summit 2016: Why Kenshoo is about to displace Cassandra with Scylla
Scylla Summit 2016: Why Kenshoo is about to displace Cassandra with ScyllaScylla Summit 2016: Why Kenshoo is about to displace Cassandra with Scylla
Scylla Summit 2016: Why Kenshoo is about to displace Cassandra with Scylla
 
Scylla Summit 2018: OLAP or OLTP? Why Not Both?
Scylla Summit 2018: OLAP or OLTP? Why Not Both?Scylla Summit 2018: OLAP or OLTP? Why Not Both?
Scylla Summit 2018: OLAP or OLTP? Why Not Both?
 
Scylla’s Journey Towards Being an Elastic Cloud Native Database
Scylla’s Journey Towards Being an Elastic Cloud Native DatabaseScylla’s Journey Towards Being an Elastic Cloud Native Database
Scylla’s Journey Towards Being an Elastic Cloud Native Database
 
Scylla Summit 2016: Using ScyllaDB for a Microservice-based Pipeline in Go
Scylla Summit 2016: Using ScyllaDB for a Microservice-based Pipeline in GoScylla Summit 2016: Using ScyllaDB for a Microservice-based Pipeline in Go
Scylla Summit 2016: Using ScyllaDB for a Microservice-based Pipeline in Go
 
Scylla Summit 2016: Compose on Containing the Database
Scylla Summit 2016: Compose on Containing the DatabaseScylla Summit 2016: Compose on Containing the Database
Scylla Summit 2016: Compose on Containing the Database
 
Building Scalable, Real Time Applications for Financial Services with DataStax
Building Scalable, Real Time Applications for Financial Services with DataStaxBuilding Scalable, Real Time Applications for Financial Services with DataStax
Building Scalable, Real Time Applications for Financial Services with DataStax
 
Date-tiered Compaction Policy for Time-series Data
Date-tiered Compaction Policy for Time-series DataDate-tiered Compaction Policy for Time-series Data
Date-tiered Compaction Policy for Time-series Data
 
Scylla Summit 2018: Keeping Your Latency SLAs No Matter What!
Scylla Summit 2018: Keeping Your Latency SLAs No Matter What!Scylla Summit 2018: Keeping Your Latency SLAs No Matter What!
Scylla Summit 2018: Keeping Your Latency SLAs No Matter What!
 
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
 
Scylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi KivityScylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi Kivity
 
Scaling Cassandra for Big Data
Scaling Cassandra for Big DataScaling Cassandra for Big Data
Scaling Cassandra for Big Data
 
Performance tuning - A key to successful cassandra migration
Performance tuning - A key to successful cassandra migrationPerformance tuning - A key to successful cassandra migration
Performance tuning - A key to successful cassandra migration
 
TechTalk v2.0 - Performance tuning Cassandra + AWS
TechTalk v2.0 - Performance tuning Cassandra + AWSTechTalk v2.0 - Performance tuning Cassandra + AWS
TechTalk v2.0 - Performance tuning Cassandra + AWS
 

Ähnlich wie CASSANDRA MEETUP - Choosing the right cloud instances for success

M6d cassandrapresentation
M6d cassandrapresentationM6d cassandrapresentation
M6d cassandrapresentation
Edward Capriolo
 
stackArmor presentation for DevOpsDC ver 4
stackArmor presentation for DevOpsDC ver 4stackArmor presentation for DevOpsDC ver 4
stackArmor presentation for DevOpsDC ver 4
Gaurav "GP" Pal
 
Sizing MongoDB on AWS with Wired Tiger-Patrick and Vigyan-Final
Sizing MongoDB on AWS with Wired Tiger-Patrick and Vigyan-FinalSizing MongoDB on AWS with Wired Tiger-Patrick and Vigyan-Final
Sizing MongoDB on AWS with Wired Tiger-Patrick and Vigyan-Final
Vigyan Jain
 
Colvin exadata mistakes_ioug_2014
Colvin exadata mistakes_ioug_2014Colvin exadata mistakes_ioug_2014
Colvin exadata mistakes_ioug_2014
marvin herrera
 
Storage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailStorage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, Whiptail
Internet World
 

Ähnlich wie CASSANDRA MEETUP - Choosing the right cloud instances for success (20)

M6d cassandrapresentation
M6d cassandrapresentationM6d cassandrapresentation
M6d cassandrapresentation
 
Ceph on All Flash Storage -- Breaking Performance Barriers
Ceph on All Flash Storage -- Breaking Performance BarriersCeph on All Flash Storage -- Breaking Performance Barriers
Ceph on All Flash Storage -- Breaking Performance Barriers
 
Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...
Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...
Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...
 
DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
DevOps for ETL processing at scale with MongoDB, Solr, AWS and ChefDevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
 
stackArmor presentation for DevOpsDC ver 4
stackArmor presentation for DevOpsDC ver 4stackArmor presentation for DevOpsDC ver 4
stackArmor presentation for DevOpsDC ver 4
 
Sizing MongoDB on AWS with Wired Tiger-Patrick and Vigyan-Final
Sizing MongoDB on AWS with Wired Tiger-Patrick and Vigyan-FinalSizing MongoDB on AWS with Wired Tiger-Patrick and Vigyan-Final
Sizing MongoDB on AWS with Wired Tiger-Patrick and Vigyan-Final
 
AWS Webcast - Cost and Performance Optimization in Amazon RDS
AWS Webcast - Cost and Performance Optimization in Amazon RDSAWS Webcast - Cost and Performance Optimization in Amazon RDS
AWS Webcast - Cost and Performance Optimization in Amazon RDS
 
Colvin exadata mistakes_ioug_2014
Colvin exadata mistakes_ioug_2014Colvin exadata mistakes_ioug_2014
Colvin exadata mistakes_ioug_2014
 
Accelerating hbase with nvme and bucket cache
Accelerating hbase with nvme and bucket cacheAccelerating hbase with nvme and bucket cache
Accelerating hbase with nvme and bucket cache
 
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar AhmedPGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
 
Everyday I'm Scaling... Cassandra (Ben Bromhead, Instaclustr) | C* Summit 2016
Everyday I'm Scaling... Cassandra (Ben Bromhead, Instaclustr) | C* Summit 2016Everyday I'm Scaling... Cassandra (Ben Bromhead, Instaclustr) | C* Summit 2016
Everyday I'm Scaling... Cassandra (Ben Bromhead, Instaclustr) | C* Summit 2016
 
Everyday I’m scaling... Cassandra
Everyday I’m scaling... CassandraEveryday I’m scaling... Cassandra
Everyday I’m scaling... Cassandra
 
Running & Scaling Large Elasticsearch Clusters
Running & Scaling Large Elasticsearch ClustersRunning & Scaling Large Elasticsearch Clusters
Running & Scaling Large Elasticsearch Clusters
 
Aerospike meetup july 2019 | Big Data Demystified
Aerospike meetup july 2019 | Big Data DemystifiedAerospike meetup july 2019 | Big Data Demystified
Aerospike meetup july 2019 | Big Data Demystified
 
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
 
AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)
AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)
AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)
 
Storage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailStorage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, Whiptail
 
505 kobal exadata
505 kobal exadata505 kobal exadata
505 kobal exadata
 
Accelerating HBase with NVMe and Bucket Cache
Accelerating HBase with NVMe and Bucket CacheAccelerating HBase with NVMe and Bucket Cache
Accelerating HBase with NVMe and Bucket Cache
 
Presentation database on flash
Presentation   database on flashPresentation   database on flash
Presentation database on flash
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

CASSANDRA MEETUP - Choosing the right cloud instances for success

  • 1. © DataStax, All Rights Reserved. Apache Cassandra™ Choosing instances for success 1 Erick Ramirez DataStax Engineering @flightc
  • 2. Welcome • Your app in focus — reads vs writes, CPU vs RAM • What IOPS? How much is enough? • Are ephemeral disks evil? • False economy — cheaper instances can cost you more • A time to kill — they’re not your pets
  • 3. © DataStax, All Rights Reserved.3
  • 4. © DataStax, All Rights Reserved.4
  • 5. © DataStax, All Rights Reserved. https://academy.datastax.com 5
  • 6. © DataStax, All Rights Reserved. ONE SIZE DOES NOT FIT ALL 6
  • 7. Tailor to workload • intimately understand your app • reads vs writes • CPU vs memory • OLTP vs OLAP • use case will dictate requirements
  • 8. © DataStax, All Rights Reserved. STORAGE OPTIONS 8
  • 9. © DataStax, All Rights Reserved. EBS gp2 SSDs 9 • general purpose EBS option • persistent (durable) • default volume for EC2 instances • guaranteed 99% single-digit millisecond latency • only pay for each GB (IOPS included) • minimum 10K IOPS for production workloads 3 IOPS/GB (3K IOPS/TB) Max 10K IOPS/vol Max 160MB/s throughput/vol 1TB = $122/mo, $1474/yr
  • 10. © DataStax, All Rights Reserved. EBS io1 SSDs 10 • fastest available EBS option • persistent (durable) • for latency-sensitive OLTP workloads • guaranteed 99.9%* single-digit millisecond latency • provisioned IOPS are charged extra • minimum 10K IOPS for production workloads * read the fine print Up to 50 IOPS/GB Max 20K IOPS/vol Max 320MB/s throughput/vol 1TB = $141/mo, $1695/yr 1K IOPS = $72/mo, $864/yr
  • 11. © DataStax, All Rights Reserved. #spoileralert EPHEMERAL IS YOUR FRIEND 11
  • 12. Ephemeral storage • performance orders of magnitude better than EBS • already included in instance costs, e.g. m3, c3, i3 • “physically” attached • not durable across reboots but…
  • 13. © DataStax, All Rights Reserved. HELLO, CASSANDRA 13
  • 14. What is Cassandra • massively scalable NoSQL database • fully distributed, no single-point-of-failure • linear horizontal scaling
  • 15. © DataStax, All Rights Reserved. Why Cassandra 15 • all nodes are the same — no SPOF • real-time, durable writes • linear scaling on commodity servers • real-time replication across data centres • always on — no offline operation • because you have a scale problem
  • 16. © DataStax, All Rights Reserved.16 Replication across DCs
  • 17. © DataStax, All Rights Reserved. CHEAP INSTANCES MAY BE COSTING YOU 17
  • 18. © DataStax, All Rights Reserved. Real example 18 • deployed on c4.4xlarge • using EBS io1 with 3K PIOPS • nodes dropping writes • high read latencies 16 vCPU, 30GB RAM Instance $ 5443 EBS io1 1TB $ 1695 PIOPS 3K $ 2592 ———————— Annual cost $ 9730
  • 19. © DataStax, All Rights Reserved. Recommendation 19 • swap to i3.2xlarge • 1.9TB NVMe SSDs included • 3M IOPS, 16GB/s • 60-70% cheaper than replaced i2.2xlarge 8 vCPU, 61GB RAM Instance $ 4174 ———————— Annual cost $ 4174
  • 20. © DataStax, All Rights Reserved. HORSES FOR COURSES 20
  • 21. © DataStax, All Rights Reserved. Use case - dev, light prod 21 • m3.large suitable • entry-level load, testing-the-waters • minimum 3 C* nodes with RF=3 • use CMS GC with 2GB heap 2 vCPU 7.5GB RAM 1 x 32GB SSD $ 962/yr
  • 22. © DataStax, All Rights Reserved. Use case - low prod volume 22 • m3.xlarge suitable • JVM will perform better with the extra RAM • min 3 C* nodes with RF=3 • use CMS GC with 8GB heap 4 vCPU 15GB RAM 1 x 40GB SSD $ 1924/yr
  • 23. © DataStax, All Rights Reserved. Use case - moderate prod volume 23 • c3.2xlarge recommended • more diskspace, extra cores a bonus • costs 50% more for 2x CPU and 4x diskspace • min 3 C* nodes with RF=3 • use CMS GC with 8GB heap 8 vCPU 15GB RAM 2 x 80GB SSD $ 2916/yr
  • 24. © DataStax, All Rights Reserved. Use case - real prod volume 24 • i3.2xlarge recommended • will handle all kinds of workloads including Analytics, Graph and Search (Solr) • min 3 C* nodes with RF=3 • use G1 GC with 24GB heap (32GB for Search nodes) 8 vCPU 61GB RAM 1.9TB NVMe SSD $ 4174/yr
  • 25. © DataStax, All Rights Reserved. https://datastaxacademy.slack.com 25
  • 26. © DataStax, All Rights Reserved. Thank you 26