Orchestrating Cassandra with Kubernetes Operator and PaaSTA

Orchestrating Cassandra
with Kubernetes Operator &
PaaSTA
Percona Live Online
May 19, 2020

About
● Me: Raghavendra D Prabhu
○ rprabhu@yelp.com / t: @randomsurfer / me@rdprabhu.com
○ Snr Software Engineer, Yelp
● Team: Database Reliability Engineering (DRE)
● Where: London, UK & San Francisco, CA
● What: Databases at Yelp: MySQL, Cassandra, ZooKeeper

Yelp’s Mission
Connecting people with great
local businesses.

Overview
● Introduction
● Cassandra at Yelp
● Orchestration
● Cassandra Operator
● Opportunities and Challenges
● Conclusion

“ A distributed system is one in which the failure of a computer
you didn't even know existed can render your own computer
unusable. ”
- Leslie Lamport

Desired Traits of Distributed
Systems
Reliability
Scalability
Maintainability

Distributed Systems Fallacies
● The network is reliable
● Latency is zero
● Bandwidth is inﬁnite
● The network is secure
● Topology doesn't change
● One administrator
● Zero Transport cost
● Homogeneous network

● Distributed wide-column NoSQL datastore
● Leaderless / Multi-master
● Multi-region support
● Tunable consistency
● Write optimized
● Cloud-aware: gossip, failure detection, topology-aware, handoﬀs

C* at Yelp
● Both primary and derived data
● Use cases
● Deployed on Amazon Web Services (AWS)
○ EBS for Storage
● Smartstack for service discovery
● Automated schema management
● ZooKeeper-based cluster coordination

us-west-2 us-east-1
Yelp (EC2) Cassandra @ 100000 ft
Multi-region Cluster

Yelp (EC2) Cassandra @ 10000 ft

Velocity
● Churn
● Feature
● Scale
Safety
● Reliable
● Available
● Maintainable

Orchestration
● What is Orchestration
● Why Orchestrate
○ Reliability, Scalability, Maintainability
○ Sustainable management of constantly growing infrastructure
○ Automation of best practices and learnings
● Orchestration + Control Plane = FTW
○ = Cassandra Operator + PaaS + Kubernetes

Kubernetes / k8s
● Popular Open Source Container-based orchestration
● Actively developed
● Stateful and stateless applications
● Well-deﬁned building blocks for distributed systems
● Integrates into our PaaS
○ k8s: generic but extensible
● Organizes containers into pods

● Yelp PaaSTA: Stateless and Stateful Microservices on Kubernetes
● Few thousand microservices deployed and growing
● Hundreds of deployments every day
● Handles compute, storage and network abstractions
● Why PaaSTA
○ Uniform interface - deployment, restarts, rollbacks ...
○ Clusterman
○ Spot and statically-reserved ﬂeet
PaaSTA: Kubernetes at Yelp

Cassandra Operator
● Introduction
● Speciﬁcations
● Reconciliation
● Putting it together

C* Operator: Intro
● Developed by DRE team
● Controller Loop for Reconciliation
● Deﬁnes a custom resource for k8s
○ Statefulset, Container spec, Storage, Secrets and more
● “Big Red Button”
○ Stop for human takeover

C* Operator: Responsibilities
● Creating cluster from specs
● Scaling the cluster up and down
● Safe and Reliable Change Deployments
● Lifecycle Management
● Multi-region coordination
● Credential management
● Balance resource utilization

The Recipe aka the Cluster Spec

Cassandra Pod
● What is a pod
● Cassandra container + Sidecars
● Sidecar containers
○ HAcheck for Smartstack
○ Cron
○ Sensu
○ Change Data Capture (CDC) publisher
○ Metrics exporters

Storage aka State
● EBS for Cassandra
○ Clear separation of stateful and stateless
○ Quick healing upon underlying node failure
○ Dynamic Provisioning
○ “Compute follows Data”
○ Stripe cluster across AZs

us-west-2
us-east-1
Yelp Cassandra @ 100000 ft + Operator
Multi-region Cluster

Autoscaling
0
1x
2x
3x
Wed Thu Fri Sat Sun Mon Tue Wed Thu Fri Sat Sun
Holiday
Requests

Deployment Strategies
Dealing with
Immutability

Distributed Systems Fallacies
The network is reliable
Latency is zero
Bandwidth is inﬁnite
The network is secure
Topology doesn't change
One administrator
Zero Transport cost
Homogeneous network

Orchestration:
Building A Reliable System
Out of Unreliable Components.

@YelpEngineering
fb.com/YelpEngineers
engineeringblog.yelp.com
github.com/yelp

Credits
● Apache cassandra logo
● https://puppet.com/
● https://www.terraform.io/
● https://kubernetes.io/
● https://etcd.io/
● https://aws.amazon.com/architecture/icons/
● https://dataintensive.net/
● https://www.yelp.com/brand
● https://thenounproject.com/
● https://upload.wikimedia.org/wikipedia/commons/2/2e/Chicago_Symphony
_Orchestra_2005.jpg
● https://upload.wikimedia.org/wikipedia/commons/thumb/e/e0/Blank_US_m
ap_borders_labels.svg/1000px-Blank_US_map_borders_labels.svg.png

Orchestrating Cassandra with Kubernetes Operator and PaaSTA

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Orchestrating Cassandra with Kubernetes Operator and PaaSTA

Ähnlich wie Orchestrating Cassandra with Kubernetes Operator and PaaSTA (20)

Mehr von Raghavendra Prabhu

Mehr von Raghavendra Prabhu (19)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Orchestrating Cassandra with Kubernetes Operator and PaaSTA