Thoughts on consistency models

•

2 gefällt mir•1,209 views

rogerbodamer

Some thought on the cap theorem, tradeoffs etc. as presented to the melbourne mongodb user group.

Technologie

cap theorem

• Eric Brewer (ex-Inktomi)
• Proved by Lynch and Gilbert

cap theorem
It is impossible in the asynchrounous network
model to implement a read/write object
that garantuees the following properties:

- Availability
- Atomic consistency in fair transactions

Or: If the network is broken,
your database won’t work

AP vs CP

• Real choices are
• Available - Partition
• Consistent - Partion

AP
• Multiple Nodes participate in writes
• System will be Eventually Consistent
• Storage System guarantees if there are no
new updates, all reads will eventually
return the same, last updated value
Examples:
- DNS
- ASync replication
- MongoDB with Slave-OK
- Memcache

eventual consistency
Master

Slave Slave

Client Client

Asuming update 1,2,3,4,5
Client will expect 1,2,2,2,3,4,5,5,5

eventual consistency
Master

Slave Slave

Client Client

However, we could get this: 1,2,2,4,2,5

eventual consistency

• Monotonic read consistency
• Pin client to certain slave / app server
• Failover still fails

multi master
Dynamo model

R - number of servers to read from
W - number of servers to get response from
N - Replication Factor

R + W > N has nice properties

multi master
Example 1 Example 2
R + W <= N R +W > N
R=1 R=2 R =1
W=1 W=1 W=2
N=5 N=2 N=2
Possibly Stale Data ‘Consistent’ Data
Higher Availability

R +W > N
If R + W > N you can’t both
have fast local reads and writes

network write
possibilities
• deny all writes
• read fully consistent data
• allow writes on one side
• allow reads on other side (stale)
• allow writes on both sides
• give up consistency

multiple writer strategies
• Last one wins

• vector clocks

• Insert

• insert often means:

• if (!exist(x)) set(x)

• exist is hard to implement in eventually
consistent systems

delete
op1: set joe, age 40
op2: delete joe
op3: set joe, 41

- consider switching 2 and 3
- tombstone: remember delete and apply last op
wins

multiple writer strategies
• programmatic merge

• store ops instead of state

• replay operations

• did I get the last one ?

• Commutative operations

• conﬂict free

• anything that’s foldable

CP

• Sometimes we need global state
• Unique - constraints
• User registration
• ACL changes

Finally

uptime(CP + average developer)
>=
uptime(AP + average developer)

Where uptime is the system is up and non-buggy

Empfohlen

Deep Dive into Apache Kafkaconfluent

Introduction to Akka-Streamsdmantula

Building your own Distributed System The easy way - Cassandra Summit EU 2014Kévin LOVATO

Streaming and MessagingXin Wang

Using eBPF to Measure the k8s Cluster HealthScyllaDB

High-Performance Networking Using eBPF, XDP, and io_uringScyllaDB

Stateful stream processing with kafka and samzaGeorge Li

Keeping Latency Low and Throughput High with Application-level Priority Manag...ScyllaDB

Empfohlen

Deep Dive into Apache Kafkaconfluent

Introduction to Akka-Streamsdmantula

Building your own Distributed System The easy way - Cassandra Summit EU 2014Kévin LOVATO

Streaming and MessagingXin Wang

Using eBPF to Measure the k8s Cluster HealthScyllaDB

High-Performance Networking Using eBPF, XDP, and io_uringScyllaDB

Stateful stream processing with kafka and samzaGeorge Li

Keeping Latency Low and Throughput High with Application-level Priority Manag...ScyllaDB

When it Absolutely, Positively, Has to be There: Reliability Guarantees in Ka...confluent

Extreme HTTP Performance Tuning: 1.2M API req/s on a 4 vCPU EC2 InstanceScyllaDB

SignalFx Kafka Consumer OptimizationSignalFx

Whoops! I Rewrote It in RustScyllaDB

Get Lower Latency and Higher Throughput for Java ApplicationsScyllaDB

How to manage large amounts of data with akka streamsIgor Mielientiev

MySQL Multi-Master ReplicationMichael Naumov

Rust, Wright's Law, and the Future of Low-Latency SystemsScyllaDB

DB Latency Using DRAM + PMem in App Direct & Memory ModesScyllaDB

Rust Is Safe. But Is It Fast?ScyllaDB

Kafka At Scale in the Cloudconfluent

Rust kafka-5-2019-unskipGerard Klijs

Basics of Node.jsAlper Unal

Data Structures for High Resolution, Real-time Telemetry at ScaleScyllaDB

Vanquishing Latency Outliers in the Lightbits LightOS Software Defined Storag...ScyllaDB

Keeping MongoDB Data SafeTony Tam

Inter-process communication on steroidsRoberto Agostino Vitillo

Crimson: Ceph for the Age of NVMe and Persistent MemoryScyllaDB

Seastore: Next Generation Backing Store for CephScyllaDB

Thoughts on Transaction and Consistency Modelsiammutex

Consistency Models in New Generation Databasesiammutex

Consistency-New-Generation-DatabasesRoger Xia

Weitere ähnliche Inhalte

Was ist angesagt?

When it Absolutely, Positively, Has to be There: Reliability Guarantees in Ka...confluent

Extreme HTTP Performance Tuning: 1.2M API req/s on a 4 vCPU EC2 InstanceScyllaDB

SignalFx Kafka Consumer OptimizationSignalFx

Whoops! I Rewrote It in RustScyllaDB

Get Lower Latency and Higher Throughput for Java ApplicationsScyllaDB

How to manage large amounts of data with akka streamsIgor Mielientiev

MySQL Multi-Master ReplicationMichael Naumov

Rust, Wright's Law, and the Future of Low-Latency SystemsScyllaDB

DB Latency Using DRAM + PMem in App Direct & Memory ModesScyllaDB

Rust Is Safe. But Is It Fast?ScyllaDB

Kafka At Scale in the Cloudconfluent

Rust kafka-5-2019-unskipGerard Klijs

Basics of Node.jsAlper Unal

Data Structures for High Resolution, Real-time Telemetry at ScaleScyllaDB

Vanquishing Latency Outliers in the Lightbits LightOS Software Defined Storag...ScyllaDB

Keeping MongoDB Data SafeTony Tam

Inter-process communication on steroidsRoberto Agostino Vitillo

Crimson: Ceph for the Age of NVMe and Persistent MemoryScyllaDB

Seastore: Next Generation Backing Store for CephScyllaDB

Was ist angesagt? (19)

When it Absolutely, Positively, Has to be There: Reliability Guarantees in Ka...

Extreme HTTP Performance Tuning: 1.2M API req/s on a 4 vCPU EC2 Instance

SignalFx Kafka Consumer Optimization

Whoops! I Rewrote It in Rust

Get Lower Latency and Higher Throughput for Java Applications

How to manage large amounts of data with akka streams

MySQL Multi-Master Replication

Rust, Wright's Law, and the Future of Low-Latency Systems

DB Latency Using DRAM + PMem in App Direct & Memory Modes

Rust Is Safe. But Is It Fast?

Kafka At Scale in the Cloud

Rust kafka-5-2019-unskip

Basics of Node.js

Data Structures for High Resolution, Real-time Telemetry at Scale

Vanquishing Latency Outliers in the Lightbits LightOS Software Defined Storag...

Keeping MongoDB Data Safe

Inter-process communication on steroids

Crimson: Ceph for the Age of NVMe and Persistent Memory

Seastore: Next Generation Backing Store for Ceph

Ähnlich wie Thoughts on consistency models

Thoughts on Transaction and Consistency Modelsiammutex

Consistency Models in New Generation Databasesiammutex

Consistency-New-Generation-DatabasesRoger Xia

Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedInLinkedIn

Ch-7-Part-2-Distributed-System.pptxKabindra Koirala

Making the Most Out of ScyllaDB's Awesome Concurrency at OptimizelyScyllaDB

Seek and Destroy Kafka Under ReplicationHostedbyConfluent

Salvatore Sanfilippo – How Redis Cluster works, and why - NoSQL matters Barce...NoSQLmatters

NoSQL afternoon in Japan Kumofs & MessagePackSadayuki Furuhashi

NoSQL afternoon in Japan kumofs & MessagePackSadayuki Furuhashi

Scylla Summit 2016: Outbrain Case Study - Lowering Latency While Doing 20X IO...ScyllaDB

Call me maybe: Jepsen and flaky networksShalin Shekhar Mangar

Eventual Consistency @WalmartLabs with Kafka, Avro, SolrCloud and HadoopAyon Sinha

Replication, Durability, and Disaster RecoverySteven Francia

Disaggregated Networking - The Drivers, the Software & The High AvailabilityOpen Networking Summit

Distributed and concurrent programming with RabbitMQ and EventMachine Rails U...Paolo Negri

3.2 Streaming and Messaging振东刘

CPU Caches - Jamie Allenjaxconf

Cpu Cachesshinolajla

Highly concurrent yet natural programmingInfinit

Ähnlich wie Thoughts on consistency models (20)

Thoughts on Transaction and Consistency Models

Consistency Models in New Generation Databases

Consistency-New-Generation-Databases

Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedIn

Ch-7-Part-2-Distributed-System.pptx

Making the Most Out of ScyllaDB's Awesome Concurrency at Optimizely

Seek and Destroy Kafka Under Replication

Salvatore Sanfilippo – How Redis Cluster works, and why - NoSQL matters Barce...

NoSQL afternoon in Japan Kumofs & MessagePack

NoSQL afternoon in Japan kumofs & MessagePack

Scylla Summit 2016: Outbrain Case Study - Lowering Latency While Doing 20X IO...

Call me maybe: Jepsen and flaky networks

Eventual Consistency @WalmartLabs with Kafka, Avro, SolrCloud and Hadoop

Replication, Durability, and Disaster Recovery

Disaggregated Networking - The Drivers, the Software & The High Availability

Distributed and concurrent programming with RabbitMQ and EventMachine Rails U...

3.2 Streaming and Messaging

CPU Caches - Jamie Allen

Cpu Caches

Highly concurrent yet natural programming

Mehr von rogerbodamer

Intro to MongoDB and datamodeling rogerbodamer

Thoughts on MongoDB Analyticsrogerbodamer

Mongo Web Apps: OSCON 2011rogerbodamer

Mongo db japanrogerbodamer

Deploymentrogerbodamer

Schema Design with MongoDBrogerbodamer

Mehr von rogerbodamer (6)

Intro to MongoDB and datamodeling

Thoughts on MongoDB Analytics

Mongo Web Apps: OSCON 2011

Mongo db japan

Deployment

Schema Design with MongoDB

Kürzlich hochgeladen

What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett

From Family Reminiscence to Scholarly Archive .Alan Dix

DevEX - reference for building teams, processes, and platformsSergiu Bodiu

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3

The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech

"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity

How to write a Business Continuity PlanDatabarracks

A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

WordPress Websites for Engineers: Elevate Your Brandgvaughan

Gen AI in Business - Global Trends Report 2024.pdfAddepto

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3

unit 4 immunoblotting technique complete.pptxBkGupta21

How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

Kürzlich hochgeladen (20)

What's New in Teams Calling, Meetings and Devices March 2024

From Family Reminiscence to Scholarly Archive .

DevEX - reference for building teams, processes, and platforms

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

The Ultimate Guide to Choosing WordPress Pros and Cons

"Debugging python applications inside k8s environment", Andrii Soldatenko

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

Nell’iperspazio con Rocket: il Framework Web di Rust!

DMCC Future of Trade Web3 - Special Edition

Dev Dives: Streamline document processing with UiPath Studio Web

How to write a Business Continuity Plan

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

WordPress Websites for Engineers: Elevate Your Brand

Gen AI in Business - Global Trends Report 2024.pdf

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx

unit 4 immunoblotting technique complete.pptx

How AI, OpenAI, and ChatGPT impact business and software.

Streamlining Python Development: A Guide to a Modern Project Setup

Thoughts on consistency models

1. cap

2. cap theorem • Eric Brewer (ex-Inktomi) • Proved by Lynch and Gilbert

3. cap theorem It is impossible in the asynchrounous network model to implement a read/write object that garantuees the following properties: - Availability - Atomic consistency in fair transactions Or: If the network is broken, your database won’t work

4. AP vs CP • Real choices are • Available - Partition • Consistent - Partion

5. AP • Multiple Nodes participate in writes • System will be Eventually Consistent • Storage System guarantees if there are no new updates, all reads will eventually return the same, last updated value Examples: - DNS - ASync replication - MongoDB with Slave-OK - Memcache

6. eventual consistency Master Slave Slave Client Client Asuming update 1,2,3,4,5 Client will expect 1,2,2,2,3,4,5,5,5

7. eventual consistency Master Slave Slave Client Client However, we could get this: 1,2,2,4,2,5

8. eventual consistency • Monotonic read consistency • Pin client to certain slave / app server • Failover still fails

9. multi master Dynamo model R - number of servers to read from W - number of servers to get response from N - Replication Factor R + W > N has nice properties

10. multi master Example 1 Example 2 R + W <= N R +W > N R=1 R=2 R =1 W=1 W=1 W=2 N=5 N=2 N=2 Possibly Stale Data ‘Consistent’ Data Higher Availability

11. R +W > N If R + W > N you can’t both have fast local reads and writes

12. network partitions

13. trivial network partition

14. network write possibilities • deny all writes • read fully consistent data • allow writes on one side • allow reads on other side (stale) • allow writes on both sides • give up consistency

15. multiple writer strategies • Last one wins • vector clocks • Insert • insert often means: • if (!exist(x)) set(x) • exist is hard to implement in eventually consistent systems

16. delete op1: set joe, age 40 op2: delete joe op3: set joe, 41 - consider switching 2 and 3 - tombstone: remember delete and apply last op wins

17. multiple writer strategies • programmatic merge • store ops instead of state • replay operations • did I get the last one ? • Commutative operations • conﬂict free • anything that’s foldable

18. CP • Sometimes we need global state • Unique - constraints • User registration • ACL changes

19. Finally uptime(CP + average developer) >= uptime(AP + average developer) Where uptime is the system is up and non-buggy