Basic distributed systems principles

BASIC DISTRIBUTED SYSTEMS
PRINCIPLES
LET’S TALK ABOUT…

BASIC DISTRIBUTED SYSTEMS PRINCIPLES
RUBEN TAN LONG ZHENG
▸ CTO of Neuroware (R1 Dot My Sdn Bhd)
▸ We Do Blockchain Stuff™
▸ Co-founder of Javascript Developers Malaysia
▸ Proud owner of 2 useless cats
▸ rubentan.com
▸ @roguejs

SESSION OVERVIEW
▸ Deﬁning Distributed Systems
▸ Eight Fallacies Of Distributed Systems
▸ CAP Theorem
▸ Harvest / Yield
▸ Replication vs. Partitioning
▸ Consensus Algorithms

DEFINING
DISTRIBUTED
SYSTEMS
What is a distributed
system?

DISTRIBUTED SYSTEMS HAPPEN WHEN DEMAND OUTPACES YOUR INFRASTRUCTURE

▸ Distributed System
▸ A bunch of processes in a networked environment
▸ Communicates by passing messages
▸ Observed as one single entity by outsiders
DEFINING DISTRIBUTED SYSTEMS
NODE
NODE
NODE
NODE
NODE

▸ Centralized vs. Decentralized
▸ A topology for control
▸ Centralized distributed system - has an authoritative
entity to ensure correctness
▸ Decentralized distributed system - no leader, every
node operates independently

▸ General Characteristics
▸ Networked - each node is connected in a network
▸ Independent Failure - each node can fail independently
▸ Concurrent - computing is done en masse
▸ No Global Clock - nodes do not need a central clock

A PRACTICAL EXAMPLE - CONFERENCE DIRECTORY

WEB DIRECTORY
P P
P
P
P
P
P
P
P
SERVER
DATABASE

WEB DIRECTORY
P P
P
P
P
P
P
P
P
SERVER
DATABASE
P
PP
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P

WEB DIRECTORY
P P
P
P
P
P
P
P
P
LOAD BALANCER
DATABASE
P
PP
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
SERVER SERVERSERVER

WEB DIRECTORY
P P
P
P
P
P
P
P
P
LOAD BALANCER
DATABASE
P
PP
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
SERVER SERVERSERVER
DATABASE DATABASE

WEB DIRECTORY
P P
P
P
P
P
P
P
P
LOAD BALANCER
DATABASE
P
PP
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
SERVER SERVERSERVER
DATABASE DATABASE
Congratulations, you now have a
distributed system!

▸ Why learn about distributed systems?
▸ Microservices - learn how to evaluate your topology
▸ Load planning - understand how to measure and plan for
load
▸ Failure management - eliminate or mitigate single point
of failures
▸ Evaluate products - understand what product to use and
what exactly do they bring to the table

EIGHT FALLACIES
Great sins of distributed
systems

DISTRIBUTED SYSTEMS FALLACIES
▸ The Network Is Reliable
▸ Latency Is Zero
▸ Bandwidth Is Inﬁnite
▸ The Network Is Secure
▸ Topology Does Not Change
▸ There Is One Administrator
▸ Transport Cost Is Zero
▸ The Network Is Homogeneous

When you believe in any of the eight fallacies…

Hardware failure
Human error
Datacenter/Cloud failure DDoS

▸ Identify critical components and SPoF
▸ Chaos Monkey in Netﬂix
▸ Monitor with heartbeats
▸ Simplify the failure model
▸ Watch out for shared states

▸ Latency Is Zero
SERVER
A
SERVER
B SERVER
CUSER
10ms
50ms
100ms

▸ Latency Is Zero
▸ Identify potential race conditions
▸ Avoid sequential operations
▸ Plan to terminate locking requests

∞ Width

▸ Not as big a fallacy as others
▸ Made worse because more bandwidth is almost always
immediately consumed
▸ Plan for unpredictable bandwidth
▸ Graceful degradation

▸ Network Is Secure
SWIFT network lost 81
million USD to a cyber heist
in 2016
LinkedIn was breached,
more than 117 million
accounts compromised

▸ Network Is Secure
▸ Harden infrastructure as early as possible
▸ Adopt industry best practises on access control
▸ Plan for byzantine faults, or at least detect them

BASIC DISTRIBUTED COMPUTING PRINCIPLES
WEB DIRECTORY
SERVER
DATABASE
WEB DIRECTORY
LOAD BALANCER
DATABASE
SERVER SERVERSERVER
DATABASE DATABASE

▸ Topology change is the most common fallacy
▸ Small changes can have massive paradigm shifts
▸ Crucial to understand distributed principles

▸ Conﬂict between system administration and infrastructure
design
▸ Access control can often cause unexpected failures
▸ System administrators have different focus compared to
software developers
▸ Think about management tools, software deﬁned
network, etc

▸ Business decisions can become hard constraints
▸ More powerful hardware can yield minimal results
▸ Transport layer may incur additional resource costs
▸ Different protocol (TCP/UDP) can have different
performance tradeoffs

▸ The Network Is Homogenous
LINUX WINDOWS

▸ The Network Is Homogenous
▸ Avoid propriety protocols/formats
▸ Focus on software/hardware that allows
interoperatability
▸ Not that big of a deal in modern day world

CAP THEOREM
Understanding the idea of
tradeoffs

CAP THEOREM
▸ Consistency - most up-to-date data upon request - from
weak to strong
▸ Availability - able to respond to a request - from low to
high
▸ Partition-tolerance - able to continue operating in the
event of a network partition - mandatory

CAP THEOREM
CONSISTENCY AVAILABILITY
PARTITION-
TOLERANCE

CAP THEOREM
PARTITION-
TOLERANCE
Strong Consistency + Partition Tolerant

CAP THEOREM
PARTITION-
TOLERANCE
Strong Consistency + Partition Tolerant
• Mission critical systems
• Financial systems

CAP THEOREM
PARTITION-
TOLERANCE
High Availability + Partition Tolerant

CAP THEOREM
PARTITION-
TOLERANCE
High Availability + Partition Tolerant
• “Webscale” systems
• Most web service backends

CAP THEOREM
PARTITION-
TOLERANCE
Also known as NOT a distributed system

CAP THEOREM
PARTITION-
TOLERANCE
Most systems are tuneable aka TRADEOFF

CAP THEOREM
Also, there are no absolutes in the system
Absolute Consistency Absolute Availability

CAP THEOREM
Also, there are no absolutes in the system
Absolute Consistency Absolute Availability
• CAP Theorem describes the nature of how a system acts
when a network partition is encountered
• Understand what consistency and availability means
• Sacriﬁce some consistency for more availability, or vice
versa

HARVEST / YIELD
Metrics to measure
distributed performance

HARVEST / YIELD
▸ Harvest - the completeness of the response to a query
▸ Yield - the probability of completing a request
▸ A response to CAP Theorem being widely misunderstood
and misused
▸ Armando Fox, Eric A. Bower - Harvest, Yield and Scalable
Tolerant Systems (1999)
▸ How much of harvest/yield to sacriﬁce in the event of a
network partition

HARVEST / YIELD
▸ Harvest - the completeness of the response to a query
▸ Total data available / Total data = Harvest
▸ Harvest is an abstract idea - depends on what you deﬁne as
completeness
▸ Examples:
▸ Pagination on large datasets
▸ Returning partial dataset on shard failure
▸ Return less accurate search results on node failure

HARVEST / YIELD
▸ Yield - the probability of completing a request
▸ Total responses / Total requests = Yield, result of 0 to 1
▸ Example:
▸ Total responses = 999
▸ Total requests = 1000
▸ Yield = 0.999
▸ Yield is NOT uptime!

HARVEST / YIELD
Harvest Yield
• You can trade harvest for yield - Probabilistic Availability
• Examples
• Returning stale data
• Prioritise the most critical data

HARVEST / YIELD
Harvest Yield
• You can trade yield for harvest
• Examples
• Database transactional locks
• Return error on network failure instead

HARVEST / YIELD
▸ Distributed systems can be evaluated by its decision to
reduce harvest/yield under network partitions
▸ Some architectures utilise different harvest/yield tradeoffs
in individual components
▸ A better representation of the kind of tradeoffs one will
make compared to the CAP Theorem

PARTITIONING &
REPLICATION
Important concepts to
understand

PARTITIONING & REPLICATION
▸ Strategies - replication and partitioning are two different
strategies on scaling a distributed system
▸ Partitioning - dividing data to improve yield during high
loads
▸ Replication - creating redundancy of data to improve
Harvest in the event of node failures
▸ Both strategies are used together in some combinations

PARTITIONING
▸ Partitioning - dividing data to improve yield during high loads
▸ Data can be divided using deterministic indexing strategies
▸ Example:
▸ By Geograpy (Asia, Europe, North America)
▸ By Hash (3xf8ca8e, etc)
▸ By Category (Hot/cold data)

PARTITIONING
WEB SERVER
P
NODE

PARTITIONING
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
When load becomes greater than the ability
of a node to handle, we need to partition
the data

PARTITIONING
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE NODE
NODE
NODE NODE
Each node contains a shard of the
original dataset

PARTITIONING
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE NODE
NODE
NODE NODE
A-D
E-H
I-M N-Q
R-U
V-Z

PARTITIONING
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE NODE
NODE
NODE NODE
A-D
E-H
I-M N-Q
R-U
V-Z
Search: “Justice League”

PARTITIONING
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE NODE
NODE
NODE NODE
A-D
E-H
I-M N-Q
R-U
V-Z
“Justice League”

PARTITIONING
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE NODE
NODE
NODE NODE
A-D
E-H
I-M N-Q
R-U
V-Z
Consistent (deterministic)
hashing is used to perform
queries in a sharded dataset
to be able to quickly map a
search query to its containing
node

PARTITIONING
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE NODE
NODE
NODE NODE
A-D
E-H
I-M N-Q
R-U
V-Z
Bonus question: what
happens when you need to
add a new partition?
NODE

PARTITIONING
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE NODE
NODE
NODE NODE
A-D
E-H
I-M N-Q
R-U
V-Z
What key range do you use?
NODE
?-?

PARTITIONING & REPLICATION
▸ Partitioning does not improve system resilience towards
network partitions or node failures
▸ Replication - used in conjunction with partitioning to
improve data redundancy
▸ However, as we replicate data, we improve read yield at
the cost of write yield, if we care about strong consistency

REPLICATION
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE NODE
NODE
NODE NODE

REPLICATION
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE
NODE NODE
Request requires data a speciﬁc
partition
NODE
NODE

REPLICATION
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE
NODE NODENode fails, your data is gone
NODE
NODE

REPLICATION
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE NODE
NODE
NODE NODE
Replicate 2 more nodes
to improve data
redundancy

REPLICATION
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE NODE
NODE
NODE NODE
What happens when the
data is updated?

REPLICATION
WEB SERVER
P P
P
P
P
P
P
P
P
NODE
NODE NODE
NODE
NODE NODE
Your data is now
inconsistent. The solution
is to implement a
consensus algorithm
amongst the replicas

CONSENSUS
Agreeing on a canonical
truth

CONSENSUS OVERVIEW
▸ Achieving Consensus = distributed system acting as one entity
▸ Consensus Problem = getting nodes in a distributed system to
agree on something (value, operation, etc)
▸ Common Examples
▸ Commit transactions to a database
▸ Synchronising clocks
▸ Replicating logs

REALITIES OF DISTRIBUTED SYSTEMS
▸ Distributed systems fail often (more often than you think)
▸ Development of distributed systems cost more
▸ Consensus/coordination is a hard problem
▸ Problems usually bigger than available memory
▸ Debugging a distributed system? Good luck
▸ Monitoring a distributed system? Good luck
▸ Learn to live with imperfections and partial availability

FAILURE MODES
▸ Fail-stop = a node dies
▸ Fail-recover = a node dies and comes back later (Jesus/
Zombie)
▸ Byzantine = a node misbehaves
▸ The scary part? The symptoms are the same!

FLP IMPOSSIBILITY PROOF
▸ Michael J. Fisher, Nancy A. Lynch, and Michael S. Patterson
▸ Impossibility of Distributed Consensus with One Faulty
Process (1985) - Dijkstra (dike-stra) Award (2001)
▸ In synchronous settings, it is possible to reach consensus at
the cost of time
▸ Consensus is impossible in an asynchronous setting even
when only 1 node will crash
▸ Why is this important? Because math > your arguments!

BYZANTINE GENERAL’S PROBLEM
▸ Originated from the Two General’s Problem (1975)
▸ Explored in detail in Leslie Lamport, Robert Shostak,
Marshall Pease paper: The Byzantine General Problem
(1982)

ENEMY
A
B
C
D
E
F
TRAITOR
ATTACK!
ATTACK!
ATTACK!
RETREAT!
RETREAT!
RETREAT!
ATTACK! RETREAT!

ENEMY
A
B
C
D
E
F
TRAITOR
MUAHAHA, NO CONSENSUS!
ROUTS THE FLEEING ARMY
ATTACKERS HAVE
INSUFFICIENT FORCE
AND ARE DESTROYED

BYZANTINE FAULT TOLERANCE
▸ Byzantine Fault
▸ Any fault that presents different symptoms to different
observers (some general attack, some general retreat)
▸ Byzantine Failure
▸ The loss of a system service reliant on consensus due to
Byzantine Fault
▸ Byzantine Fault Tolerance
▸ A system that is resilient/tolerant of a Byzantine Fault

SOLVING THE CONSENSUS PROBLEM
▸ Strong consensus follows these properties:
▸ Termination - all nodes eventually decide on a value
▸ Agreement - all nodes decide on a value
▸ Integrity - all nodes must decide on at most 1 value, and
this value must be a value that’s been proposed
▸ Validity - if all correct nodes propose the same value,
then all nodes decide on the same value

CONSENSUS PROTOCOLS
▸ 2 Phase Commit
▸ 3 Phase Commit
▸ Basic Paxos
▸ Bitcoin Consensus

2 PHASE COMMIT
▸ Simplest consensus protocol
▸ Phase 1 - Proposal
▸ A node (called coordinator) proposes a value to all other nodes,
then gathers votes
▸ Phase 2 - Commit-or-abort
▸ The coordinator sends:
▸ Commit if all nodes voted yes. All nodes commit the new value
▸ Abort if 1 or more nodes voted no. All nodes abort the value

COOR.
NODE
NODE
NODE
NODE
Coordinator proposes a value

COOR.
NODE
NODE
NODE
NODE
All nodes vote yes or no

COOR.
NODE
NODE
NODE
NODE
Coordinator sends commit if
all nodes voted yes; sends
abort otherwise All nodes now
update themselves
to contain the
proposed value, or
all nodes abort

2 PHASE COMMIT
▸ Agreement - every node accepts the value from the
coordinator at phase 2 = YES
▸ Integrity - commit/abort originated from the coordinator = YES
▸ Termination - no loops in the steps, doesn’t run forever = YES
▸ Validity - all correct nodes accept the correct proposed value =
YES
▸ Therefore, 2 phase commit fulﬁls the requirements of a
consensus protocol

2 PHASE COMMIT
▸ Blocking failure when coordinator fails before sending
proposal to all nodes
COOR.
NODE
NODE
NODE
Coordinator proposes a value

2 PHASE COMMIT
COOR.
NODE
NODE
NODE
Receives proposed
value, votes yes, now
waiting for commit

2 PHASE COMMIT
COOR.
NODE
NODE
NODE
Coordinator crashes… and a different
coordinator comes in to propose a
different value
NEW
COOR.

2 PHASE COMMIT
COOR.
NODE
NODE
NODE
NEW
COOR.
Node cannot accept new proposal
because waiting on commit. Cannot
abort because ﬁrst Coordinator might
recover.

2 PHASE COMMIT
▸ Guarantees safety, but not liveness
▸ Safety = all nodes agree on a value proposed by a node
▸ Liveness = should still be able to function when some
nodes crash

3 PHASE COMMIT
▸ Similar to 2 Phase Commit, with an extra phase (duh)
▸ Phase 1 - canCommit - same as 2PC, nodes reply with Yes
▸ Phase 2 - preCommit - similar to 2PC commit-or-abort, but
nodes reply with ACK instead
▸ Phase 3 - doCommit - now the nodes commit
▸ Tolerant of node crashes, but not network partitions
▸ Won’t cover in detail

COOR.
NODE
NODE
NODE
NODE
All nodes vote Yes or No

COOR.
NODE
NODE
NODE
NODE
Once the coordinator
receives all Yes, it sends
out preCommit
All nodes now sets the
value to a pre-commit
state.

COOR.
NODE
NODE
NODE
NODE
All nodes send back ACK
once they have
committed the value

COOR.
NODE
NODE
NODE
NODE
Coordinator sends out
doCommit to all nodes

COOR.
NODE
NODE
NODE
NODE
All nodes send back
haveCommit to signify
that they have fully
committed the value

COOR.
NODE
NODE
NODE
NODE
Guarantee 1: if ANY node
receives an preCommit,
we can safely assume that
ALL nodes have replied
YES, in which case they
can safely assume that the
value is agreed upon by
all nodes

COOR.
NODE
NODE
NODE
NODE
Guarantee 2: if ANY node
receives an canCommit,
we can safely assume that
ALL nodes have replied
ACK, in which case even if
the coordinator fails, they
can safely assume their
commit is correct

PAXOS
▸ Presented by Leslie Lamport in The Part-Time Parliament
(1988)
▸ Named after the Paxos civilisation’s legislation
▸ Remains as:
▸ The hardest to understand in theory
▸ The hardest to implement
▸ The closest we get to reaching ideal consensus

PAXOS
▸ Used in:
▸ Apache Zookeeper
▸ Google Chubby (BigTable)
▸ Google Spannar
▸ Apache Mesos
▸ Apache Cassandra
▸ etc

BASIC PAXOS
▸ Components:
▸ Proposers
▸ Proposes values to other nodes
▸ Acceptors
▸ Respond to proposers with votes
▸ Commits chosen value & decision state
▸ Server can have both 1 Proposer & 1 Acceptor

BASIC PAXOS
▸ Revolves around two important properties: proposal
number, and time
▸ Proposal number is unique, and higher proposal
number has priority over lower proposal number
▸ Proposal number needs to be persisted on each node

BASIC PAXOS
▸ Uses a two-phase approach:
▸ Broadcast Prepare
▸ Find out if there’s already a chosen value
▸ Block older proposals that have yet to be completed
▸ Broadcast Accept
▸ Ask acceptors to accept a value
▸ Impossible to have an algorithm that completes in one cycle

BASIC PAXOS
▸ Proposal Phase
▸ Proposer generates a proposal number p
▸ Proposer broadcasts p and a value v
▸ Acceptor checks p if higher than its min-p, updates if so
▸ Acceptor replies any accepted-p and accepted-v
▸ Proposer waits for majority (quorum) to reply
▸ Checks if any return accepted-p is highest, and replace v with accepted-v
▸ If no quorum, generate a new proposal, using accepted-p as a base

PAXOS
▸ Accept Phase
▸ Proposer sends p and v to all acceptors
▸ Acceptors check if p is lower than min-p, and ignores if
so. Otherwise, accepted-p = min-p = p and accepted-v =
v, and returns the min-p
▸ Acceptor reply accepted or rejected
▸ If majority accepted, terminate with v. Otherwise, restart
Propose Phase with new p

A1
A2
A3
X
S1 proposes X using proposal number 1
MIN-P 0 ACC-P - ACC-V -
S1
P1.1 X
P1.1 X
PROPOSAL PHASE

A1
A2
A3
X
Both A1 and A2 sets their min-P, and replies with acc-P and acc-V
MIN-P 1.1 ACC-P - ACC-V -
S1
P1.1 X
P1.1 X
PROPOSAL PHASE
ACC-P -
ACC-V -
ACC-P -
ACC-V -

A1
A2
A3
X
S1 notices that the highest Acc-P is not more than its own P
S1
P1.1 X
P1.1 X
PROPOSAL PHASE
ACC-P -
ACC-V -
ACC-P -
ACC-V -

A1
A2
A3
X
S1 issues an accept command using the same proposal and value
MIN-P 1.1 ACC-P 1.1 ACC-V X
S1
P1.1 X
P1.1 X
COMMIT PHASE
P1.1 X

PAXOS - MULTI PROPOSERS
▸ What if there were multiple proposers?
▸ Brace yourself, It’s Complicated™ (not really)

A1
A2
A3
S1 proposes X using proposal number 1, 2 out of 3 nodes already accepted
S1
P1.1 X
P1.1 X
YS2PROPOSAL PHASE
S2 proposes Y using proposal number 2.2

A1
A2
A3
S1
P1.1 X
P1.1 X
YS2PROPOSAL PHASE
A3 will return null for Acc-P and Acc-V…
ACC-P -
ACC-V -

A1
A2
A3
S1
P1.1 X
P1.1 X
YS2PROPOSAL PHASE
But A2 will return Acc-P of 1.1, and Acc-V of X
ACC-P -
ACC-V -
ACC-P 1.1
ACC-V X

A1
A2
A3
S1
P1.1 X
P1.1 X
YS2PROPOSAL PHASE
What is the highest Acc-P? 1.1
ACC-P -
ACC-V -
ACC-P 1.1
ACC-V X

A1
A2
A3
S1
P1.1 X
P1.1 X
XS2PROPOSAL PHASE
Change value to X, and send it back as a commit

A1
A2
A3
S1
P1.1 X
P1.1 X
XS2COMMIT PHASE
Change value to X, and send it back as a commit
P2.1 X
P2.1 X
P2.1 X

A1
A2
A3
S1
P1.1 X
P1.1 X
XS2COMMIT PHASE
P2.1 X
P2.1 X
P2.1 X

A1
A2
A3
S1
P1.1 X
P1.1 X
S2COMMIT PHASE
P2.1 X
P2.1 X
P2.1 X
All values are in sync now

BASIC PAXOS
▸ This is BASIC Paxos: 2PC with a twist (Quorum)
▸ It has vulnerabilities!
▸ Best of 2PC (safety), with strong liveness
▸ Most Consensus Algorithm are a variant of Paxos
▸ Forms the basis of Distributed Consensus Research

CLOSING…
▸ Basic Paxos is not Byzantine Fault Tolerant, but more
advanced variants can be (e.g. PBFT, raft)
▸ It is a challenge to create a consensus protocol
(termination, agreement, validity) that is Byzantine Fault
Tolerant
▸ Further developments: Multi-Paxos, Raft, Byzantine Fault
Tolerant Paxos, etc…

BITCOIN CONSENSUS
▸ Why you need to know?
▸ Bitcoin
▸ Litecoin
▸ Dogecoin
▸ etc

BITCOIN CONSENSUS
▸ Requirements:
▸ Anybody can access the ledger
▸ Anybody can modify the ledger
▸ Everybody must have the same truth
▸ Nobody exerts sole authority over the truth

BLK-1 T T T T T T T T T T T T2017 FEB 23, HOUR 1
BLK-2 T T T T T T T2017 FEB 23, HOUR 2
BLK-3 T T T T T T T T T T2017 FEB 23, HOUR 3

1 - Each block contains a list of transactions
2 - Each block contains a “hash” of its previous parent
3 - Each block is timestamped to a speciﬁc time

T1 T3
T4
T5
T2 T6
T7
T8
New transactions arrive into a memory pool

T1 T3
T4
T5
T2 T6
T7
T8
MINER-1
MINER-2
MINER-3
All miners receive these transactions via gossip, and
collects them into blocks
BLK-X
BLK-Y
BLK-Z

T1 T3
T4
T5
T2 T6
T7
T8
MINER-1
MINER-2
MINER-3
Miners hashes the block, and races to match the hash
with a pattern. This pattern has a difﬁculty that roughly
determines the number of hashes required to solve it.
BLK-X
BLK-Y
BLK-Z
1
BLK-X 2
BLK-X 3

T1
T3
T4
T5
T2
T6
T7
T8
MINER-1
MINER-2
MINER-3
A match is found! Broadcast the solution, hash, and block
BLK-X
BLK-Y
BLK-Z
11

T3
T5
T6
T7
T8
MINER-1
MINER-2
MINER-3
BLK-5 T T T2017 FEB 23, HOUR 5
$$$
:(
:(

BLK-5 T T T
2017 FEB 23, HOUR 5
BLK-5 T T T T T T
So… what if 2 blocks are discovered at the same time?

BLK-5B T T TBLK-5A T T T T T T
Fork it. Next block chooses the block that has the most work
MINER-1

5A it is!
MINER-1

BLK-6 T T T T
2017 FEB 23, HOUR 5
2017 FEB 23, HOUR 6
All future blocks will only choose the longest chain, so 5B is orphaned

T
T
T
BLK-5A T T T T T T
BLK-6 T T T T
2017 FEB 23, HOUR 5
2017 FEB 23, HOUR 6
Transactions in 5B eventually gets returned to the mempool, to be included into
a different block

BLK-5 T T T T T T
BLK-6 T T T T
2017 FEB 23, HOUR 5
2017 FEB 23, HOUR 6
Consensus achieved, one single version of truth!

BITCOIN CONSENSUS
▸ Achieves consensus through proof of work
▸ An economic solution to a distributed problem
▸ Expensive to attack, even when attack vectors are known
▸ Incentivised by playing nice
▸ Great basis for cryptocurrencies
▸ Tradeoffs
▸ Limited number of transactions per second
▸ Improvements limited by politics

CLOSING…
▸ Understanding distributed models will level up your
perspective when developing
▸ Gives you the tools to evaluate technologies to see if they
ﬁt your problems
▸ Allows you to reliably tell what kind of tradeoffs a
technology makes, and if you are okay with that sacriﬁce

SESSION OVERVIEW
▸ Deﬁning Distributed Systems
▸ Eight Fallacies Of Distributed Systems
▸ CAP Theorem
▸ Harvest / Yield
▸ Replication vs. Partitioning
▸ Consensus Algorithms

DISTRIBUTED SYSTEMS ARE HARD

Complexity
Number of nodes

TAKEAWAYS
▸ If you can solve it in memory, don’t go distributed
▸ If you can afford a monolithic architecture, don’t go
microservices
▸ If you insist on microservices, use it as a service abstraction,
not a scaling method
▸ Scale vertically ﬁrst before horizontally
▸ When you need to scale horizontally, use these principles to
evaluate solutions and design your system

QUESTIONS?
Ask and I shall (try to)
answer

Basic distributed systems principles

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Basic distributed systems principles

Ähnlich wie Basic distributed systems principles (20)

Mehr von Ruben Tan

Mehr von Ruben Tan (11)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Basic distributed systems principles