Red Hat Storage Server Replication Past, Present, & Future

RED HAT STORAGE SERVER
REPLICATION: PAST AND PRESENT
Jeff Darcy, Venky Shankar, Raghavan Pichai
GlusterFS/RHS Developers @ Red Hat

Talk Outline
Background
Local replication
Remote replication
Next steps
Questions

Background
Types of replication, goals, and challenges

Synchronous Replication
S
S
Y
Y
N
N
C
C
+ high consistency - network sensitive

Quorum Enforcement
Replica #1 Replica #2 Replica #3
Majority can write Minority can’t
There can only be one majority => no split brain

Synchronous Replication Data Flows
X
X
X
Y
Y
Y
Chain Fan Out
Client
Server
Server
Client
Server
Server

Fan Out Replication
Y
Y
Y Client
Server
Server
Split
Bandwidth
Wait for
Slowest

Chain Replication
X
X
X
Client
Server
Server
Full
Bandwidth
Two Hops

Asynchronous Replication
A
A
S
S
C
C
Y
Y
N
N
+ low consistency - network insensitive

Effect of Network Partitions
A
A
S
S
MY
Y N
What’s the correct value?

Tradeoff Space
Network Sensitive Network Insensitive
High
Consistency
Low
Consistency
S
A

Red Hat Storage
Synchronous Near-Replication
Raghavan P
Developer, Red Hat

Traditional replication using AFR
“Automatic file replication”
Client based replication
Entry, meta data and data based replication.
Automated Self healing in case bricks recover after failure.

AFR Sequence Diagram
Client 1
Client 2
Server A
Server B
Lock
Pre Op
Op
Post Op
Unlock
Lock (blocked) Pre Op

AFR improvements
In 3.4 release
Eager locking
Piggybacking
Server quorum
In 3.5 release
Granular self heal
In 3.6 release
Rewrite of the code
Pending counters
Self healing in the context of self
heal daemon

NSR – new style (aka server side) replication
Replication to the back end (brick processes)
Controlled by a designated “leader” also known as sweeper.
Advantages
Bandwidth usage of client network optimized for direct (fuse) mounts
Avoidance of split brain
Sweeper elected using majority principle.
Per term Changelog on the sweeper preseves the ordering of operations.
Variable consistency models
for trading consistency with performance.

NSR high level blocks
NSR client side translator
Sends IO to sweeper
Sweeper (leader)
Forwards IO to peers
Commits after all peer completion
Non sweeper (follower)
Accepts IO only from sweeper or reconciliation
Rejects IO from client (client retry)
Change log
Reconciliation
Makes use of membership to figure out terms missing.
Makes use of change logs for syncing the corresponding terms.

NSR Sequence Diagram
Client 1
Client 2
Sweeper
Follower
Client 1 Request
Client 2 Request

Red Hat Storage Server
Geo-Replication
Venky Shankar
Developer, Red Hat

Geo-Replication
Asynchronous data replication
Continuous, Incremental
Across geographies
One site (master) to another (slave)
Multi-slave
Cascading
Fan-out
Disaster Recovery

Single node
Change detection
Crawling (xtime based crawl)
Data synchronization
Rsync
Suboptimal processing
rename, deletes, hardlink
Overview

Crawling and xtime
Xtime
Inode changed time
Marked up to root (marker xlator)
Crawling/Scanning
Directory crawl and file synchronization
xtime(master) > xtime(slave)
Slave xtime maintained by master

Overview
Multi node
Distributed (parallel) synchronization
Replica failover
Change detection
Consumable journals
Data synchronization (configurable)
Rsync, tar+ssh (large number of small files)
Efficient processing
rename, delete, hardlink

Journaling
Journaling Translator (changelog)
Records FOP (efficiently) local to a brick
Data, Entry, Metadata
Change detection : O(1) relative to number of changes
Consumer library (libgfchangelog)
Per brick
Publish/Subscribe mechanism
Journals periodically published

Replicating Snapshots
Multi Master
Vector clocks
Conflict detection & resolution
Libgfapi integration
Geo-replication to Swift target
Features

Red Hat Storage Server
Replication-related Features
Jeff Darcy
Developer, Red Hat

Unified Replication
Leader
Change
Log
Local
Replica
Change
Log
Remote
Replica
Change
LogSync Async

Erasure Coding (a.k.a. “disperse”)
D1 D2 D3 D4 P1 P2 P3
D1 D2 D3 D4 P1 P2 P3
D2

Also…
Volume
Snapshot
File
Snapshot
Deduplication
+
Compression
Checksums
OK
OK

Tiering (a.k.a. data classification)
Tier 0
Tier 1
Tier 2
SSD, no replication
Normal disk, sync replication
SMR disk, erasure coding
compression + checksums
async replication

Red Hat Storage Server Replication Past, Present, & Future

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (11)

Ähnlich wie Red Hat Storage Server Replication Past, Present, & Future

Ähnlich wie Red Hat Storage Server Replication Past, Present, & Future (20)

Mehr von Red_Hat_Storage

Mehr von Red_Hat_Storage (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Red Hat Storage Server Replication Past, Present, & Future