OSDC 2019 | Running backups with Ceph-to-Ceph by Michael Raabe

NETWAYS
NETWAYSNETWAYS
Running backups with Ceph-to-Ceph
OSDC 2019 May 15th, 2019
Michel Raabe
Cloud Solution Architect
B1 Systems GmbH
raabe@b1-systems.de
Introducing B1 Systems
founded in 2004
operating both nationally and internationally
more than 100 employees
vendor-independent (hardware and software)
focus:
consulting
support
development
training
operations
solutions
branch offices in Rockolding, Berlin, Cologne & Dresden
B1 Systems GmbH Running backups with Ceph-to-Ceph 2 / 34
Areas of Expertise
B1 Systems GmbH Running backups with Ceph-to-Ceph 3 / 34
Running backups with Ceph-to-Ceph
B1 Systems GmbH Running backups with Ceph-to-Ceph 4 / 34
Requirements
“We want to backup our Ceph cluster”
independent from OpenStack
asynchronous
not always offsite
fuc**** file browser
B1 Systems GmbH Running backups with Ceph-to-Ceph 5 / 34
Methods
native
rbd-mirror
s3 multisite
3rd party
“scripts”
backy2
rbd2qcow
???
B1 Systems GmbH Running backups with Ceph-to-Ceph 6 / 34
Challenges
B1 Systems GmbH Running backups with Ceph-to-Ceph 7 / 34
Challenges
crash consistency
disaster recovery
bandwidth
costs
B1 Systems GmbH Running backups with Ceph-to-Ceph 8 / 34
Crash consistency
only access to the base layer
unknown workload
corrupt filesystem
lost transactions
B1 Systems GmbH Running backups with Ceph-to-Ceph 9 / 34
Disaster recovery
how to access the remote cluster?
bandwidth?
route?
switch the storage backend?
supported by the application?
B1 Systems GmbH Running backups with Ceph-to-Ceph 10 / 34
Bandwidth
bandwidth vs. backup volume
20TB in 24h over 800mbit - ?
network latency
B1 Systems GmbH Running backups with Ceph-to-Ceph 11 / 34
Costs
a second cluster
different type of disks (hdd/ssd/nvme)
similar amount of available disk space
uplink
1/10/100 Gbit
B1 Systems GmbH Running backups with Ceph-to-Ceph 12 / 34
What can we do?
B1 Systems GmbH Running backups with Ceph-to-Ceph 13 / 34
rbd-mirror – Overview
does mirror support:
single image
whole pool
available since jewel (10.2.x)
asynchronous
daemon: rbd-mirror
no “file browser”
B1 Systems GmbH Running backups with Ceph-to-Ceph 14 / 34
rbd-mirror – What’s needed?
rbd feature: journaling (+ exclusive lock):
rbd feature enable ... journaling
30s default trigger:
rbd_mirror_sync_point_update_age
cluster name
default is “ceph”
it’s possible but ...
... hard to track
B1 Systems GmbH Running backups with Ceph-to-Ceph 15 / 34
rbd-mirror – Keyring sample
key layout:
ceph.client.admin.keyring
<clustername>.<type>.<id>.keyring
example config files:
remote.client.mirror.keyring
remote.conf
local.client.mirror.keyring
local.conf
B1 Systems GmbH Running backups with Ceph-to-Ceph 16 / 34
rbd-mirror – Problems
rbd-daemons must be able to connect to both clusters
no two public networks
same network or
routing kung fu
krbd module
B1 Systems GmbH Running backups with Ceph-to-Ceph 17 / 34
s3 multisite – Overview
S3 - simple storage service
compatible with Amazon S3
Swift compatible
Keystone integration
encryption
no “file browser”
B1 Systems GmbH Running backups with Ceph-to-Ceph 18 / 34
s3 multisite – What’s needed?
read-write or read-only
cloudsync plugin (e.g. aws)
NFS export possible
S3 “client”
B1 Systems GmbH Running backups with Ceph-to-Ceph 19 / 34
s3 multisite – Problems
Masterzone “default”
Zonegroup(s) “default”
<zone>-<zonegroup1> <-> <zone>-<zonegroup2>
default-default <-> default-default
de-fra <-> de-muc
Zonegroups are synced
one connection ...
... radosgw to radosgw
B1 Systems GmbH Running backups with Ceph-to-Ceph 20 / 34
3rd party
custom scripts
backy2
...
all based on snapshots and diff exports
B1 Systems GmbH Running backups with Ceph-to-Ceph 21 / 34
3rd party – Scripts – Overview
should use snapshot and ‘diff’
rbd snap create ..
rbd export-diff .. | ssh rbd import-diff ..
rbd export-diff --from-snap .. | ssh rbd import-diff ..
someone has to track the snapshots
B1 Systems GmbH Running backups with Ceph-to-Ceph 22 / 34
3rd party – backy2 – Overview
internal db
snapshots
rbd and file
can do backups to s3
tricky with k8s
python (only deb)
nbd mount
B1 Systems GmbH Running backups with Ceph-to-Ceph 23 / 34
3rd party – Problems
active/old snapshots
k8s with ceph backend
pv cannot be deleted
tracking/database
still no “file browser”
B1 Systems GmbH Running backups with Ceph-to-Ceph 24 / 34
3rd party – Example workflow
1 Create initial snapshot.
2 Copy first snapshot to remote cluster.
3 ... next hour/day/week/month ...
4 Create a snapshot.
5 Copy diff between snap1 vs snap2 to remote cluster.
6 Delete old snapshots.
B1 Systems GmbH Running backups with Ceph-to-Ceph 25 / 34
What can’t we do
B1 Systems GmbH Running backups with Ceph-to-Ceph 26 / 34
rbd export
plain rbd export
for one-time syncs
only disk images
Figure: rbd export | rbd import - 21min (20G)
B1 Systems GmbH Running backups with Ceph-to-Ceph 27 / 34
rbd export-diff
plain rbd export-diff
depends on the (total) diff size
only disk images
fast(er)?
scheduled
Figure: rbd export-diff - 8min (20G)
B1 Systems GmbH Running backups with Ceph-to-Ceph 28 / 34
rbd mirror
rbd-mirror
slow?
runs in the background
no schedule
Figure: rbd mirror - 30min (20G)
B1 Systems GmbH Running backups with Ceph-to-Ceph 29 / 34
s3 multisite
s3 capable client
“only” http(s)
scalable (really, really well)
B1 Systems GmbH Running backups with Ceph-to-Ceph 30 / 34
Wrap-up
B1 Systems GmbH Running backups with Ceph-to-Ceph 31 / 34
What now?
1 rbd snapshots
simple, easy, fast
controllable
can be a backup
2 s3 multisite
simple, fast
built-in
nfs export (ganesha)
no backup at all
3 rbd-mirror
disk-based
built-in
no backup at all
B1 Systems GmbH Running backups with Ceph-to-Ceph 32 / 34
What’s with ...
cephfs
no built-in replication
hourly/daily rsync?
snapshots?
B1 Systems GmbH Running backups with Ceph-to-Ceph 33 / 34
Thank You!
For more information, refer to info@b1-systems.de
or +49 (0)8457 - 931096
1 von 34

Recomendados

CloudOpen 2013: Developing cloud infrastructure: from scratch: the tale of an... von
CloudOpen 2013: Developing cloud infrastructure: from scratch: the tale of an...CloudOpen 2013: Developing cloud infrastructure: from scratch: the tale of an...
CloudOpen 2013: Developing cloud infrastructure: from scratch: the tale of an...Andrey Korolyov
617 views21 Folien
Programming policy diagrams von
Programming policy diagramsProgramming policy diagrams
Programming policy diagramsRoman Gonzalez Productions
192 views3 Folien
A Guide to Data Versioning with MapR Snapshots von
A Guide to Data Versioning with MapR SnapshotsA Guide to Data Versioning with MapR Snapshots
A Guide to Data Versioning with MapR SnapshotsIan Downard
539 views22 Folien
New Process/Thread Runtime von
New Process/Thread Runtime	New Process/Thread Runtime
New Process/Thread Runtime Linaro
2.6K views24 Folien
Simplifying and accelerating converged media with Open Visual Cloud von
Simplifying and accelerating converged media with Open Visual CloudSimplifying and accelerating converged media with Open Visual Cloud
Simplifying and accelerating converged media with Open Visual CloudLiz Warner
318 views25 Folien
Post-K: Building the Arm HPC Ecosystem von
Post-K: Building the Arm HPC Ecosystem	Post-K: Building the Arm HPC Ecosystem
Post-K: Building the Arm HPC Ecosystem Linaro
2.5K views9 Folien

Más contenido relacionado

Similar a OSDC 2019 | Running backups with Ceph-to-Ceph by Michael Raabe

Images for the Clouds with KIWI & OBS von
Images for the Clouds with KIWI & OBSImages for the Clouds with KIWI & OBS
Images for the Clouds with KIWI & OBSB1 Systems GmbH
286 views34 Folien
Images for the Clouds with KIWI & OBS von
Images for the Clouds with KIWI & OBSImages for the Clouds with KIWI & OBS
Images for the Clouds with KIWI & OBSAnke Börnig
495 views34 Folien
End of the Road - Facing Current Scaling Limits within OpenStack von
End of the Road - Facing Current Scaling Limits within OpenStackEnd of the Road - Facing Current Scaling Limits within OpenStack
End of the Road - Facing Current Scaling Limits within OpenStackB1 Systems GmbH
214 views61 Folien
C# Production Debugging Made Easy von
 C# Production Debugging Made Easy C# Production Debugging Made Easy
C# Production Debugging Made EasyAlon Fliess
4K views35 Folien
Jenkins to Gitlab - Intelligent Build-Pipelines von
Jenkins to Gitlab - Intelligent Build-PipelinesJenkins to Gitlab - Intelligent Build-Pipelines
Jenkins to Gitlab - Intelligent Build-PipelinesChristian Münch
2.7K views33 Folien
Kubernetes - Shifting the mindset from servers to containers - microxchg 201... von
Kubernetes  - Shifting the mindset from servers to containers - microxchg 201...Kubernetes  - Shifting the mindset from servers to containers - microxchg 201...
Kubernetes - Shifting the mindset from servers to containers - microxchg 201...Schlomo Schapiro
5.1K views32 Folien

Similar a OSDC 2019 | Running backups with Ceph-to-Ceph by Michael Raabe(20)

Images for the Clouds with KIWI & OBS von B1 Systems GmbH
Images for the Clouds with KIWI & OBSImages for the Clouds with KIWI & OBS
Images for the Clouds with KIWI & OBS
B1 Systems GmbH286 views
Images for the Clouds with KIWI & OBS von Anke Börnig
Images for the Clouds with KIWI & OBSImages for the Clouds with KIWI & OBS
Images for the Clouds with KIWI & OBS
Anke Börnig495 views
End of the Road - Facing Current Scaling Limits within OpenStack von B1 Systems GmbH
End of the Road - Facing Current Scaling Limits within OpenStackEnd of the Road - Facing Current Scaling Limits within OpenStack
End of the Road - Facing Current Scaling Limits within OpenStack
B1 Systems GmbH214 views
C# Production Debugging Made Easy von Alon Fliess
 C# Production Debugging Made Easy C# Production Debugging Made Easy
C# Production Debugging Made Easy
Alon Fliess4K views
Jenkins to Gitlab - Intelligent Build-Pipelines von Christian Münch
Jenkins to Gitlab - Intelligent Build-PipelinesJenkins to Gitlab - Intelligent Build-Pipelines
Jenkins to Gitlab - Intelligent Build-Pipelines
Christian Münch2.7K views
Kubernetes - Shifting the mindset from servers to containers - microxchg 201... von Schlomo Schapiro
Kubernetes  - Shifting the mindset from servers to containers - microxchg 201...Kubernetes  - Shifting the mindset from servers to containers - microxchg 201...
Kubernetes - Shifting the mindset from servers to containers - microxchg 201...
Schlomo Schapiro5.1K views
Scaffolding for Serverless: lightning talk for AWS Arlington Meetup von Chris Shenton
Scaffolding for Serverless: lightning talk for AWS Arlington MeetupScaffolding for Serverless: lightning talk for AWS Arlington Meetup
Scaffolding for Serverless: lightning talk for AWS Arlington Meetup
Chris Shenton523 views
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes von Alluxio, Inc.
Deep Learning and Gene Computing Acceleration with Alluxio in KubernetesDeep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Alluxio, Inc.2.1K views
Testing Persistent Storage Performance in Kubernetes with Sherlock von ScyllaDB
Testing Persistent Storage Performance in Kubernetes with SherlockTesting Persistent Storage Performance in Kubernetes with Sherlock
Testing Persistent Storage Performance in Kubernetes with Sherlock
ScyllaDB278 views
Creating a dynamic software deployment solution using free/libre software von B1 Systems GmbH
Creating a dynamic software deployment solution using free/libre softwareCreating a dynamic software deployment solution using free/libre software
Creating a dynamic software deployment solution using free/libre software
B1 Systems GmbH1K views
DCEU 18: Developing with Docker Containers von Docker, Inc.
DCEU 18: Developing with Docker ContainersDCEU 18: Developing with Docker Containers
DCEU 18: Developing with Docker Containers
Docker, Inc.595 views
Scaleable PHP Applications in Kubernetes von Robert Lemke
Scaleable PHP Applications in KubernetesScaleable PHP Applications in Kubernetes
Scaleable PHP Applications in Kubernetes
Robert Lemke51 views
Salt - A Scalable Systems Management Solution for Datacenters von B1 Systems GmbH
Salt - A Scalable Systems Management Solution for DatacentersSalt - A Scalable Systems Management Solution for Datacenters
Salt - A Scalable Systems Management Solution for Datacenters
B1 Systems GmbH172 views
414: Build an agile CI/CD Pipeline for application integration von Trevor Dolby
414: Build an agile CI/CD Pipeline for application integration414: Build an agile CI/CD Pipeline for application integration
414: Build an agile CI/CD Pipeline for application integration
Trevor Dolby957 views
Some experiences for porting application to Intel Xeon Phi von Maho Nakata
Some experiences for porting application to Intel Xeon PhiSome experiences for porting application to Intel Xeon Phi
Some experiences for porting application to Intel Xeon Phi
Maho Nakata6.5K views

Último

DSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - Parker von
DSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - ParkerDSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - Parker
DSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - ParkerDeltares
9 views16 Folien
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -... von
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...Deltares
6 views15 Folien
DSD-INT 2023 FloodAdapt - A decision-support tool for compound flood risk mit... von
DSD-INT 2023 FloodAdapt - A decision-support tool for compound flood risk mit...DSD-INT 2023 FloodAdapt - A decision-support tool for compound flood risk mit...
DSD-INT 2023 FloodAdapt - A decision-support tool for compound flood risk mit...Deltares
13 views34 Folien
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports von
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug ReportsBushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug ReportsRa'Fat Al-Msie'deen
5 views49 Folien
ict act 1.pptx von
ict act 1.pptxict act 1.pptx
ict act 1.pptxsanjaniarun08
13 views17 Folien
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea... von
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...Safe Software
412 views59 Folien

Último(20)

DSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - Parker von Deltares
DSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - ParkerDSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - Parker
DSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - Parker
Deltares9 views
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -... von Deltares
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
Deltares6 views
DSD-INT 2023 FloodAdapt - A decision-support tool for compound flood risk mit... von Deltares
DSD-INT 2023 FloodAdapt - A decision-support tool for compound flood risk mit...DSD-INT 2023 FloodAdapt - A decision-support tool for compound flood risk mit...
DSD-INT 2023 FloodAdapt - A decision-support tool for compound flood risk mit...
Deltares13 views
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports von Ra'Fat Al-Msie'deen
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug ReportsBushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea... von Safe Software
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Safe Software412 views
Software evolution understanding: Automatic extraction of software identifier... von Ra'Fat Al-Msie'deen
Software evolution understanding: Automatic extraction of software identifier...Software evolution understanding: Automatic extraction of software identifier...
Software evolution understanding: Automatic extraction of software identifier...
DSD-INT 2023 HydroMT model building and river-coast coupling in Python - Bove... von Deltares
DSD-INT 2023 HydroMT model building and river-coast coupling in Python - Bove...DSD-INT 2023 HydroMT model building and river-coast coupling in Python - Bove...
DSD-INT 2023 HydroMT model building and river-coast coupling in Python - Bove...
Deltares17 views
Elevate your SAP landscape's efficiency and performance with HCL Workload Aut... von HCLSoftware
Elevate your SAP landscape's efficiency and performance with HCL Workload Aut...Elevate your SAP landscape's efficiency and performance with HCL Workload Aut...
Elevate your SAP landscape's efficiency and performance with HCL Workload Aut...
HCLSoftware6 views
.NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra... von Marc Müller
.NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra....NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra...
.NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra...
Marc Müller38 views
Tridens DevOps von Tridens
Tridens DevOpsTridens DevOps
Tridens DevOps
Tridens9 views
DSD-INT 2023 3D hydrodynamic modelling of microplastic transport in lakes - J... von Deltares
DSD-INT 2023 3D hydrodynamic modelling of microplastic transport in lakes - J...DSD-INT 2023 3D hydrodynamic modelling of microplastic transport in lakes - J...
DSD-INT 2023 3D hydrodynamic modelling of microplastic transport in lakes - J...
Deltares9 views
Upgrading Incident Management with Icinga - Icinga Camp Milan 2023 von Icinga
Upgrading Incident Management with Icinga - Icinga Camp Milan 2023Upgrading Incident Management with Icinga - Icinga Camp Milan 2023
Upgrading Incident Management with Icinga - Icinga Camp Milan 2023
Icinga38 views
DSD-INT 2023 Delft3D FM Suite 2024.01 1D2D - Beta testing programme - Geertsema von Deltares
DSD-INT 2023 Delft3D FM Suite 2024.01 1D2D - Beta testing programme - GeertsemaDSD-INT 2023 Delft3D FM Suite 2024.01 1D2D - Beta testing programme - Geertsema
DSD-INT 2023 Delft3D FM Suite 2024.01 1D2D - Beta testing programme - Geertsema
Deltares17 views
Consulting for Data Monetization Maximizing the Profit Potential of Your Data... von Flexsin
Consulting for Data Monetization Maximizing the Profit Potential of Your Data...Consulting for Data Monetization Maximizing the Profit Potential of Your Data...
Consulting for Data Monetization Maximizing the Profit Potential of Your Data...
Flexsin 15 views
Fleet Management Software in India von Fleetable
Fleet Management Software in India Fleet Management Software in India
Fleet Management Software in India
Fleetable11 views

OSDC 2019 | Running backups with Ceph-to-Ceph by Michael Raabe

  • 1. Running backups with Ceph-to-Ceph OSDC 2019 May 15th, 2019 Michel Raabe Cloud Solution Architect B1 Systems GmbH raabe@b1-systems.de
  • 2. Introducing B1 Systems founded in 2004 operating both nationally and internationally more than 100 employees vendor-independent (hardware and software) focus: consulting support development training operations solutions branch offices in Rockolding, Berlin, Cologne & Dresden B1 Systems GmbH Running backups with Ceph-to-Ceph 2 / 34
  • 3. Areas of Expertise B1 Systems GmbH Running backups with Ceph-to-Ceph 3 / 34
  • 4. Running backups with Ceph-to-Ceph B1 Systems GmbH Running backups with Ceph-to-Ceph 4 / 34
  • 5. Requirements “We want to backup our Ceph cluster” independent from OpenStack asynchronous not always offsite fuc**** file browser B1 Systems GmbH Running backups with Ceph-to-Ceph 5 / 34
  • 7. Challenges B1 Systems GmbH Running backups with Ceph-to-Ceph 7 / 34
  • 8. Challenges crash consistency disaster recovery bandwidth costs B1 Systems GmbH Running backups with Ceph-to-Ceph 8 / 34
  • 9. Crash consistency only access to the base layer unknown workload corrupt filesystem lost transactions B1 Systems GmbH Running backups with Ceph-to-Ceph 9 / 34
  • 10. Disaster recovery how to access the remote cluster? bandwidth? route? switch the storage backend? supported by the application? B1 Systems GmbH Running backups with Ceph-to-Ceph 10 / 34
  • 11. Bandwidth bandwidth vs. backup volume 20TB in 24h over 800mbit - ? network latency B1 Systems GmbH Running backups with Ceph-to-Ceph 11 / 34
  • 12. Costs a second cluster different type of disks (hdd/ssd/nvme) similar amount of available disk space uplink 1/10/100 Gbit B1 Systems GmbH Running backups with Ceph-to-Ceph 12 / 34
  • 13. What can we do? B1 Systems GmbH Running backups with Ceph-to-Ceph 13 / 34
  • 14. rbd-mirror – Overview does mirror support: single image whole pool available since jewel (10.2.x) asynchronous daemon: rbd-mirror no “file browser” B1 Systems GmbH Running backups with Ceph-to-Ceph 14 / 34
  • 15. rbd-mirror – What’s needed? rbd feature: journaling (+ exclusive lock): rbd feature enable ... journaling 30s default trigger: rbd_mirror_sync_point_update_age cluster name default is “ceph” it’s possible but ... ... hard to track B1 Systems GmbH Running backups with Ceph-to-Ceph 15 / 34
  • 16. rbd-mirror – Keyring sample key layout: ceph.client.admin.keyring <clustername>.<type>.<id>.keyring example config files: remote.client.mirror.keyring remote.conf local.client.mirror.keyring local.conf B1 Systems GmbH Running backups with Ceph-to-Ceph 16 / 34
  • 17. rbd-mirror – Problems rbd-daemons must be able to connect to both clusters no two public networks same network or routing kung fu krbd module B1 Systems GmbH Running backups with Ceph-to-Ceph 17 / 34
  • 18. s3 multisite – Overview S3 - simple storage service compatible with Amazon S3 Swift compatible Keystone integration encryption no “file browser” B1 Systems GmbH Running backups with Ceph-to-Ceph 18 / 34
  • 19. s3 multisite – What’s needed? read-write or read-only cloudsync plugin (e.g. aws) NFS export possible S3 “client” B1 Systems GmbH Running backups with Ceph-to-Ceph 19 / 34
  • 20. s3 multisite – Problems Masterzone “default” Zonegroup(s) “default” <zone>-<zonegroup1> <-> <zone>-<zonegroup2> default-default <-> default-default de-fra <-> de-muc Zonegroups are synced one connection ... ... radosgw to radosgw B1 Systems GmbH Running backups with Ceph-to-Ceph 20 / 34
  • 21. 3rd party custom scripts backy2 ... all based on snapshots and diff exports B1 Systems GmbH Running backups with Ceph-to-Ceph 21 / 34
  • 22. 3rd party – Scripts – Overview should use snapshot and ‘diff’ rbd snap create .. rbd export-diff .. | ssh rbd import-diff .. rbd export-diff --from-snap .. | ssh rbd import-diff .. someone has to track the snapshots B1 Systems GmbH Running backups with Ceph-to-Ceph 22 / 34
  • 23. 3rd party – backy2 – Overview internal db snapshots rbd and file can do backups to s3 tricky with k8s python (only deb) nbd mount B1 Systems GmbH Running backups with Ceph-to-Ceph 23 / 34
  • 24. 3rd party – Problems active/old snapshots k8s with ceph backend pv cannot be deleted tracking/database still no “file browser” B1 Systems GmbH Running backups with Ceph-to-Ceph 24 / 34
  • 25. 3rd party – Example workflow 1 Create initial snapshot. 2 Copy first snapshot to remote cluster. 3 ... next hour/day/week/month ... 4 Create a snapshot. 5 Copy diff between snap1 vs snap2 to remote cluster. 6 Delete old snapshots. B1 Systems GmbH Running backups with Ceph-to-Ceph 25 / 34
  • 26. What can’t we do B1 Systems GmbH Running backups with Ceph-to-Ceph 26 / 34
  • 27. rbd export plain rbd export for one-time syncs only disk images Figure: rbd export | rbd import - 21min (20G) B1 Systems GmbH Running backups with Ceph-to-Ceph 27 / 34
  • 28. rbd export-diff plain rbd export-diff depends on the (total) diff size only disk images fast(er)? scheduled Figure: rbd export-diff - 8min (20G) B1 Systems GmbH Running backups with Ceph-to-Ceph 28 / 34
  • 29. rbd mirror rbd-mirror slow? runs in the background no schedule Figure: rbd mirror - 30min (20G) B1 Systems GmbH Running backups with Ceph-to-Ceph 29 / 34
  • 30. s3 multisite s3 capable client “only” http(s) scalable (really, really well) B1 Systems GmbH Running backups with Ceph-to-Ceph 30 / 34
  • 31. Wrap-up B1 Systems GmbH Running backups with Ceph-to-Ceph 31 / 34
  • 32. What now? 1 rbd snapshots simple, easy, fast controllable can be a backup 2 s3 multisite simple, fast built-in nfs export (ganesha) no backup at all 3 rbd-mirror disk-based built-in no backup at all B1 Systems GmbH Running backups with Ceph-to-Ceph 32 / 34
  • 33. What’s with ... cephfs no built-in replication hourly/daily rsync? snapshots? B1 Systems GmbH Running backups with Ceph-to-Ceph 33 / 34
  • 34. Thank You! For more information, refer to info@b1-systems.de or +49 (0)8457 - 931096