SlideShare ist ein Scribd-Unternehmen logo
1 von 28
Case study: Building bi-directional DR

Joep Piscaer, VMware vExpert, VCDX #101
Agenda
   Introduction
   Project description, goals, requirements, constraints
   High level overview: product and component overview
   Backup and DR Approach and architecture
   How to improve RTO (Recovery Time Objective)
   Find your bottlenecks
   Technical deep dive and live demo
   Q&A
Agenda
   Introduction
   Project description, goals, requirements, constraints
   High level overview: product and component overview
   Backup and DR Approach and architecture
   How to improve RTO (Recovery Time Objective)
   Find your bottlenecks
   Technical deep dive and live demo
   Q&A
Introduction
 Joep Piscaer
   ● Consulting Architect at OGD ict-diensten
   ● VMware VCDX5 #101, vExpert 2009, 2011, 2012
   ● Know Veeam since 2007 and in love with them ever since
     (best. VMworld. parties. ever.)
Agenda
   Introduction
   Project goals, requirements and constraints
   High level overview: product and component overview
   Backup and DR Approach and architecture
   How to improve RTO (Recovery Time Objective)
   Find your bottlenecks
   Technical deep dive and live demo
   Q&A
Project Goals
 Replace current (cloud-based) DR solution
   ● RTO is weeks; all data needs to be replicated back to our site
   ● RPO is ∞; current solution doesn‟t have all data (no OS-disks)
   ● Backups are not application-level consistent
 Develop bi-directional DR for two separate infrastructures
   ● We have a internal IT infrastructure but also a cloud service provider
     infrastructure; both are highly virtualized
 „Eat your own dogfood‟
   ● Use hard- and software we known implement at customers regularly
Project Requirements
 RTO and RPO need to be improved
   ● RTO needs to be reduced from „weeks‟ to „hours‟
   ● RPO needs to be reduced from „∞‟ to „a day‟
 Transactionally consistent application state
  back-ups of mission critical and other VMs
 Be able to restore individual application and file items but
  also complete disks, virtual machines and clusters.
   ● Backup needs to be available locally for fast application item restores
   ● Backup needs to be available remotely for DR-purposes
Project Constraints
 WAN link too slow to do initial seed
   ● During first months of project, WAN capacity was only 20 Mbit/s
     After project completion, WAN capacity upped to 100 Mbit/s
   ● We used two 16x 1TB SuperMicro servers as temporary backup repository;
     offline initial seed by physically swapping them
     and mapping jobs to backup files
 Due to „eat your own dogfood‟, we could only use a very
  limited set of products, including Veeam B&R
Customer Background
 2 infrastructures on separate sites
 About 25 hosts in 3 clusters
 About 250 VMs total
   ● PXE-booted Citrix XenApp VMs (excluded from backup)
   ● Exchange and Zarafa groupware environments
      − Additional scripts required to create consistent snapshots for Zarafa
   ● About 12 TB of data
 About 1000 users overall
Agenda
   Introduction
   Project description, goals, requirements, constraints
   High level overview: product and component overview
   Backup and DR Approach and architecture
   How to improve RTO (Recovery Time Objective)
   Find your bottlenecks
   Technical deep dive and live demo
   Q&A
Overview
 Two Dell PowerEdge R510 servers with 24 GB RAM
   ● 12x 3TB nearline SAS disks in RAID-6 for 27 TB backup repository
   ● 4 gigabit NICs dedicated to iSCSI for direct SAN access
 Two Veeam v6.0 consoles / installations
 Distributed backup architecture
   ● Multiple proxies
   ● Multiple repositories
 Enterprise Manager installed on one site
Overview
 We leveraged
  ●   Application-aware image processing
  ●   Instant VM Recovery and vPower NFS
  ●   Virtual Labs
  ●   SureBackup
Agenda
   Introduction
   Project description, goals, requirements, constraints
   High level overview: product and component overview
   Backup and DR Approach and architecture
   How to improve RTO (Recovery Time Objective)
   Find your bottlenecks
   Technical deep dive and live demo
   Q&A
Overall architecture
  ● Used Veeam‟s distributed architecture for
    proxies and repositories, with a twist
  ● We installed two separate „stretched‟
    backup infrastructures on two hosts so
    we can restore and continue local backup
    jobs if the remote site fails
  ● Two consoles to separate administrative
    domains for each IT operations teams
  ● Enterprise Manager as a
    single point of management for licensing
    and file level restores
Design Choices
 Each host has one proxy;
 Each host has two repositories
   ● One owned by the local console
   ● One owned by the remote console
 Local SQL Express databases
   ● Size of environment didn‟t require move to „full‟ SQL Server
   ● Totally independent backup environment required
 Use Application-aware image processing for consistency
 Optimize all jobs for WAN replication
Job Type
 We had no space available on primary SAN storage
   ● „Replication‟ job type replicates from proxy to proxy, requires standby host
     and can only store in native (VMX/VMDK) format on VMFS datastores
 Therefore, we chose „regular‟ job type
   ● We cannot replicate the local backup.
     This means each VM would be touched twice every day:
     once by the job that stores the VM on the local repository
     once by the job that stores the VM on the remote repository
Deduplication and Compression
 Compression set to „best‟ for all jobs
 Deduplication happens both at source
   ● before data is sent to repository, significantly improving performance
 and at target
   ● to achieve additional reduction for jobs with multiple VMs
 Block level deduplication optimized for WAN
   ● Using 256KB block size instead of default 1024KB size
 If anything happens to target we can seed the locally stored
  backups again using the two 16x 1TB SuperMicro servers
  CBT is designed to handle such usage cases.
Agenda
   Introduction
   Project description, goals, requirements, constraints
   High level overview: product and component overview
   Backup and DR Approach and architecture
   How to improve RTO (Recovery Time Objective)
   Find your bottlenecks
   Technical deep dive and live demo
   Q&A
Job Type
 Advantages of Replication job type:
   ● Files stored in native VMware format
   ● Restores are parallel
   ● No need to choose between reverse and forward incremental
   ● No vPower NFS or Instant VM recovery needed
       − VMs run at full I/O speed
       − Number of VMs that can be powered on depends on infrastructure
         no dependency on backup server
       − No additional migration like Storage vMotion needed after recovery
   ● Advanced features like re-IP, failback available
Job Type
 Pitfalls of using regular jobs:
   ● Files stored in Veeam file format; manual interaction required to restore
       − This increases RTO and makes restores sequential
       − Other solutions provide (semi-)parallel restores, keeping RTO down
   ● Need to make difficult choice between reversed and forward incremental
   ● Instant VM Recovery uses vPower NFS;
       − recovered VMs will not run at full I/O speed
       − affects number of VMs that can be powered on after total site failure
       − Storage vMotion required to complete recovery of each VM
   ● No Re-IP and failback and other specific functionality
     available in „Replication‟ job type
Agenda
   Introduction
   Project description, goals, requirements, constraints
   High level overview: product and component overview
   Backup and DR Approach and architecture
   How to improve RTO (Recovery Time Objective)
   Find your bottlenecks
   Technical deep dive and live demo
   Q&A
Backup Method
 We chose „reverse incremental‟:
   ● Uses the least amount of disk space to store backups
      − We wanted to maximize retention for jobs stored on local repository.
         We set retention for jobs stored on remote repository to two restore
         point as these backups are only for DR-purposes
   ● Calculations to produce reverse incremental done on (remote) repository
      − Full backup file is rebuilt every day on remote repository
      − Our physical backup servers have enough oomph to handle 3x I/O load
      − Minimize stress on WAN link; only changed blocks are sent over WAN
   ● Last backup is always full
      − No periodic full needed
      − Imagine replicating a full backup of every VM over WAN every week 
Why not replicate the local backup?
 Veeam‟s distributed architecture supports this use case:
   ● Proxy and repository at local site for local backups
   ● Proxy at local site and repository at remote site for remote backups
 No suitable Windows-based tool was found that does block
  level replication
   ● None of the tools integrate with Veeam‟s proxy architecture
     and aren‟t intelligent enough to understand the reverse incremental files
     to do smart changed block replication
     These tools take forever to create changed block indexes and begin
     replication which has a very negative effect on RPO
   ● Didn‟t want to use custom (Powershell) scripts
     makes your solution harder to manage, upgrade and support
Evaluation of lessons learned
 Re-evaluate job type: „regular‟ or „replication‟
   ● Replication job type requires standby host and capacity on SAN
      − We didn‟t anticipate this and didn‟t have SAN capacity to spare
   ● Regular jobs require touching source VM‟s twice
      − Wasn‟t a problem at first, started to become cumbersome as we grew
 Re-evaluate forward or reverse incremental modes for
  replication
   ● Both have pros and cons for replication; choice is very hard to make
   ● We chose to maximize retention and use reverse incremental
 Re-evaluate Hot Add (“Virtual Appliance”) mode
   ● Much faster compared to Direct SAN Acces with thin provisioned disks
   ● Restores are much faster compared to Direct SAN Access mode
Agenda
   Introduction
   Project description, goals, requirements, constraints
   High level overview: product and component overview
   Backup and DR Approach and architecture
   How to improve RTO (Recovery Time Objective)
   Find your bottlenecks
   Technical deep dive and live demo
   Q&A
Live Demo
Agenda
   Introduction
   Project description, goals, requirements, constraints
   High level overview: product and component overview
   Backup and DR Approach and architecture
   How to improve RTO (Recovery Time Objective)
   Find your bottlenecks
   Technical deep dive and live demo
   Q&A
Q&A

Weitere ähnliche Inhalte

Was ist angesagt?

Scott Schnoll - Exchange server 2013 virtualization best practices
Scott Schnoll - Exchange server 2013 virtualization best practicesScott Schnoll - Exchange server 2013 virtualization best practices
Scott Schnoll - Exchange server 2013 virtualization best practicesNordic Infrastructure Conference
 
Tuning DB2 in a Solaris Environment
Tuning DB2 in a Solaris EnvironmentTuning DB2 in a Solaris Environment
Tuning DB2 in a Solaris EnvironmentJignesh Shah
 
Vmware srm 6.1
Vmware srm 6.1Vmware srm 6.1
Vmware srm 6.1faz4eva_27
 
Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...
Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...
Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...Paula Koziol
 
Windows Server 2012 R2 Hyper-V Replica
Windows Server 2012 R2 Hyper-V ReplicaWindows Server 2012 R2 Hyper-V Replica
Windows Server 2012 R2 Hyper-V ReplicaRavikanth Chaganti
 
Nagios Conference 2014 - Jeremy Rust - Avoiding Downtime Using Linux High Ava...
Nagios Conference 2014 - Jeremy Rust - Avoiding Downtime Using Linux High Ava...Nagios Conference 2014 - Jeremy Rust - Avoiding Downtime Using Linux High Ava...
Nagios Conference 2014 - Jeremy Rust - Avoiding Downtime Using Linux High Ava...Nagios
 
Deploying Maximum HA Architecture With PostgreSQL
Deploying Maximum HA Architecture With PostgreSQLDeploying Maximum HA Architecture With PostgreSQL
Deploying Maximum HA Architecture With PostgreSQLDenish Patel
 
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best PracticesVMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best PracticesVMworld
 
UKGSE DB2 pureScale
UKGSE DB2 pureScaleUKGSE DB2 pureScale
UKGSE DB2 pureScaleLaura Hood
 
VMworld 2013: Successfully Virtualize Microsoft Exchange Server
VMworld 2013: Successfully Virtualize Microsoft Exchange Server VMworld 2013: Successfully Virtualize Microsoft Exchange Server
VMworld 2013: Successfully Virtualize Microsoft Exchange Server VMworld
 
VMworld 2013: A Technical Deep Dive on VMware Horizon View 5.2 Performance an...
VMworld 2013: A Technical Deep Dive on VMware Horizon View 5.2 Performance an...VMworld 2013: A Technical Deep Dive on VMware Horizon View 5.2 Performance an...
VMworld 2013: A Technical Deep Dive on VMware Horizon View 5.2 Performance an...VMworld
 
SQL Server High Availability and Disaster Recovery
SQL Server High Availability and Disaster RecoverySQL Server High Availability and Disaster Recovery
SQL Server High Availability and Disaster RecoveryMichael Poremba
 
PostgreSQL and Benchmarks
PostgreSQL and BenchmarksPostgreSQL and Benchmarks
PostgreSQL and BenchmarksJignesh Shah
 
SRM versus Stretched Clusters: Choosing the Right Solution
SRM versus Stretched Clusters: Choosing the Right SolutionSRM versus Stretched Clusters: Choosing the Right Solution
SRM versus Stretched Clusters: Choosing the Right SolutionScott Lowe
 
Dell data protection & performance management solutions
Dell data protection & performance management solutionsDell data protection & performance management solutions
Dell data protection & performance management solutionsDell Data Protection Software
 
12 cool features in defrag 12
12 cool features in defrag 1212 cool features in defrag 12
12 cool features in defrag 12aosborne
 
VMworld 2014: Virtualizing Databases
VMworld 2014: Virtualizing DatabasesVMworld 2014: Virtualizing Databases
VMworld 2014: Virtualizing DatabasesVMworld
 
Educational seminar lessons learned from customer db2 for z os health check...
Educational seminar   lessons learned from customer db2 for z os health check...Educational seminar   lessons learned from customer db2 for z os health check...
Educational seminar lessons learned from customer db2 for z os health check...John Campbell
 
Inf bco2891 release candidate v11 copy
Inf bco2891 release candidate v11 copyInf bco2891 release candidate v11 copy
Inf bco2891 release candidate v11 copyAbdul Rasheed
 

Was ist angesagt? (20)

Scott Schnoll - Exchange server 2013 virtualization best practices
Scott Schnoll - Exchange server 2013 virtualization best practicesScott Schnoll - Exchange server 2013 virtualization best practices
Scott Schnoll - Exchange server 2013 virtualization best practices
 
Double-Take Software
Double-Take SoftwareDouble-Take Software
Double-Take Software
 
Tuning DB2 in a Solaris Environment
Tuning DB2 in a Solaris EnvironmentTuning DB2 in a Solaris Environment
Tuning DB2 in a Solaris Environment
 
Vmware srm 6.1
Vmware srm 6.1Vmware srm 6.1
Vmware srm 6.1
 
Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...
Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...
Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...
 
Windows Server 2012 R2 Hyper-V Replica
Windows Server 2012 R2 Hyper-V ReplicaWindows Server 2012 R2 Hyper-V Replica
Windows Server 2012 R2 Hyper-V Replica
 
Nagios Conference 2014 - Jeremy Rust - Avoiding Downtime Using Linux High Ava...
Nagios Conference 2014 - Jeremy Rust - Avoiding Downtime Using Linux High Ava...Nagios Conference 2014 - Jeremy Rust - Avoiding Downtime Using Linux High Ava...
Nagios Conference 2014 - Jeremy Rust - Avoiding Downtime Using Linux High Ava...
 
Deploying Maximum HA Architecture With PostgreSQL
Deploying Maximum HA Architecture With PostgreSQLDeploying Maximum HA Architecture With PostgreSQL
Deploying Maximum HA Architecture With PostgreSQL
 
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best PracticesVMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
 
UKGSE DB2 pureScale
UKGSE DB2 pureScaleUKGSE DB2 pureScale
UKGSE DB2 pureScale
 
VMworld 2013: Successfully Virtualize Microsoft Exchange Server
VMworld 2013: Successfully Virtualize Microsoft Exchange Server VMworld 2013: Successfully Virtualize Microsoft Exchange Server
VMworld 2013: Successfully Virtualize Microsoft Exchange Server
 
VMworld 2013: A Technical Deep Dive on VMware Horizon View 5.2 Performance an...
VMworld 2013: A Technical Deep Dive on VMware Horizon View 5.2 Performance an...VMworld 2013: A Technical Deep Dive on VMware Horizon View 5.2 Performance an...
VMworld 2013: A Technical Deep Dive on VMware Horizon View 5.2 Performance an...
 
SQL Server High Availability and Disaster Recovery
SQL Server High Availability and Disaster RecoverySQL Server High Availability and Disaster Recovery
SQL Server High Availability and Disaster Recovery
 
PostgreSQL and Benchmarks
PostgreSQL and BenchmarksPostgreSQL and Benchmarks
PostgreSQL and Benchmarks
 
SRM versus Stretched Clusters: Choosing the Right Solution
SRM versus Stretched Clusters: Choosing the Right SolutionSRM versus Stretched Clusters: Choosing the Right Solution
SRM versus Stretched Clusters: Choosing the Right Solution
 
Dell data protection & performance management solutions
Dell data protection & performance management solutionsDell data protection & performance management solutions
Dell data protection & performance management solutions
 
12 cool features in defrag 12
12 cool features in defrag 1212 cool features in defrag 12
12 cool features in defrag 12
 
VMworld 2014: Virtualizing Databases
VMworld 2014: Virtualizing DatabasesVMworld 2014: Virtualizing Databases
VMworld 2014: Virtualizing Databases
 
Educational seminar lessons learned from customer db2 for z os health check...
Educational seminar   lessons learned from customer db2 for z os health check...Educational seminar   lessons learned from customer db2 for z os health check...
Educational seminar lessons learned from customer db2 for z os health check...
 
Inf bco2891 release candidate v11 copy
Inf bco2891 release candidate v11 copyInf bco2891 release candidate v11 copy
Inf bco2891 release candidate v11 copy
 

Ähnlich wie Veeam Webinar - Case study: building bi-directional DR

5 Ways Your Backup Design Can Impact Virtualized Data Protection
5 Ways Your Backup Design Can Impact Virtualized Data Protection5 Ways Your Backup Design Can Impact Virtualized Data Protection
5 Ways Your Backup Design Can Impact Virtualized Data ProtectionStorage Switzerland
 
Veeam backup and replication v5
Veeam backup and replication v5Veeam backup and replication v5
Veeam backup and replication v5henkarts
 
VMworld 2013: Implementing a Holistic BC/DR Strategy with VMware - Part Two
VMworld 2013: Implementing a Holistic BC/DR Strategy with VMware - Part TwoVMworld 2013: Implementing a Holistic BC/DR Strategy with VMware - Part Two
VMworld 2013: Implementing a Holistic BC/DR Strategy with VMware - Part TwoVMworld
 
The road to enterprise ready open stack storage as service
The road to enterprise ready open stack storage as serviceThe road to enterprise ready open stack storage as service
The road to enterprise ready open stack storage as serviceSean Cohen
 
Veeam Webinar - Backing up Zarafa with SureBackup
Veeam Webinar - Backing up Zarafa with SureBackupVeeam Webinar - Backing up Zarafa with SureBackup
Veeam Webinar - Backing up Zarafa with SureBackupJoep Piscaer
 
NetBackup Appliance Family presentation
NetBackup Appliance Family presentationNetBackup Appliance Family presentation
NetBackup Appliance Family presentationSymantec
 
HPE + Veeam Technical Hands ON Workshop #1
HPE + Veeam Technical Hands ON Workshop #1HPE + Veeam Technical Hands ON Workshop #1
HPE + Veeam Technical Hands ON Workshop #1Tanawit Chansuchai
 
PHDVirtual Backups for VMware
PHDVirtual Backups for VMwarePHDVirtual Backups for VMware
PHDVirtual Backups for VMwareDevansh Chowdhary
 
Backup Exec Blueprints▶ Deduplication
Backup Exec Blueprints▶ DeduplicationBackup Exec Blueprints▶ Deduplication
Backup Exec Blueprints▶ DeduplicationSymantec
 
VMworld 2014: Data Protection for vSphere 101
VMworld 2014: Data Protection for vSphere 101VMworld 2014: Data Protection for vSphere 101
VMworld 2014: Data Protection for vSphere 101VMworld
 
The Pensions Trust - VM Backup Experiences
The Pensions Trust - VM Backup ExperiencesThe Pensions Trust - VM Backup Experiences
The Pensions Trust - VM Backup Experiencesglbsolutions
 
IBM MQ Disaster Recovery
IBM MQ Disaster RecoveryIBM MQ Disaster Recovery
IBM MQ Disaster RecoveryMarkTaylorIBM
 
Containerizing GPU Applications with Docker for Scaling to the Cloud
Containerizing GPU Applications with Docker for Scaling to the CloudContainerizing GPU Applications with Docker for Scaling to the Cloud
Containerizing GPU Applications with Docker for Scaling to the CloudSubbu Rama
 
Cloud-Native Patterns and the Benefits of MySQL as a Platform Managed Service
Cloud-Native Patterns and the Benefits of MySQL as a Platform Managed ServiceCloud-Native Patterns and the Benefits of MySQL as a Platform Managed Service
Cloud-Native Patterns and the Benefits of MySQL as a Platform Managed ServiceVMware Tanzu
 
Veeam Backup & Replication Tips and Tricks
Veeam Backup & Replication Tips and TricksVeeam Backup & Replication Tips and Tricks
Veeam Backup & Replication Tips and TricksVeeam Software
 
Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...
Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...
Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...Andrew Miller
 
Backroll: Production Grade KVM Backup Solution Integrated in CloudStack
Backroll: Production Grade KVM Backup Solution Integrated in CloudStackBackroll: Production Grade KVM Backup Solution Integrated in CloudStack
Backroll: Production Grade KVM Backup Solution Integrated in CloudStackShapeBlue
 
NICConf 2015 - azure disaster recovery in 60min
NICConf 2015 -  azure disaster recovery in 60minNICConf 2015 -  azure disaster recovery in 60min
NICConf 2015 - azure disaster recovery in 60minPeter De Tender
 
Business_Continuity_Planning_with_SQL_Server_HADR_options_TechEd_Bangalore_20...
Business_Continuity_Planning_with_SQL_Server_HADR_options_TechEd_Bangalore_20...Business_Continuity_Planning_with_SQL_Server_HADR_options_TechEd_Bangalore_20...
Business_Continuity_Planning_with_SQL_Server_HADR_options_TechEd_Bangalore_20...LarryZaman
 
Virtualizing Tier One Applications - Varrow
Virtualizing Tier One Applications - VarrowVirtualizing Tier One Applications - Varrow
Virtualizing Tier One Applications - VarrowAndrew Miller
 

Ähnlich wie Veeam Webinar - Case study: building bi-directional DR (20)

5 Ways Your Backup Design Can Impact Virtualized Data Protection
5 Ways Your Backup Design Can Impact Virtualized Data Protection5 Ways Your Backup Design Can Impact Virtualized Data Protection
5 Ways Your Backup Design Can Impact Virtualized Data Protection
 
Veeam backup and replication v5
Veeam backup and replication v5Veeam backup and replication v5
Veeam backup and replication v5
 
VMworld 2013: Implementing a Holistic BC/DR Strategy with VMware - Part Two
VMworld 2013: Implementing a Holistic BC/DR Strategy with VMware - Part TwoVMworld 2013: Implementing a Holistic BC/DR Strategy with VMware - Part Two
VMworld 2013: Implementing a Holistic BC/DR Strategy with VMware - Part Two
 
The road to enterprise ready open stack storage as service
The road to enterprise ready open stack storage as serviceThe road to enterprise ready open stack storage as service
The road to enterprise ready open stack storage as service
 
Veeam Webinar - Backing up Zarafa with SureBackup
Veeam Webinar - Backing up Zarafa with SureBackupVeeam Webinar - Backing up Zarafa with SureBackup
Veeam Webinar - Backing up Zarafa with SureBackup
 
NetBackup Appliance Family presentation
NetBackup Appliance Family presentationNetBackup Appliance Family presentation
NetBackup Appliance Family presentation
 
HPE + Veeam Technical Hands ON Workshop #1
HPE + Veeam Technical Hands ON Workshop #1HPE + Veeam Technical Hands ON Workshop #1
HPE + Veeam Technical Hands ON Workshop #1
 
PHDVirtual Backups for VMware
PHDVirtual Backups for VMwarePHDVirtual Backups for VMware
PHDVirtual Backups for VMware
 
Backup Exec Blueprints▶ Deduplication
Backup Exec Blueprints▶ DeduplicationBackup Exec Blueprints▶ Deduplication
Backup Exec Blueprints▶ Deduplication
 
VMworld 2014: Data Protection for vSphere 101
VMworld 2014: Data Protection for vSphere 101VMworld 2014: Data Protection for vSphere 101
VMworld 2014: Data Protection for vSphere 101
 
The Pensions Trust - VM Backup Experiences
The Pensions Trust - VM Backup ExperiencesThe Pensions Trust - VM Backup Experiences
The Pensions Trust - VM Backup Experiences
 
IBM MQ Disaster Recovery
IBM MQ Disaster RecoveryIBM MQ Disaster Recovery
IBM MQ Disaster Recovery
 
Containerizing GPU Applications with Docker for Scaling to the Cloud
Containerizing GPU Applications with Docker for Scaling to the CloudContainerizing GPU Applications with Docker for Scaling to the Cloud
Containerizing GPU Applications with Docker for Scaling to the Cloud
 
Cloud-Native Patterns and the Benefits of MySQL as a Platform Managed Service
Cloud-Native Patterns and the Benefits of MySQL as a Platform Managed ServiceCloud-Native Patterns and the Benefits of MySQL as a Platform Managed Service
Cloud-Native Patterns and the Benefits of MySQL as a Platform Managed Service
 
Veeam Backup & Replication Tips and Tricks
Veeam Backup & Replication Tips and TricksVeeam Backup & Replication Tips and Tricks
Veeam Backup & Replication Tips and Tricks
 
Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...
Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...
Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...
 
Backroll: Production Grade KVM Backup Solution Integrated in CloudStack
Backroll: Production Grade KVM Backup Solution Integrated in CloudStackBackroll: Production Grade KVM Backup Solution Integrated in CloudStack
Backroll: Production Grade KVM Backup Solution Integrated in CloudStack
 
NICConf 2015 - azure disaster recovery in 60min
NICConf 2015 -  azure disaster recovery in 60minNICConf 2015 -  azure disaster recovery in 60min
NICConf 2015 - azure disaster recovery in 60min
 
Business_Continuity_Planning_with_SQL_Server_HADR_options_TechEd_Bangalore_20...
Business_Continuity_Planning_with_SQL_Server_HADR_options_TechEd_Bangalore_20...Business_Continuity_Planning_with_SQL_Server_HADR_options_TechEd_Bangalore_20...
Business_Continuity_Planning_with_SQL_Server_HADR_options_TechEd_Bangalore_20...
 
Virtualizing Tier One Applications - Varrow
Virtualizing Tier One Applications - VarrowVirtualizing Tier One Applications - Varrow
Virtualizing Tier One Applications - Varrow
 

Veeam Webinar - Case study: building bi-directional DR

  • 1. Case study: Building bi-directional DR Joep Piscaer, VMware vExpert, VCDX #101
  • 2. Agenda  Introduction  Project description, goals, requirements, constraints  High level overview: product and component overview  Backup and DR Approach and architecture  How to improve RTO (Recovery Time Objective)  Find your bottlenecks  Technical deep dive and live demo  Q&A
  • 3. Agenda  Introduction  Project description, goals, requirements, constraints  High level overview: product and component overview  Backup and DR Approach and architecture  How to improve RTO (Recovery Time Objective)  Find your bottlenecks  Technical deep dive and live demo  Q&A
  • 4. Introduction  Joep Piscaer ● Consulting Architect at OGD ict-diensten ● VMware VCDX5 #101, vExpert 2009, 2011, 2012 ● Know Veeam since 2007 and in love with them ever since (best. VMworld. parties. ever.)
  • 5. Agenda  Introduction  Project goals, requirements and constraints  High level overview: product and component overview  Backup and DR Approach and architecture  How to improve RTO (Recovery Time Objective)  Find your bottlenecks  Technical deep dive and live demo  Q&A
  • 6. Project Goals  Replace current (cloud-based) DR solution ● RTO is weeks; all data needs to be replicated back to our site ● RPO is ∞; current solution doesn‟t have all data (no OS-disks) ● Backups are not application-level consistent  Develop bi-directional DR for two separate infrastructures ● We have a internal IT infrastructure but also a cloud service provider infrastructure; both are highly virtualized  „Eat your own dogfood‟ ● Use hard- and software we known implement at customers regularly
  • 7. Project Requirements  RTO and RPO need to be improved ● RTO needs to be reduced from „weeks‟ to „hours‟ ● RPO needs to be reduced from „∞‟ to „a day‟  Transactionally consistent application state back-ups of mission critical and other VMs  Be able to restore individual application and file items but also complete disks, virtual machines and clusters. ● Backup needs to be available locally for fast application item restores ● Backup needs to be available remotely for DR-purposes
  • 8. Project Constraints  WAN link too slow to do initial seed ● During first months of project, WAN capacity was only 20 Mbit/s After project completion, WAN capacity upped to 100 Mbit/s ● We used two 16x 1TB SuperMicro servers as temporary backup repository; offline initial seed by physically swapping them and mapping jobs to backup files  Due to „eat your own dogfood‟, we could only use a very limited set of products, including Veeam B&R
  • 9. Customer Background  2 infrastructures on separate sites  About 25 hosts in 3 clusters  About 250 VMs total ● PXE-booted Citrix XenApp VMs (excluded from backup) ● Exchange and Zarafa groupware environments − Additional scripts required to create consistent snapshots for Zarafa ● About 12 TB of data  About 1000 users overall
  • 10. Agenda  Introduction  Project description, goals, requirements, constraints  High level overview: product and component overview  Backup and DR Approach and architecture  How to improve RTO (Recovery Time Objective)  Find your bottlenecks  Technical deep dive and live demo  Q&A
  • 11. Overview  Two Dell PowerEdge R510 servers with 24 GB RAM ● 12x 3TB nearline SAS disks in RAID-6 for 27 TB backup repository ● 4 gigabit NICs dedicated to iSCSI for direct SAN access  Two Veeam v6.0 consoles / installations  Distributed backup architecture ● Multiple proxies ● Multiple repositories  Enterprise Manager installed on one site
  • 12. Overview  We leveraged ● Application-aware image processing ● Instant VM Recovery and vPower NFS ● Virtual Labs ● SureBackup
  • 13. Agenda  Introduction  Project description, goals, requirements, constraints  High level overview: product and component overview  Backup and DR Approach and architecture  How to improve RTO (Recovery Time Objective)  Find your bottlenecks  Technical deep dive and live demo  Q&A
  • 14. Overall architecture ● Used Veeam‟s distributed architecture for proxies and repositories, with a twist ● We installed two separate „stretched‟ backup infrastructures on two hosts so we can restore and continue local backup jobs if the remote site fails ● Two consoles to separate administrative domains for each IT operations teams ● Enterprise Manager as a single point of management for licensing and file level restores
  • 15. Design Choices  Each host has one proxy;  Each host has two repositories ● One owned by the local console ● One owned by the remote console  Local SQL Express databases ● Size of environment didn‟t require move to „full‟ SQL Server ● Totally independent backup environment required  Use Application-aware image processing for consistency  Optimize all jobs for WAN replication
  • 16. Job Type  We had no space available on primary SAN storage ● „Replication‟ job type replicates from proxy to proxy, requires standby host and can only store in native (VMX/VMDK) format on VMFS datastores  Therefore, we chose „regular‟ job type ● We cannot replicate the local backup. This means each VM would be touched twice every day: once by the job that stores the VM on the local repository once by the job that stores the VM on the remote repository
  • 17. Deduplication and Compression  Compression set to „best‟ for all jobs  Deduplication happens both at source ● before data is sent to repository, significantly improving performance  and at target ● to achieve additional reduction for jobs with multiple VMs  Block level deduplication optimized for WAN ● Using 256KB block size instead of default 1024KB size  If anything happens to target we can seed the locally stored backups again using the two 16x 1TB SuperMicro servers CBT is designed to handle such usage cases.
  • 18. Agenda  Introduction  Project description, goals, requirements, constraints  High level overview: product and component overview  Backup and DR Approach and architecture  How to improve RTO (Recovery Time Objective)  Find your bottlenecks  Technical deep dive and live demo  Q&A
  • 19. Job Type  Advantages of Replication job type: ● Files stored in native VMware format ● Restores are parallel ● No need to choose between reverse and forward incremental ● No vPower NFS or Instant VM recovery needed − VMs run at full I/O speed − Number of VMs that can be powered on depends on infrastructure no dependency on backup server − No additional migration like Storage vMotion needed after recovery ● Advanced features like re-IP, failback available
  • 20. Job Type  Pitfalls of using regular jobs: ● Files stored in Veeam file format; manual interaction required to restore − This increases RTO and makes restores sequential − Other solutions provide (semi-)parallel restores, keeping RTO down ● Need to make difficult choice between reversed and forward incremental ● Instant VM Recovery uses vPower NFS; − recovered VMs will not run at full I/O speed − affects number of VMs that can be powered on after total site failure − Storage vMotion required to complete recovery of each VM ● No Re-IP and failback and other specific functionality available in „Replication‟ job type
  • 21. Agenda  Introduction  Project description, goals, requirements, constraints  High level overview: product and component overview  Backup and DR Approach and architecture  How to improve RTO (Recovery Time Objective)  Find your bottlenecks  Technical deep dive and live demo  Q&A
  • 22. Backup Method  We chose „reverse incremental‟: ● Uses the least amount of disk space to store backups − We wanted to maximize retention for jobs stored on local repository. We set retention for jobs stored on remote repository to two restore point as these backups are only for DR-purposes ● Calculations to produce reverse incremental done on (remote) repository − Full backup file is rebuilt every day on remote repository − Our physical backup servers have enough oomph to handle 3x I/O load − Minimize stress on WAN link; only changed blocks are sent over WAN ● Last backup is always full − No periodic full needed − Imagine replicating a full backup of every VM over WAN every week 
  • 23. Why not replicate the local backup?  Veeam‟s distributed architecture supports this use case: ● Proxy and repository at local site for local backups ● Proxy at local site and repository at remote site for remote backups  No suitable Windows-based tool was found that does block level replication ● None of the tools integrate with Veeam‟s proxy architecture and aren‟t intelligent enough to understand the reverse incremental files to do smart changed block replication These tools take forever to create changed block indexes and begin replication which has a very negative effect on RPO ● Didn‟t want to use custom (Powershell) scripts makes your solution harder to manage, upgrade and support
  • 24. Evaluation of lessons learned  Re-evaluate job type: „regular‟ or „replication‟ ● Replication job type requires standby host and capacity on SAN − We didn‟t anticipate this and didn‟t have SAN capacity to spare ● Regular jobs require touching source VM‟s twice − Wasn‟t a problem at first, started to become cumbersome as we grew  Re-evaluate forward or reverse incremental modes for replication ● Both have pros and cons for replication; choice is very hard to make ● We chose to maximize retention and use reverse incremental  Re-evaluate Hot Add (“Virtual Appliance”) mode ● Much faster compared to Direct SAN Acces with thin provisioned disks ● Restores are much faster compared to Direct SAN Access mode
  • 25. Agenda  Introduction  Project description, goals, requirements, constraints  High level overview: product and component overview  Backup and DR Approach and architecture  How to improve RTO (Recovery Time Objective)  Find your bottlenecks  Technical deep dive and live demo  Q&A
  • 27. Agenda  Introduction  Project description, goals, requirements, constraints  High level overview: product and component overview  Backup and DR Approach and architecture  How to improve RTO (Recovery Time Objective)  Find your bottlenecks  Technical deep dive and live demo  Q&A
  • 28. Q&A

Hinweis der Redaktion

  1. Explain we have one service desk thatneedstorestore files frombothinfrastructuresExplain we have Veeam Service Provider licenses; installed the samelicense on bothbackup servers. Socket countmanagedby Enterprise Manager
  2. Explainrepositoriesfromtwo points of view:Host: each host has tworepositories: oneownedbylocalinstall, oneownedby remote installConsole: Each console has tworepositories: one on the local server, one on the remote server.
  3. Stored in Veeam file format on repositories: deduped and compressed!Local Jobs are ‘LAN-free’ (Direct SAN access)Data flow for remote jobs:SAN > Local source proxy > WAN > Remote target repository
  4. Explain ‘anything happens’: data corruption on target, proxy failing and leaving backup job/file in an inconsistent state, etc. Anything that requires re-sync of backups files.
  5. Explain these were real drawbacks we neededtocopewith in our project.Explainthat we mightreconsider; refertolessonslearned / evaluation
  6. Explain the ‘problem’ first: you’dliketo make a backuplocally (proxy andrepositorylocal) andreplicate the producedbackup files remotelyfor DR.2 problemshere:Block-level replication on Windows platform: not a lot of tools can do thatNone of those are abletoreadthrough a file todeterminechangedblocksquickenough; itliterallywould take dayswithrsync
  7. Idealsituation: have all the pros of regular jobs (deduped, compressed, stored on repositoryinstead of vmfs datastore) but without needingtotoucheach source VM twice! In otherwords: replicate the files of analreadycompleted ‘regular’ job to a second repositoryusing the proxy architecture, comparabletohow the replication job type uses source and target proxiesEachbackup servers stores localbackups (which are reverse incrementalby default) and remote backups (which are forward incrementalby default). We chose reverse incrementalsfor remote backupsduetospaceconstraints (maximizingretention). Also, we wantedtobeableto do repeataninitialseed of anythinghappenedto the remote backups; thisrequiresidenticalsettingsforall (bothlocaland remote) jobs: reverse.Re-evaluatenumber of I/O dueto reverse incremental. Bigger impact thanexpected. Source: http://www.vuemuer.it/en/?p=3027