SlideShare ist ein Scribd-Unternehmen logo
1 von 9
High Availability Microsoft SQL
     Server Database Architecture
VM HA and Symantec Application Availability vs. Microsoft Clustering


                                                   February 2012
The Problem
Difference of opinion building a “High Availability” database environment.



    Infrastructure Team prefers: VMware High Availability
                                 & Symantec ApplicationHA



    Architecture Team prefers:   Microsoft Failover Clustering
Factors driving the difference of opinion
            Infrastructure Team                                                Architecture Team
•   Prefer VM HA / ApplicationHA because, out of the box, it    •   Prefer MS Clustering because it is well integrated at
    provides high availability without the cost or complexity       application level and industry best practice
    of traditional clustering solutions
                                                                •   Unfamiliar with VMware HA and Symantec
•   Unfamiliar with MS Clustering Services                          ApplicationHA

•   Restricts use of VMotion dynamic scaling. Moving            •   Concerned that ApplicationHA will not recognize all
    Clustered Applications between Blades will require Guest        circumstances that cause application unavailability
    OS Downtime
                                                                •   Undefined scripting effort required for application
•   Clustering adds complexity to backup procedures                 monitoring with VM HA and continuing M&O will be
                                                                    required to support scripts

                                                                •   Concerned that VM HA and present M&O support will
                                                                    not deliver required solution availability during hours of
                                                                    operation
HA Drivers (subset)
                                                                         Availability
Clinical Application                     Business Days   Hours of Use   Requirements
Appeals Tracking                            7 Days       0700 - 1900       99.999
Document Management System                  7 Days       0600 - 1800       99.999
SharePoint                                  7 Days       0700- 1900        99.99
Clinical Operations Review System           5 Days       0800 - 1700       99.999
Dental Imaging                                  Clustering Mandated by Vendor
Dictation and Transcription                 7 Days        24 Hours         99.99
Digital Signature                           7 Days        24 Hours         99.99
Information Portal                          7 Days        24 Hours         99.99
Radiology Information System                    Clustering Mandated by Vendor


  Business days, hours of use, and availability requirements were
  obtained from available business requirements documents and
  verbally from user leadership.
Microsoft Clustering
 Pros
 •   Supports application level awareness
 •   Will survive a single node OS system crash
 •   Redundant Node in the event of a SQL Node failure
 •   Minimizes downtime
 •   Permits an automatic response to a failed server or software (no human intervention)
 •   Supports upgrades without forcing users off the system for extended periods of time
 •   Applications connected to SQL remain available while maintenance/patching is
     performed on the redundant Node
 •   Doesn’t require any servers to be renamed - when failover occurs, it is transparent to
     end-users
 •   Faster recovery during HA events i.e.. Node BSOD, SQL connection or authentication
     failures
 •   Failing back is quick, and can be done once the primary server if fixed and put back on-
     line
 •   Is a Microsoft supported solution
 •   Works without snapshots
Microsoft Clustering
 Cons
 •   Additional Cost to deploy and maintain the redundant Nodes
 •   Potential added environment cost for active/passive implementations
 •   Decreased use of VM functionality (no VMotion…)
 •   Added implementation and management complexity
 •   Requires more experienced DBAs and network administrators
 •   Complexity added to SQL and VMware environment
 •   Any HA event requires server admin and or DBA interaction, anywhere from Node reboot to
     rebuild [not self healing]
 •   In a situation where both Nodes have failed recovery time may be greatly increased due to
     the added complexity
 •   No Snapshot or Full Virtual Machine backup option available – a Node or Cluster loss could
     require a rebuild (RTO=days not hours) – This is a wash, backup / recovery options exactly
     the same for HA vs. Clustering due to SQL not supporting snapshots
 •   VMware Host patching/maintenance would have to be done after hours and would require
     DBA participation - Would potentially require a DBA, would NOT require after hours
     (failover can be forced)
 •   VMware Functionality is reduced for all Clustered SQL Nodes i.e.. Snapshot, vMotion, DRS,
     Storage DRS, Storage vMotion – Snapshotting not supported
VM HA + ApplicationHA
     Pros
     •   Eliminates the need for dedicated standby hardware and the installation of additional
         software
     •   Less infrastructure implementation effort
     •   Supports full range of VM functionality (leads to maximized resource utilization)
     •   Reduced implementation and management complexity
     •   Application agnostic
     •   Reduced Cost due to the fact that no redundant Node is necessary for HA
     •   Reduced Complexity for SQL and VMWare environments – This is not accurate, if you
         add in the Symantec ApplicationHA, at best this is a wash, at worst you’ve create a
         new development M&O project which is infinitely more complex than additional
         hardware.
     •   In a situation where the SQL Server has failed entirely recovery time is much shorter
         since we will leverage a complete Virtual Machine recovery option through Symantec
         NetBackup (RTO=minutes or hours) – See note before, this is a wash, bare metal
         recovery will be required in either situation since snapshots aren’t supported.
     •   VMWare Host patching/maintenance could be accomplished without after hour
         maintenance windows or DBA participation
     •   In many HA events i.e.. SQL connection or authentication failures, Application HA can
         take action against individual Windows and SQL components eliminated reboot as the
         only option for resolution [self healing] – This concept of self healing vs. not self healing
         is a red herring, if the server dies and anything except a reboot is required, neither
         setup is “self healing”
     •   Full VMWare Functionality can be realized for the SQL Servers i.e.. Snapshot, vMotion,
         DRS, Storage DRS, Storage vMotion - Again, snapshots not supported
VM HA + ApplicationHA
     Cons
     •   Added application dev implementation effort to support application awareness,
         and continuing M&O (additional coverage required)
     •   Added complexity – multiple components of HA solution
     •   OS crash will result in down time and requires human intervention
     •   If VMHA fails to recognize system crash, human intervention is required
     •   Added application dev implementation effort to support application awareness,
         and continuing M&O (additional coverage required)
     •   Requires snapshotting
            – (Snapshotting of SQL and SharePoint is not supported by Microsoft due to
                data corruption issues)
     •   Some HA events may require the Server to be restarted which could take
         approximately 30-60 seconds i.e.. BSOD, SQL connection or authentication
         failures that Application HA was not able to resolve
     •   Applications connected to SQL are not available while maintenance/patching is
         performed on the SQL Server during scheduled maintenance windows. If
         something happens to the server during patching, full recovery must be
         executed before service availability returns.
     •   To adhere to VMware recommended best practices to achieve true HA, a hot
         standby database server with SQL Server running and replication established
         between the two databases, must be running. In the event of a failure, an
         application developer must manually redirect the application. This is added
         DBA complexity and added reliance on AppDev
Business Sets Availability Requirements!
      Availability            Downtime                  Downtime
     90% (1-nine)            36.5 days/year
     99% (2-nines)           3.65 days/year
     99.9% (3-nines)         8.76 hours/year           10 minutes/week
     99.99% (4-nines)        52 minutes/year           1 minute/week
     99.999% (5-nines)       5 minutes/year            6 seconds/week
     99.9999% (6-nines)      31 seconds/year !
Need to determine if availability is measured:

1)   During operational time (i.e. expected use) which does not included schedule
     maintenance windows

2)   On a 24 hr basis which includes non-operational time

Weitere ähnliche Inhalte

Andere mochten auch

Always on in SQL Server 2012
Always on in SQL Server 2012Always on in SQL Server 2012
Always on in SQL Server 2012Fadi Abdulwahab
 
Compare Clustering Methods for MS SQL Server
Compare Clustering Methods for MS SQL ServerCompare Clustering Methods for MS SQL Server
Compare Clustering Methods for MS SQL ServerAlexDepo
 
Application HA in Virtual Environments
Application HA in Virtual EnvironmentsApplication HA in Virtual Environments
Application HA in Virtual EnvironmentsArrow ECS UK
 
Administering Database - Pengenalan DBA dan Konfigurasi SQL Server 2005
Administering Database - Pengenalan DBA dan Konfigurasi SQL Server 2005Administering Database - Pengenalan DBA dan Konfigurasi SQL Server 2005
Administering Database - Pengenalan DBA dan Konfigurasi SQL Server 2005Dudy Ali
 
IMCSummit 2015 - Day 1 Developer Track - Evolution of non-volatile memory exp...
IMCSummit 2015 - Day 1 Developer Track - Evolution of non-volatile memory exp...IMCSummit 2015 - Day 1 Developer Track - Evolution of non-volatile memory exp...
IMCSummit 2015 - Day 1 Developer Track - Evolution of non-volatile memory exp...In-Memory Computing Summit
 
Migrate Microsoft Access to SQL Server
Migrate Microsoft Access to SQL ServerMigrate Microsoft Access to SQL Server
Migrate Microsoft Access to SQL ServerADNUG
 
STUDY OF EFFECT OF CONDENSING COVER MATERIALS ON THE PERFORMANCE OF A SOLAR S...
STUDY OF EFFECT OF CONDENSING COVER MATERIALS ON THE PERFORMANCE OF A SOLAR S...STUDY OF EFFECT OF CONDENSING COVER MATERIALS ON THE PERFORMANCE OF A SOLAR S...
STUDY OF EFFECT OF CONDENSING COVER MATERIALS ON THE PERFORMANCE OF A SOLAR S...IAEME Publication
 
Building Energy 2014: PV and SHW Design basics by Fortunat Mueller
Building Energy 2014: PV and SHW Design basics by Fortunat MuellerBuilding Energy 2014: PV and SHW Design basics by Fortunat Mueller
Building Energy 2014: PV and SHW Design basics by Fortunat Muellerfortunatmueller
 
Jornadas Internacionales sobre Género y Discurso Político
Jornadas Internacionales sobre Género y Discurso PolíticoJornadas Internacionales sobre Género y Discurso Político
Jornadas Internacionales sobre Género y Discurso PolíticoUniversidad de Sevilla
 
10223-60712_CLS Campaign Report 2015-proof
10223-60712_CLS Campaign Report 2015-proof10223-60712_CLS Campaign Report 2015-proof
10223-60712_CLS Campaign Report 2015-proofSarah Palermo
 
A Simple Trick to Tackle Permissions with Ease - Brandon Bernard Jones
A Simple Trick to Tackle Permissions with Ease - Brandon Bernard JonesA Simple Trick to Tackle Permissions with Ease - Brandon Bernard Jones
A Simple Trick to Tackle Permissions with Ease - Brandon Bernard JonesSalesforce Admins
 

Andere mochten auch (14)

Always on in SQL Server 2012
Always on in SQL Server 2012Always on in SQL Server 2012
Always on in SQL Server 2012
 
Compare Clustering Methods for MS SQL Server
Compare Clustering Methods for MS SQL ServerCompare Clustering Methods for MS SQL Server
Compare Clustering Methods for MS SQL Server
 
Application HA in Virtual Environments
Application HA in Virtual EnvironmentsApplication HA in Virtual Environments
Application HA in Virtual Environments
 
J2 ee คืออะไร
J2 ee คืออะไรJ2 ee คืออะไร
J2 ee คืออะไร
 
Administering Database - Pengenalan DBA dan Konfigurasi SQL Server 2005
Administering Database - Pengenalan DBA dan Konfigurasi SQL Server 2005Administering Database - Pengenalan DBA dan Konfigurasi SQL Server 2005
Administering Database - Pengenalan DBA dan Konfigurasi SQL Server 2005
 
IMCSummit 2015 - Day 1 Developer Track - Evolution of non-volatile memory exp...
IMCSummit 2015 - Day 1 Developer Track - Evolution of non-volatile memory exp...IMCSummit 2015 - Day 1 Developer Track - Evolution of non-volatile memory exp...
IMCSummit 2015 - Day 1 Developer Track - Evolution of non-volatile memory exp...
 
Migrate Microsoft Access to SQL Server
Migrate Microsoft Access to SQL ServerMigrate Microsoft Access to SQL Server
Migrate Microsoft Access to SQL Server
 
Facebook
FacebookFacebook
Facebook
 
STUDY OF EFFECT OF CONDENSING COVER MATERIALS ON THE PERFORMANCE OF A SOLAR S...
STUDY OF EFFECT OF CONDENSING COVER MATERIALS ON THE PERFORMANCE OF A SOLAR S...STUDY OF EFFECT OF CONDENSING COVER MATERIALS ON THE PERFORMANCE OF A SOLAR S...
STUDY OF EFFECT OF CONDENSING COVER MATERIALS ON THE PERFORMANCE OF A SOLAR S...
 
Building Energy 2014: PV and SHW Design basics by Fortunat Mueller
Building Energy 2014: PV and SHW Design basics by Fortunat MuellerBuilding Energy 2014: PV and SHW Design basics by Fortunat Mueller
Building Energy 2014: PV and SHW Design basics by Fortunat Mueller
 
Resume v2.1.3.2
Resume v2.1.3.2Resume v2.1.3.2
Resume v2.1.3.2
 
Jornadas Internacionales sobre Género y Discurso Político
Jornadas Internacionales sobre Género y Discurso PolíticoJornadas Internacionales sobre Género y Discurso Político
Jornadas Internacionales sobre Género y Discurso Político
 
10223-60712_CLS Campaign Report 2015-proof
10223-60712_CLS Campaign Report 2015-proof10223-60712_CLS Campaign Report 2015-proof
10223-60712_CLS Campaign Report 2015-proof
 
A Simple Trick to Tackle Permissions with Ease - Brandon Bernard Jones
A Simple Trick to Tackle Permissions with Ease - Brandon Bernard JonesA Simple Trick to Tackle Permissions with Ease - Brandon Bernard Jones
A Simple Trick to Tackle Permissions with Ease - Brandon Bernard Jones
 

Kürzlich hochgeladen

Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.MateoGardella
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...KokoStevan
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 

Kürzlich hochgeladen (20)

Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 

Microsoft SQL Server Clustering vs. VMware HA

  • 1. High Availability Microsoft SQL Server Database Architecture VM HA and Symantec Application Availability vs. Microsoft Clustering February 2012
  • 2. The Problem Difference of opinion building a “High Availability” database environment. Infrastructure Team prefers: VMware High Availability & Symantec ApplicationHA Architecture Team prefers: Microsoft Failover Clustering
  • 3. Factors driving the difference of opinion Infrastructure Team Architecture Team • Prefer VM HA / ApplicationHA because, out of the box, it • Prefer MS Clustering because it is well integrated at provides high availability without the cost or complexity application level and industry best practice of traditional clustering solutions • Unfamiliar with VMware HA and Symantec • Unfamiliar with MS Clustering Services ApplicationHA • Restricts use of VMotion dynamic scaling. Moving • Concerned that ApplicationHA will not recognize all Clustered Applications between Blades will require Guest circumstances that cause application unavailability OS Downtime • Undefined scripting effort required for application • Clustering adds complexity to backup procedures monitoring with VM HA and continuing M&O will be required to support scripts • Concerned that VM HA and present M&O support will not deliver required solution availability during hours of operation
  • 4. HA Drivers (subset) Availability Clinical Application Business Days Hours of Use Requirements Appeals Tracking 7 Days 0700 - 1900 99.999 Document Management System 7 Days 0600 - 1800 99.999 SharePoint 7 Days 0700- 1900 99.99 Clinical Operations Review System 5 Days 0800 - 1700 99.999 Dental Imaging Clustering Mandated by Vendor Dictation and Transcription 7 Days 24 Hours 99.99 Digital Signature 7 Days 24 Hours 99.99 Information Portal 7 Days 24 Hours 99.99 Radiology Information System Clustering Mandated by Vendor Business days, hours of use, and availability requirements were obtained from available business requirements documents and verbally from user leadership.
  • 5. Microsoft Clustering Pros • Supports application level awareness • Will survive a single node OS system crash • Redundant Node in the event of a SQL Node failure • Minimizes downtime • Permits an automatic response to a failed server or software (no human intervention) • Supports upgrades without forcing users off the system for extended periods of time • Applications connected to SQL remain available while maintenance/patching is performed on the redundant Node • Doesn’t require any servers to be renamed - when failover occurs, it is transparent to end-users • Faster recovery during HA events i.e.. Node BSOD, SQL connection or authentication failures • Failing back is quick, and can be done once the primary server if fixed and put back on- line • Is a Microsoft supported solution • Works without snapshots
  • 6. Microsoft Clustering Cons • Additional Cost to deploy and maintain the redundant Nodes • Potential added environment cost for active/passive implementations • Decreased use of VM functionality (no VMotion…) • Added implementation and management complexity • Requires more experienced DBAs and network administrators • Complexity added to SQL and VMware environment • Any HA event requires server admin and or DBA interaction, anywhere from Node reboot to rebuild [not self healing] • In a situation where both Nodes have failed recovery time may be greatly increased due to the added complexity • No Snapshot or Full Virtual Machine backup option available – a Node or Cluster loss could require a rebuild (RTO=days not hours) – This is a wash, backup / recovery options exactly the same for HA vs. Clustering due to SQL not supporting snapshots • VMware Host patching/maintenance would have to be done after hours and would require DBA participation - Would potentially require a DBA, would NOT require after hours (failover can be forced) • VMware Functionality is reduced for all Clustered SQL Nodes i.e.. Snapshot, vMotion, DRS, Storage DRS, Storage vMotion – Snapshotting not supported
  • 7. VM HA + ApplicationHA Pros • Eliminates the need for dedicated standby hardware and the installation of additional software • Less infrastructure implementation effort • Supports full range of VM functionality (leads to maximized resource utilization) • Reduced implementation and management complexity • Application agnostic • Reduced Cost due to the fact that no redundant Node is necessary for HA • Reduced Complexity for SQL and VMWare environments – This is not accurate, if you add in the Symantec ApplicationHA, at best this is a wash, at worst you’ve create a new development M&O project which is infinitely more complex than additional hardware. • In a situation where the SQL Server has failed entirely recovery time is much shorter since we will leverage a complete Virtual Machine recovery option through Symantec NetBackup (RTO=minutes or hours) – See note before, this is a wash, bare metal recovery will be required in either situation since snapshots aren’t supported. • VMWare Host patching/maintenance could be accomplished without after hour maintenance windows or DBA participation • In many HA events i.e.. SQL connection or authentication failures, Application HA can take action against individual Windows and SQL components eliminated reboot as the only option for resolution [self healing] – This concept of self healing vs. not self healing is a red herring, if the server dies and anything except a reboot is required, neither setup is “self healing” • Full VMWare Functionality can be realized for the SQL Servers i.e.. Snapshot, vMotion, DRS, Storage DRS, Storage vMotion - Again, snapshots not supported
  • 8. VM HA + ApplicationHA Cons • Added application dev implementation effort to support application awareness, and continuing M&O (additional coverage required) • Added complexity – multiple components of HA solution • OS crash will result in down time and requires human intervention • If VMHA fails to recognize system crash, human intervention is required • Added application dev implementation effort to support application awareness, and continuing M&O (additional coverage required) • Requires snapshotting – (Snapshotting of SQL and SharePoint is not supported by Microsoft due to data corruption issues) • Some HA events may require the Server to be restarted which could take approximately 30-60 seconds i.e.. BSOD, SQL connection or authentication failures that Application HA was not able to resolve • Applications connected to SQL are not available while maintenance/patching is performed on the SQL Server during scheduled maintenance windows. If something happens to the server during patching, full recovery must be executed before service availability returns. • To adhere to VMware recommended best practices to achieve true HA, a hot standby database server with SQL Server running and replication established between the two databases, must be running. In the event of a failure, an application developer must manually redirect the application. This is added DBA complexity and added reliance on AppDev
  • 9. Business Sets Availability Requirements! Availability Downtime Downtime 90% (1-nine) 36.5 days/year 99% (2-nines) 3.65 days/year 99.9% (3-nines) 8.76 hours/year 10 minutes/week 99.99% (4-nines) 52 minutes/year 1 minute/week 99.999% (5-nines) 5 minutes/year 6 seconds/week 99.9999% (6-nines) 31 seconds/year ! Need to determine if availability is measured: 1) During operational time (i.e. expected use) which does not included schedule maintenance windows 2) On a 24 hr basis which includes non-operational time