SlideShare ist ein Scribd-Unternehmen logo
1 von 27
High Availability in the Cloud –
     Architecting Best Practices

        Watch the video of this webinar

                   May 6, 2011




1
Your Panel Today
Presenting:
• Michael Crandell, CEO Twitter: @michaelcrandell
• Josep Blanquer, Sr. Systems Architect
• Brian Adler, Professional Services Architect

Q&A:
• Jason Altobelli, Account Manager



 Please use the questions window to ask questions anytime!



 2
Agenda
• Design for Failure
• What happened in the AWS Outage
• RightScale’s experience
• Infrastructure abstraction and automation as building blocks
  for highly available applications
• Architectural options to protect against cloud failures
• Conclusions / Q&A




    Please use the questions window to ask questions anytime!

    3
Terminology
• Fault Tolerance
     • Fault tolerance leverages redundancy and replication to enable systems to
       continue operating properly if one or more components fails
• High Availability
     • Fault Tolerant systems are measured by their Availability in terms of
       planned and unplanned service outages for end users
         •   99% Availability = 3.65 days of downtime per year
         •   99.5% Availability = 1.83 days of downtime per year
         •   99.9% Availability = 8.76 hours of downtime per year
         •   99.99% Availability = 53 minutes of downtime per year
         •   99.999% Availability = 5.26 minutes of downtime per year

• Disaster Recovery
     • The process, policies and procedures related to restoring critical systems
       after a catastrophic event


 4
Design for Failure
• Large scale failures in the cloud are rare but happen
• Application owners are ultimately responsible for
  availability and recoverability
• Balance cost and complexity of HA efforts against
  risk you’re willing to bear
• Fortunately, cloud infrastructure has made DR and HA
  remarkably affordable versus past options
     •   Multi-server
     •   Multi-AZ
     •   Multi-region
     •   Multi-cloud




 5
Assessing Risk vs Cost in Aviation
 Fatalities per
1M Hours Flown


      20 Fatalities                          General
                                             Aviation

   10 Fatalities                  Commuter
                                   Airline

                      Large Air
      4 Fatalities
                       Carrier



                        $           $$         $$$      Cost of
                                                        Service


  6
What Happened in the April 21 AWS Outage?
•       Triggered by operator error during a router upgrade which funneled high-
        volume network traffic into a low-bandwidth control network used by EBS
•       Flooding of the control network caused a large number of EBS servers to
        be effectively isolated from one another, which broke volume replication,
        and caused these servers to start re-replicating the data to fresh servers
•       This large-scale re-replication storm in turn had two effects:
         • It failed in many cases causing the volumes to go offline for manual intervention
         • It flooded the EBS control plane with re-replication events that affected its operation
           across the entire us-east region
•       Steps taken by AWS:
         • Stopping the re-replication attempts to quiesce the system and prevent new
           volumes from being drawn into the outage
         • Isolated the affected availability zone from the EBS control plane to restore normal
           operation in other zones
         • AWS started to recover volumes


    7
RightScale’s Experience
•       ~2am – Monitoring server disks started to die like flies
         •   Tried replacing them with fresh EBS drives – Failed
         •   Tried re-launching them with fresh EBS drives in different Zones - Failed
•       ~3am – Realized EBS wasn’t going to be fixed anytime soon: Plan B time!
         •   Revamped our monitoring ServerTemplate to use ephemeral volumes instead of EBS
         •   Our architecture for monitoring data continually backs up and pulls data on demand from S3
•       ~3:20am – Started re-launching affected monitoring servers
         •   Over the next several hours, ~15% of monitoring servers failed and were easily re-launched
•       ~7am – Amazon announces EBS volume creation is fixed in other zones
•       ~11am – EBS disk failures hit our master database
         •   Chose one of our slaves (from our zone 1d) to be promoted
         •   In a few minutes we had the new master up and running
         •   It took a while to achieve needed performance due to its cold working set




    8
Agenda
• Design for Failure
• What happened in the AWS Outage
• RightScale’s experience
• Infrastructure abstraction and automation as building
  blocks for highly available applications
• Architectural options to protect against cloud failures
• Conclusions / Q&A




    Please use the questions window to ask questions anytime!

    9
What do we mean by Cloud?
• A cloud is a physical data center entity behind an API endpoint

• What do you mean by that?
      •   Amazon Web Services is not a cloud
      •   EC2 is not a cloud
      •   Eucalyptus, Cloud.com are not clouds
      •   EC2 East, EC2 AsiaPacific, my private cloud… are clouds
      •   An availability zone is not a cloud, it’s part of one
      •   Think of a cloud as a “resource pool” accessed via API




 10
Overcoming Multi-Cloud Pain Points
• APIs differ
      • Different sets of resources
      • Different formats, encodings and versions
• Abstractions and features differ
      • Network architectures differ: VLANs, security groups, NAT, IPs, ACLs, …
      • Storage architectures differ: local/attachable disks, backup, snapshots, …
      • Hypervisors, machine images…cost models, billing, reporting…etc.
• They are truly different beasts, with different semantics

• So make sure you:
      • Design using generic concepts yet deploy using cloud specifics
      • Have tools that translate your concepts to cloud-specific ones
      • Think of how to share resources across clouds (i.e. data sharing)

 11
How does RightScale help with Abstraction?
• Unified Multi-Cloud UI and new API (in progress)
      •   Multi-Cloud Servers/Arrays
      •   Multi-Cloud ServerTemplates
      •   Multi-Cloud Images
      •   Others in the pipeline


                                                                             I
                                                 1:N                             I
                        1:1   ServerTemplate                                         I
          Server                                          Image
                                                                             I
                                                                                 I

 runnable abstraction          software config         runtime config   cloud resources




 12
Infrastructure Abstraction & Automation
as Building Blocks of Highly Available Applications

•    Multi-Cloud Dashboard, Architecture & Application Portability
         •   Single pane of glass through UI and API
         •   Allows simplified deployments across multiple regions/clouds
•    Automated Deployments (Provisioning and Configuration Mgmt)
         •   Reproducible Configurations with Change Control - Avoids manual configuration errors
         •   Cost effective – Pay as you go for backup. It’s easy and inexpensive to test fault tolerance.
•    Advanced Server and Deployment Monitoring
         •   Custom monitoring dashboard, custom graphing, cluster graphing
•    Automated Scaling and Operations
         •   Easy to scale up/down, replace failed/unhealthy instances,
             backup data, replicate data, etc
•    Library of Cloud Optimized Solution Stacks
         •   RightImages, RightScripts, and ServerTemplates



    13
Agenda
• Design for Failure
• What happened in the AWS Outage
• RightScale’s experience
• Infrastructure abstraction and automation as building blocks
  for highly available applications
• Architectural options to protect against cloud failures
• Conclusions / Q&A




    Please use the questions window to ask questions anytime!

    14
HA/DR Checklist for Risk Mitigation

 Determine who owns the architecture, DR process and testing.
 Develop expertise in house and / or get outside help.
 Conduct a risk assessment for each application.
 Specify your target Recovery Time Objective
  and Recovery Point Objective.
 Design for failure starting with application architecture.
 Implement HA best practices balancing cost, complexity and risk.
          Automate infrastructure for consistency and reliability.
          Abstract applications for flexibility and portability.
 Document operational processes and automations.
 Test the failover... then test it again.
 Release the Chaos Monkey.


    15
Application Architecture Deployment Options
• Storage Options
      • Local storage, EBS, S3, CloudFiles, Gluster, etc.
• DNS Configuration Options
      • DNS APIs for dynamic configuration (Route53, DynDNS, DNS Made Easy)
• Load Balancing Options
      • HA Proxy or AWS ELB to distribute traffic across multiple instances / AZs
• Server Array Options
      • Create scalable tiers for web and application servers
• Database Options
      • MySQL (EBS with snapshots to S3 or Local Disk with LVM snapshots)
          • Database Manager Features (Automated snapshots, replication, slave promotion, striping)
      • AWS RDS
      • Database Sharding
      • NoSQL Databases
 16
General HA Best Practices
 Avoid single points of failure
 Always place one of each component (load balancers,
  app servers, databases) in at least two AZs
 Maintain sufficient capacity to absorb AZ / cloud failures
 Replicate data across AZs and backup or replicate across
  clouds/regions for failover
 Setup monitoring, alerts and operations to identify and
  automate problem resolution or failover process
 Design stateless applications for resilience to reboot / relaunch




 17
Multi-AZ Example




                                                                                Consider local storage for
                                                                               additional slave database to
                                                                              remove dependency on EBS
                                                                            (Use LVM to snapshot backups)




                                                                    Consider distributed NoSQL databases
      Snapshot EBS volume for                                      with the same distribution considerations.
     backups so the database can    Place Slave databases in one   Spread primary and replica nodes across
      be readily recovered within      or more AZs for failover.   multiple AZs. Place as many as you need
              the region.                                                    for required resiliency.


18
Multi-Cloud Cold / Warm / Hot DR Options

No Downtime                                          Hot HA
                                               (Live/Live Config)
 > 5 Minutes                            Hot DR
                                    (Least Common)
 > 1 Hour                  Warm DR
                        (Recommended)

 > Few Hours      Cold DR
               (Most Common)


                    $          $$        $$$         $$$$



19
Multi-Cloud Cold DR Example
Staged Server Configuration and generally no staged data




  20
Multi-Cloud Warm DR Example
Staged Server Configuration, pre-staged data and running Slave Database Server




 21
Multi-Cloud Hot DR Example
Parallel Deployment with all servers running but all traffic going to primary




  22
Multi-Cloud Hot HA Example
Live/Live configuration. May use Geo-target IP services to direct traffic to regional
load balancers.




  23
Multi-Cloud Hot HA Example
Multi-Cloud looks similar to Multi-AZ… but there are additional problems to solve as
                                                 some resources are not shared
              You need DNS management                                 across clouds
                       or a global load balancer




Images for cloud servers are
 specific to the cloud/region.


                                                      You need to copy or replicate data
                                                   yourself as EBS snapshots are specific
                                                    to the source AZ. Getting the data out
                                                     requires a slave sync or taking LVM
  24                                                 snapshots and transferring the data.
So What’s Best?
• Design for failure
• No one size fits all solution
      • Every application/components has its own architecture
      • Tradeoffs between levels of resiliency and cost
• The options available in the cloud today are unprecedented
      • Capabilities for global redundancy
      • Time to access
      • Investment required
• Follow our High Availability Checklist!




 25
Special Offer –
 High Availability Assessment &
 Design Recommendations
 Become a RightScale customer in May and receive:
          Free High Availability Assessment
          Free Design/Architecture Recommendations
          Half Off Onboarding Fee
 Ready to get started?
 Contact us at sales@rightscale.com or (866) 720-0208


 Learn More –
 RightScale Free Edition       White Paper Library            Webinar Library
  RightScale.com/free       Rightscale.com/whitepapers     RightScale.com/webinars

 RightScale User Conference – June 8 in NYC! Register here: www.rightscale.com/conference
26
Thank You!




27

Weitere ähnliche Inhalte

Was ist angesagt?

(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014
(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014
(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014Amazon Web Services
 
AWS Summit London 2014 | From One to Many - Evolving VPC Design (400)
AWS Summit London 2014 | From One to Many - Evolving VPC Design (400)AWS Summit London 2014 | From One to Many - Evolving VPC Design (400)
AWS Summit London 2014 | From One to Many - Evolving VPC Design (400)Amazon Web Services
 
AWS re:Invent 2016: How to Migrate Microsoft Windows Applications to AWS Quic...
AWS re:Invent 2016: How to Migrate Microsoft Windows Applications to AWS Quic...AWS re:Invent 2016: How to Migrate Microsoft Windows Applications to AWS Quic...
AWS re:Invent 2016: How to Migrate Microsoft Windows Applications to AWS Quic...Amazon Web Services
 
Netflix Development Patterns for Scale, Performance & Availability (DMG206) |...
Netflix Development Patterns for Scale, Performance & Availability (DMG206) |...Netflix Development Patterns for Scale, Performance & Availability (DMG206) |...
Netflix Development Patterns for Scale, Performance & Availability (DMG206) |...Amazon Web Services
 
AWS VPC best practices 2016 by Bogdan Naydenov
AWS VPC best practices 2016 by Bogdan NaydenovAWS VPC best practices 2016 by Bogdan Naydenov
AWS VPC best practices 2016 by Bogdan NaydenovBogdan Naydenov
 
Creating Your Virtual Data Center: VPC Fundamentals and Connectivity Options
Creating Your Virtual Data Center: VPC Fundamentals and Connectivity OptionsCreating Your Virtual Data Center: VPC Fundamentals and Connectivity Options
Creating Your Virtual Data Center: VPC Fundamentals and Connectivity OptionsAmazon Web Services
 
Cloud Architecture: Patterns and Best Practices
Cloud Architecture: Patterns and Best PracticesCloud Architecture: Patterns and Best Practices
Cloud Architecture: Patterns and Best PracticesSascha Möllering
 
AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...
AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...
AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...Amazon Web Services
 
Cloud Developer Conference May 2011 SiliconIndia : Design for Failure - High ...
Cloud Developer Conference May 2011 SiliconIndia : Design for Failure - High ...Cloud Developer Conference May 2011 SiliconIndia : Design for Failure - High ...
Cloud Developer Conference May 2011 SiliconIndia : Design for Failure - High ...Harish Ganesan
 
NEW LAUNCH! Advanced Task Scheduling with Amazon ECS and Blox
NEW LAUNCH! Advanced Task Scheduling with Amazon ECS and BloxNEW LAUNCH! Advanced Task Scheduling with Amazon ECS and Blox
NEW LAUNCH! Advanced Task Scheduling with Amazon ECS and BloxAmazon Web Services
 
Amazon Virtual Private Cloud VPC Architecture AWS Web Services
Amazon Virtual Private Cloud VPC Architecture AWS Web ServicesAmazon Virtual Private Cloud VPC Architecture AWS Web Services
Amazon Virtual Private Cloud VPC Architecture AWS Web ServicesRobert Wilson
 
AWS re:Invent 2016: Mitigating DDoS Attacks on AWS: Five Vectors and Four Use...
AWS re:Invent 2016: Mitigating DDoS Attacks on AWS: Five Vectors and Four Use...AWS re:Invent 2016: Mitigating DDoS Attacks on AWS: Five Vectors and Four Use...
AWS re:Invent 2016: Mitigating DDoS Attacks on AWS: Five Vectors and Four Use...Amazon Web Services
 
NEW LAUNCH IPv6 in the Cloud: Virtual Private Cloud Deep Dive
NEW LAUNCH IPv6 in the Cloud: Virtual Private Cloud Deep DiveNEW LAUNCH IPv6 in the Cloud: Virtual Private Cloud Deep Dive
NEW LAUNCH IPv6 in the Cloud: Virtual Private Cloud Deep DiveAmazon Web Services
 
Criando o seu datacenter virtual vpc e conectividade
Criando o seu datacenter virtual  vpc e conectividadeCriando o seu datacenter virtual  vpc e conectividade
Criando o seu datacenter virtual vpc e conectividadeAmazon Web Services LATAM
 
(ENT205) AWS and VMware: How to Architect and Manage Hybrid Environments | AW...
(ENT205) AWS and VMware: How to Architect and Manage Hybrid Environments | AW...(ENT205) AWS and VMware: How to Architect and Manage Hybrid Environments | AW...
(ENT205) AWS and VMware: How to Architect and Manage Hybrid Environments | AW...Amazon Web Services
 
AWS for Start-ups - Architectural Best Practices & Automating Your Infrastruc...
AWS for Start-ups - Architectural Best Practices & Automating Your Infrastruc...AWS for Start-ups - Architectural Best Practices & Automating Your Infrastruc...
AWS for Start-ups - Architectural Best Practices & Automating Your Infrastruc...Amazon Web Services
 
Cloud Architecture Tutorial - Platform Component Architecture (2of3)
Cloud Architecture Tutorial - Platform Component Architecture (2of3)Cloud Architecture Tutorial - Platform Component Architecture (2of3)
Cloud Architecture Tutorial - Platform Component Architecture (2of3)Adrian Cockcroft
 
Managing Multi-Cloud and On-Premises with Microsoft Azure
Managing Multi-Cloud and On-Premises with Microsoft AzureManaging Multi-Cloud and On-Premises with Microsoft Azure
Managing Multi-Cloud and On-Premises with Microsoft Azure2nd Watch
 
Continuous Delivery in the AWS Cloud
Continuous Delivery in the AWS CloudContinuous Delivery in the AWS Cloud
Continuous Delivery in the AWS CloudNigel Fernandes
 

Was ist angesagt? (20)

(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014
(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014
(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014
 
AWS Summit London 2014 | From One to Many - Evolving VPC Design (400)
AWS Summit London 2014 | From One to Many - Evolving VPC Design (400)AWS Summit London 2014 | From One to Many - Evolving VPC Design (400)
AWS Summit London 2014 | From One to Many - Evolving VPC Design (400)
 
AWS re:Invent 2016: How to Migrate Microsoft Windows Applications to AWS Quic...
AWS re:Invent 2016: How to Migrate Microsoft Windows Applications to AWS Quic...AWS re:Invent 2016: How to Migrate Microsoft Windows Applications to AWS Quic...
AWS re:Invent 2016: How to Migrate Microsoft Windows Applications to AWS Quic...
 
Netflix Development Patterns for Scale, Performance & Availability (DMG206) |...
Netflix Development Patterns for Scale, Performance & Availability (DMG206) |...Netflix Development Patterns for Scale, Performance & Availability (DMG206) |...
Netflix Development Patterns for Scale, Performance & Availability (DMG206) |...
 
Deep Dive: Hybrid Architectures
Deep Dive: Hybrid ArchitecturesDeep Dive: Hybrid Architectures
Deep Dive: Hybrid Architectures
 
AWS VPC best practices 2016 by Bogdan Naydenov
AWS VPC best practices 2016 by Bogdan NaydenovAWS VPC best practices 2016 by Bogdan Naydenov
AWS VPC best practices 2016 by Bogdan Naydenov
 
Creating Your Virtual Data Center: VPC Fundamentals and Connectivity Options
Creating Your Virtual Data Center: VPC Fundamentals and Connectivity OptionsCreating Your Virtual Data Center: VPC Fundamentals and Connectivity Options
Creating Your Virtual Data Center: VPC Fundamentals and Connectivity Options
 
Cloud Architecture: Patterns and Best Practices
Cloud Architecture: Patterns and Best PracticesCloud Architecture: Patterns and Best Practices
Cloud Architecture: Patterns and Best Practices
 
AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...
AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...
AWS re:Invent 2016: Optimizing Network Performance for Amazon EC2 Instances (...
 
Cloud Developer Conference May 2011 SiliconIndia : Design for Failure - High ...
Cloud Developer Conference May 2011 SiliconIndia : Design for Failure - High ...Cloud Developer Conference May 2011 SiliconIndia : Design for Failure - High ...
Cloud Developer Conference May 2011 SiliconIndia : Design for Failure - High ...
 
NEW LAUNCH! Advanced Task Scheduling with Amazon ECS and Blox
NEW LAUNCH! Advanced Task Scheduling with Amazon ECS and BloxNEW LAUNCH! Advanced Task Scheduling with Amazon ECS and Blox
NEW LAUNCH! Advanced Task Scheduling with Amazon ECS and Blox
 
Amazon Virtual Private Cloud VPC Architecture AWS Web Services
Amazon Virtual Private Cloud VPC Architecture AWS Web ServicesAmazon Virtual Private Cloud VPC Architecture AWS Web Services
Amazon Virtual Private Cloud VPC Architecture AWS Web Services
 
AWS re:Invent 2016: Mitigating DDoS Attacks on AWS: Five Vectors and Four Use...
AWS re:Invent 2016: Mitigating DDoS Attacks on AWS: Five Vectors and Four Use...AWS re:Invent 2016: Mitigating DDoS Attacks on AWS: Five Vectors and Four Use...
AWS re:Invent 2016: Mitigating DDoS Attacks on AWS: Five Vectors and Four Use...
 
NEW LAUNCH IPv6 in the Cloud: Virtual Private Cloud Deep Dive
NEW LAUNCH IPv6 in the Cloud: Virtual Private Cloud Deep DiveNEW LAUNCH IPv6 in the Cloud: Virtual Private Cloud Deep Dive
NEW LAUNCH IPv6 in the Cloud: Virtual Private Cloud Deep Dive
 
Criando o seu datacenter virtual vpc e conectividade
Criando o seu datacenter virtual  vpc e conectividadeCriando o seu datacenter virtual  vpc e conectividade
Criando o seu datacenter virtual vpc e conectividade
 
(ENT205) AWS and VMware: How to Architect and Manage Hybrid Environments | AW...
(ENT205) AWS and VMware: How to Architect and Manage Hybrid Environments | AW...(ENT205) AWS and VMware: How to Architect and Manage Hybrid Environments | AW...
(ENT205) AWS and VMware: How to Architect and Manage Hybrid Environments | AW...
 
AWS for Start-ups - Architectural Best Practices & Automating Your Infrastruc...
AWS for Start-ups - Architectural Best Practices & Automating Your Infrastruc...AWS for Start-ups - Architectural Best Practices & Automating Your Infrastruc...
AWS for Start-ups - Architectural Best Practices & Automating Your Infrastruc...
 
Cloud Architecture Tutorial - Platform Component Architecture (2of3)
Cloud Architecture Tutorial - Platform Component Architecture (2of3)Cloud Architecture Tutorial - Platform Component Architecture (2of3)
Cloud Architecture Tutorial - Platform Component Architecture (2of3)
 
Managing Multi-Cloud and On-Premises with Microsoft Azure
Managing Multi-Cloud and On-Premises with Microsoft AzureManaging Multi-Cloud and On-Premises with Microsoft Azure
Managing Multi-Cloud and On-Premises with Microsoft Azure
 
Continuous Delivery in the AWS Cloud
Continuous Delivery in the AWS CloudContinuous Delivery in the AWS Cloud
Continuous Delivery in the AWS Cloud
 

Andere mochten auch

Aws multi-region High Availability
Aws multi-region High Availability Aws multi-region High Availability
Aws multi-region High Availability Adam Book
 
High Availability Infrastructure for Cloud Computing
High Availability Infrastructure for Cloud ComputingHigh Availability Infrastructure for Cloud Computing
High Availability Infrastructure for Cloud ComputingBob Rhubart
 
Multi-master, multi-region MySQL deployment in Amazon AWS
Multi-master, multi-region MySQL deployment in Amazon AWSMulti-master, multi-region MySQL deployment in Amazon AWS
Multi-master, multi-region MySQL deployment in Amazon AWSContinuent
 
Using MySQL Fabric for High Availability and Scaling Out
Using MySQL Fabric for High Availability and Scaling OutUsing MySQL Fabric for High Availability and Scaling Out
Using MySQL Fabric for High Availability and Scaling OutOSSCube
 
Architecture Patterns - Open Discussion
Architecture Patterns - Open DiscussionArchitecture Patterns - Open Discussion
Architecture Patterns - Open DiscussionNguyen Tung
 
(SDD408) Amazon Route 53 Deep Dive: Delivering Resiliency, Minimizing Latency...
(SDD408) Amazon Route 53 Deep Dive: Delivering Resiliency, Minimizing Latency...(SDD408) Amazon Route 53 Deep Dive: Delivering Resiliency, Minimizing Latency...
(SDD408) Amazon Route 53 Deep Dive: Delivering Resiliency, Minimizing Latency...Amazon Web Services
 
Webinar: snom UC edition Phones and Native Lync Server Provisioning
Webinar: snom UC edition Phones and Native Lync Server ProvisioningWebinar: snom UC edition Phones and Native Lync Server Provisioning
Webinar: snom UC edition Phones and Native Lync Server Provisioningsnom technology
 
Choosing a Polycom - Microsoft Lync Phone
Choosing a Polycom - Microsoft Lync PhoneChoosing a Polycom - Microsoft Lync Phone
Choosing a Polycom - Microsoft Lync PhoneGraham Walsh
 
20120426 high availability MySQL
20120426 high availability MySQL20120426 high availability MySQL
20120426 high availability MySQLJui-Nan Lin
 
MySQL Security Best Practises
MySQL Security Best PractisesMySQL Security Best Practises
MySQL Security Best PractisesMark Swarbrick
 
AWS Summit Nordics - Architecting for High Availability
AWS Summit Nordics - Architecting for High AvailabilityAWS Summit Nordics - Architecting for High Availability
AWS Summit Nordics - Architecting for High AvailabilityAmazon Web Services
 
AWS Summit 2013 | Singapore - Architecting for High Availability
AWS Summit 2013 | Singapore - Architecting for High AvailabilityAWS Summit 2013 | Singapore - Architecting for High Availability
AWS Summit 2013 | Singapore - Architecting for High AvailabilityAmazon Web Services
 
MySQL Group Replication
MySQL Group ReplicationMySQL Group Replication
MySQL Group ReplicationBogdan Kecman
 
Designing and Building Multi-Region Swift Deployment
Designing and Building Multi-Region Swift DeploymentDesigning and Building Multi-Region Swift Deployment
Designing and Building Multi-Region Swift DeploymentSiheon Kim
 
ARC202 Architecting for High Availability - AWS re: Invent 2012
ARC202 Architecting for High Availability - AWS re: Invent 2012ARC202 Architecting for High Availability - AWS re: Invent 2012
ARC202 Architecting for High Availability - AWS re: Invent 2012Amazon Web Services
 
MySQL 5.6 Replication Webinar
MySQL 5.6 Replication WebinarMySQL 5.6 Replication Webinar
MySQL 5.6 Replication WebinarMark Swarbrick
 
Hhm 3479 mq clustering and shared queues for high availability
Hhm 3479 mq clustering and shared queues for high availabilityHhm 3479 mq clustering and shared queues for high availability
Hhm 3479 mq clustering and shared queues for high availabilityPete Siddall
 

Andere mochten auch (20)

Aws multi-region High Availability
Aws multi-region High Availability Aws multi-region High Availability
Aws multi-region High Availability
 
High Availability Infrastructure for Cloud Computing
High Availability Infrastructure for Cloud ComputingHigh Availability Infrastructure for Cloud Computing
High Availability Infrastructure for Cloud Computing
 
Multi-master, multi-region MySQL deployment in Amazon AWS
Multi-master, multi-region MySQL deployment in Amazon AWSMulti-master, multi-region MySQL deployment in Amazon AWS
Multi-master, multi-region MySQL deployment in Amazon AWS
 
How to Design for High Availability & Scale with AWS
How to Design for High Availability & Scale with AWSHow to Design for High Availability & Scale with AWS
How to Design for High Availability & Scale with AWS
 
Using MySQL Fabric for High Availability and Scaling Out
Using MySQL Fabric for High Availability and Scaling OutUsing MySQL Fabric for High Availability and Scaling Out
Using MySQL Fabric for High Availability and Scaling Out
 
Architecture Patterns - Open Discussion
Architecture Patterns - Open DiscussionArchitecture Patterns - Open Discussion
Architecture Patterns - Open Discussion
 
(SDD408) Amazon Route 53 Deep Dive: Delivering Resiliency, Minimizing Latency...
(SDD408) Amazon Route 53 Deep Dive: Delivering Resiliency, Minimizing Latency...(SDD408) Amazon Route 53 Deep Dive: Delivering Resiliency, Minimizing Latency...
(SDD408) Amazon Route 53 Deep Dive: Delivering Resiliency, Minimizing Latency...
 
Webinar: snom UC edition Phones and Native Lync Server Provisioning
Webinar: snom UC edition Phones and Native Lync Server ProvisioningWebinar: snom UC edition Phones and Native Lync Server Provisioning
Webinar: snom UC edition Phones and Native Lync Server Provisioning
 
Choosing a Polycom - Microsoft Lync Phone
Choosing a Polycom - Microsoft Lync PhoneChoosing a Polycom - Microsoft Lync Phone
Choosing a Polycom - Microsoft Lync Phone
 
20120426 high availability MySQL
20120426 high availability MySQL20120426 high availability MySQL
20120426 high availability MySQL
 
MySQL Fabric
MySQL FabricMySQL Fabric
MySQL Fabric
 
MySQL HA
MySQL HAMySQL HA
MySQL HA
 
MySQL Security Best Practises
MySQL Security Best PractisesMySQL Security Best Practises
MySQL Security Best Practises
 
AWS Summit Nordics - Architecting for High Availability
AWS Summit Nordics - Architecting for High AvailabilityAWS Summit Nordics - Architecting for High Availability
AWS Summit Nordics - Architecting for High Availability
 
AWS Summit 2013 | Singapore - Architecting for High Availability
AWS Summit 2013 | Singapore - Architecting for High AvailabilityAWS Summit 2013 | Singapore - Architecting for High Availability
AWS Summit 2013 | Singapore - Architecting for High Availability
 
MySQL Group Replication
MySQL Group ReplicationMySQL Group Replication
MySQL Group Replication
 
Designing and Building Multi-Region Swift Deployment
Designing and Building Multi-Region Swift DeploymentDesigning and Building Multi-Region Swift Deployment
Designing and Building Multi-Region Swift Deployment
 
ARC202 Architecting for High Availability - AWS re: Invent 2012
ARC202 Architecting for High Availability - AWS re: Invent 2012ARC202 Architecting for High Availability - AWS re: Invent 2012
ARC202 Architecting for High Availability - AWS re: Invent 2012
 
MySQL 5.6 Replication Webinar
MySQL 5.6 Replication WebinarMySQL 5.6 Replication Webinar
MySQL 5.6 Replication Webinar
 
Hhm 3479 mq clustering and shared queues for high availability
Hhm 3479 mq clustering and shared queues for high availabilityHhm 3479 mq clustering and shared queues for high availability
Hhm 3479 mq clustering and shared queues for high availability
 

Ähnlich wie High Availability in the Cloud - Architectural Best Practices

Cloud Immortality - Architecting for High Availability & Disaster Recovery
Cloud Immortality - Architecting for High Availability & Disaster RecoveryCloud Immortality - Architecting for High Availability & Disaster Recovery
Cloud Immortality - Architecting for High Availability & Disaster RecoveryRightScale
 
Running Siebel on AWS - Oracle Open World 13
Running Siebel on AWS - Oracle Open World 13Running Siebel on AWS - Oracle Open World 13
Running Siebel on AWS - Oracle Open World 13Milind Waikul
 
Ask The Architect: RightScale & AWS Dive Deep into Hybrid IT
Ask The Architect: RightScale & AWS Dive Deep into Hybrid ITAsk The Architect: RightScale & AWS Dive Deep into Hybrid IT
Ask The Architect: RightScale & AWS Dive Deep into Hybrid ITRightScale
 
enlight cloud
enlight cloudenlight cloud
enlight cloudIsha687
 
Web App Security -Pradeep K.pptx
Web App Security -Pradeep K.pptxWeb App Security -Pradeep K.pptx
Web App Security -Pradeep K.pptxPradeepK344324
 
Breaking the Monolith Road to Containers
Breaking the Monolith Road to ContainersBreaking the Monolith Road to Containers
Breaking the Monolith Road to ContainersAmazon Web Services
 
NetflixOSS for Triangle Devops Oct 2013
NetflixOSS for Triangle Devops Oct 2013NetflixOSS for Triangle Devops Oct 2013
NetflixOSS for Triangle Devops Oct 2013aspyker
 
Taming the cost of your first cloud - CCCEU 2014
Taming the cost of your first cloud - CCCEU 2014Taming the cost of your first cloud - CCCEU 2014
Taming the cost of your first cloud - CCCEU 2014Tim Mackey
 
Building a highly scalable and available cloud application
Building a highly scalable and available cloud applicationBuilding a highly scalable and available cloud application
Building a highly scalable and available cloud applicationNoam Sheffer
 
AWS Webcast - Datacenter Migration to AWS
AWS Webcast - Datacenter Migration to AWSAWS Webcast - Datacenter Migration to AWS
AWS Webcast - Datacenter Migration to AWSAmazon Web Services
 
Customer Sharing: HTC - What is in AWS Cloud for me?
Customer Sharing: HTC - What is in AWS Cloud for me?Customer Sharing: HTC - What is in AWS Cloud for me?
Customer Sharing: HTC - What is in AWS Cloud for me?Amazon Web Services
 
Cloudstack for beginners
Cloudstack for beginnersCloudstack for beginners
Cloudstack for beginnersJoseph Amirani
 
Oracle COTS Applications on AWS
Oracle COTS Applications on AWSOracle COTS Applications on AWS
Oracle COTS Applications on AWSTom Laszewski
 
Managed Cloud Services for Siebel CRM on Amazon AWS
Managed Cloud Services for Siebel CRM on Amazon AWSManaged Cloud Services for Siebel CRM on Amazon AWS
Managed Cloud Services for Siebel CRM on Amazon AWSMilind Waikul
 
Configuration Management in the Cloud | AWS Public Sector Summit 2017
Configuration Management in the Cloud | AWS Public Sector Summit 2017Configuration Management in the Cloud | AWS Public Sector Summit 2017
Configuration Management in the Cloud | AWS Public Sector Summit 2017Amazon Web Services
 
Patterns for building resilient and scalable microservices platform on AWS
Patterns for building resilient and scalable microservices platform on AWSPatterns for building resilient and scalable microservices platform on AWS
Patterns for building resilient and scalable microservices platform on AWSBoyan Dimitrov
 
Satrtup Bootcamp - Scale on AWS
Satrtup Bootcamp - Scale on AWSSatrtup Bootcamp - Scale on AWS
Satrtup Bootcamp - Scale on AWSIdan Tohami
 
DPC 2016 - 53 Minutes or Less - Architecting For Failure
DPC 2016 - 53 Minutes or Less - Architecting For FailureDPC 2016 - 53 Minutes or Less - Architecting For Failure
DPC 2016 - 53 Minutes or Less - Architecting For Failurebenwaine
 

Ähnlich wie High Availability in the Cloud - Architectural Best Practices (20)

Cloud Immortality - Architecting for High Availability & Disaster Recovery
Cloud Immortality - Architecting for High Availability & Disaster RecoveryCloud Immortality - Architecting for High Availability & Disaster Recovery
Cloud Immortality - Architecting for High Availability & Disaster Recovery
 
Running Siebel on AWS - Oracle Open World 13
Running Siebel on AWS - Oracle Open World 13Running Siebel on AWS - Oracle Open World 13
Running Siebel on AWS - Oracle Open World 13
 
Ask The Architect: RightScale & AWS Dive Deep into Hybrid IT
Ask The Architect: RightScale & AWS Dive Deep into Hybrid ITAsk The Architect: RightScale & AWS Dive Deep into Hybrid IT
Ask The Architect: RightScale & AWS Dive Deep into Hybrid IT
 
Azure basics
Azure basicsAzure basics
Azure basics
 
Cloud stack for_beginners
Cloud stack for_beginnersCloud stack for_beginners
Cloud stack for_beginners
 
enlight cloud
enlight cloudenlight cloud
enlight cloud
 
Web App Security -Pradeep K.pptx
Web App Security -Pradeep K.pptxWeb App Security -Pradeep K.pptx
Web App Security -Pradeep K.pptx
 
Breaking the Monolith Road to Containers
Breaking the Monolith Road to ContainersBreaking the Monolith Road to Containers
Breaking the Monolith Road to Containers
 
NetflixOSS for Triangle Devops Oct 2013
NetflixOSS for Triangle Devops Oct 2013NetflixOSS for Triangle Devops Oct 2013
NetflixOSS for Triangle Devops Oct 2013
 
Taming the cost of your first cloud - CCCEU 2014
Taming the cost of your first cloud - CCCEU 2014Taming the cost of your first cloud - CCCEU 2014
Taming the cost of your first cloud - CCCEU 2014
 
Building a highly scalable and available cloud application
Building a highly scalable and available cloud applicationBuilding a highly scalable and available cloud application
Building a highly scalable and available cloud application
 
AWS Webcast - Datacenter Migration to AWS
AWS Webcast - Datacenter Migration to AWSAWS Webcast - Datacenter Migration to AWS
AWS Webcast - Datacenter Migration to AWS
 
Customer Sharing: HTC - What is in AWS Cloud for me?
Customer Sharing: HTC - What is in AWS Cloud for me?Customer Sharing: HTC - What is in AWS Cloud for me?
Customer Sharing: HTC - What is in AWS Cloud for me?
 
Cloudstack for beginners
Cloudstack for beginnersCloudstack for beginners
Cloudstack for beginners
 
Oracle COTS Applications on AWS
Oracle COTS Applications on AWSOracle COTS Applications on AWS
Oracle COTS Applications on AWS
 
Managed Cloud Services for Siebel CRM on Amazon AWS
Managed Cloud Services for Siebel CRM on Amazon AWSManaged Cloud Services for Siebel CRM on Amazon AWS
Managed Cloud Services for Siebel CRM on Amazon AWS
 
Configuration Management in the Cloud | AWS Public Sector Summit 2017
Configuration Management in the Cloud | AWS Public Sector Summit 2017Configuration Management in the Cloud | AWS Public Sector Summit 2017
Configuration Management in the Cloud | AWS Public Sector Summit 2017
 
Patterns for building resilient and scalable microservices platform on AWS
Patterns for building resilient and scalable microservices platform on AWSPatterns for building resilient and scalable microservices platform on AWS
Patterns for building resilient and scalable microservices platform on AWS
 
Satrtup Bootcamp - Scale on AWS
Satrtup Bootcamp - Scale on AWSSatrtup Bootcamp - Scale on AWS
Satrtup Bootcamp - Scale on AWS
 
DPC 2016 - 53 Minutes or Less - Architecting For Failure
DPC 2016 - 53 Minutes or Less - Architecting For FailureDPC 2016 - 53 Minutes or Less - Architecting For Failure
DPC 2016 - 53 Minutes or Less - Architecting For Failure
 

Mehr von RightScale

10 Must-Have Automated Cloud Policies for IT Governance
10 Must-Have Automated Cloud Policies for IT Governance10 Must-Have Automated Cloud Policies for IT Governance
10 Must-Have Automated Cloud Policies for IT GovernanceRightScale
 
Kubernetes and Terraform in the Cloud: How RightScale Does DevOps
Kubernetes and Terraform in the Cloud: How RightScale Does DevOpsKubernetes and Terraform in the Cloud: How RightScale Does DevOps
Kubernetes and Terraform in the Cloud: How RightScale Does DevOpsRightScale
 
Optimize Software, SaaS, and Cloud with Flexera and RightScale
Optimize Software, SaaS, and Cloud with Flexera and RightScaleOptimize Software, SaaS, and Cloud with Flexera and RightScale
Optimize Software, SaaS, and Cloud with Flexera and RightScaleRightScale
 
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About NowPrepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About NowRightScale
 
How to Set Up a Cloud Cost Optimization Process for your Enterprise
How to Set Up a Cloud Cost Optimization Process for your EnterpriseHow to Set Up a Cloud Cost Optimization Process for your Enterprise
How to Set Up a Cloud Cost Optimization Process for your EnterpriseRightScale
 
Multi-Cloud Management with RightScale CMP (Demo)
Multi-Cloud Management with RightScale CMP (Demo)Multi-Cloud Management with RightScale CMP (Demo)
Multi-Cloud Management with RightScale CMP (Demo)RightScale
 
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBMComparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBMRightScale
 
How to Allocate and Report Cloud Costs with RightScale Optima
How to Allocate and Report Cloud Costs with RightScale OptimaHow to Allocate and Report Cloud Costs with RightScale Optima
How to Allocate and Report Cloud Costs with RightScale OptimaRightScale
 
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...RightScale
 
Using RightScale CMP with Cloud Provider Tools
Using RightScale CMP with Cloud Provider ToolsUsing RightScale CMP with Cloud Provider Tools
Using RightScale CMP with Cloud Provider ToolsRightScale
 
Best Practices for Multi-Cloud Security and Compliance
Best Practices for Multi-Cloud Security and ComplianceBest Practices for Multi-Cloud Security and Compliance
Best Practices for Multi-Cloud Security and ComplianceRightScale
 
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
Automating Multi-Cloud Policies for AWS, Azure, Google, and MoreAutomating Multi-Cloud Policies for AWS, Azure, Google, and More
Automating Multi-Cloud Policies for AWS, Azure, Google, and MoreRightScale
 
The 5 Stages of Cloud Management for Enterprises
The 5 Stages of Cloud Management for EnterprisesThe 5 Stages of Cloud Management for Enterprises
The 5 Stages of Cloud Management for EnterprisesRightScale
 
9 Ways to Reduce Cloud Storage Costs
9 Ways to Reduce Cloud Storage Costs9 Ways to Reduce Cloud Storage Costs
9 Ways to Reduce Cloud Storage CostsRightScale
 
Serverless Comparison: AWS vs Azure vs Google vs IBM
Serverless Comparison: AWS vs Azure vs Google vs IBMServerless Comparison: AWS vs Azure vs Google vs IBM
Serverless Comparison: AWS vs Azure vs Google vs IBMRightScale
 
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
Best Practices for Cloud Managed Services Providers: The Path to CMP SuccessBest Practices for Cloud Managed Services Providers: The Path to CMP Success
Best Practices for Cloud Managed Services Providers: The Path to CMP SuccessRightScale
 
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
Cloud Storage Comparison: AWS vs Azure vs Google vs IBMCloud Storage Comparison: AWS vs Azure vs Google vs IBM
Cloud Storage Comparison: AWS vs Azure vs Google vs IBMRightScale
 
2018 Cloud Trends: RightScale State of the Cloud Report
2018 Cloud Trends: RightScale State of the Cloud Report2018 Cloud Trends: RightScale State of the Cloud Report
2018 Cloud Trends: RightScale State of the Cloud ReportRightScale
 
Got a Multi-Cloud Strategy? How RightScale CMP Helps
Got a Multi-Cloud Strategy? How RightScale CMP HelpsGot a Multi-Cloud Strategy? How RightScale CMP Helps
Got a Multi-Cloud Strategy? How RightScale CMP HelpsRightScale
 
How to Manage Cloud Costs with RightScale Optima
How to Manage Cloud Costs with RightScale OptimaHow to Manage Cloud Costs with RightScale Optima
How to Manage Cloud Costs with RightScale OptimaRightScale
 

Mehr von RightScale (20)

10 Must-Have Automated Cloud Policies for IT Governance
10 Must-Have Automated Cloud Policies for IT Governance10 Must-Have Automated Cloud Policies for IT Governance
10 Must-Have Automated Cloud Policies for IT Governance
 
Kubernetes and Terraform in the Cloud: How RightScale Does DevOps
Kubernetes and Terraform in the Cloud: How RightScale Does DevOpsKubernetes and Terraform in the Cloud: How RightScale Does DevOps
Kubernetes and Terraform in the Cloud: How RightScale Does DevOps
 
Optimize Software, SaaS, and Cloud with Flexera and RightScale
Optimize Software, SaaS, and Cloud with Flexera and RightScaleOptimize Software, SaaS, and Cloud with Flexera and RightScale
Optimize Software, SaaS, and Cloud with Flexera and RightScale
 
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About NowPrepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
 
How to Set Up a Cloud Cost Optimization Process for your Enterprise
How to Set Up a Cloud Cost Optimization Process for your EnterpriseHow to Set Up a Cloud Cost Optimization Process for your Enterprise
How to Set Up a Cloud Cost Optimization Process for your Enterprise
 
Multi-Cloud Management with RightScale CMP (Demo)
Multi-Cloud Management with RightScale CMP (Demo)Multi-Cloud Management with RightScale CMP (Demo)
Multi-Cloud Management with RightScale CMP (Demo)
 
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBMComparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
 
How to Allocate and Report Cloud Costs with RightScale Optima
How to Allocate and Report Cloud Costs with RightScale OptimaHow to Allocate and Report Cloud Costs with RightScale Optima
How to Allocate and Report Cloud Costs with RightScale Optima
 
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
 
Using RightScale CMP with Cloud Provider Tools
Using RightScale CMP with Cloud Provider ToolsUsing RightScale CMP with Cloud Provider Tools
Using RightScale CMP with Cloud Provider Tools
 
Best Practices for Multi-Cloud Security and Compliance
Best Practices for Multi-Cloud Security and ComplianceBest Practices for Multi-Cloud Security and Compliance
Best Practices for Multi-Cloud Security and Compliance
 
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
Automating Multi-Cloud Policies for AWS, Azure, Google, and MoreAutomating Multi-Cloud Policies for AWS, Azure, Google, and More
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
 
The 5 Stages of Cloud Management for Enterprises
The 5 Stages of Cloud Management for EnterprisesThe 5 Stages of Cloud Management for Enterprises
The 5 Stages of Cloud Management for Enterprises
 
9 Ways to Reduce Cloud Storage Costs
9 Ways to Reduce Cloud Storage Costs9 Ways to Reduce Cloud Storage Costs
9 Ways to Reduce Cloud Storage Costs
 
Serverless Comparison: AWS vs Azure vs Google vs IBM
Serverless Comparison: AWS vs Azure vs Google vs IBMServerless Comparison: AWS vs Azure vs Google vs IBM
Serverless Comparison: AWS vs Azure vs Google vs IBM
 
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
Best Practices for Cloud Managed Services Providers: The Path to CMP SuccessBest Practices for Cloud Managed Services Providers: The Path to CMP Success
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
 
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
Cloud Storage Comparison: AWS vs Azure vs Google vs IBMCloud Storage Comparison: AWS vs Azure vs Google vs IBM
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
 
2018 Cloud Trends: RightScale State of the Cloud Report
2018 Cloud Trends: RightScale State of the Cloud Report2018 Cloud Trends: RightScale State of the Cloud Report
2018 Cloud Trends: RightScale State of the Cloud Report
 
Got a Multi-Cloud Strategy? How RightScale CMP Helps
Got a Multi-Cloud Strategy? How RightScale CMP HelpsGot a Multi-Cloud Strategy? How RightScale CMP Helps
Got a Multi-Cloud Strategy? How RightScale CMP Helps
 
How to Manage Cloud Costs with RightScale Optima
How to Manage Cloud Costs with RightScale OptimaHow to Manage Cloud Costs with RightScale Optima
How to Manage Cloud Costs with RightScale Optima
 

Kürzlich hochgeladen

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 

Kürzlich hochgeladen (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

High Availability in the Cloud - Architectural Best Practices

  • 1. High Availability in the Cloud – Architecting Best Practices Watch the video of this webinar May 6, 2011 1
  • 2. Your Panel Today Presenting: • Michael Crandell, CEO Twitter: @michaelcrandell • Josep Blanquer, Sr. Systems Architect • Brian Adler, Professional Services Architect Q&A: • Jason Altobelli, Account Manager Please use the questions window to ask questions anytime! 2
  • 3. Agenda • Design for Failure • What happened in the AWS Outage • RightScale’s experience • Infrastructure abstraction and automation as building blocks for highly available applications • Architectural options to protect against cloud failures • Conclusions / Q&A Please use the questions window to ask questions anytime! 3
  • 4. Terminology • Fault Tolerance • Fault tolerance leverages redundancy and replication to enable systems to continue operating properly if one or more components fails • High Availability • Fault Tolerant systems are measured by their Availability in terms of planned and unplanned service outages for end users • 99% Availability = 3.65 days of downtime per year • 99.5% Availability = 1.83 days of downtime per year • 99.9% Availability = 8.76 hours of downtime per year • 99.99% Availability = 53 minutes of downtime per year • 99.999% Availability = 5.26 minutes of downtime per year • Disaster Recovery • The process, policies and procedures related to restoring critical systems after a catastrophic event 4
  • 5. Design for Failure • Large scale failures in the cloud are rare but happen • Application owners are ultimately responsible for availability and recoverability • Balance cost and complexity of HA efforts against risk you’re willing to bear • Fortunately, cloud infrastructure has made DR and HA remarkably affordable versus past options • Multi-server • Multi-AZ • Multi-region • Multi-cloud 5
  • 6. Assessing Risk vs Cost in Aviation Fatalities per 1M Hours Flown 20 Fatalities General Aviation 10 Fatalities Commuter Airline Large Air 4 Fatalities Carrier $ $$ $$$ Cost of Service 6
  • 7. What Happened in the April 21 AWS Outage? • Triggered by operator error during a router upgrade which funneled high- volume network traffic into a low-bandwidth control network used by EBS • Flooding of the control network caused a large number of EBS servers to be effectively isolated from one another, which broke volume replication, and caused these servers to start re-replicating the data to fresh servers • This large-scale re-replication storm in turn had two effects: • It failed in many cases causing the volumes to go offline for manual intervention • It flooded the EBS control plane with re-replication events that affected its operation across the entire us-east region • Steps taken by AWS: • Stopping the re-replication attempts to quiesce the system and prevent new volumes from being drawn into the outage • Isolated the affected availability zone from the EBS control plane to restore normal operation in other zones • AWS started to recover volumes 7
  • 8. RightScale’s Experience • ~2am – Monitoring server disks started to die like flies • Tried replacing them with fresh EBS drives – Failed • Tried re-launching them with fresh EBS drives in different Zones - Failed • ~3am – Realized EBS wasn’t going to be fixed anytime soon: Plan B time! • Revamped our monitoring ServerTemplate to use ephemeral volumes instead of EBS • Our architecture for monitoring data continually backs up and pulls data on demand from S3 • ~3:20am – Started re-launching affected monitoring servers • Over the next several hours, ~15% of monitoring servers failed and were easily re-launched • ~7am – Amazon announces EBS volume creation is fixed in other zones • ~11am – EBS disk failures hit our master database • Chose one of our slaves (from our zone 1d) to be promoted • In a few minutes we had the new master up and running • It took a while to achieve needed performance due to its cold working set 8
  • 9. Agenda • Design for Failure • What happened in the AWS Outage • RightScale’s experience • Infrastructure abstraction and automation as building blocks for highly available applications • Architectural options to protect against cloud failures • Conclusions / Q&A Please use the questions window to ask questions anytime! 9
  • 10. What do we mean by Cloud? • A cloud is a physical data center entity behind an API endpoint • What do you mean by that? • Amazon Web Services is not a cloud • EC2 is not a cloud • Eucalyptus, Cloud.com are not clouds • EC2 East, EC2 AsiaPacific, my private cloud… are clouds • An availability zone is not a cloud, it’s part of one • Think of a cloud as a “resource pool” accessed via API 10
  • 11. Overcoming Multi-Cloud Pain Points • APIs differ • Different sets of resources • Different formats, encodings and versions • Abstractions and features differ • Network architectures differ: VLANs, security groups, NAT, IPs, ACLs, … • Storage architectures differ: local/attachable disks, backup, snapshots, … • Hypervisors, machine images…cost models, billing, reporting…etc. • They are truly different beasts, with different semantics • So make sure you: • Design using generic concepts yet deploy using cloud specifics • Have tools that translate your concepts to cloud-specific ones • Think of how to share resources across clouds (i.e. data sharing) 11
  • 12. How does RightScale help with Abstraction? • Unified Multi-Cloud UI and new API (in progress) • Multi-Cloud Servers/Arrays • Multi-Cloud ServerTemplates • Multi-Cloud Images • Others in the pipeline I 1:N I 1:1 ServerTemplate I Server Image I I runnable abstraction software config runtime config cloud resources 12
  • 13. Infrastructure Abstraction & Automation as Building Blocks of Highly Available Applications • Multi-Cloud Dashboard, Architecture & Application Portability • Single pane of glass through UI and API • Allows simplified deployments across multiple regions/clouds • Automated Deployments (Provisioning and Configuration Mgmt) • Reproducible Configurations with Change Control - Avoids manual configuration errors • Cost effective – Pay as you go for backup. It’s easy and inexpensive to test fault tolerance. • Advanced Server and Deployment Monitoring • Custom monitoring dashboard, custom graphing, cluster graphing • Automated Scaling and Operations • Easy to scale up/down, replace failed/unhealthy instances, backup data, replicate data, etc • Library of Cloud Optimized Solution Stacks • RightImages, RightScripts, and ServerTemplates 13
  • 14. Agenda • Design for Failure • What happened in the AWS Outage • RightScale’s experience • Infrastructure abstraction and automation as building blocks for highly available applications • Architectural options to protect against cloud failures • Conclusions / Q&A Please use the questions window to ask questions anytime! 14
  • 15. HA/DR Checklist for Risk Mitigation  Determine who owns the architecture, DR process and testing.  Develop expertise in house and / or get outside help.  Conduct a risk assessment for each application.  Specify your target Recovery Time Objective and Recovery Point Objective.  Design for failure starting with application architecture.  Implement HA best practices balancing cost, complexity and risk.  Automate infrastructure for consistency and reliability.  Abstract applications for flexibility and portability.  Document operational processes and automations.  Test the failover... then test it again.  Release the Chaos Monkey. 15
  • 16. Application Architecture Deployment Options • Storage Options • Local storage, EBS, S3, CloudFiles, Gluster, etc. • DNS Configuration Options • DNS APIs for dynamic configuration (Route53, DynDNS, DNS Made Easy) • Load Balancing Options • HA Proxy or AWS ELB to distribute traffic across multiple instances / AZs • Server Array Options • Create scalable tiers for web and application servers • Database Options • MySQL (EBS with snapshots to S3 or Local Disk with LVM snapshots) • Database Manager Features (Automated snapshots, replication, slave promotion, striping) • AWS RDS • Database Sharding • NoSQL Databases 16
  • 17. General HA Best Practices  Avoid single points of failure  Always place one of each component (load balancers, app servers, databases) in at least two AZs  Maintain sufficient capacity to absorb AZ / cloud failures  Replicate data across AZs and backup or replicate across clouds/regions for failover  Setup monitoring, alerts and operations to identify and automate problem resolution or failover process  Design stateless applications for resilience to reboot / relaunch 17
  • 18. Multi-AZ Example Consider local storage for additional slave database to remove dependency on EBS (Use LVM to snapshot backups) Consider distributed NoSQL databases Snapshot EBS volume for with the same distribution considerations. backups so the database can Place Slave databases in one Spread primary and replica nodes across be readily recovered within or more AZs for failover. multiple AZs. Place as many as you need the region. for required resiliency. 18
  • 19. Multi-Cloud Cold / Warm / Hot DR Options No Downtime Hot HA (Live/Live Config) > 5 Minutes Hot DR (Least Common) > 1 Hour Warm DR (Recommended) > Few Hours Cold DR (Most Common) $ $$ $$$ $$$$ 19
  • 20. Multi-Cloud Cold DR Example Staged Server Configuration and generally no staged data 20
  • 21. Multi-Cloud Warm DR Example Staged Server Configuration, pre-staged data and running Slave Database Server 21
  • 22. Multi-Cloud Hot DR Example Parallel Deployment with all servers running but all traffic going to primary 22
  • 23. Multi-Cloud Hot HA Example Live/Live configuration. May use Geo-target IP services to direct traffic to regional load balancers. 23
  • 24. Multi-Cloud Hot HA Example Multi-Cloud looks similar to Multi-AZ… but there are additional problems to solve as some resources are not shared You need DNS management across clouds or a global load balancer Images for cloud servers are specific to the cloud/region. You need to copy or replicate data yourself as EBS snapshots are specific to the source AZ. Getting the data out requires a slave sync or taking LVM 24 snapshots and transferring the data.
  • 25. So What’s Best? • Design for failure • No one size fits all solution • Every application/components has its own architecture • Tradeoffs between levels of resiliency and cost • The options available in the cloud today are unprecedented • Capabilities for global redundancy • Time to access • Investment required • Follow our High Availability Checklist! 25
  • 26. Special Offer – High Availability Assessment & Design Recommendations Become a RightScale customer in May and receive: Free High Availability Assessment Free Design/Architecture Recommendations Half Off Onboarding Fee Ready to get started? Contact us at sales@rightscale.com or (866) 720-0208 Learn More – RightScale Free Edition White Paper Library Webinar Library RightScale.com/free Rightscale.com/whitepapers RightScale.com/webinars RightScale User Conference – June 8 in NYC! Register here: www.rightscale.com/conference 26

Hinweis der Redaktion

  1. RightScale already provides several abstractions that are cloud-agnostic. In fact you’re already using probably all of them (despite you might only be deployed in 1 cloud)..We have the concept of a server (something that can be launched/running on any cloud)The concept of a ServerTemplate, which specifies the configuration we want on a serverAnd the concept of an MCI which specifies which image configuration we want (lower-level stuff)And all these things are RS concepts…the cloud is not really involved in all this…
  2. Cold DR(Most common... hours) Staged Server Configuration and generally no staged data. Bring up the servers and load the data to failover. Cold DR failover is typically manual.Warm DR(Recommended... >hour) Staged Server Configuration, pre-staged data and running Database Slave Server. Warm DR failover is typically manual but can be automated.Hot DR(Least common... but needed if <5 min) Parallel Deployment with all servers running but all traffic going to primary. Hot DR failover is normally automated.Hot HALive/Live configuration. May use Geo-target IP services to direct traffic to regional load balancers. Failover to other region if one has problems. Hot HA is normally seamlessly automated.
  3. Differences to clarify from Multi-AZ…Don’t be fooled by similarity of appearance…Some resources are not shared across clouds (images, Elastic IPs and Snapshots)…These are problems you have to solve… Images we have made easyElastic IPsyou need a global load balancer somewhere… or use DNSHave to copy the data yourself for snapshots. You can have a replicating slave in the cloud…Or backups to a common place like S3 (constraint is that EBS snapshots are only good in the same AZ)To get a snapshot multi- cloud portable you 1) slave sync or 2) take LVM snapshots and upload the data somewhere