SlideShare ist ein Scribd-Unternehmen logo
1 von 16
Resiliency Vs. High
Availability
High Availability is usually achieved through redundant set-up of each
component such that any single point of failure is avoided. Special
attention needs to be paid to x-shape inter-connectivity of components
A and B so that every single failure of component A or B can be
bypassed without loss of functionality. High Availability about doubles
TCO compared to none HA systems. Resiliency set-up avoids TCO
doubling but need more investment into inbuilt error recovery
mechanisms, see text.
Second image is for “Resiliency”, which is the ability to recover from
temporary failures or through some explicit error handling and error
correction. Like before, in the 99% availability case only a small
amount of steps will fail in average when performing a business
scenario. You’d pass in average 495 “Things” successfully and only 5
will go wrong in average.
Resilience: It is the ability to recover quickly. That is, if Site 1 goes down, Site 2
immediately comes operational. Or if a disk drive fails, another spare disk drive
quickly is added to the storage pool. System Resilience includes eliminating single
points of failure in system designs into critical systems.
Quality of Service (QoS): It is a technology that enables specified services to
receive a higher quality of service than other specified services. Therefore, service
providers need to determine which service has the highest priority among the
services they provide to their customers. For example, Voice over Internet Protocol
(VoIP) systems typically are prioritized to ensure sufficient network bandwidth is
always available to avoid any traffic delay or degradation of voice quality. Other
services (such as web browsing) will be prioritized at a lower level. Why? Because
they are not sensitive to delays. The new net neutrality law gives ISPs a right to
provide a higher quality of services to a specified set of customers or for a
specified service on the internet.
High Availability: It is about having multiple redundant systems that
enable zero downtime or degradation for a single failure. High
availability can usually be implemented in cluster systems, and it has
two modes: 1- Active-active mode: both systems are running and
quickly available. 2- Active-passive mode: One system is active, while
the other is in standby but can become active, usually within a matter
of seconds.
Fault Tolerance: It is the ability of a system to suffer a fault but
continue to operate. How can the system have this capability? Via
adding redundant components such as additional disks within a
redundant array of inexpensive disks (RAID) array, multiple power
supplies, NIC (multiple network interfaces), or additional servers within
a failover clustered configuration.
Resiliency is not the same thing as high-availability. Resiliency is the network’s ability
to handle failures. This includes HA but also includes factors like rate-limiting,
security, management, and monitoring.
Network-Level Resiliency
Network-level resiliency includes redundancy in the topology (including physical),
and control plane resiliency. This means using the hardware for failure detection,
prevention, and recovery. For example, using stacking, multiple links, and so on.
This is where to use a Defence in Depth approach. This means using several layers of
resilience. As an example, you may have many ECMP routed links. Also, you may also
enable UDLD on the links to detect layer-1 failures.
Use a modular design in the control plane. One example of this is to use route
summarization. Throttling can prevent overwhelming the control plane. The goal is to
isolate failures to a single area.
System-Level Resiliency
This is providing resiliency at the device level. This includes dual power
supplies, dual supervisors, SSO/NSF, and so on.
It also includes software resilience, including security features and control
control plane hardening. Overlooking this can result in high CPU load,
TCAM starvation, and similar errors.
Consider using Control Plane Policing (CoPP), limiting flooding, and
hardening spanning-tree. Also consider using QoS and Storm Control to
prevent overwhelming the data plane.
Operational Resiliency
This is about how you manage the network. In particular, think about
change management and change windows.
Software updates also fall into this category. Some platforms support ISSU
ISSU (In-Service Software Upgrade) or similar for non-disruptive updates.
Availability can simply be understood as system
uptime, i.e., the percentage of time the storage
system is available and operational, allowing
data to be accessed. Highly available systems are
designed to minimize downtime and avoid loss of
service. All organizations expect to achieve high
availability for their applications and business
services. This is not achieved by a single IT
component alone. High availability depends on
many IT infrastructure components including the
storage hardware and software to work in concert
as expected, minimizing downtime by quickly
restoring essential services in the event of a
failure.
Availability is typically calculated in number of
9s.
1 nine = 90% availability, 2 nines = 99%
availability, 3 nines = 99.9 % availability, 4
nines = 99.99% availability, and so on. The
converse of availability is downtime. So, if a
storage system has an annual SLA of 7 nines
availability (99.99999%), it would suffer just 3.15
seconds of downtime in a year. You need to fully
understand your business requirements and the costs
involved to be able to determine and set your
availability goals. Service providers, too, offer
availability SLAs as part of their contracts.
To improve availability, organizations generally
use replication techniques that create redundant
Availability is typically calculated in number of
9s.
1 nine = 90% availability, 2 nines = 99%
availability, 3 nines = 99.9 % availability, 4
nines = 99.99% availability, and so on. The
converse of availability is downtime. So, if a
storage system has an annual SLA of 7 nines
availability (99.99999%), it would suffer just 3.15
seconds of downtime in a year. You need to fully
understand your business requirements and the costs
involved to be able to determine and set your
availability goals. Service providers, too, offer
availability SLAs as part of their contracts.
To improve availability, organizations generally
use replication techniques that create redundant
Resiliency describes the ability of a storage
system to self-heal, recover, and continue
operating after encountering failure, outage,
security incidents, etc. High resiliency doesn’t
mean there is high data availability. It just means
that the storage infrastructure is equipped enough
to overcome disruptions. Resiliency is not a
standalone metric; it spans business continuity,
incidence response, and recovery techniques to
reduce the magnitude and duration of disruptive
events.
Resiliency of a storage system can be improved
through redundancy and failover and by building in
software-defined intelligence to automatically
detect issues and self-heal in a short span of
Fault tolerance is similar to the concept of
availability, but it goes one step further
to guarantee zero downtime. While a highly
available storage system may have minimal
interruption, a fault-tolerant system will have no
service interruption. Having a more complex design
a fault-tolerant system is a typically quite
expensive to maintain: it will involve running
active-active copies of data all the time with the
necessary automation to fail over when encountered
with any components of a storage system failing and
causing downtime. And this failover will be non-
disruptive in such a way that applications and data
access are not impacted at all the business
continues to function as expected..
Durability refers to the continued
persistence of data. Businesses will have
long-term data retention goals. This is
achieved by improving durability of the data
and the storage infrastructure preserving it.
Especially in the context of object
storage where data is archived and preserved
for longer terms, it is important to achieve
higher durability. A high level of durability
ensures that the data does not suffer from
bit rot, degradation, or any form of
corruption or data loss.
Reliability is typically associated with the
infrastructure storing the data. It refers to the
probability that the storage system will work as
expected. A storage system may be available for a
certain period of time, but it may not work as
expected. In that case, the reliability will be
low. Various factors contribute to increasing
reliability of a system. It’s not easy to measure
reliability. One common metric that is used to
indicate reliability is mean time between failures
(MTBF). MTBF is the predicted elapsed time between
inherent failures of a storage system during normal
operations. If MTBF is high, it is an indicator
that reliability is low.
THANK YOU
Like the Video and Subscribe the Channel

Weitere ähnliche Inhalte

Was ist angesagt?

AWS Interview Questions Part - 1 | AWS Interview Questions And Answers Part -...
AWS Interview Questions Part - 1 | AWS Interview Questions And Answers Part -...AWS Interview Questions Part - 1 | AWS Interview Questions And Answers Part -...
AWS Interview Questions Part - 1 | AWS Interview Questions And Answers Part -...
Simplilearn
 
AWS 201 - A Walk through the AWS Cloud: Introduction to Amazon CloudFront
AWS 201 - A Walk through the AWS Cloud: Introduction to Amazon CloudFrontAWS 201 - A Walk through the AWS Cloud: Introduction to Amazon CloudFront
AWS 201 - A Walk through the AWS Cloud: Introduction to Amazon CloudFront
Amazon Web Services
 

Was ist angesagt? (20)

AWS Elastic Beanstalk
AWS Elastic BeanstalkAWS Elastic Beanstalk
AWS Elastic Beanstalk
 
Internal Architecture of Amazon Aurora (Level 400) - 발표자: 정달영, APAC RDS Speci...
Internal Architecture of Amazon Aurora (Level 400) - 발표자: 정달영, APAC RDS Speci...Internal Architecture of Amazon Aurora (Level 400) - 발표자: 정달영, APAC RDS Speci...
Internal Architecture of Amazon Aurora (Level 400) - 발표자: 정달영, APAC RDS Speci...
 
Stability Patterns for Microservices
Stability Patterns for MicroservicesStability Patterns for Microservices
Stability Patterns for Microservices
 
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache KafkaReal-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
 
Microservices Architecture - Bangkok 2018
Microservices Architecture - Bangkok 2018Microservices Architecture - Bangkok 2018
Microservices Architecture - Bangkok 2018
 
AWS Interview Questions Part - 1 | AWS Interview Questions And Answers Part -...
AWS Interview Questions Part - 1 | AWS Interview Questions And Answers Part -...AWS Interview Questions Part - 1 | AWS Interview Questions And Answers Part -...
AWS Interview Questions Part - 1 | AWS Interview Questions And Answers Part -...
 
ElastiCache & Redis
ElastiCache & RedisElastiCache & Redis
ElastiCache & Redis
 
Amazon ElastiCache and Redis
Amazon ElastiCache and RedisAmazon ElastiCache and Redis
Amazon ElastiCache and Redis
 
AWS Cloud Watch
AWS Cloud WatchAWS Cloud Watch
AWS Cloud Watch
 
CI/CD on AWS
CI/CD on AWSCI/CD on AWS
CI/CD on AWS
 
Creating Highly Available MongoDB Microservices with Docker Containers and Ku...
Creating Highly Available MongoDB Microservices with Docker Containers and Ku...Creating Highly Available MongoDB Microservices with Docker Containers and Ku...
Creating Highly Available MongoDB Microservices with Docker Containers and Ku...
 
AWS Kinesis Streams
AWS Kinesis StreamsAWS Kinesis Streams
AWS Kinesis Streams
 
High Availability Application Architectures in Amazon VPC (ARC202) | AWS re:I...
High Availability Application Architectures in Amazon VPC (ARC202) | AWS re:I...High Availability Application Architectures in Amazon VPC (ARC202) | AWS re:I...
High Availability Application Architectures in Amazon VPC (ARC202) | AWS re:I...
 
AWS 201 - A Walk through the AWS Cloud: Introduction to Amazon CloudFront
AWS 201 - A Walk through the AWS Cloud: Introduction to Amazon CloudFrontAWS 201 - A Walk through the AWS Cloud: Introduction to Amazon CloudFront
AWS 201 - A Walk through the AWS Cloud: Introduction to Amazon CloudFront
 
kafka
kafkakafka
kafka
 
bamboo 로 PHP 프로젝트 지속적인 배포
bamboo 로 PHP 프로젝트 지속적인 배포bamboo 로 PHP 프로젝트 지속적인 배포
bamboo 로 PHP 프로젝트 지속적인 배포
 
Architecting for High Availability
Architecting for High AvailabilityArchitecting for High Availability
Architecting for High Availability
 
AWS Lake Formation을 통한 손쉬운 데이터 레이크 구성 및 관리 - 윤석찬 :: AWS Unboxing 온라인 세미나
AWS Lake Formation을 통한 손쉬운 데이터 레이크 구성 및 관리 - 윤석찬 :: AWS Unboxing 온라인 세미나AWS Lake Formation을 통한 손쉬운 데이터 레이크 구성 및 관리 - 윤석찬 :: AWS Unboxing 온라인 세미나
AWS Lake Formation을 통한 손쉬운 데이터 레이크 구성 및 관리 - 윤석찬 :: AWS Unboxing 온라인 세미나
 
Tuning kafka pipelines
Tuning kafka pipelinesTuning kafka pipelines
Tuning kafka pipelines
 
초보자를 위한 AWS EC2, RDS 설정
초보자를 위한 AWS EC2, RDS 설정초보자를 위한 AWS EC2, RDS 설정
초보자를 위한 AWS EC2, RDS 설정
 

Ähnlich wie Resiliency vs High Availability vs Fault Tolerance vs Reliability

RTOS_GROUP_activity which is for the 7th sem e
RTOS_GROUP_activity which is for  the 7th sem eRTOS_GROUP_activity which is for  the 7th sem e
RTOS_GROUP_activity which is for the 7th sem e
RajeshKotian11
 
Top-Down Network DesignAnalyzing Technical Goals.docx
Top-Down Network DesignAnalyzing Technical Goals.docxTop-Down Network DesignAnalyzing Technical Goals.docx
Top-Down Network DesignAnalyzing Technical Goals.docx
juliennehar
 
MS Cloud Design Patterns Infographic 2015
MS Cloud Design Patterns Infographic 2015MS Cloud Design Patterns Infographic 2015
MS Cloud Design Patterns Infographic 2015
James Tramel
 
Jisto_Whitepaper_Recapturing_Stranded_Resources
Jisto_Whitepaper_Recapturing_Stranded_ResourcesJisto_Whitepaper_Recapturing_Stranded_Resources
Jisto_Whitepaper_Recapturing_Stranded_Resources
Kevin Donovan
 
Cloud quiz question answer
Cloud quiz question answerCloud quiz question answer
Cloud quiz question answer
Lahore Garrison University
 
Understanding_the_Cloud
Understanding_the_CloudUnderstanding_the_Cloud
Understanding_the_Cloud
Melissa Kattke
 
An Introduction to Designing Reliable Cloud Services January 2014
An Introduction to Designing Reliable Cloud Services January 2014An Introduction to Designing Reliable Cloud Services January 2014
An Introduction to Designing Reliable Cloud Services January 2014
David J Rosenthal
 

Ähnlich wie Resiliency vs High Availability vs Fault Tolerance vs Reliability (20)

Contrasting High Availability, Fault Tolerance, and Disaster Recovery
Contrasting High Availability, Fault Tolerance, and Disaster RecoveryContrasting High Availability, Fault Tolerance, and Disaster Recovery
Contrasting High Availability, Fault Tolerance, and Disaster Recovery
 
Module 4 disaster recovery student slides ver 1.0
Module 4 disaster recovery   student slides ver 1.0Module 4 disaster recovery   student slides ver 1.0
Module 4 disaster recovery student slides ver 1.0
 
Vanmathy distributed operating system
Vanmathy distributed operating system Vanmathy distributed operating system
Vanmathy distributed operating system
 
PriyaDharshini distributed operating system
PriyaDharshini distributed operating systemPriyaDharshini distributed operating system
PriyaDharshini distributed operating system
 
Monitoring Clusters and Load Balancers
Monitoring Clusters and Load BalancersMonitoring Clusters and Load Balancers
Monitoring Clusters and Load Balancers
 
RTOS_GROUP_activity which is for the 7th sem e
RTOS_GROUP_activity which is for  the 7th sem eRTOS_GROUP_activity which is for  the 7th sem e
RTOS_GROUP_activity which is for the 7th sem e
 
Maximizing Business Continuity Success
Maximizing Business Continuity SuccessMaximizing Business Continuity Success
Maximizing Business Continuity Success
 
Top-Down Network DesignAnalyzing Technical Goals.docx
Top-Down Network DesignAnalyzing Technical Goals.docxTop-Down Network DesignAnalyzing Technical Goals.docx
Top-Down Network DesignAnalyzing Technical Goals.docx
 
MS Cloud Design Patterns Infographic 2015
MS Cloud Design Patterns Infographic 2015MS Cloud Design Patterns Infographic 2015
MS Cloud Design Patterns Infographic 2015
 
Ms cloud design patterns infographic 2015
Ms cloud design patterns infographic 2015Ms cloud design patterns infographic 2015
Ms cloud design patterns infographic 2015
 
High Availability of Services in Wide-Area Shared Computing Networks
High Availability of Services in Wide-Area Shared Computing NetworksHigh Availability of Services in Wide-Area Shared Computing Networks
High Availability of Services in Wide-Area Shared Computing Networks
 
Thinklogical White Paper: Redundant Fiber-Based Systems
Thinklogical White Paper: Redundant Fiber-Based SystemsThinklogical White Paper: Redundant Fiber-Based Systems
Thinklogical White Paper: Redundant Fiber-Based Systems
 
Availability Considerations for SQL Server
Availability Considerations for SQL ServerAvailability Considerations for SQL Server
Availability Considerations for SQL Server
 
NFV resiliency whitepaper - Ali Kafel, Stratus Technologies
NFV resiliency whitepaper - Ali Kafel, Stratus TechnologiesNFV resiliency whitepaper - Ali Kafel, Stratus Technologies
NFV resiliency whitepaper - Ali Kafel, Stratus Technologies
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready Apps
 
Jisto_Whitepaper_Recapturing_Stranded_Resources
Jisto_Whitepaper_Recapturing_Stranded_ResourcesJisto_Whitepaper_Recapturing_Stranded_Resources
Jisto_Whitepaper_Recapturing_Stranded_Resources
 
Cloud quiz question answer
Cloud quiz question answerCloud quiz question answer
Cloud quiz question answer
 
Understanding_the_Cloud
Understanding_the_CloudUnderstanding_the_Cloud
Understanding_the_Cloud
 
An Introduction to Designing Reliable Cloud Services January 2014
An Introduction to Designing Reliable Cloud Services January 2014An Introduction to Designing Reliable Cloud Services January 2014
An Introduction to Designing Reliable Cloud Services January 2014
 
Storage Virtualization isn’t About Storage
Storage Virtualization isn’t About StorageStorage Virtualization isn’t About Storage
Storage Virtualization isn’t About Storage
 

Mehr von jeetendra mandal

Mehr von jeetendra mandal (20)

what is OSI model
what is OSI modelwhat is OSI model
what is OSI model
 
What is AWS Cloud Watch
What is AWS Cloud WatchWhat is AWS Cloud Watch
What is AWS Cloud Watch
 
What is AWS Fargate
What is AWS FargateWhat is AWS Fargate
What is AWS Fargate
 
Eventual consistency vs Strong consistency what is the difference
Eventual consistency vs Strong consistency what is the differenceEventual consistency vs Strong consistency what is the difference
Eventual consistency vs Strong consistency what is the difference
 
Batch Processing vs Stream Processing Difference
Batch Processing vs Stream Processing DifferenceBatch Processing vs Stream Processing Difference
Batch Processing vs Stream Processing Difference
 
Difference between Database vs Data Warehouse vs Data Lake
Difference between Database vs Data Warehouse vs Data LakeDifference between Database vs Data Warehouse vs Data Lake
Difference between Database vs Data Warehouse vs Data Lake
 
Difference between Client Polling vs Server Push vs Websocket vs Long Polling
Difference between Client Polling vs Server Push vs Websocket vs Long PollingDifference between Client Polling vs Server Push vs Websocket vs Long Polling
Difference between Client Polling vs Server Push vs Websocket vs Long Polling
 
Difference between TLS 1.2 vs TLS 1.3 and tutorial of TLS2 and TLS2 version c...
Difference between TLS 1.2 vs TLS 1.3 and tutorial of TLS2 and TLS2 version c...Difference between TLS 1.2 vs TLS 1.3 and tutorial of TLS2 and TLS2 version c...
Difference between TLS 1.2 vs TLS 1.3 and tutorial of TLS2 and TLS2 version c...
 
Difference Program vs Process vs Thread
Difference Program vs Process vs ThreadDifference Program vs Process vs Thread
Difference Program vs Process vs Thread
 
Carrier Advice for a JAVA Developer How to Become a Java Programmer
Carrier Advice for a JAVA Developer How to Become a Java ProgrammerCarrier Advice for a JAVA Developer How to Become a Java Programmer
Carrier Advice for a JAVA Developer How to Become a Java Programmer
 
How to become a Software Tester Carrier Path for Software Quality Tester
How to become a Software Tester Carrier Path for Software Quality TesterHow to become a Software Tester Carrier Path for Software Quality Tester
How to become a Software Tester Carrier Path for Software Quality Tester
 
How to become a Software Engineer Carrier Path for Software Developer
How to become a Software Engineer Carrier Path for Software DeveloperHow to become a Software Engineer Carrier Path for Software Developer
How to become a Software Engineer Carrier Path for Software Developer
 
Events vs Notifications
Events vs NotificationsEvents vs Notifications
Events vs Notifications
 
Microservice Architecture Software Architecture Microservice Design Pattern
Microservice Architecture Software Architecture Microservice Design PatternMicroservice Architecture Software Architecture Microservice Design Pattern
Microservice Architecture Software Architecture Microservice Design Pattern
 
Event Driven Software Architecture Pattern
Event Driven Software Architecture PatternEvent Driven Software Architecture Pattern
Event Driven Software Architecture Pattern
 
Top 5 Software Architecture Pattern Event Driven SOA Microservice Serverless ...
Top 5 Software Architecture Pattern Event Driven SOA Microservice Serverless ...Top 5 Software Architecture Pattern Event Driven SOA Microservice Serverless ...
Top 5 Software Architecture Pattern Event Driven SOA Microservice Serverless ...
 
Observability vs APM vs Monitoring Comparison
Observability vs APM vs  Monitoring ComparisonObservability vs APM vs  Monitoring Comparison
Observability vs APM vs Monitoring Comparison
 
Disaster Recovery vs Data Backup what is the difference
Disaster Recovery vs Data Backup what is the differenceDisaster Recovery vs Data Backup what is the difference
Disaster Recovery vs Data Backup what is the difference
 
What is Spinnaker? Spinnaker tutorial
What is Spinnaker? Spinnaker tutorialWhat is Spinnaker? Spinnaker tutorial
What is Spinnaker? Spinnaker tutorial
 
Difference between Github vs Gitlab vs Bitbucket
Difference between Github vs Gitlab vs BitbucketDifference between Github vs Gitlab vs Bitbucket
Difference between Github vs Gitlab vs Bitbucket
 

Kürzlich hochgeladen

%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
masabamasaba
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
masabamasaba
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Kürzlich hochgeladen (20)

%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
tonesoftg
tonesoftgtonesoftg
tonesoftg
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Harnessing ChatGPT - Elevating Productivity in Today's Agile Environment
Harnessing ChatGPT  - Elevating Productivity in Today's Agile EnvironmentHarnessing ChatGPT  - Elevating Productivity in Today's Agile Environment
Harnessing ChatGPT - Elevating Productivity in Today's Agile Environment
 

Resiliency vs High Availability vs Fault Tolerance vs Reliability

  • 2.
  • 3. High Availability is usually achieved through redundant set-up of each component such that any single point of failure is avoided. Special attention needs to be paid to x-shape inter-connectivity of components A and B so that every single failure of component A or B can be bypassed without loss of functionality. High Availability about doubles TCO compared to none HA systems. Resiliency set-up avoids TCO doubling but need more investment into inbuilt error recovery mechanisms, see text. Second image is for “Resiliency”, which is the ability to recover from temporary failures or through some explicit error handling and error correction. Like before, in the 99% availability case only a small amount of steps will fail in average when performing a business scenario. You’d pass in average 495 “Things” successfully and only 5 will go wrong in average.
  • 4. Resilience: It is the ability to recover quickly. That is, if Site 1 goes down, Site 2 immediately comes operational. Or if a disk drive fails, another spare disk drive quickly is added to the storage pool. System Resilience includes eliminating single points of failure in system designs into critical systems. Quality of Service (QoS): It is a technology that enables specified services to receive a higher quality of service than other specified services. Therefore, service providers need to determine which service has the highest priority among the services they provide to their customers. For example, Voice over Internet Protocol (VoIP) systems typically are prioritized to ensure sufficient network bandwidth is always available to avoid any traffic delay or degradation of voice quality. Other services (such as web browsing) will be prioritized at a lower level. Why? Because they are not sensitive to delays. The new net neutrality law gives ISPs a right to provide a higher quality of services to a specified set of customers or for a specified service on the internet.
  • 5. High Availability: It is about having multiple redundant systems that enable zero downtime or degradation for a single failure. High availability can usually be implemented in cluster systems, and it has two modes: 1- Active-active mode: both systems are running and quickly available. 2- Active-passive mode: One system is active, while the other is in standby but can become active, usually within a matter of seconds. Fault Tolerance: It is the ability of a system to suffer a fault but continue to operate. How can the system have this capability? Via adding redundant components such as additional disks within a redundant array of inexpensive disks (RAID) array, multiple power supplies, NIC (multiple network interfaces), or additional servers within a failover clustered configuration.
  • 6. Resiliency is not the same thing as high-availability. Resiliency is the network’s ability to handle failures. This includes HA but also includes factors like rate-limiting, security, management, and monitoring. Network-Level Resiliency Network-level resiliency includes redundancy in the topology (including physical), and control plane resiliency. This means using the hardware for failure detection, prevention, and recovery. For example, using stacking, multiple links, and so on. This is where to use a Defence in Depth approach. This means using several layers of resilience. As an example, you may have many ECMP routed links. Also, you may also enable UDLD on the links to detect layer-1 failures. Use a modular design in the control plane. One example of this is to use route summarization. Throttling can prevent overwhelming the control plane. The goal is to isolate failures to a single area.
  • 7. System-Level Resiliency This is providing resiliency at the device level. This includes dual power supplies, dual supervisors, SSO/NSF, and so on. It also includes software resilience, including security features and control control plane hardening. Overlooking this can result in high CPU load, TCAM starvation, and similar errors. Consider using Control Plane Policing (CoPP), limiting flooding, and hardening spanning-tree. Also consider using QoS and Storm Control to prevent overwhelming the data plane.
  • 8. Operational Resiliency This is about how you manage the network. In particular, think about change management and change windows. Software updates also fall into this category. Some platforms support ISSU ISSU (In-Service Software Upgrade) or similar for non-disruptive updates.
  • 9. Availability can simply be understood as system uptime, i.e., the percentage of time the storage system is available and operational, allowing data to be accessed. Highly available systems are designed to minimize downtime and avoid loss of service. All organizations expect to achieve high availability for their applications and business services. This is not achieved by a single IT component alone. High availability depends on many IT infrastructure components including the storage hardware and software to work in concert as expected, minimizing downtime by quickly restoring essential services in the event of a failure.
  • 10. Availability is typically calculated in number of 9s. 1 nine = 90% availability, 2 nines = 99% availability, 3 nines = 99.9 % availability, 4 nines = 99.99% availability, and so on. The converse of availability is downtime. So, if a storage system has an annual SLA of 7 nines availability (99.99999%), it would suffer just 3.15 seconds of downtime in a year. You need to fully understand your business requirements and the costs involved to be able to determine and set your availability goals. Service providers, too, offer availability SLAs as part of their contracts. To improve availability, organizations generally use replication techniques that create redundant
  • 11. Availability is typically calculated in number of 9s. 1 nine = 90% availability, 2 nines = 99% availability, 3 nines = 99.9 % availability, 4 nines = 99.99% availability, and so on. The converse of availability is downtime. So, if a storage system has an annual SLA of 7 nines availability (99.99999%), it would suffer just 3.15 seconds of downtime in a year. You need to fully understand your business requirements and the costs involved to be able to determine and set your availability goals. Service providers, too, offer availability SLAs as part of their contracts. To improve availability, organizations generally use replication techniques that create redundant
  • 12. Resiliency describes the ability of a storage system to self-heal, recover, and continue operating after encountering failure, outage, security incidents, etc. High resiliency doesn’t mean there is high data availability. It just means that the storage infrastructure is equipped enough to overcome disruptions. Resiliency is not a standalone metric; it spans business continuity, incidence response, and recovery techniques to reduce the magnitude and duration of disruptive events. Resiliency of a storage system can be improved through redundancy and failover and by building in software-defined intelligence to automatically detect issues and self-heal in a short span of
  • 13. Fault tolerance is similar to the concept of availability, but it goes one step further to guarantee zero downtime. While a highly available storage system may have minimal interruption, a fault-tolerant system will have no service interruption. Having a more complex design a fault-tolerant system is a typically quite expensive to maintain: it will involve running active-active copies of data all the time with the necessary automation to fail over when encountered with any components of a storage system failing and causing downtime. And this failover will be non- disruptive in such a way that applications and data access are not impacted at all the business continues to function as expected..
  • 14. Durability refers to the continued persistence of data. Businesses will have long-term data retention goals. This is achieved by improving durability of the data and the storage infrastructure preserving it. Especially in the context of object storage where data is archived and preserved for longer terms, it is important to achieve higher durability. A high level of durability ensures that the data does not suffer from bit rot, degradation, or any form of corruption or data loss.
  • 15. Reliability is typically associated with the infrastructure storing the data. It refers to the probability that the storage system will work as expected. A storage system may be available for a certain period of time, but it may not work as expected. In that case, the reliability will be low. Various factors contribute to increasing reliability of a system. It’s not easy to measure reliability. One common metric that is used to indicate reliability is mean time between failures (MTBF). MTBF is the predicted elapsed time between inherent failures of a storage system during normal operations. If MTBF is high, it is an indicator that reliability is low.
  • 16. THANK YOU Like the Video and Subscribe the Channel