How to Build High-Volume, Scalable, and Resilient APIs (EXP18038)

•

3 gefällt mir•2,293 views

Management and exposure of application programming interfaces (APIs) are hot topics in the world of the Internet of Things. But how do you make sure that your APIs are always reachable, scalable, and capable of processing high volumes of requests with zero downtime? Share common patterns and best practices, and gain insights into building the most powerful, robust APIs that can serve millions of things concurrently. -- SAP TechEd && d-code Berlin (November 13 2014)

Technologie

“If you add up all the smartphones and the tablets
and the digital televisions and the PCs... we see a
large opportunity of perhaps 3 billion to 4 billion
units per annum, but we see an embedded market
that’s maybe 30 billion to 40 billion units per
annum”
- ARM CEO Warren East

Problem definition
For example, running an application that depends on 30
services that each have 99.99% uptime we get:
99.9930 = 99.7% uptime
0.3% of 1 million requests = 3,000 failures
2+ hours downtime/month even if all dependencies have excellent
uptime.
Reality is generally worse.

Design principles
• Restrict any single dependency from using up all user threads.
• Shed load and fail fast instead of queueing.
• Provide fallbacks wherever feasible to protect users from failure
• Use isolation techniques (such as bulkhead, swimlane and circuit breaker
patterns) to limit impact of any one dependency.
• Optimize for time-to-discovery through near real-time metrics, monitoring
and alerting
• Optimize for time-to-recovery with low latency propagation of configuration
changes and support for dynamic property changes in virtually all aspects of
Hystrix to allow real-time operational modifications with low latency
feedback loops.
• Protect against entire dependency client execution, not just network traffic

Use timeouts
Time-out calls that take longer than defined thresholds. A
default exists but for most dependencies is custom-set via
properties to be just slightly higher than the measured
99.5th percentile performance for each dependency.

Bulkheads
Maintain a small thread-pool (or semaphore) for
each dependency and if it becomes full commands
will be immediately rejected instead of queued up.
Dependencies with Clogged threads pools shouldn’t
hinder access to other dependencies.

Circuit breakers
Trip a circuit-breaker automatically or manually
to stop all requests to that service for a period of
time if error percentage passes a threshold.

Fallback logic
Perform fallback logic when a request
fails, is rejected, timed-out or short-circuited.

Measure
Measure success, failures
(exceptions thrown by client),
timeouts, and thread
rejections.

Request collapsing
Collapse multiple concurrent user request
into one a single backend dependency call
(within a short time window of e.g. 10ms)

Request caching
Reduce the number of request being sent to the
backend dependencies by caching and de-duping
requests.

Define a pipeline and context
Many service share base functionality such as
authentication. Defining a clear request pipeline and
context, optimizes shared logic and prevents
repeating calls (e.g. getCustomer)

Don’t lock the bonnet
Make it possible to switch on logging and direct certain
traffic to a specific node

REST vs Experience API
/users/<id>/ratings/title
/users/<id>/queues
/users/<id>/queues/instant
/users/<id>/recommendations
/catalog/titles/movie
/catalog/titles/series
/catalog/people
VS

Example: /phone/homescreen
User Interface Rendering
Data gathering, formatting
and delivery

Thanks for listening!
We are hiring!
Contact me:
jan@penninkhof.com

Weitere ähnliche Inhalte

Was ist angesagt?

CQRS in 4 stepsRadosław Maziarka

Microservices Architectures: Become a Unicorn like Netflix, Twitter and Hailogjuljo

Patterns of resilienceUwe Friedrichsen

Oracle RAC, Oracle Data Guard, and Pluggable Databases: When MAA Meets Oracle...Ludovico Caldara

An Architectural Deep Dive With Kubernetes And Containers Powerpoint Presenta...SlideTeam

Effective AIOps with Open Source Software in a WeekDatabricks

MicroservicesSmartBear

The 7 quests of resilient software designUwe Friedrichsen

Serverlesslakshman diwaakar

Microservice architecture design principlesSanjoy Kumar Roy

Event Storming and SagaAraf Karsh Hamid

Resilient Functional Service DesignUwe Friedrichsen

Event Sourcing & CQRS, Kafka, Rabbit MQAraf Karsh Hamid

Disaster Recovery for Multi-Region Apache Kafka Ecosystems at Uberconfluent

Service meshArnab Mitra

Circuit Breaker PatternVikash Kodati

APIsecure 2023 - API orchestration: to build resilient applications, Cherish ...apidays

Introduction to Resilience4jKnoldus Inc.

The Architecture of an API PlatformJohannes Ridderstedt

Microservices Architecture Part 2 Event Sourcing and SagaAraf Karsh Hamid

Was ist angesagt? (20)

CQRS in 4 steps

Microservices Architectures: Become a Unicorn like Netflix, Twitter and Hailo

Patterns of resilience

Oracle RAC, Oracle Data Guard, and Pluggable Databases: When MAA Meets Oracle...

An Architectural Deep Dive With Kubernetes And Containers Powerpoint Presenta...

Effective AIOps with Open Source Software in a Week

Microservices

The 7 quests of resilient software design

Serverless

Microservice architecture design principles

Event Storming and Saga

Resilient Functional Service Design

Event Sourcing & CQRS, Kafka, Rabbit MQ

Disaster Recovery for Multi-Region Apache Kafka Ecosystems at Uber

Service mesh

Circuit Breaker Pattern

APIsecure 2023 - API orchestration: to build resilient applications, Cherish ...

Introduction to Resilience4j

The Architecture of an API Platform

Microservices Architecture Part 2 Event Sourcing and Saga

Andere mochten auch

Next-gen OData/ui5 microservices with Spring BootJan Penninkhof

SAP & Open Souce - Give & TakeJan Penninkhof

Sap Teched 2015 RecapJan Penninkhof

Node.js in SAP HANA SPS11Jan Penninkhof

SAP and The Internet of ThingsJan Penninkhof

API Risk: Taking Your API Security to the Next LevelCA Technologies

Cloud foundry as driver of hana’s evolutionJan Penninkhof

Building Consistent RESTful APIs in a high-performance environmentLinkedIn

Build and Manage Your APIs with Amazon API GatewayAmazon Web Services

Andere mochten auch (9)

Next-gen OData/ui5 microservices with Spring Boot

SAP & Open Souce - Give & Take

Sap Teched 2015 Recap

Node.js in SAP HANA SPS11

SAP and The Internet of Things

API Risk: Taking Your API Security to the Next Level

Cloud foundry as driver of hana’s evolution

Building Consistent RESTful APIs in a high-performance environment

Build and Manage Your APIs with Amazon API Gateway

Ähnlich wie How to Build High-Volume, Scalable, and Resilient APIs (EXP18038)

Tef con2016 (1)ggarber

Top-Down Network DesignAnalyzing Technical Goals.docxjuliennehar

AVAILABILITY METRICS: UNDER CONTROLLED ENVIRONMENTS FOR WEB SERVICES ijwscjournal

Enabling Carrier-Grade Availability Within a Cloud InfrastructureOPNFV

Microservices architectureFaren faren

Improving the Reliability of the Distribution Grid Using a Distributed Restor...Power System Operation

Monitoring Clusters and Load BalancersPrince JabaKumar

Cloud Native & Service MeshRoi Ezra

Resilience engineeringSumanth Chinthagunta

Resilience planning and how the empire strikes backBhakti Mehta

Expect the unexpected: Anticipate and prepare for failures in microservices b...Bhakti Mehta

Agile integration: Decomposing the monolith Judy Breedlove

Data stream processing and micro service architectureVyacheslav Benedichuk

Cloud architectureMahmoud Moussa

Resiliency vs High Availability vs Fault Tolerance vs Reliabilityjeetendra mandal

IRJET- Analysis of Micro Inversion to Improve Fault Tolerance in High Spe...IRJET Journal

Cloud design patternAreeba jabeen

Carrier-Class Availability for EnterprisesSheri Determan

Ähnlich wie How to Build High-Volume, Scalable, and Resilient APIs (EXP18038) (20)

Tef con2016 (1)

Top-Down Network DesignAnalyzing Technical Goals.docx

AVAILABILITY METRICS: UNDER CONTROLLED ENVIRONMENTS FOR WEB SERVICES

Enabling Carrier-Grade Availability Within a Cloud Infrastructure

Microservices architecture

Improving the Reliability of the Distribution Grid Using a Distributed Restor...

Monitoring Clusters and Load Balancers

Cloud Native & Service Mesh

Resilience engineering

Resilience planning and how the empire strikes back

Expect the unexpected: Anticipate and prepare for failures in microservices b...

Agile integration: Decomposing the monolith

Data stream processing and micro service architecture

Cloud architecture

Resiliency vs High Availability vs Fault Tolerance vs Reliability

IRJET- Analysis of Micro Inversion to Improve Fault Tolerance in High Spe...

Cloud design pattern

Carrier-Class Availability for Enterprises

Kürzlich hochgeladen

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

Understanding the Laravel MVC ArchitecturePixlogix Infotech

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

A Domino Admins Adventures (Engage 2024)Gabriella Davis

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

🐬 The future of MySQL is Postgres 🐘RTylerCroy

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Kürzlich hochgeladen (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

Understanding the Laravel MVC Architecture

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

08448380779 Call Girls In Civil Lines Women Seeking Men

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Boost PC performance: How more available memory can improve productivity

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

[2024]Digital Global Overview Report 2024 Meltwater.pdf

Finology Group – Insurtech Innovation Award 2024

A Domino Admins Adventures (Engage 2024)

My Hashitalk Indonesia April 2024 Presentation

Salesforce Community Group Quito, Salesforce 101

Injustice - Developers Among Us (SciFiDevCon 2024)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

CNv6 Instructor Chapter 6 Quality of Service

🐬 The future of MySQL is Postgres 🐘

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

How to Build High-Volume, Scalable, and Resilient APIs (EXP18038)

1. Resilient APIs

2. @JPENNINKHOF

3. Challenge

4. “If you add up all the smartphones and the tablets and the digital televisions and the PCs... we see a large opportunity of perhaps 3 billion to 4 billion units per annum, but we see an embedded market that’s maybe 30 billion to 40 billion units per annum” - ARM CEO Warren East

5. Problem definition For example, running an application that depends on 30 services that each have 99.99% uptime we get: 99.9930 = 99.7% uptime 0.3% of 1 million requests = 3,000 failures 2+ hours downtime/month even if all dependencies have excellent uptime. Reality is generally worse.

6. API vulnerability

7. API Fallbacks

8. Design principles • Restrict any single dependency from using up all user threads. • Shed load and fail fast instead of queueing. • Provide fallbacks wherever feasible to protect users from failure • Use isolation techniques (such as bulkhead, swimlane and circuit breaker patterns) to limit impact of any one dependency. • Optimize for time-to-discovery through near real-time metrics, monitoring and alerting • Optimize for time-to-recovery with low latency propagation of configuration changes and support for dynamic property changes in virtually all aspects of Hystrix to allow real-time operational modifications with low latency feedback loops. • Protect against entire dependency client execution, not just network traffic

9. Use timeouts Time-out calls that take longer than defined thresholds. A default exists but for most dependencies is custom-set via properties to be just slightly higher than the measured 99.5th percentile performance for each dependency.

10. Bulkheads Maintain a small thread-pool (or semaphore) for each dependency and if it becomes full commands will be immediately rejected instead of queued up. Dependencies with Clogged threads pools shouldn’t hinder access to other dependencies.

11. Circuit breakers Trip a circuit-breaker automatically or manually to stop all requests to that service for a period of time if error percentage passes a threshold.

12. Fallback logic Perform fallback logic when a request fails, is rejected, timed-out or short-circuited.

13. Measure Measure success, failures (exceptions thrown by client), timeouts, and thread rejections.

14. Request collapsing Collapse multiple concurrent user request into one a single backend dependency call (within a short time window of e.g. 10ms)

15. Request caching Reduce the number of request being sent to the backend dependencies by caching and de-duping requests.

16. Define a pipeline and context Many service share base functionality such as authentication. Defining a clear request pipeline and context, optimizes shared logic and prevents repeating calls (e.g. getCustomer)

17. Don’t lock the bonnet Make it possible to switch on logging and direct certain traffic to a specific node

18. REST vs Experience API /users/<id>/ratings/title /users/<id>/queues /users/<id>/queues/instant /users/<id>/recommendations /catalog/titles/movie /catalog/titles/series /catalog/people VS

19. Example: /phone/homescreen User Interface Rendering Data gathering, formatting and delivery

20.

21. Thanks for listening! We are hiring! Contact me: jan@penninkhof.com

How to Build High-Volume, Scalable, and Resilient APIs (EXP18038)

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (9)

Ähnlich wie How to Build High-Volume, Scalable, and Resilient APIs (EXP18038)

Ähnlich wie How to Build High-Volume, Scalable, and Resilient APIs (EXP18038) (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

How to Build High-Volume, Scalable, and Resilient APIs (EXP18038)