SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Downloaden Sie, um offline zu lesen
CASCON 2020 Nov 12, 2020
Performance Modeling of Serverless Computing Platforms
N. Mahmoudi and H. Khazaei, "Performance Modeling of
Serverless Computing Platforms," in IEEE Transactions on
Cloud Computing, doi: 10.1109/TCC.2020.3033373.
Hamzeh Khazaei
hkh@yorku.ca
Performant and Available
Computing Systems (PACS) Lab
Nima Mahmoudi
nmahmoud@ualberta.ca
CASCON 2020 Nov 12, 2020
TOC
Introduction
System Description
Analytical Modelling
Experimental Validation
Conclusion
2
Introduction
3
Serverless Computing
● Runtime operation and management done by the provider
○ Reduces the overhead for the software owner
○ Provisioning
○ Scaling resources
● Software is developed by writing functions
○ Well-defined interface
○ Functions deployed separately
4
Serverless Computing
Image source: https://aws.amazon.com/lambda/
5
The Need for a Performance Model
● No previous work has been done for performance modelling of Serverless Computing
platforms
● Accurate performance modelling can beneficial in many ways:
○ Ensure the Quality of Service (QoS)
○ Improve performance metrics
○ Predict/optimize infrastructure cost
○ Move from best-effort to performance guarantees
● It can benefit both serverless provider and application developer
6
System Description
7
Function States, Cold Starts, and Warm Starts
● Function States:
○ Initializing: Performing initialization tasks to prepare the function for incoming requests. Includes
infrastructure initialization and application initialization.
○ Running: Running the tasks required to process a request.
○ Idle: Provisioned instance that is not running any workloads. The instances in this state are not billed.
● Cold Start Requests:
○ A request that needs to go through initialization steps due to lack of provisioned capacity.
● Warm Start Requests:
○ Only includes request processing time since idle instance was available
8
Autoscaling
Expiration Threshold
Scaling Out:
Scaling In:
9
Other Important Characteristics
● Initialization Time: The amount of time instance spends in the initializing state.
● Response Time: No queuing, so it is equal to service time. It remains stable
throughout time for cold and warm start requests.
● Maximum Concurrency Level: Maximum number of instances that can be in the state
running in parallel.
● Request Routing: To facilitate scaling in, requests are routed to recently created
instances first.
10
Analytical Modelling
11
Overview
Warm Pool Model:
12
● Cold Start Rate
○ The system behaves like an Erlang Loss System (M/G/m/m).
○ The blocked requests are either rejected (reached maximum concurrency level) or cause a
cold start (and thus the creation of a new instance)
● Arrival Rate for Each Instance
○ Requests blocked by instance n are either processed by instance n+1 or blocked by it.
○ The difference between blocked rates gives us individual arrival rates.
● Server Expiration Rate
○ Can be calculated knowing individual arrival rates and expiration threshold.
● Warm Pool Model
○ Each state represents the number of instances in the warm pool.
13
● For each state, we can also calculate:
○ Probability of Rejection: Probability of being blocked when reaching
maximum concurrency.
○ Probability of Cold Start: Probability of being blocked in other states.
○ Average Response Time:
○ Mean Number of Instances in Warm Pool:
■ Running
■ Idle
○ Utilization: Ratio of instances in running state over all instances.
14
15
Experimental Validation
16
Experimental Setup
● Experiments done on AWS Lambda
○ Python 3.6 runtime with 128MB of RAM on us-east-1 region
○ A mixture of CPU and IO intensive tasks
● Client was a virtual machine on Compute Canada Arbutus
○ 8 vCPUs, 16GB of RAM, 1000Mbps connectivity, single-digit milliseconds latency to AWS
servers
○ Python with in-house workload generation tool pacswg
○ Official boto3 library for API communication
○ Communicated directly with Lambda API, no intermediary interfaces like API Gateway
17
Experimental Results
18
Sample Workloads
Name Application Warm (ms) Cold (ms)
W1 CPU and disk intensive benchmark 2000 2200
W2 A range of benchmarks with different configurations 300 10000
W3 Startup test with echo on Apache OpenWhisk 20 1000
W4 Fibonacci Calculation on AWS Lambda 4211 5961
W5 Fibonacci calculation on Azure Functions 1809 26681
19
Probability of Rejection
20
What-If Analysis Results
21
Conclusion
22
Conclusion
● Accurate and tractable analytical performance model
● Ability to predict important performance/cost related metrics
● Can predict QoS
● Can benefit serverless providers
○ Ability to predict QoS under different loads on deployment time
○ Optimize infrastructure cost, knowing how it would affect performance
● Could be useful to application developers
○ Predict how their system will react under different loads
○ Help them optimize their memory configuration to occur minimal cost that satisfies
performance requirements
● Making the expiration threshold adaptive would allow better cost-performance tradeoff
23
Summary
Website: research.nima-dev.com
Twitter: @nima_mahmoudi
24

Weitere ähnliche Inhalte

Was ist angesagt?

Resource scheduling algorithm
Resource scheduling algorithmResource scheduling algorithm
Resource scheduling algorithm
Shilpa Damor
 
Virtualization Techniques & Cloud Compting
Virtualization Techniques & Cloud ComptingVirtualization Techniques & Cloud Compting
Virtualization Techniques & Cloud Compting
Ahmed Mekkawy
 
CS8791 Cloud Computing - Question Bank
CS8791 Cloud Computing - Question BankCS8791 Cloud Computing - Question Bank
CS8791 Cloud Computing - Question Bank
pkaviya
 
Cloud computing virtualization
Cloud computing virtualizationCloud computing virtualization
Cloud computing virtualization
Ayaz Shahid
 

Was ist angesagt? (20)

Introduction of Cloud computing
Introduction of Cloud computingIntroduction of Cloud computing
Introduction of Cloud computing
 
Approaches to Software Development
Approaches to Software DevelopmentApproaches to Software Development
Approaches to Software Development
 
Software Deployment Principles & Practices
Software Deployment Principles & PracticesSoftware Deployment Principles & Practices
Software Deployment Principles & Practices
 
Software design principles for evolving architectures
Software design principles for evolving architecturesSoftware design principles for evolving architectures
Software design principles for evolving architectures
 
Load Balancing In Distributed Computing
Load Balancing In Distributed ComputingLoad Balancing In Distributed Computing
Load Balancing In Distributed Computing
 
Resource scheduling algorithm
Resource scheduling algorithmResource scheduling algorithm
Resource scheduling algorithm
 
Virtualization Techniques & Cloud Compting
Virtualization Techniques & Cloud ComptingVirtualization Techniques & Cloud Compting
Virtualization Techniques & Cloud Compting
 
CS8791 Cloud Computing - Question Bank
CS8791 Cloud Computing - Question BankCS8791 Cloud Computing - Question Bank
CS8791 Cloud Computing - Question Bank
 
Oracle databáze – Konsolidovaná Data Management Platforma
Oracle databáze – Konsolidovaná Data Management PlatformaOracle databáze – Konsolidovaná Data Management Platforma
Oracle databáze – Konsolidovaná Data Management Platforma
 
Task scheduling Survey in Cloud Computing
Task scheduling Survey in Cloud ComputingTask scheduling Survey in Cloud Computing
Task scheduling Survey in Cloud Computing
 
Virtual Machine provisioning and migration services
Virtual Machine provisioning and migration servicesVirtual Machine provisioning and migration services
Virtual Machine provisioning and migration services
 
Case study of amazon EC2 by Akash Badone
Case study of amazon EC2 by Akash BadoneCase study of amazon EC2 by Akash Badone
Case study of amazon EC2 by Akash Badone
 
Platform as a Service (PaaS)
Platform as a Service (PaaS)Platform as a Service (PaaS)
Platform as a Service (PaaS)
 
Cloud computing virtualization
Cloud computing virtualizationCloud computing virtualization
Cloud computing virtualization
 
Introduction to Google App Engine
Introduction to Google App EngineIntroduction to Google App Engine
Introduction to Google App Engine
 
Virtual Machine
Virtual MachineVirtual Machine
Virtual Machine
 
Cloud computing and Docker
Cloud computing and DockerCloud computing and Docker
Cloud computing and Docker
 
Virtualization
Virtualization Virtualization
Virtualization
 
REVIEW PAPER on Scheduling in Cloud Computing
REVIEW PAPER on Scheduling in Cloud ComputingREVIEW PAPER on Scheduling in Cloud Computing
REVIEW PAPER on Scheduling in Cloud Computing
 
Edge computing
Edge computingEdge computing
Edge computing
 

Ähnlich wie Performance Modeling of Serverless Computing Platforms - CASCON2020 Workshop on Cloud Computing

Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ UberKafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
confluent
 
TechTalk_Cloud Performance Testing_0.6
TechTalk_Cloud Performance Testing_0.6TechTalk_Cloud Performance Testing_0.6
TechTalk_Cloud Performance Testing_0.6
Sravanthi N
 

Ähnlich wie Performance Modeling of Serverless Computing Platforms - CASCON2020 Workshop on Cloud Computing (20)

Temporal Performance Modelling of Serverless Computing Platforms - WoSC6
Temporal Performance Modelling of Serverless Computing Platforms - WoSC6Temporal Performance Modelling of Serverless Computing Platforms - WoSC6
Temporal Performance Modelling of Serverless Computing Platforms - WoSC6
 
Nexmark with beam
Nexmark with beamNexmark with beam
Nexmark with beam
 
Serving deep learning models in a serverless platform (IC2E 2018)
Serving deep learning models in a serverless platform (IC2E 2018)Serving deep learning models in a serverless platform (IC2E 2018)
Serving deep learning models in a serverless platform (IC2E 2018)
 
Zero Downtime JEE Architectures
Zero Downtime JEE ArchitecturesZero Downtime JEE Architectures
Zero Downtime JEE Architectures
 
Container world 2019 Canary Release
Container world 2019 Canary ReleaseContainer world 2019 Canary Release
Container world 2019 Canary Release
 
How Netflix Uses Amazon Kinesis Streams to Monitor and Optimize Large-scale N...
How Netflix Uses Amazon Kinesis Streams to Monitor and Optimize Large-scale N...How Netflix Uses Amazon Kinesis Streams to Monitor and Optimize Large-scale N...
How Netflix Uses Amazon Kinesis Streams to Monitor and Optimize Large-scale N...
 
Serverless GraphQL. AppSync 101
Serverless GraphQL. AppSync 101Serverless GraphQL. AppSync 101
Serverless GraphQL. AppSync 101
 
Machine learning at scale with aws sage maker
Machine learning at scale with aws sage makerMachine learning at scale with aws sage maker
Machine learning at scale with aws sage maker
 
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ UberKafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
 
Stream and Batch Processing in the Cloud with Data Microservices
Stream and Batch Processing in the Cloud with Data MicroservicesStream and Batch Processing in the Cloud with Data Microservices
Stream and Batch Processing in the Cloud with Data Microservices
 
Scaling Monitoring At Databricks From Prometheus to M3
Scaling Monitoring At Databricks From Prometheus to M3Scaling Monitoring At Databricks From Prometheus to M3
Scaling Monitoring At Databricks From Prometheus to M3
 
SimScale: Unparalleled CFD Speeds with Parallel Computing
SimScale: Unparalleled CFD Speeds with Parallel ComputingSimScale: Unparalleled CFD Speeds with Parallel Computing
SimScale: Unparalleled CFD Speeds with Parallel Computing
 
Wayfair Storefront Performance Monitoring with InfluxEnterprise by Richard La...
Wayfair Storefront Performance Monitoring with InfluxEnterprise by Richard La...Wayfair Storefront Performance Monitoring with InfluxEnterprise by Richard La...
Wayfair Storefront Performance Monitoring with InfluxEnterprise by Richard La...
 
Phil Basford - machine learning at scale with aws sage maker
Phil Basford - machine learning at scale with aws sage makerPhil Basford - machine learning at scale with aws sage maker
Phil Basford - machine learning at scale with aws sage maker
 
Test strategies for data processing pipelines
Test strategies for data processing pipelinesTest strategies for data processing pipelines
Test strategies for data processing pipelines
 
TechTalk_Cloud Performance Testing_0.6
TechTalk_Cloud Performance Testing_0.6TechTalk_Cloud Performance Testing_0.6
TechTalk_Cloud Performance Testing_0.6
 
How Uber scaled its Real Time Infrastructure to Trillion events per day
How Uber scaled its Real Time Infrastructure to Trillion events per dayHow Uber scaled its Real Time Infrastructure to Trillion events per day
How Uber scaled its Real Time Infrastructure to Trillion events per day
 
Hadoop summit - Scaling Uber’s Real-Time Infra for Trillion Events per Day
Hadoop summit - Scaling Uber’s Real-Time Infra for  Trillion Events per DayHadoop summit - Scaling Uber’s Real-Time Infra for  Trillion Events per Day
Hadoop summit - Scaling Uber’s Real-Time Infra for Trillion Events per Day
 
HKG15-204: OpenStack: 3rd party testing and performance benchmarking
HKG15-204: OpenStack: 3rd party testing and performance benchmarkingHKG15-204: OpenStack: 3rd party testing and performance benchmarking
HKG15-204: OpenStack: 3rd party testing and performance benchmarking
 
AppRunner DeepDive
AppRunner DeepDiveAppRunner DeepDive
AppRunner DeepDive
 

Kürzlich hochgeladen

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Dr.Costas Sachpazis
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
ankushspencer015
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Kürzlich hochgeladen (20)

KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and Properties
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 

Performance Modeling of Serverless Computing Platforms - CASCON2020 Workshop on Cloud Computing

  • 1. CASCON 2020 Nov 12, 2020 Performance Modeling of Serverless Computing Platforms N. Mahmoudi and H. Khazaei, "Performance Modeling of Serverless Computing Platforms," in IEEE Transactions on Cloud Computing, doi: 10.1109/TCC.2020.3033373. Hamzeh Khazaei hkh@yorku.ca Performant and Available Computing Systems (PACS) Lab Nima Mahmoudi nmahmoud@ualberta.ca
  • 2. CASCON 2020 Nov 12, 2020 TOC Introduction System Description Analytical Modelling Experimental Validation Conclusion 2
  • 4. Serverless Computing ● Runtime operation and management done by the provider ○ Reduces the overhead for the software owner ○ Provisioning ○ Scaling resources ● Software is developed by writing functions ○ Well-defined interface ○ Functions deployed separately 4
  • 5. Serverless Computing Image source: https://aws.amazon.com/lambda/ 5
  • 6. The Need for a Performance Model ● No previous work has been done for performance modelling of Serverless Computing platforms ● Accurate performance modelling can beneficial in many ways: ○ Ensure the Quality of Service (QoS) ○ Improve performance metrics ○ Predict/optimize infrastructure cost ○ Move from best-effort to performance guarantees ● It can benefit both serverless provider and application developer 6
  • 8. Function States, Cold Starts, and Warm Starts ● Function States: ○ Initializing: Performing initialization tasks to prepare the function for incoming requests. Includes infrastructure initialization and application initialization. ○ Running: Running the tasks required to process a request. ○ Idle: Provisioned instance that is not running any workloads. The instances in this state are not billed. ● Cold Start Requests: ○ A request that needs to go through initialization steps due to lack of provisioned capacity. ● Warm Start Requests: ○ Only includes request processing time since idle instance was available 8
  • 10. Other Important Characteristics ● Initialization Time: The amount of time instance spends in the initializing state. ● Response Time: No queuing, so it is equal to service time. It remains stable throughout time for cold and warm start requests. ● Maximum Concurrency Level: Maximum number of instances that can be in the state running in parallel. ● Request Routing: To facilitate scaling in, requests are routed to recently created instances first. 10
  • 13. ● Cold Start Rate ○ The system behaves like an Erlang Loss System (M/G/m/m). ○ The blocked requests are either rejected (reached maximum concurrency level) or cause a cold start (and thus the creation of a new instance) ● Arrival Rate for Each Instance ○ Requests blocked by instance n are either processed by instance n+1 or blocked by it. ○ The difference between blocked rates gives us individual arrival rates. ● Server Expiration Rate ○ Can be calculated knowing individual arrival rates and expiration threshold. ● Warm Pool Model ○ Each state represents the number of instances in the warm pool. 13
  • 14. ● For each state, we can also calculate: ○ Probability of Rejection: Probability of being blocked when reaching maximum concurrency. ○ Probability of Cold Start: Probability of being blocked in other states. ○ Average Response Time: ○ Mean Number of Instances in Warm Pool: ■ Running ■ Idle ○ Utilization: Ratio of instances in running state over all instances. 14
  • 15. 15
  • 17. Experimental Setup ● Experiments done on AWS Lambda ○ Python 3.6 runtime with 128MB of RAM on us-east-1 region ○ A mixture of CPU and IO intensive tasks ● Client was a virtual machine on Compute Canada Arbutus ○ 8 vCPUs, 16GB of RAM, 1000Mbps connectivity, single-digit milliseconds latency to AWS servers ○ Python with in-house workload generation tool pacswg ○ Official boto3 library for API communication ○ Communicated directly with Lambda API, no intermediary interfaces like API Gateway 17
  • 19. Sample Workloads Name Application Warm (ms) Cold (ms) W1 CPU and disk intensive benchmark 2000 2200 W2 A range of benchmarks with different configurations 300 10000 W3 Startup test with echo on Apache OpenWhisk 20 1000 W4 Fibonacci Calculation on AWS Lambda 4211 5961 W5 Fibonacci calculation on Azure Functions 1809 26681 19
  • 23. Conclusion ● Accurate and tractable analytical performance model ● Ability to predict important performance/cost related metrics ● Can predict QoS ● Can benefit serverless providers ○ Ability to predict QoS under different loads on deployment time ○ Optimize infrastructure cost, knowing how it would affect performance ● Could be useful to application developers ○ Predict how their system will react under different loads ○ Help them optimize their memory configuration to occur minimal cost that satisfies performance requirements ● Making the expiration threshold adaptive would allow better cost-performance tradeoff 23