SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Downloaden Sie, um offline zu lesen
Ming Mao, Jie Li, Marty Humphrey

             eScience Group

   CS Department, University of Virginia

        Grid 2010 – Oct 27, 2010
   A fast growing computing platform
     IDC - Cloud spending increases 27.4% a year to $56
      billion (compared 5% a year of traditional IT)
            $16.5 billion (2009) -> $55.5 billion (2014)
     src: Worldwide and Regional Public IT Cloud Service 2010-2014 Forecast


   Two most quoted benefits
     Scalable computing and storage
     Reduced cost

   Concerns
     Security, availability, cost management, integration
      interoperability, etc.
        Q1. Cost – the most important factor in
                       practice?
                      Rate the benefits commonly ascribed to the                                                                  How important is it that Cloud service providers...
                                cloud on-demand model                                                                                      Offer competitive pricing                                              91.60%
            Pay only for what you use                                                           77.90%                              Offer Service Level Agreements                                              88.60%
       Easy/fast to deply to end-users                                                                             Option to move cloud offerings back on premise                                               87.80%
                                                                                                77.70%
                                                                                                                                       Provide a complete solution                                             86.00%
                    Monthly payments                                                           75.30%
                                                                                                                             Understand my business and industry                                              84.50%
        Encourages standard systems                                                     68.50%
                                                                                                                     Allow managing on-premise & cloud together                                              82.10%
  Requires less in-house IT staff, costs                                               67.00%                                        Support many of my IT needes                                           81.00%
    Alwasys offers latest functionality                                               64.60%                       Offer both on-premise and public cloud services                                         79.20%
Sharing systems with partners simpler                                                 63.90%                       Are a technology and business model innovator                                           78.30%
         Seems like the way of future                                        54.00%                                    Have local presence, can come to my offices                                      72.90%

                                      0.00%        20.00%      40.00%        60.00%       80.00%         100.00%                                                 0.00% 20.00% 40.00% 60.00%             80.00%   100.00%
                              Source: IDC Enterprise Panel, 3Q09, n = 263, Sep 2009                                                             Source: IDC Enterprise Panel, 3Q09, n = 263, Sep 2009




                      Q2. Moving into Cloud == Reduced Cost ?
   Resource utilization information based triggers (e.g.
    AWS auto-scaling, RightScale, enStratus, Scalr, etc)
   Multiple instance types


   Current billing models
     Full hour billing

   Non-ignorable instance acquisition time
     7-15 min in Windows Azure

   More specific performance goals
   Budget awareness (e.g. dollars/month, dollars/job)
Cloud
   Deadline               Users
                                       Application



    (Job finish time)
                                                     Cloud Server



    Cost
                                         Job



    Problem Statement – how to enable cloud
    applications to finish all the submitted jobs
    before user specified deadline with as little
    money as possible using auto-scaling.
   Workload are non-dependent jobs submitted
    in the job queue

   FCFS manner and fairly distributed

   Different classes of jobs

   Same performance goal (e.g.1 hour deadline)

   VM instances take time to startup
ni
tj
I
Vii,i j
d




          Key variables used in the model
   Workload
     W  (J j , nj )

   Computing Power of Instance I i
                       D  nj
    P  (J j ,                                    )       Running Instance
                 
     i
                       j
                           t j ,type ( Ii ) n j


             ( D  (dtype ( Ii )  si ))  n j
P  (J j ,                                            )
                    j t j ,type ( Ii ) n j                Pending Instance
 i
   Scale up
     Sufficient budget
     Min(i ctype ( Ii ') )         P ' W  P
                                          i       i

     Insufficient budget
      Max( Pi ')             c
                               i   type ( Ii ')        C  i ctype ( Ii )

   Scale down

       P  P W
          i   i   s
Workload                           Required Computing Power
j1 :  x  60 10  10   40          j1 : 10              10        10  x   45
j2 :  y   60   5    20  35
                                 j2 : n1 '  5   n2 ' 20  n3 ' 10   y   35
                                                                               
j3 :  z  60  20  5  35
                                 j3 :  20             5         10  z  35
                                                                               
      P' W            I1      I2                   V1          V2           V3      P'

                               Min(c1n1 ' c2 n2 ' c3n3 ')

              where          c1n1 ' c2 n2 ' c3n3 ' ctype( I1 )  ctype( I2 )  C
Cloud Cruise Control

            notify                           Decider

 admin                    Min( i ctype ( Ii ') ) &            Pj '  W  P
           dynamic                                          j

         configuration
                                                                        vm plan

                                                                     VM
                          Monitor          Repository
                                                                   Manager
                                                                                     +, –
                                              Config


               workload             update             update                  vm info




enqueue
                                               Historical
                                                                                         VM instances
                                                 Data
           users
                                                 dequeue
Workload & VM simulation parameters

                    Mix             Computing          IO Intensive
               Avg 30 jobs/hour     Intensive        Avg 30 jobs/hour
               STD 5 jobs/hour    Avg 30 jobs/hour   STD 5 jobs/hour
                                  STD 5 jobs/hour
  General       Average 300s       Average 300s       Average 300s
0.085$/hour       STD 50s            STD 50s            STD 50s
Delay 600s
 High-CPU       Average 210s        Average 75s       Average 300s
0.17$/hour        STD 25s            STD 15s            STD 50s
Delay 720s
  High-IO       Average 210s       Average 300s        Average 75s
0.17$/hour        STD 25s            STD 50s            STD 15s
Delay 720s
Stable Worload & Changing Deadline
Response (sec)                                            Utilization (%)
                                                                 100.00%
7000
                                                                 90.00%
6000
                                                                 80.00%
5000                                                             70.00%
                                                                 60.00%
4000
                                                                 50.00%
3000                                                             40.00%

2000                                                             30.00%
                                                                 20.00%
1000
                                                                 10.00%
   0                                                             0.00%

       0    10       20   30     40     50     60   70     80
                               Time (hour)
           utilization     deadline        avg      max         min
Changing Workload & Fixed Deadline
Response (sec)                                                Worload (job/h)
 4000                                                                   350

 3500                                                                   300
 3000
                                                                        250
 2500
                                                                        200
 2000
                                                                        150
 1500
                                                                        100
 1000

  500                                                                   50

    0                                                                   0
        0        10     20     30    40     50      60   70      80
                                    Time (hour)
             deadline        avg      max         min     workload
VM Types               Total Cost ($)
                                         % more than optimal
Choice #1             General               98.52$ (43%)
Choice #2            High-CPU              128.86$ (87%)
Choice #3             High-IO              129.71$ (88%)
Choice #4   General, High-CPU, High-IO      78.62$ (14%)
 Optimal    General, High-CPU, High-IO         68.85$
   MODIS
200X – Year                                   Terra & Aqua – Satellite
(X - Y) – Day X to day Y                      15 images / day
                      Moderate scale test (up to 20 instances)
                              1hour deadline          2hour deadline        3hour deadline
      Terra 2004(10-12)         18 min late             8 min early          20 min early
         Total 45 jobs       9 C.H.or 1.08$           6 C.H or 0.72$         5 C.H.or 0.6$
       4 C.H.* or 0.48$
      Aqua 2008(30-32)              15min late         20 min early         29 min early
         Total 45 jobs            10 C.H or 1.2$      7 C.H.or 0.84$        5 C.H.or 0.6$
       4 C.H. or 0.48$

                          Large Scale test (up to 90 instances)
                                             2 hour deadline            4 hour deadline
       Terra & Aqua 2006(1-75)                  20min late                6 min early
             Total 1125 jobs                170 C.H. or 20.4$          132 C.H. or 15.84$
           93 C.H. or 11.16$
       Terra & Aqua 2006(1-150)             Admission Denied              22 min early
             Total 2250 jobs                                           243 C.H. or 29.16$
           185 C.H. or 22.2$
               * C.H. – computing hour             1C.H. = 0.12$ in Windows Azure
   Test: Terra & Aqua 2006(1-75) - total 1125 jobs
                  6min early
                  theoretical cost - 93 C.H. or 11.16$
                  actual cost - 132 C.H. or 15.84$
                                       Instance Acquisition and Release
                          40
                          38
                          36
                          34
                          32
                          30
                          28
                          26
        Instance Number




                          24
                          22
                          20
                          18
                          16
                          14
                          12
                          10
                           8
                           6
                           4
                           2
                           0

                               0   1         2                 3               4                  5
                                                 Time (hour)        Released       Acquiring   Ready
   Conclusions
     More cost-efficient than fixed-size instance choice
     VM startup delay can affect hugely in practice


   Future works
     More general cloud application model
     Multiple job classes
     Consider other instance types (e.g. spot instances &
      reserved instances)
     Data transfer performance and storage cost
Cloud auto-scaling with deadline and budget constraints

Weitere ähnliche Inhalte

Ähnlich wie Cloud auto-scaling with deadline and budget constraints

Capacity Planning for Linux Systems
Capacity Planning for Linux SystemsCapacity Planning for Linux Systems
Capacity Planning for Linux SystemsRodrigo Campos
 
Feature Extraction for Predictive LTV Modeling using Hadoop, Hive, and Cascad...
Feature Extraction for Predictive LTV Modeling using Hadoop, Hive, and Cascad...Feature Extraction for Predictive LTV Modeling using Hadoop, Hive, and Cascad...
Feature Extraction for Predictive LTV Modeling using Hadoop, Hive, and Cascad...Kontagent
 
Cost Analysis In IT - HES08
Cost Analysis In IT - HES08Cost Analysis In IT - HES08
Cost Analysis In IT - HES08Thomas Danford
 
Estimating the principal of Technical Debt - Dr. Bill Curtis - WTD '12
Estimating the principal of Technical Debt - Dr. Bill Curtis - WTD '12Estimating the principal of Technical Debt - Dr. Bill Curtis - WTD '12
Estimating the principal of Technical Debt - Dr. Bill Curtis - WTD '12OnTechnicalDebt
 
External should that be a microservice
External should that be a microserviceExternal should that be a microservice
External should that be a microserviceRohit Kelapure
 
AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...
AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...
AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...Amazon Web Services
 
The Cloud - An introduction
The Cloud - An introductionThe Cloud - An introduction
The Cloud - An introductionLenny Rachitsky
 
ENT204 The AWS Cloud Value Framework
ENT204 The AWS Cloud Value FrameworkENT204 The AWS Cloud Value Framework
ENT204 The AWS Cloud Value FrameworkAmazon Web Services
 
Achieving Business Value - Transformation Day Philadelphia 2018
Achieving Business Value - Transformation Day Philadelphia 2018Achieving Business Value - Transformation Day Philadelphia 2018
Achieving Business Value - Transformation Day Philadelphia 2018Amazon Web Services
 
Cloud computing - co daje firmie?
Cloud computing - co daje firmie? Cloud computing - co daje firmie?
Cloud computing - co daje firmie? Biznes to Rozmowy
 
Gitex2010 ICT strategies moving to the cloud v11
Gitex2010 ICT strategies moving to the cloud v11Gitex2010 ICT strategies moving to the cloud v11
Gitex2010 ICT strategies moving to the cloud v11Jorge Sebastiao
 
MVisio: A Computer Graphics Platform for Virtual Reality, Science and Education
MVisio: A Computer Graphics Platform for Virtual Reality, Science and EducationMVisio: A Computer Graphics Platform for Virtual Reality, Science and Education
MVisio: A Computer Graphics Platform for Virtual Reality, Science and EducationAchille Peternier
 
AWS Cloud Value Framework - AWS Transformation Days Raleigh 2018.pdf
AWS Cloud Value Framework - AWS Transformation Days Raleigh 2018.pdfAWS Cloud Value Framework - AWS Transformation Days Raleigh 2018.pdf
AWS Cloud Value Framework - AWS Transformation Days Raleigh 2018.pdfAmazon Web Services
 
Microservices with .Net - NDC Sydney, 2016
Microservices with .Net - NDC Sydney, 2016Microservices with .Net - NDC Sydney, 2016
Microservices with .Net - NDC Sydney, 2016Richard Banks
 
CCCC Neustar Lenny Rachitsky
CCCC Neustar Lenny RachitskyCCCC Neustar Lenny Rachitsky
CCCC Neustar Lenny RachitskyCloud Congress
 
What does performance mean in the cloud
What does performance mean in the cloudWhat does performance mean in the cloud
What does performance mean in the cloudMichael Kopp
 
Business Case Calculator for DevOps Initiatives - Leading credit card service...
Business Case Calculator for DevOps Initiatives - Leading credit card service...Business Case Calculator for DevOps Initiatives - Leading credit card service...
Business Case Calculator for DevOps Initiatives - Leading credit card service...Capgemini
 

Ähnlich wie Cloud auto-scaling with deadline and budget constraints (20)

Capacity Planning for Linux Systems
Capacity Planning for Linux SystemsCapacity Planning for Linux Systems
Capacity Planning for Linux Systems
 
Feature Extraction for Predictive LTV Modeling using Hadoop, Hive, and Cascad...
Feature Extraction for Predictive LTV Modeling using Hadoop, Hive, and Cascad...Feature Extraction for Predictive LTV Modeling using Hadoop, Hive, and Cascad...
Feature Extraction for Predictive LTV Modeling using Hadoop, Hive, and Cascad...
 
Cost Analysis In IT - HES08
Cost Analysis In IT - HES08Cost Analysis In IT - HES08
Cost Analysis In IT - HES08
 
Estimating the principal of Technical Debt - Dr. Bill Curtis - WTD '12
Estimating the principal of Technical Debt - Dr. Bill Curtis - WTD '12Estimating the principal of Technical Debt - Dr. Bill Curtis - WTD '12
Estimating the principal of Technical Debt - Dr. Bill Curtis - WTD '12
 
Inventory
InventoryInventory
Inventory
 
External should that be a microservice
External should that be a microserviceExternal should that be a microservice
External should that be a microservice
 
ParticleVM
ParticleVMParticleVM
ParticleVM
 
AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...
AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...
AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...
 
The Cloud - An introduction
The Cloud - An introductionThe Cloud - An introduction
The Cloud - An introduction
 
ENT204 The AWS Cloud Value Framework
ENT204 The AWS Cloud Value FrameworkENT204 The AWS Cloud Value Framework
ENT204 The AWS Cloud Value Framework
 
Achieving Business Value - Transformation Day Philadelphia 2018
Achieving Business Value - Transformation Day Philadelphia 2018Achieving Business Value - Transformation Day Philadelphia 2018
Achieving Business Value - Transformation Day Philadelphia 2018
 
Cloud computing - co daje firmie?
Cloud computing - co daje firmie? Cloud computing - co daje firmie?
Cloud computing - co daje firmie?
 
Gitex2010 ICT strategies moving to the cloud v11
Gitex2010 ICT strategies moving to the cloud v11Gitex2010 ICT strategies moving to the cloud v11
Gitex2010 ICT strategies moving to the cloud v11
 
MVisio: A Computer Graphics Platform for Virtual Reality, Science and Education
MVisio: A Computer Graphics Platform for Virtual Reality, Science and EducationMVisio: A Computer Graphics Platform for Virtual Reality, Science and Education
MVisio: A Computer Graphics Platform for Virtual Reality, Science and Education
 
P5 cloud economics_v1
P5 cloud economics_v1P5 cloud economics_v1
P5 cloud economics_v1
 
AWS Cloud Value Framework - AWS Transformation Days Raleigh 2018.pdf
AWS Cloud Value Framework - AWS Transformation Days Raleigh 2018.pdfAWS Cloud Value Framework - AWS Transformation Days Raleigh 2018.pdf
AWS Cloud Value Framework - AWS Transformation Days Raleigh 2018.pdf
 
Microservices with .Net - NDC Sydney, 2016
Microservices with .Net - NDC Sydney, 2016Microservices with .Net - NDC Sydney, 2016
Microservices with .Net - NDC Sydney, 2016
 
CCCC Neustar Lenny Rachitsky
CCCC Neustar Lenny RachitskyCCCC Neustar Lenny Rachitsky
CCCC Neustar Lenny Rachitsky
 
What does performance mean in the cloud
What does performance mean in the cloudWhat does performance mean in the cloud
What does performance mean in the cloud
 
Business Case Calculator for DevOps Initiatives - Leading credit card service...
Business Case Calculator for DevOps Initiatives - Leading credit card service...Business Case Calculator for DevOps Initiatives - Leading credit card service...
Business Case Calculator for DevOps Initiatives - Leading credit card service...
 

Kürzlich hochgeladen

Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 

Kürzlich hochgeladen (20)

Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 

Cloud auto-scaling with deadline and budget constraints

  • 1. Ming Mao, Jie Li, Marty Humphrey eScience Group CS Department, University of Virginia Grid 2010 – Oct 27, 2010
  • 2. A fast growing computing platform  IDC - Cloud spending increases 27.4% a year to $56 billion (compared 5% a year of traditional IT)  $16.5 billion (2009) -> $55.5 billion (2014) src: Worldwide and Regional Public IT Cloud Service 2010-2014 Forecast  Two most quoted benefits  Scalable computing and storage  Reduced cost  Concerns  Security, availability, cost management, integration interoperability, etc.
  • 3. Q1. Cost – the most important factor in practice? Rate the benefits commonly ascribed to the How important is it that Cloud service providers... cloud on-demand model Offer competitive pricing 91.60% Pay only for what you use 77.90% Offer Service Level Agreements 88.60% Easy/fast to deply to end-users Option to move cloud offerings back on premise 87.80% 77.70% Provide a complete solution 86.00% Monthly payments 75.30% Understand my business and industry 84.50% Encourages standard systems 68.50% Allow managing on-premise & cloud together 82.10% Requires less in-house IT staff, costs 67.00% Support many of my IT needes 81.00% Alwasys offers latest functionality 64.60% Offer both on-premise and public cloud services 79.20% Sharing systems with partners simpler 63.90% Are a technology and business model innovator 78.30% Seems like the way of future 54.00% Have local presence, can come to my offices 72.90% 0.00% 20.00% 40.00% 60.00% 80.00% 100.00% 0.00% 20.00% 40.00% 60.00% 80.00% 100.00% Source: IDC Enterprise Panel, 3Q09, n = 263, Sep 2009 Source: IDC Enterprise Panel, 3Q09, n = 263, Sep 2009  Q2. Moving into Cloud == Reduced Cost ?
  • 4. Resource utilization information based triggers (e.g. AWS auto-scaling, RightScale, enStratus, Scalr, etc)
  • 5. Multiple instance types  Current billing models  Full hour billing  Non-ignorable instance acquisition time  7-15 min in Windows Azure  More specific performance goals  Budget awareness (e.g. dollars/month, dollars/job)
  • 6. Cloud  Deadline Users Application (Job finish time) Cloud Server Cost Job  Problem Statement – how to enable cloud applications to finish all the submitted jobs before user specified deadline with as little money as possible using auto-scaling.
  • 7. Workload are non-dependent jobs submitted in the job queue  FCFS manner and fairly distributed  Different classes of jobs  Same performance goal (e.g.1 hour deadline)  VM instances take time to startup
  • 8. ni tj I Vii,i j d Key variables used in the model
  • 9. Workload W  (J j , nj )  Computing Power of Instance I i D  nj P  (J j , ) Running Instance  i j t j ,type ( Ii ) n j ( D  (dtype ( Ii )  si ))  n j P  (J j , )  j t j ,type ( Ii ) n j Pending Instance i
  • 10. Scale up  Sufficient budget Min(i ctype ( Ii ') )  P ' W  P i i  Insufficient budget Max( Pi ') c i type ( Ii ')  C  i ctype ( Ii )  Scale down  P  P W i i s
  • 11. Workload Required Computing Power j1 :  x  60 10  10   40 j1 : 10  10  10  x   45 j2 :  y   60   5    20  35           j2 : n1 '  5   n2 ' 20  n3 ' 10   y   35           j3 :  z  60  20  5  35           j3 :  20 5 10  z  35           P' W I1 I2 V1 V2 V3 P' Min(c1n1 ' c2 n2 ' c3n3 ') where c1n1 ' c2 n2 ' c3n3 ' ctype( I1 )  ctype( I2 )  C
  • 12. Cloud Cruise Control notify Decider admin Min( i ctype ( Ii ') ) &  Pj '  W  P dynamic j configuration vm plan VM Monitor Repository Manager +, – Config workload update update vm info enqueue Historical VM instances Data users dequeue
  • 13. Workload & VM simulation parameters Mix Computing IO Intensive Avg 30 jobs/hour Intensive Avg 30 jobs/hour STD 5 jobs/hour Avg 30 jobs/hour STD 5 jobs/hour STD 5 jobs/hour General Average 300s Average 300s Average 300s 0.085$/hour STD 50s STD 50s STD 50s Delay 600s High-CPU Average 210s Average 75s Average 300s 0.17$/hour STD 25s STD 15s STD 50s Delay 720s High-IO Average 210s Average 300s Average 75s 0.17$/hour STD 25s STD 50s STD 15s Delay 720s
  • 14. Stable Worload & Changing Deadline Response (sec) Utilization (%) 100.00% 7000 90.00% 6000 80.00% 5000 70.00% 60.00% 4000 50.00% 3000 40.00% 2000 30.00% 20.00% 1000 10.00% 0 0.00% 0 10 20 30 40 50 60 70 80 Time (hour) utilization deadline avg max min
  • 15. Changing Workload & Fixed Deadline Response (sec) Worload (job/h) 4000 350 3500 300 3000 250 2500 200 2000 150 1500 100 1000 500 50 0 0 0 10 20 30 40 50 60 70 80 Time (hour) deadline avg max min workload
  • 16. VM Types Total Cost ($) % more than optimal Choice #1 General 98.52$ (43%) Choice #2 High-CPU 128.86$ (87%) Choice #3 High-IO 129.71$ (88%) Choice #4 General, High-CPU, High-IO 78.62$ (14%) Optimal General, High-CPU, High-IO 68.85$
  • 17. MODIS 200X – Year Terra & Aqua – Satellite (X - Y) – Day X to day Y 15 images / day Moderate scale test (up to 20 instances) 1hour deadline 2hour deadline 3hour deadline Terra 2004(10-12) 18 min late 8 min early 20 min early Total 45 jobs 9 C.H.or 1.08$ 6 C.H or 0.72$ 5 C.H.or 0.6$ 4 C.H.* or 0.48$ Aqua 2008(30-32) 15min late 20 min early 29 min early Total 45 jobs 10 C.H or 1.2$ 7 C.H.or 0.84$ 5 C.H.or 0.6$ 4 C.H. or 0.48$ Large Scale test (up to 90 instances) 2 hour deadline 4 hour deadline Terra & Aqua 2006(1-75) 20min late 6 min early Total 1125 jobs 170 C.H. or 20.4$ 132 C.H. or 15.84$ 93 C.H. or 11.16$ Terra & Aqua 2006(1-150) Admission Denied 22 min early Total 2250 jobs 243 C.H. or 29.16$ 185 C.H. or 22.2$ * C.H. – computing hour 1C.H. = 0.12$ in Windows Azure
  • 18. Test: Terra & Aqua 2006(1-75) - total 1125 jobs 6min early theoretical cost - 93 C.H. or 11.16$ actual cost - 132 C.H. or 15.84$ Instance Acquisition and Release 40 38 36 34 32 30 28 26 Instance Number 24 22 20 18 16 14 12 10 8 6 4 2 0 0 1 2 3 4 5 Time (hour) Released Acquiring Ready
  • 19. Conclusions  More cost-efficient than fixed-size instance choice  VM startup delay can affect hugely in practice  Future works  More general cloud application model  Multiple job classes  Consider other instance types (e.g. spot instances & reserved instances)  Data transfer performance and storage cost