SlideShare ist ein Scribd-Unternehmen logo
1 von 51
Energy Efficiency in Large Scale Systems Gaurav Dhiman, Raid Ayoub Prof. Tajana ŠimunićRosing Dept. of Computer Science
Large scale systems: Clusters Power consumption is a critical  	design parameter: Operational costs ,[object Object]
CoolingBy 2010, US electricity bill for powering and cooling data centers ~$7B[1] Electricity input to data centers in the US exceeds electricity consumption of Italy! [1]: Meisner et al, ASPLOS 2008 2
Energy Savings with DVFS Reduction in CPU power Extra system power
Effectiveness of DVFS For energy savings ER > EE Factors in modern systems affecting this equation: Performance delay (tdelay) Idle CPU power consumption (PE) Power consumption of other devices (PE)
Performance Delay Lower tdelay=> higher energy savings Depends on memory/CPU intensiveness Experiments with SPEC CPU2000 mcf: highly memory intensive Expect low tdelay sixtrack: highly cache/CPU intensive Expect high tdelay Two state of the art processors AMD quad core Opteron On die memory controller (2.6GHz), DDR3 Intel quad core Xeon Off chip memory controller (1.3GHz), DDR2
Performance Delay mcf much closer to best case on Xeon mcf much closer to worst case on AMD Due to on die memory controller and fast DDR3 memory Due to slower memory controller and memory
Idle CPU power consumption Low power idle CPU states common now C1 state used be default Zero dynamic power consumption Support for deeper C-states appearing C6 on Nehalem Zero dynamic+leakage power ,[object Object]
Lower DVFS benefits,[object Object]
Lower DVFS benefits for memory intensive benchmarks,[object Object]
Methodology Run SPEC CPU2000 benchmarks at all v-f settings Estimate savings baselined against system with PM-(1:3) policies ,[object Object]
DVFS beneficial if:
%EsavingsPM-i > 0,[object Object]
On die memory controller,[object Object]
High perf delay ,[object Object]
Avg 7% savings
Avg 200% delay ,[object Object]
 Lower system idle power consumption,[object Object]
Server Power Breakdown
Energy Proportional Computing “The Case for  Energy-Proportional  Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer December 2007  Doing nothing well …NOT! Energy Efficiency = Utilization/Power Figure 2. Server power usage and energy efficiency at varying utilization levels, from idle to  peak performance. Even an energy-efficient server still consumes about half its full power when doing virtually no work. 17
Energy Proportional Computing “The Case for  Energy-Proportional  Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer December 2007  It is surprisingly hardto achieve high levelsof utilization of typical servers (and your homePC or laptop is even worse) Figure 1. Average CPU utilization of more than 5,000 servers during a six-month period. Servers  are rarely completely idle and seldom operate near their maximum utilization, instead operating  most of the time at between 10 and 50 percent of their maximum 18
Energy Proportional Computing “The Case for  Energy-Proportional  Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer December 2007  Doing nothing  VERY well Design for  wide dynamic  power range and  active low power modes Energy Efficiency = Utilization/Power Figure 4. Power usage and energy efficiency in a more energy-proportional server. This  server has a power efficiency of more than 80 percent of its peak value for utilizations of  30 percent and above, with efficiency remaining above 50 percent for utilization levels as  low as 10 percent. 19
Why not consolidate servers? Security Isolation Must use the same OS Solution: Use virtualization!
Virtualization Benefits: ,[object Object]
Different OS in each VM
Better resource utilization21
Virtualization Benefits: ,[object Object]
Dynamic load management
Energy savings through VM consolidation!22
How to Save Energy? VM consolidation is a common practice: Increases resource utilization Turn idle machines into sleep mode Active machines? Active power management: e.g. DVFS less effective in newer line of server processors  Leakage, faster memories, low voltage range Make the workload run faster Similar average power across machines Exploit workload characteristics to share resources efficiently 23
Motivation: Workload Characterization VM1 VM2 PM1 mcf 60% PM2 eon 24
Motivation: Workload Characterization 50W Workload characteristics determine: Power/performance profile Power distribution Co-schedule/consolidate heterogeneous VMs 25
Motivation: Workload Characterization Co-schedule/consolidate heterogeneous VMs 26
What about DVFS? 80% 40% 9% Poor performance  Energy inefficient Only good if homogeneously high MPC workload 27
vGreen A system for VM scheduling across a cluster of physical machines Dynamic VM characterization: ,[object Object]
Instruction throughput
CPU utilizationCo-schedule VMs with heterogeneous characteristics for better:  Performance Energy efficiency Balanced thermal profile 28
Scheduling with VMs VM1 VM2 VM1 Dom0 VM2 Xen Scheduler ,[object Object]
Management
I/O
VM Creation:
Specify CPU, memory, I/O config
CPU of VM referred to as VCPU:
Fundamental unit of executionVCPU2 VCPU1 VCPU2 VCPU1 ,[object Object]
Xen schedules VCPUs across PCPUs29
vGreen Architecture Main Components: ,[object Object]
vgxen: Characterizes the running VMs
vgdom: Exports information to vgservvgserv vgpolicy Updates Updates Commands ,[object Object]
Collects and analyzes the characterization information
Issues scheduling commands based on balancing policyvgdom vgdom VM1 Dom0 VM2 VM1 Dom0 VM2 Xen vgxen Xen vgxen vgnode1 vgnode2 30
vgnode (client physical machine) vgdom vgxen: characterizes the VMs ,[object Object]

Weitere ähnliche Inhalte

Was ist angesagt?

Calculating total power
Calculating total powerCalculating total power
Calculating total power
jeet69
 
D1.2 analysis and selection of low power techniques, services and patterns
D1.2 analysis and selection of low power techniques, services and patternsD1.2 analysis and selection of low power techniques, services and patterns
D1.2 analysis and selection of low power techniques, services and patterns
Babak Sorkhpour
 

Was ist angesagt? (12)

DigSILENT PF - 00 stability fundamentals
DigSILENT PF - 00 stability fundamentalsDigSILENT PF - 00 stability fundamentals
DigSILENT PF - 00 stability fundamentals
 
E3 s binghamton
E3 s binghamtonE3 s binghamton
E3 s binghamton
 
Data Center Cooling, Critical Facility and Infrastructure Optimization
Data Center Cooling, Critical Facility and Infrastructure OptimizationData Center Cooling, Critical Facility and Infrastructure Optimization
Data Center Cooling, Critical Facility and Infrastructure Optimization
 
Data center power infrastructure
Data center power infrastructureData center power infrastructure
Data center power infrastructure
 
Eaton Enterprise Data Centers Sep 2010
Eaton Enterprise Data Centers Sep 2010Eaton Enterprise Data Centers Sep 2010
Eaton Enterprise Data Centers Sep 2010
 
hp power capping and hp dynamic
hp power capping and hp dynamic hp power capping and hp dynamic
hp power capping and hp dynamic
 
White Paper: Monitoring EMC Greenplum DCA with Nagios - EMC Greenplum Data Co...
White Paper: Monitoring EMC Greenplum DCA with Nagios - EMC Greenplum Data Co...White Paper: Monitoring EMC Greenplum DCA with Nagios - EMC Greenplum Data Co...
White Paper: Monitoring EMC Greenplum DCA with Nagios - EMC Greenplum Data Co...
 
Calculating total power
Calculating total powerCalculating total power
Calculating total power
 
D1.2 analysis and selection of low power techniques, services and patterns
D1.2 analysis and selection of low power techniques, services and patternsD1.2 analysis and selection of low power techniques, services and patterns
D1.2 analysis and selection of low power techniques, services and patterns
 
Frequency regulation of deregulated power system
Frequency regulation of deregulated power systemFrequency regulation of deregulated power system
Frequency regulation of deregulated power system
 
CA Nimsoft ecoMeter
CA Nimsoft ecoMeterCA Nimsoft ecoMeter
CA Nimsoft ecoMeter
 
Stmm white paper
Stmm white paperStmm white paper
Stmm white paper
 

Andere mochten auch

Designing Ultra Large Scale Systems List
Designing Ultra Large Scale Systems ListDesigning Ultra Large Scale Systems List
Designing Ultra Large Scale Systems List
Crafitti Consulting
 
Principles of microservices velocity
Principles of microservices   velocityPrinciples of microservices   velocity
Principles of microservices velocity
Sam Newman
 

Andere mochten auch (7)

Deployability
DeployabilityDeployability
Deployability
 
Architecting for the cloud elasticity security
Architecting for the cloud elasticity securityArchitecting for the cloud elasticity security
Architecting for the cloud elasticity security
 
Acm Tech Talk - Decomposition Paradigms for Large Scale Systems
Acm Tech Talk - Decomposition Paradigms for Large Scale SystemsAcm Tech Talk - Decomposition Paradigms for Large Scale Systems
Acm Tech Talk - Decomposition Paradigms for Large Scale Systems
 
NEW ALGORITHMS FOR SECURE OUTSOURCING OF LARGE-SCALE SYSTEMS OF LINEAR EQUAT...
 NEW ALGORITHMS FOR SECURE OUTSOURCING OF LARGE-SCALE SYSTEMS OF LINEAR EQUAT... NEW ALGORITHMS FOR SECURE OUTSOURCING OF LARGE-SCALE SYSTEMS OF LINEAR EQUAT...
NEW ALGORITHMS FOR SECURE OUTSOURCING OF LARGE-SCALE SYSTEMS OF LINEAR EQUAT...
 
Designing Ultra Large Scale Systems List
Designing Ultra Large Scale Systems ListDesigning Ultra Large Scale Systems List
Designing Ultra Large Scale Systems List
 
Architectural Tactics for Large Scale Systems
Architectural Tactics for Large Scale SystemsArchitectural Tactics for Large Scale Systems
Architectural Tactics for Large Scale Systems
 
Principles of microservices velocity
Principles of microservices   velocityPrinciples of microservices   velocity
Principles of microservices velocity
 

Ähnlich wie Energy Efficiency in Large Scale Systems

MRI Energy-Efficient Cloud Computing
MRI Energy-Efficient Cloud ComputingMRI Energy-Efficient Cloud Computing
MRI Energy-Efficient Cloud Computing
Roger Rafanell Mas
 
Mobile computing edited
Mobile computing editedMobile computing edited
Mobile computing edited
m_hughes
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
Windows Server 2008 R2 Hyper V
Windows Server 2008 R2 Hyper VWindows Server 2008 R2 Hyper V
Windows Server 2008 R2 Hyper V
Amit Gatenyo
 

Ähnlich wie Energy Efficiency in Large Scale Systems (20)

CNR @ VMUG.IT 20150304
CNR @ VMUG.IT 20150304CNR @ VMUG.IT 20150304
CNR @ VMUG.IT 20150304
 
Windows server power_efficiency___robben_and_worthington__final
Windows server power_efficiency___robben_and_worthington__finalWindows server power_efficiency___robben_and_worthington__final
Windows server power_efficiency___robben_and_worthington__final
 
Parallel and Distributed Computing Chapter 9
Parallel and Distributed Computing Chapter 9Parallel and Distributed Computing Chapter 9
Parallel and Distributed Computing Chapter 9
 
Hitec Brochure QPS English_LR
Hitec Brochure QPS English_LRHitec Brochure QPS English_LR
Hitec Brochure QPS English_LR
 
Emerson Energy Logic
Emerson Energy LogicEmerson Energy Logic
Emerson Energy Logic
 
Runtime Methods to Improve Energy Efficiency in HPC Applications
Runtime Methods to Improve Energy Efficiency in HPC ApplicationsRuntime Methods to Improve Energy Efficiency in HPC Applications
Runtime Methods to Improve Energy Efficiency in HPC Applications
 
A Study on Task Scheduling in Could Data Centers for Energy Efficacy
A Study on Task Scheduling in Could Data Centers for Energy Efficacy A Study on Task Scheduling in Could Data Centers for Energy Efficacy
A Study on Task Scheduling in Could Data Centers for Energy Efficacy
 
Virtualización para la Eficiencia
Virtualización para la EficienciaVirtualización para la Eficiencia
Virtualización para la Eficiencia
 
DataCenter:: Infrastructure Presentation
DataCenter:: Infrastructure PresentationDataCenter:: Infrastructure Presentation
DataCenter:: Infrastructure Presentation
 
E03403027030
E03403027030E03403027030
E03403027030
 
MRI Energy-Efficient Cloud Computing
MRI Energy-Efficient Cloud ComputingMRI Energy-Efficient Cloud Computing
MRI Energy-Efficient Cloud Computing
 
Green It
Green ItGreen It
Green It
 
KVM Tuning @ eBay
KVM Tuning @ eBayKVM Tuning @ eBay
KVM Tuning @ eBay
 
Mobile computing edited
Mobile computing editedMobile computing edited
Mobile computing edited
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
GUI overhead
GUI overheadGUI overhead
GUI overhead
 
Run-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environmentsRun-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environments
 
Probabilistic consolidation of virtual machines in self organizing cloud data...
Probabilistic consolidation of virtual machines in self organizing cloud data...Probabilistic consolidation of virtual machines in self organizing cloud data...
Probabilistic consolidation of virtual machines in self organizing cloud data...
 
Build Energy Saving into Your Datacenter
Build Energy Saving into Your DatacenterBuild Energy Saving into Your Datacenter
Build Energy Saving into Your Datacenter
 
Windows Server 2008 R2 Hyper V
Windows Server 2008 R2 Hyper VWindows Server 2008 R2 Hyper V
Windows Server 2008 R2 Hyper V
 

Mehr von Jerry Sheehan

Cine grid exchange@cenic2010-5
Cine grid exchange@cenic2010-5Cine grid exchange@cenic2010-5
Cine grid exchange@cenic2010-5
Jerry Sheehan
 

Mehr von Jerry Sheehan (14)

Cisco educause
Cisco educauseCisco educause
Cisco educause
 
Educause-Presidential/CIO
Educause-Presidential/CIOEducause-Presidential/CIO
Educause-Presidential/CIO
 
Montana State, Research Networking and the Outcomes from the First National R...
Montana State, Research Networking and the Outcomes from the First National R...Montana State, Research Networking and the Outcomes from the First National R...
Montana State, Research Networking and the Outcomes from the First National R...
 
Technology, Complexity, and Change: The Creative Frictions of Today
Technology, Complexity, and Change:  The Creative Frictions of TodayTechnology, Complexity, and Change:  The Creative Frictions of Today
Technology, Complexity, and Change: The Creative Frictions of Today
 
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
 
Best Practices, Cyberinfrastructure from Scratch
Best Practices, Cyberinfrastructure from ScratchBest Practices, Cyberinfrastructure from Scratch
Best Practices, Cyberinfrastructure from Scratch
 
Project GreenLight
Project GreenLightProject GreenLight
Project GreenLight
 
DC Modular Datacenter for Improved Energy Efficiency
DC Modular Datacenter for Improved Energy EfficiencyDC Modular Datacenter for Improved Energy Efficiency
DC Modular Datacenter for Improved Energy Efficiency
 
GreenLight Data Collection Architecture
GreenLight Data Collection ArchitectureGreenLight Data Collection Architecture
GreenLight Data Collection Architecture
 
GreenLight CENIC Award
GreenLight CENIC AwardGreenLight CENIC Award
GreenLight CENIC Award
 
Cine grid exchange@cenic2010-5
Cine grid exchange@cenic2010-5Cine grid exchange@cenic2010-5
Cine grid exchange@cenic2010-5
 
GreenLight: GLIF 2009
GreenLight:  GLIF 2009GreenLight:  GLIF 2009
GreenLight: GLIF 2009
 
GreenLight Overview for Minority Serving Institute Workshop
GreenLight Overview for Minority Serving Institute WorkshopGreenLight Overview for Minority Serving Institute Workshop
GreenLight Overview for Minority Serving Institute Workshop
 
GreenLight Project Overview
GreenLight Project OverviewGreenLight Project Overview
GreenLight Project Overview
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Energy Efficiency in Large Scale Systems

  • 1. Energy Efficiency in Large Scale Systems Gaurav Dhiman, Raid Ayoub Prof. Tajana ŠimunićRosing Dept. of Computer Science
  • 2.
  • 3. CoolingBy 2010, US electricity bill for powering and cooling data centers ~$7B[1] Electricity input to data centers in the US exceeds electricity consumption of Italy! [1]: Meisner et al, ASPLOS 2008 2
  • 4. Energy Savings with DVFS Reduction in CPU power Extra system power
  • 5. Effectiveness of DVFS For energy savings ER > EE Factors in modern systems affecting this equation: Performance delay (tdelay) Idle CPU power consumption (PE) Power consumption of other devices (PE)
  • 6. Performance Delay Lower tdelay=> higher energy savings Depends on memory/CPU intensiveness Experiments with SPEC CPU2000 mcf: highly memory intensive Expect low tdelay sixtrack: highly cache/CPU intensive Expect high tdelay Two state of the art processors AMD quad core Opteron On die memory controller (2.6GHz), DDR3 Intel quad core Xeon Off chip memory controller (1.3GHz), DDR2
  • 7. Performance Delay mcf much closer to best case on Xeon mcf much closer to worst case on AMD Due to on die memory controller and fast DDR3 memory Due to slower memory controller and memory
  • 8.
  • 9.
  • 10.
  • 11.
  • 13.
  • 14.
  • 15.
  • 17.
  • 18.
  • 20. Energy Proportional Computing “The Case for Energy-Proportional Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer December 2007 Doing nothing well …NOT! Energy Efficiency = Utilization/Power Figure 2. Server power usage and energy efficiency at varying utilization levels, from idle to peak performance. Even an energy-efficient server still consumes about half its full power when doing virtually no work. 17
  • 21. Energy Proportional Computing “The Case for Energy-Proportional Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer December 2007 It is surprisingly hardto achieve high levelsof utilization of typical servers (and your homePC or laptop is even worse) Figure 1. Average CPU utilization of more than 5,000 servers during a six-month period. Servers are rarely completely idle and seldom operate near their maximum utilization, instead operating most of the time at between 10 and 50 percent of their maximum 18
  • 22. Energy Proportional Computing “The Case for Energy-Proportional Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer December 2007 Doing nothing VERY well Design for wide dynamic power range and active low power modes Energy Efficiency = Utilization/Power Figure 4. Power usage and energy efficiency in a more energy-proportional server. This server has a power efficiency of more than 80 percent of its peak value for utilizations of 30 percent and above, with efficiency remaining above 50 percent for utilization levels as low as 10 percent. 19
  • 23. Why not consolidate servers? Security Isolation Must use the same OS Solution: Use virtualization!
  • 24.
  • 25. Different OS in each VM
  • 27.
  • 29. Energy savings through VM consolidation!22
  • 30. How to Save Energy? VM consolidation is a common practice: Increases resource utilization Turn idle machines into sleep mode Active machines? Active power management: e.g. DVFS less effective in newer line of server processors Leakage, faster memories, low voltage range Make the workload run faster Similar average power across machines Exploit workload characteristics to share resources efficiently 23
  • 31. Motivation: Workload Characterization VM1 VM2 PM1 mcf 60% PM2 eon 24
  • 32. Motivation: Workload Characterization 50W Workload characteristics determine: Power/performance profile Power distribution Co-schedule/consolidate heterogeneous VMs 25
  • 33. Motivation: Workload Characterization Co-schedule/consolidate heterogeneous VMs 26
  • 34. What about DVFS? 80% 40% 9% Poor performance Energy inefficient Only good if homogeneously high MPC workload 27
  • 35.
  • 37. CPU utilizationCo-schedule VMs with heterogeneous characteristics for better: Performance Energy efficiency Balanced thermal profile 28
  • 38.
  • 40. I/O
  • 42. Specify CPU, memory, I/O config
  • 43. CPU of VM referred to as VCPU:
  • 44.
  • 45. Xen schedules VCPUs across PCPUs29
  • 46.
  • 48.
  • 49. Collects and analyzes the characterization information
  • 50. Issues scheduling commands based on balancing policyvgdom vgdom VM1 Dom0 VM2 VM1 Dom0 VM2 Xen vgxen Xen vgxen vgnode1 vgnode2 30
  • 51.
  • 53.
  • 55. Exports to vgservwMPC wIPC util wMPC wIPC util wMPC wIPC util wMPC wIPC util 31 VCPU1 VCPU2 VCPU1 VCPU2
  • 56. Hierarchical Workload Characterization nMPC nIPC nutil Node Level Metrics (maintained by vgpolicy) VGNODE VM Level Metrics (maintained by vgpolicy and vgxen) vMPC vIPC vutil vMPC vIPC vutil VM1 VM2 VCPU Level Metrics (maintained by vgxen) wMPC wIPC util wMPC wIPC util wMPC wIPC util wMPC wIPC util 32 VCPU1 VCPU2 VCPU1 VCPU2
  • 57.
  • 58. MPC: performance and energy efficiency
  • 59.
  • 60.
  • 61. Migrate if it does not reverse imbalanceVM1 VM2 VM2 VM1 nMPC > nMPCth vgnodenMPCmin 34
  • 62. Implementation Xen 3.3.1 as the hypervisor vgxen implemented as part of the stock Xen credit scheduler vgdom implemented as a driver and application in Domain0 Communicates with vgxen through a shared page No modifications required to the guest OS! Used a testbed of Dual Intel Quad core Xeon based machines as vgnodes Linux based desktop used as vgserv vgdom VM1 Dom0 VM2 Xen vgxen 35
  • 63.
  • 64. Compare against ‘E+’: Eucalyptus + state of the art dynamic VM scheduling algorithms
  • 65. Perform VM consolidation based on CPU utilization36
  • 66. Weighted Speedup vs E+ Average 40% Weighted Speedup 20% speedup on average 37
  • 67. Energy Savings vs E+ Average 35% Energy Savings 38
  • 68. Balanced Thermal Profile Average power variance reduction of 30W 39
  • 69.
  • 70. How to minimize the cooling costs within a single machine?
  • 71. How to further reduce the cooling costs by creating a better temperature distribution across the physical machines1U server CPU CPU Fan subsystem
  • 72.
  • 73.
  • 74.
  • 75.
  • 77. Provides better stabilityReactive approach  Lowers cooling savings  Cannot minimize the noise level Impacts fan stability Challenge: Design of efficient proactive dynamic cooling aware workload management technique
  • 78.
  • 79. Migrate some of the active threads from the sockets with high fan speed to sockets with lower speed
  • 80. Swap some of the hot threads from sockets with high fan speed with colder threads from sockets with lower speed.VPW VPA VPA VPC VPw VPY VPC VPY VPX VPB VPD VPZ VPZ VPB VPX VPD High speed Low speed Moderate speed Moderate speed
  • 81.
  • 82. If Fan speedM≥Fan speedN, we can swap the hot thread from socket N with colder threads from socket MVPA VPW VPW VPY VPA VPC VPC VPY PW ≤ PC+PD VPB VPD VPD VPX VPB VPX Moderate speed Moderate speed Moderate speed Low speed 46 46
  • 83.
  • 84. VM management at the physical machine level
  • 85. VP management at the CPU socket levelVM migration Thread migration VP1 VP2 VP3 VP3 VP5 VP4 VP6 VP4 VP1 VP3 VP2 VP2 VP4 VP6 VP5 VP4 Low speed Moderate speed High speed Moderate speed Server i Server j 47
  • 86.
  • 87. VMs at the machine level
  • 88. VPs at the socket level
  • 90. Period ~ minutes @ the VM level
  • 91. Period ~ seconds @ the VP level Mark if savings exist Traverse VMs/VPs Schedule Evaluate Consolidation Savings Mark if savings exist 48
  • 92.
  • 93.
  • 94. Dynamic load balancing minimizes the differences in task queues across various levels49 K. Skadron, et al. Temperature-aware microarchitecture, ISCA 2003.
  • 95.

Hinweis der Redaktion

  1. This figure shows a typical fan controller that is based on a classical close-loop approach. The fan controller decides the required fan speed. The output of the controller is fed to the actuator to actually adjust the fan speed. The feedback is collected using thermal sensors (each CPU core has a dedicated thermal sensor) where the fan speed is in proportional to the highest temperature <click> The cooling optimizations techniques up until now focused mainly on the fan controller without including workload management which we show later that including workload management can results in a big cooling savings
  2. Current load balancing do not consider cooling costs <click> Read the example to the audience (stop when you reach the equation) [The figure is the visual representation of the example]. In this figure we show a case of dual sockets (each socket has 4 cores where each runs 1 (thr=workload thread or job)<click> Thermal imbalance leads to cooling inefficiencies due to “cubic relation between fan speed and power” <click> This indicate that better workload assignment can improve the thermal distribution and lower cooling cost. The question is HOW and WHEN to schedule the workload
  3. We utilize the freedom in migrating the workload around to perform cooling aware workload scheduling to minimize the cooling costs<click> The good news is that the migration overhead of the threads between sockets is minor since the temperature change is quite slow (order of sec) compared to the migration time (order of micro sec)<click> In this example we show a case of thermal imbalance between two sockets (one fan run at high speed while the other at low speed)<click> The challenge is which threads to migrate to get a better thermal and cooling balance. Then read the second bullet in the yellow box
  4. The question that we need to answer is “when we should trigger the workload rescheduling”One way is to employ a reactive approach that acts when the system is in cooling inefficiency condition. The problem with this approach is that mitigating the inefficiencies require time (temperature changes slowly) which impacts the cooling savings, noise and may generate instability in the fan system <click> The alternative way is to use proactive researching that predict then avoid cooling inefficiencies at earlier point in time and reschedule accordingly. Read quickly the benefits in the green box<click> Read the challenge sentence
  5. In this slide and the following one we illustrate the fundamental ways to deliver cooling savings: This slide explains “spreading the hot threads” concept to obtain cooling savings through creating a better temperature distribution across the CPU sockets. This technique needs to be applied when there is an imbalance in the heat sink temperature across the CPU sockets. To implement job spreading we can employ either job migration or swapping (read the two bullets briefly). <click> The example in the bottom clearly shows how spreading works. In the left side we have a case of big imbalance. To solve the imbalance we swap the hot threads (C,D) with the colder ones (W,X). The two fans now run at a moderate speed (savings is expected due to the cubic relation between fan power and speed)
  6. Here we illustrate the second way to obtain cooling savings. The motivation is to concentrate more hot threads into fewer sockets while keeping their fan speed in almost the same. We apply this method when the average temperature across sockets is in similar range (it should be noted that consolidation is not opposite to the spreading but it can be applies on top of it)Consolidation can be implemented in two ways:Squeezing more hot jobs to the fan that is running more that what it should be (fan speeds is discrete, e.g 8 or 16 speeds)<click> The other way is to trade a (hot thread) from the socket that have lower fan with (colder threads but have similar total power) from the socket with higher fan speed to maintain temperature balance. This help lowering the fan speed of the socket that receives the cold threads while keeping the higher fan speed almost the same. The example below illustrate this case