SlideShare a Scribd company logo
1 of 28
Interested in learning more about Cloud?

       Look at the Cloud sessions offered at the upcoming Fall 2012 Data Center World Conference at:

       www.datacenterworld.com.




This presentation was given during the Spring, 2012 Data Center World Conference and Expo. Contents contained are owned by
AFCOM and Data Center World and can only be reused with the express permission of ACOM. Questions or for permission contact:
jater@afcom.com.
How to Design a Scalable
Private Cloud

                Mark Sand
                Datacenter Architect
                Citrix Systems Inc.
Defining the Private & Public Clouds

• Private vs. Public Clouds (Infrastructure as a Service - IaaS)
 • The private cloud is a virtual environment deployed within an organization that
   is restricted to users within the company and usually resides behind the
   corporate firewall. The private cloud also consists of an easy to use web portal
   that allows end users to auto provision and manage the lifecycle of their VMs,
   and may or may not incorporate a chargeback model.

 • The Public cloud is a virtual environment that is publically available for any
   consumer to purchase computing resources, usually on a pay per use basis, via
   an easy to use web portal. The public cloud allows any consumer to purchase,
   manage, and monitor the lifecycle of their VMs through a user friendly web
   portal.
Designing the Cloud Infrastructure

• Proper planning and design are critical components to
  successfully implementing a scalable Cloud environment
• Here are some key design areas that we will address:
 • Capacity planning and sizing
 • Virtual Platform (hypervisor)
 • Datacenter locations (will this be a global Cloud or hosted from one DC)
 • Networking
 • SAN (NAS/Fibre)
 • Server Hardware
 • Power
 • Monitoring & Management Solutions
 • Documenting the solution
Capacity Planning and Sizing the Environment

• Accurate capacity planning and sizing will ensure that you
  implement a scalable, supportable, and successful
  environment
• Key sizing criteria:
 • Number of VMs you are looking to host per virtual server
 • Number and types of clusters/pools
 • Estimated yearly growth for VMs
 • Amount of storage required to host all of the VMs for current and future growth
 • Amount of estimated network bandwidth required to host the VMs for current
   and future growth
Current Capacity and Sizing Example

• Cluster/pool(s) configuration:
 • We support a mix of 2,4,8, and 16GB VMs in each of our cluster/pool(s)
 • We average approximately 20 VMs per host

             Cluster/Pool   Number of Hosts   Total Storage
           Production             20             20TBs
           QA                     8              9.5TBs
           Dev                    11             15TBs
           DMZ                    6               4TBs
           DR                     15              4TBs
Current Capacity and Sizing cont.

• Average Yearly Growth Statistics:
• VMs account for approximately 85% of our yearly server growth
• We add approximately 5 -10 TBs of storage (spread across all cluster/pool(s))
• We have not needed to add any additional network bandwidth since the
  environment was implemented
Virtual Platform & Datacenter Locations

• Selecting the proper virtual platform (hypervisor):
 • There are several hypervisors out there that have benefits and drawbacks so
   each organization should choose whichever option best fits their needs

• Datacenter Locations:
 • Determine if the cloud will be hosted from several global datacenters or if it will
   be hosted from one central datacenter
 • If the cloud will be hosted from different locations then it is also important to
   follow a set of standards for each of the areas we will be talking about (network,
   storage, server HW, etc.)
Datacenter Locations Example

• US Private Cloud
 • We currently have a large private cloud environment that is hosted out of our
   corporate datacenter as well as a smaller private cloud that is hosted in two
   additional datacenters in the US

• Global Private Cloud
 • We currently have a private cloud environment in three of our regional
   datacenters

• Global Standards
 • We have standardized on the same server hardware/configuration
   and networking devices for the global private cloud; however, we
   were required to create two different storage standards
Network Design

• Define the type of uplinks that will be used:
 • 1GB Uplink
 • Multiple 1GB uplinks configured as a port channel
 • 10GB uplink

• Number/type of uplinks for each of the hosts functions:
 • Virtual Server Management Interface
 • VM traffic
 • NFS/iSCSI traffic for environments utilizing NAS

• Utilize redundant uplinks from separate switches
• Evaluate the proper size & number of VLANs required
Network Description Example

• Network Components
 • Management Network
   • 2 x switches with 2 x 1GB uplinks connected to each switch. Each switch is connected to a different
    distribution layer switch to ensure network redundancy
 • VM Traffic
   • 2 x blade switches with 4 x 1GB uplinks configured as a 2GB port channel is connected to each switch.
    We have three dedicated /24 VLANs for new VMs, and we also trunk existing VLANs to the switches in
    order to account for servers that were P2V’ed and are unable to change their IP address
 • Storage Traffic (regional datacenters only)
   • 2 x blade switches with 4 x 1GB uplinks configured as a 2GB port channel is connected to each switch
Network Diagram Example – Corporate DC




       Note: Storage is connected via an HBA to our fibre channel SAN (not depicted here)
Network Diagram Example – Regional DCs




         Note: Storage for the regional servers are connected to our NAS via NFS
SAN/NAS Design
• NAS vs. Fibre arrays:
 • Each technology has benefits and drawbacks, so each organization should
   choose whichever option best fits their needs

• Define a standard LUN size
• Define a standard naming convention when creating
  LUN/volume(s)
• For NAS ensure to configure a dedicated VLAN for VM
  storage traffic
• For fibre channel SANs ensure that you have two
  independent SAN fabrics (A&B) & utilize multipathing
Storage Example – Corporate DC

• Two fully populated blade enclosures connect into our fibre
  channel SAN via 4GB SAN switches
• We standardized on 2TB LUNs (storage repositories / data
  stores) in our corporate DC and 1TB LUNs regionally
Storage Diagram Example – Corporate DC
Server Hardware

                                                                                                       •Scale
Out vs. Scale Up Methodologies
•Scale Out - several host servers are configured with standard to moderate
 virtualization specs (2 x CPUs & 48 to 128GBs of RAM) that make up a pool/cluster
 • Pros: The servers are less expensive so you can usually grow the pool faster, and you will sustain less downtime
   for VMs if a server fails
 • Cons: There are more servers to manage in each pool/cluster
•Scale Up - only a few host servers are configured with large virtualization specs (4 CPUs
 or greater & 128GBs of RAM or greater) that can handle a large number of VMs
 • Pros: You can run a large number of VMs on the host server due to the vast resources each server has available
 • Cons: The servers are costly so you will likely not be able to grow the pool/cluster as fast, and you will potentially
  have a larger outage for VMs if a host fails
Server Hardware Cont.

                                                                  •Minimu
m specs for virtualization (blade or rack mount):
 • 2 x Quad Core CPUs
 • 48GBs of RAM (96GBs or greater is preferred for large environments)
 • Enough 1GB/10GB NICs that will allow you to have two connections to each
   uplink so you can bond the NICs for redundancy
 • HBA for servers that will connect to the SAN via fibre

                                                       •Ensure
you plan for an additional host server to account for failover
(HA) for each cluster/pool
Server Diagram Example – Corporate DC
• Server specs:
   • 2 x Six Core CPUs
   • 96GBs of RAM
   • 6 x NICs (2 x embedded & 1 quad
      port mezzanine card)
   • 1 x dual port HBA mezzanine card

• Interconnect specs:
    • 4 x network switches (1GB)
    • 2 x 4GB San switches
    • 1 x 1GB Ethernet pass-thru
       module (for backups)
Server Diagram Example – Regional DC
• Server specs:
   • 2 x Quad Core CPUs
   • 96GBs of RAM
   • 8 x NICs (2 x embedded, 1 quad
      port & 1 dual port NIC mezzanine
      card)

• Interconnect specs:
    • 6 x network switches (1GB)
Power Design

• It is important to properly size the power circuits the host
  servers will use since they draw more power than standard
  servers
• Ensure that the environment utilizes two load balanced
  circuits or two independent circuits for redundancy
• Ensure each circuit is terminated from a different feeder
• Separate the virtual host servers into at least two different
  racks
Power Diagram Example – Corporate DC
• Each rack contains:
   • 2 x L6-30 208v (A&B) Single Phase Circuits
   • Each A&B circuit is load balanced
   • 4 x 30amp208v single phase PDUs

• The two blade enclosures that house all of the virtual hosts
  are located in two different racks
Monitoring Solution

• The health of the virtual environment is critical so it is key to
  monitor and alert on some of the following areas:
 • Physical hardware - detect if a DIMM, disk, CPU, etc. goes bad
 • VMs – verify they are online and not over utilized/subscribed
 • Virtual Platform – detect failures within the hypervisor
 • Capacity – verify that each host/cluster/pool is not running out of resources
   (storage, RAM,CPUs, etc.) that would prevent provisioning new VMs

• It usually requires a mix of native and 3rd party tools to
  successfully monitor all aspects of a virtual environment
Management Solution

• Centralized VM and host management is extremely
  important; however, all of the major virtualization vendors do
  provide a centralized management solution
• Auto provisioning of VMs
 • This is a key component of the Cloud and is not always adequately addressed
   by the centralized management solution provided by the virtualization vendors
 • This also often requires a combination of custom (internal) developed
   applications and 3rd party products
 • A good provisioning tool will take into account the utilization of a charge back
   model for VMs, as well as, address proper approvals to control the growth of
   VMs
Management Solution cont.

• How to address VM sprawl?
 • Place proper controls/approvals on who and how many VMs a user can request
 • Automatically track the number, hostname, and type of VM a user creates via
   the self/auto provisioning process
 • Monitor the utilization of all VMs, and then either automatically power the
   underutilized VMs down or follow-up with the VM owner

• We have had our own challenges with trying to implement a
  fully automated solution that incorporates all of our needs,
  and this is something that large companies within the IT
  industry have struggled with as well.
Documenting the solution

• During the design and implementation phase of the
  environment it is important to take detailed notes and
  diagram each of the phases
• A good design document will provide a clear and concise
  view of how all aspects of the environment is configured
• When we handoff any environment to our Operations team
  we provide a detailed design doc, a runbook, and then hold
  an official handoff meeting to cover any questions or
  concerns the Operations team may have.
Questions?
Interested in learning more about Cloud?

       Look at the Cloud sessions offered at the upcoming Fall 2012 Data Center World Conference at:

       www.datacenterworld.com.




This presentation was given during the Spring, 2012 Data Center World Conference and Expo. Contents contained are owned by
AFCOM and Data Center World and can only be reused with the express permission of ACOM. Questions or for permission contact:
jater@afcom.com.

More Related Content

What's hot

Cloudstack vs Openstack
Cloudstack vs OpenstackCloudstack vs Openstack
Cloudstack vs OpenstackHuzefa Husain
 
Cloud stack design camp on jun 15
Cloud stack design camp on jun 15Cloud stack design camp on jun 15
Cloud stack design camp on jun 15Isaac Chiang
 
Hypervisor Selection in CloudStack and OpenStack
Hypervisor Selection in CloudStack and OpenStackHypervisor Selection in CloudStack and OpenStack
Hypervisor Selection in CloudStack and OpenStackTim Mackey
 
CloudStack Overview
CloudStack OverviewCloudStack Overview
CloudStack Overviewsedukull
 
Comparing open source private cloud platforms
Comparing open source private cloud platformsComparing open source private cloud platforms
Comparing open source private cloud platformsOSCON Byrum
 
CloudStack Architecture Future
CloudStack Architecture FutureCloudStack Architecture Future
CloudStack Architecture FutureKimihiko Kitase
 
Whats New in Apache CloudStack Version 4.5
Whats New in Apache CloudStack Version 4.5Whats New in Apache CloudStack Version 4.5
Whats New in Apache CloudStack Version 4.5ShapeBlue
 
OSCON2014: Understanding Hypervisor Selection in Apache CloudStack
OSCON2014: Understanding Hypervisor Selection in Apache CloudStackOSCON2014: Understanding Hypervisor Selection in Apache CloudStack
OSCON2014: Understanding Hypervisor Selection in Apache CloudStackTim Mackey
 
Getting Started with Apache CloudStack
Getting Started with Apache CloudStackGetting Started with Apache CloudStack
Getting Started with Apache CloudStackJoe Brockmeier
 
Better, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CS
Better, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CSBetter, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CS
Better, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CSJohn Burwell
 
OpenStack + VMware: Everything You Need to Know (Kilo-edition)
OpenStack + VMware: Everything You Need to Know (Kilo-edition)OpenStack + VMware: Everything You Need to Know (Kilo-edition)
OpenStack + VMware: Everything You Need to Know (Kilo-edition)Dan Wendlandt
 
Running OpenStack on Amazon AWS, Alex Fishman
Running OpenStack on Amazon AWS, Alex FishmanRunning OpenStack on Amazon AWS, Alex Fishman
Running OpenStack on Amazon AWS, Alex FishmanCloud Native Day Tel Aviv
 
Utilizing Public AND Private Clouds with Bright Cluster Manager
Utilizing Public AND Private Clouds with Bright Cluster ManagerUtilizing Public AND Private Clouds with Bright Cluster Manager
Utilizing Public AND Private Clouds with Bright Cluster ManagerIan Lumb
 
Migrate Oracle database to Amazon RDS
Migrate Oracle database to Amazon RDSMigrate Oracle database to Amazon RDS
Migrate Oracle database to Amazon RDSJesus Guzman
 
Build a Cloud Day - CloudStack
Build a Cloud Day - CloudStackBuild a Cloud Day - CloudStack
Build a Cloud Day - CloudStackke4qqq
 
Azure Virtual Machines Deployment Scenarios
Azure Virtual Machines Deployment ScenariosAzure Virtual Machines Deployment Scenarios
Azure Virtual Machines Deployment ScenariosBrian Benz
 

What's hot (20)

Cloudstack vs Openstack
Cloudstack vs OpenstackCloudstack vs Openstack
Cloudstack vs Openstack
 
Introduction to CloudStack
Introduction to CloudStack Introduction to CloudStack
Introduction to CloudStack
 
Cloud stack design camp on jun 15
Cloud stack design camp on jun 15Cloud stack design camp on jun 15
Cloud stack design camp on jun 15
 
Hypervisor Selection in CloudStack and OpenStack
Hypervisor Selection in CloudStack and OpenStackHypervisor Selection in CloudStack and OpenStack
Hypervisor Selection in CloudStack and OpenStack
 
CloudStack Overview
CloudStack OverviewCloudStack Overview
CloudStack Overview
 
Comparing open source private cloud platforms
Comparing open source private cloud platformsComparing open source private cloud platforms
Comparing open source private cloud platforms
 
CloudStack Hyderabad Meetup: How the Apache community works
CloudStack Hyderabad Meetup: How the Apache community worksCloudStack Hyderabad Meetup: How the Apache community works
CloudStack Hyderabad Meetup: How the Apache community works
 
CloudStack Architecture Future
CloudStack Architecture FutureCloudStack Architecture Future
CloudStack Architecture Future
 
Introduction to CloudStack: How to Deploy and Manage Infrastructure-as-a-Serv...
Introduction to CloudStack: How to Deploy and Manage Infrastructure-as-a-Serv...Introduction to CloudStack: How to Deploy and Manage Infrastructure-as-a-Serv...
Introduction to CloudStack: How to Deploy and Manage Infrastructure-as-a-Serv...
 
Whats New in Apache CloudStack Version 4.5
Whats New in Apache CloudStack Version 4.5Whats New in Apache CloudStack Version 4.5
Whats New in Apache CloudStack Version 4.5
 
Building FOSS clouds
Building FOSS cloudsBuilding FOSS clouds
Building FOSS clouds
 
OSCON2014: Understanding Hypervisor Selection in Apache CloudStack
OSCON2014: Understanding Hypervisor Selection in Apache CloudStackOSCON2014: Understanding Hypervisor Selection in Apache CloudStack
OSCON2014: Understanding Hypervisor Selection in Apache CloudStack
 
Getting Started with Apache CloudStack
Getting Started with Apache CloudStackGetting Started with Apache CloudStack
Getting Started with Apache CloudStack
 
Better, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CS
Better, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CSBetter, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CS
Better, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CS
 
OpenStack + VMware: Everything You Need to Know (Kilo-edition)
OpenStack + VMware: Everything You Need to Know (Kilo-edition)OpenStack + VMware: Everything You Need to Know (Kilo-edition)
OpenStack + VMware: Everything You Need to Know (Kilo-edition)
 
Running OpenStack on Amazon AWS, Alex Fishman
Running OpenStack on Amazon AWS, Alex FishmanRunning OpenStack on Amazon AWS, Alex Fishman
Running OpenStack on Amazon AWS, Alex Fishman
 
Utilizing Public AND Private Clouds with Bright Cluster Manager
Utilizing Public AND Private Clouds with Bright Cluster ManagerUtilizing Public AND Private Clouds with Bright Cluster Manager
Utilizing Public AND Private Clouds with Bright Cluster Manager
 
Migrate Oracle database to Amazon RDS
Migrate Oracle database to Amazon RDSMigrate Oracle database to Amazon RDS
Migrate Oracle database to Amazon RDS
 
Build a Cloud Day - CloudStack
Build a Cloud Day - CloudStackBuild a Cloud Day - CloudStack
Build a Cloud Day - CloudStack
 
Azure Virtual Machines Deployment Scenarios
Azure Virtual Machines Deployment ScenariosAzure Virtual Machines Deployment Scenarios
Azure Virtual Machines Deployment Scenarios
 

Similar to How to Design a Scalable Private Cloud

VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...
VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...
VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...VMworld
 
Presentation architecting a cloud infrastructure
Presentation   architecting a cloud infrastructurePresentation   architecting a cloud infrastructure
Presentation architecting a cloud infrastructurexKinAnx
 
Presentation architecting a cloud infrastructure
Presentation   architecting a cloud infrastructurePresentation   architecting a cloud infrastructure
Presentation architecting a cloud infrastructuresolarisyourep
 
Hacking apache cloud stack
Hacking apache cloud stackHacking apache cloud stack
Hacking apache cloud stackNitin Mehta
 
Virtualizing Tier One Applications - Varrow
Virtualizing Tier One Applications - VarrowVirtualizing Tier One Applications - Varrow
Virtualizing Tier One Applications - VarrowAndrew Miller
 
TechNet Live spor 1 sesjon 6 - more vdi
TechNet Live spor 1   sesjon 6 - more vdiTechNet Live spor 1   sesjon 6 - more vdi
TechNet Live spor 1 sesjon 6 - more vdiAnders Borchsenius
 
Varrow madness 2013 virtualizing sql presentation
Varrow madness 2013 virtualizing sql presentationVarrow madness 2013 virtualizing sql presentation
Varrow madness 2013 virtualizing sql presentationpittmantony
 
CloudOpen Japan - Controlling the cost of your first cloud
CloudOpen Japan - Controlling the cost of your first cloudCloudOpen Japan - Controlling the cost of your first cloud
CloudOpen Japan - Controlling the cost of your first cloudTim Mackey
 
Whats new in Microsoft Windows Server 2016 Clustering and Storage
Whats new in Microsoft Windows Server 2016 Clustering and StorageWhats new in Microsoft Windows Server 2016 Clustering and Storage
Whats new in Microsoft Windows Server 2016 Clustering and StorageJohn Moran
 
SAN overview.pptx
SAN overview.pptxSAN overview.pptx
SAN overview.pptxMugabo4
 
What is coming for VMware vSphere?
What is coming for VMware vSphere?What is coming for VMware vSphere?
What is coming for VMware vSphere?Duncan Epping
 
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best PracticesVMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best PracticesVMworld
 
Provisioning server high_availability_considerations2
Provisioning server high_availability_considerations2Provisioning server high_availability_considerations2
Provisioning server high_availability_considerations2Nuno Alves
 
Azure IaaS Tanıtım - Kısa Anlatım
Azure IaaS Tanıtım - Kısa Anlatım Azure IaaS Tanıtım - Kısa Anlatım
Azure IaaS Tanıtım - Kısa Anlatım Mustafa
 
VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld 2013: How SRP Delivers More Than Power to Their Customers VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld 2013: How SRP Delivers More Than Power to Their Customers VMworld
 
Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...
Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...
Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...Andrew Miller
 
VMware virtual SAN 6 overview
VMware virtual SAN 6 overviewVMware virtual SAN 6 overview
VMware virtual SAN 6 overviewsolarisyougood
 
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...Linaro
 
VMware Virtual SAN slideshow
VMware Virtual SAN slideshowVMware Virtual SAN slideshow
VMware Virtual SAN slideshowAshley Williams
 

Similar to How to Design a Scalable Private Cloud (20)

VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...
VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...
VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...
 
Presentation architecting a cloud infrastructure
Presentation   architecting a cloud infrastructurePresentation   architecting a cloud infrastructure
Presentation architecting a cloud infrastructure
 
Presentation architecting a cloud infrastructure
Presentation   architecting a cloud infrastructurePresentation   architecting a cloud infrastructure
Presentation architecting a cloud infrastructure
 
Hacking apache cloud stack
Hacking apache cloud stackHacking apache cloud stack
Hacking apache cloud stack
 
Virtualizing Tier One Applications - Varrow
Virtualizing Tier One Applications - VarrowVirtualizing Tier One Applications - Varrow
Virtualizing Tier One Applications - Varrow
 
TechNet Live spor 1 sesjon 6 - more vdi
TechNet Live spor 1   sesjon 6 - more vdiTechNet Live spor 1   sesjon 6 - more vdi
TechNet Live spor 1 sesjon 6 - more vdi
 
Varrow madness 2013 virtualizing sql presentation
Varrow madness 2013 virtualizing sql presentationVarrow madness 2013 virtualizing sql presentation
Varrow madness 2013 virtualizing sql presentation
 
CloudOpen Japan - Controlling the cost of your first cloud
CloudOpen Japan - Controlling the cost of your first cloudCloudOpen Japan - Controlling the cost of your first cloud
CloudOpen Japan - Controlling the cost of your first cloud
 
Whats new in Microsoft Windows Server 2016 Clustering and Storage
Whats new in Microsoft Windows Server 2016 Clustering and StorageWhats new in Microsoft Windows Server 2016 Clustering and Storage
Whats new in Microsoft Windows Server 2016 Clustering and Storage
 
SAN overview.pptx
SAN overview.pptxSAN overview.pptx
SAN overview.pptx
 
What is coming for VMware vSphere?
What is coming for VMware vSphere?What is coming for VMware vSphere?
What is coming for VMware vSphere?
 
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best PracticesVMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
 
Provisioning server high_availability_considerations2
Provisioning server high_availability_considerations2Provisioning server high_availability_considerations2
Provisioning server high_availability_considerations2
 
Azure IaaS Tanıtım - Kısa Anlatım
Azure IaaS Tanıtım - Kısa Anlatım Azure IaaS Tanıtım - Kısa Anlatım
Azure IaaS Tanıtım - Kısa Anlatım
 
VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld 2013: How SRP Delivers More Than Power to Their Customers VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld 2013: How SRP Delivers More Than Power to Their Customers
 
Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...
Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...
Varrow Q4 Lunch & Learn Presentation - Virtualizing Business Critical Applica...
 
SQL Server Clustering Part1
SQL Server Clustering Part1SQL Server Clustering Part1
SQL Server Clustering Part1
 
VMware virtual SAN 6 overview
VMware virtual SAN 6 overviewVMware virtual SAN 6 overview
VMware virtual SAN 6 overview
 
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
 
VMware Virtual SAN slideshow
VMware Virtual SAN slideshowVMware Virtual SAN slideshow
VMware Virtual SAN slideshow
 

More from AFCOM

Maximizing the life of existing equipment
Maximizing the life of existing equipmentMaximizing the life of existing equipment
Maximizing the life of existing equipmentAFCOM
 
Lowering operating costs through cooling system design
Lowering operating costs through cooling system designLowering operating costs through cooling system design
Lowering operating costs through cooling system designAFCOM
 
Disaster recovery the new imperative
Disaster recovery the new imperativeDisaster recovery the new imperative
Disaster recovery the new imperativeAFCOM
 
Cloudy with a chance of downtime
Cloudy with a chance of downtimeCloudy with a chance of downtime
Cloudy with a chance of downtimeAFCOM
 
Adding intelligence to your dcim solution rf code
Adding intelligence to your dcim solution rf codeAdding intelligence to your dcim solution rf code
Adding intelligence to your dcim solution rf codeAFCOM
 
Benchmark the Relative Performance of Your Data Center
Benchmark the Relative Performance of Your Data CenterBenchmark the Relative Performance of Your Data Center
Benchmark the Relative Performance of Your Data CenterAFCOM
 
Understanding And Evaluating Colocation Data Centers
Understanding And Evaluating Colocation Data CentersUnderstanding And Evaluating Colocation Data Centers
Understanding And Evaluating Colocation Data CentersAFCOM
 
Buy vs Build Considerations in Today's Data Center Marketplace
Buy vs Build Considerations in Today's Data Center Marketplace Buy vs Build Considerations in Today's Data Center Marketplace
Buy vs Build Considerations in Today's Data Center Marketplace AFCOM
 
Data Center Management: Where Brain Meet Braun
Data Center Management: Where Brain Meet BraunData Center Management: Where Brain Meet Braun
Data Center Management: Where Brain Meet BraunAFCOM
 
Taming the DCIM Wave with ITIL
Taming the DCIM Wave with ITILTaming the DCIM Wave with ITIL
Taming the DCIM Wave with ITILAFCOM
 
Best Practices To Mitigate Risks When Retiring IT Assets
Best Practices To Mitigate Risks When Retiring IT AssetsBest Practices To Mitigate Risks When Retiring IT Assets
Best Practices To Mitigate Risks When Retiring IT AssetsAFCOM
 
Balancing Trends in Data Center Migration, Site Selection, and Operating Costs
Balancing Trends in Data Center Migration, Site Selection, and Operating CostsBalancing Trends in Data Center Migration, Site Selection, and Operating Costs
Balancing Trends in Data Center Migration, Site Selection, and Operating CostsAFCOM
 

More from AFCOM (12)

Maximizing the life of existing equipment
Maximizing the life of existing equipmentMaximizing the life of existing equipment
Maximizing the life of existing equipment
 
Lowering operating costs through cooling system design
Lowering operating costs through cooling system designLowering operating costs through cooling system design
Lowering operating costs through cooling system design
 
Disaster recovery the new imperative
Disaster recovery the new imperativeDisaster recovery the new imperative
Disaster recovery the new imperative
 
Cloudy with a chance of downtime
Cloudy with a chance of downtimeCloudy with a chance of downtime
Cloudy with a chance of downtime
 
Adding intelligence to your dcim solution rf code
Adding intelligence to your dcim solution rf codeAdding intelligence to your dcim solution rf code
Adding intelligence to your dcim solution rf code
 
Benchmark the Relative Performance of Your Data Center
Benchmark the Relative Performance of Your Data CenterBenchmark the Relative Performance of Your Data Center
Benchmark the Relative Performance of Your Data Center
 
Understanding And Evaluating Colocation Data Centers
Understanding And Evaluating Colocation Data CentersUnderstanding And Evaluating Colocation Data Centers
Understanding And Evaluating Colocation Data Centers
 
Buy vs Build Considerations in Today's Data Center Marketplace
Buy vs Build Considerations in Today's Data Center Marketplace Buy vs Build Considerations in Today's Data Center Marketplace
Buy vs Build Considerations in Today's Data Center Marketplace
 
Data Center Management: Where Brain Meet Braun
Data Center Management: Where Brain Meet BraunData Center Management: Where Brain Meet Braun
Data Center Management: Where Brain Meet Braun
 
Taming the DCIM Wave with ITIL
Taming the DCIM Wave with ITILTaming the DCIM Wave with ITIL
Taming the DCIM Wave with ITIL
 
Best Practices To Mitigate Risks When Retiring IT Assets
Best Practices To Mitigate Risks When Retiring IT AssetsBest Practices To Mitigate Risks When Retiring IT Assets
Best Practices To Mitigate Risks When Retiring IT Assets
 
Balancing Trends in Data Center Migration, Site Selection, and Operating Costs
Balancing Trends in Data Center Migration, Site Selection, and Operating CostsBalancing Trends in Data Center Migration, Site Selection, and Operating Costs
Balancing Trends in Data Center Migration, Site Selection, and Operating Costs
 

Recently uploaded

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 

Recently uploaded (20)

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 

How to Design a Scalable Private Cloud

  • 1. Interested in learning more about Cloud? Look at the Cloud sessions offered at the upcoming Fall 2012 Data Center World Conference at: www.datacenterworld.com. This presentation was given during the Spring, 2012 Data Center World Conference and Expo. Contents contained are owned by AFCOM and Data Center World and can only be reused with the express permission of ACOM. Questions or for permission contact: jater@afcom.com.
  • 2. How to Design a Scalable Private Cloud Mark Sand Datacenter Architect Citrix Systems Inc.
  • 3. Defining the Private & Public Clouds • Private vs. Public Clouds (Infrastructure as a Service - IaaS) • The private cloud is a virtual environment deployed within an organization that is restricted to users within the company and usually resides behind the corporate firewall. The private cloud also consists of an easy to use web portal that allows end users to auto provision and manage the lifecycle of their VMs, and may or may not incorporate a chargeback model. • The Public cloud is a virtual environment that is publically available for any consumer to purchase computing resources, usually on a pay per use basis, via an easy to use web portal. The public cloud allows any consumer to purchase, manage, and monitor the lifecycle of their VMs through a user friendly web portal.
  • 4. Designing the Cloud Infrastructure • Proper planning and design are critical components to successfully implementing a scalable Cloud environment • Here are some key design areas that we will address: • Capacity planning and sizing • Virtual Platform (hypervisor) • Datacenter locations (will this be a global Cloud or hosted from one DC) • Networking • SAN (NAS/Fibre) • Server Hardware • Power • Monitoring & Management Solutions • Documenting the solution
  • 5. Capacity Planning and Sizing the Environment • Accurate capacity planning and sizing will ensure that you implement a scalable, supportable, and successful environment • Key sizing criteria: • Number of VMs you are looking to host per virtual server • Number and types of clusters/pools • Estimated yearly growth for VMs • Amount of storage required to host all of the VMs for current and future growth • Amount of estimated network bandwidth required to host the VMs for current and future growth
  • 6. Current Capacity and Sizing Example • Cluster/pool(s) configuration: • We support a mix of 2,4,8, and 16GB VMs in each of our cluster/pool(s) • We average approximately 20 VMs per host Cluster/Pool Number of Hosts Total Storage Production 20 20TBs QA 8 9.5TBs Dev 11 15TBs DMZ 6 4TBs DR 15 4TBs
  • 7. Current Capacity and Sizing cont. • Average Yearly Growth Statistics: • VMs account for approximately 85% of our yearly server growth • We add approximately 5 -10 TBs of storage (spread across all cluster/pool(s)) • We have not needed to add any additional network bandwidth since the environment was implemented
  • 8. Virtual Platform & Datacenter Locations • Selecting the proper virtual platform (hypervisor): • There are several hypervisors out there that have benefits and drawbacks so each organization should choose whichever option best fits their needs • Datacenter Locations: • Determine if the cloud will be hosted from several global datacenters or if it will be hosted from one central datacenter • If the cloud will be hosted from different locations then it is also important to follow a set of standards for each of the areas we will be talking about (network, storage, server HW, etc.)
  • 9. Datacenter Locations Example • US Private Cloud • We currently have a large private cloud environment that is hosted out of our corporate datacenter as well as a smaller private cloud that is hosted in two additional datacenters in the US • Global Private Cloud • We currently have a private cloud environment in three of our regional datacenters • Global Standards • We have standardized on the same server hardware/configuration and networking devices for the global private cloud; however, we were required to create two different storage standards
  • 10. Network Design • Define the type of uplinks that will be used: • 1GB Uplink • Multiple 1GB uplinks configured as a port channel • 10GB uplink • Number/type of uplinks for each of the hosts functions: • Virtual Server Management Interface • VM traffic • NFS/iSCSI traffic for environments utilizing NAS • Utilize redundant uplinks from separate switches • Evaluate the proper size & number of VLANs required
  • 11. Network Description Example • Network Components • Management Network • 2 x switches with 2 x 1GB uplinks connected to each switch. Each switch is connected to a different distribution layer switch to ensure network redundancy • VM Traffic • 2 x blade switches with 4 x 1GB uplinks configured as a 2GB port channel is connected to each switch. We have three dedicated /24 VLANs for new VMs, and we also trunk existing VLANs to the switches in order to account for servers that were P2V’ed and are unable to change their IP address • Storage Traffic (regional datacenters only) • 2 x blade switches with 4 x 1GB uplinks configured as a 2GB port channel is connected to each switch
  • 12. Network Diagram Example – Corporate DC Note: Storage is connected via an HBA to our fibre channel SAN (not depicted here)
  • 13. Network Diagram Example – Regional DCs Note: Storage for the regional servers are connected to our NAS via NFS
  • 14. SAN/NAS Design • NAS vs. Fibre arrays: • Each technology has benefits and drawbacks, so each organization should choose whichever option best fits their needs • Define a standard LUN size • Define a standard naming convention when creating LUN/volume(s) • For NAS ensure to configure a dedicated VLAN for VM storage traffic • For fibre channel SANs ensure that you have two independent SAN fabrics (A&B) & utilize multipathing
  • 15. Storage Example – Corporate DC • Two fully populated blade enclosures connect into our fibre channel SAN via 4GB SAN switches • We standardized on 2TB LUNs (storage repositories / data stores) in our corporate DC and 1TB LUNs regionally
  • 16. Storage Diagram Example – Corporate DC
  • 17. Server Hardware •Scale Out vs. Scale Up Methodologies •Scale Out - several host servers are configured with standard to moderate virtualization specs (2 x CPUs & 48 to 128GBs of RAM) that make up a pool/cluster • Pros: The servers are less expensive so you can usually grow the pool faster, and you will sustain less downtime for VMs if a server fails • Cons: There are more servers to manage in each pool/cluster •Scale Up - only a few host servers are configured with large virtualization specs (4 CPUs or greater & 128GBs of RAM or greater) that can handle a large number of VMs • Pros: You can run a large number of VMs on the host server due to the vast resources each server has available • Cons: The servers are costly so you will likely not be able to grow the pool/cluster as fast, and you will potentially have a larger outage for VMs if a host fails
  • 18. Server Hardware Cont. •Minimu m specs for virtualization (blade or rack mount): • 2 x Quad Core CPUs • 48GBs of RAM (96GBs or greater is preferred for large environments) • Enough 1GB/10GB NICs that will allow you to have two connections to each uplink so you can bond the NICs for redundancy • HBA for servers that will connect to the SAN via fibre •Ensure you plan for an additional host server to account for failover (HA) for each cluster/pool
  • 19. Server Diagram Example – Corporate DC • Server specs: • 2 x Six Core CPUs • 96GBs of RAM • 6 x NICs (2 x embedded & 1 quad port mezzanine card) • 1 x dual port HBA mezzanine card • Interconnect specs: • 4 x network switches (1GB) • 2 x 4GB San switches • 1 x 1GB Ethernet pass-thru module (for backups)
  • 20. Server Diagram Example – Regional DC • Server specs: • 2 x Quad Core CPUs • 96GBs of RAM • 8 x NICs (2 x embedded, 1 quad port & 1 dual port NIC mezzanine card) • Interconnect specs: • 6 x network switches (1GB)
  • 21. Power Design • It is important to properly size the power circuits the host servers will use since they draw more power than standard servers • Ensure that the environment utilizes two load balanced circuits or two independent circuits for redundancy • Ensure each circuit is terminated from a different feeder • Separate the virtual host servers into at least two different racks
  • 22. Power Diagram Example – Corporate DC • Each rack contains: • 2 x L6-30 208v (A&B) Single Phase Circuits • Each A&B circuit is load balanced • 4 x 30amp208v single phase PDUs • The two blade enclosures that house all of the virtual hosts are located in two different racks
  • 23. Monitoring Solution • The health of the virtual environment is critical so it is key to monitor and alert on some of the following areas: • Physical hardware - detect if a DIMM, disk, CPU, etc. goes bad • VMs – verify they are online and not over utilized/subscribed • Virtual Platform – detect failures within the hypervisor • Capacity – verify that each host/cluster/pool is not running out of resources (storage, RAM,CPUs, etc.) that would prevent provisioning new VMs • It usually requires a mix of native and 3rd party tools to successfully monitor all aspects of a virtual environment
  • 24. Management Solution • Centralized VM and host management is extremely important; however, all of the major virtualization vendors do provide a centralized management solution • Auto provisioning of VMs • This is a key component of the Cloud and is not always adequately addressed by the centralized management solution provided by the virtualization vendors • This also often requires a combination of custom (internal) developed applications and 3rd party products • A good provisioning tool will take into account the utilization of a charge back model for VMs, as well as, address proper approvals to control the growth of VMs
  • 25. Management Solution cont. • How to address VM sprawl? • Place proper controls/approvals on who and how many VMs a user can request • Automatically track the number, hostname, and type of VM a user creates via the self/auto provisioning process • Monitor the utilization of all VMs, and then either automatically power the underutilized VMs down or follow-up with the VM owner • We have had our own challenges with trying to implement a fully automated solution that incorporates all of our needs, and this is something that large companies within the IT industry have struggled with as well.
  • 26. Documenting the solution • During the design and implementation phase of the environment it is important to take detailed notes and diagram each of the phases • A good design document will provide a clear and concise view of how all aspects of the environment is configured • When we handoff any environment to our Operations team we provide a detailed design doc, a runbook, and then hold an official handoff meeting to cover any questions or concerns the Operations team may have.
  • 28. Interested in learning more about Cloud? Look at the Cloud sessions offered at the upcoming Fall 2012 Data Center World Conference at: www.datacenterworld.com. This presentation was given during the Spring, 2012 Data Center World Conference and Expo. Contents contained are owned by AFCOM and Data Center World and can only be reused with the express permission of ACOM. Questions or for permission contact: jater@afcom.com.