SlideShare ist ein Scribd-Unternehmen logo
1 von 34
Downloaden Sie, um offline zu lesen
Xen @ Google
  Iustin Pop, <iustin@google.com>
          Google Switzerland




Sponsored by:

                &     &
Introduction       Corporate infrastructure       Technology   Workflows   Outlook



                                              Outline
       Introduction
           Talk overview
       Corporate infrastructure
          Overview
          Use cases
       Technology
          Open source components
          Internal components
       Workflows
         Common workflows
       Outlook
          Open-source road map
          Internal deployment road map
Introduction          Corporate infrastructure        Technology   Workflows           Outlook



                                                 Overview


       I will talk about. . .
           • virtualization at Google:
                • in the corporate (internal) infrastructure
                • this is not used for user-facing products (search, gmail, . . . )
           • use cases, software used, tools and infrastructure

       Terminology I might use (accidentally):
                  node physical machine (dom0)
               instance virtual machine (domU)
Introduction       Corporate infrastructure       Technology   Workflows   Outlook



                                              Outline
       Introduction
           Talk overview
       Corporate infrastructure
          Overview
          Use cases
       Technology
          Open source components
          Internal components
       Workflows
         Common workflows
       Outlook
          Open-source road map
          Internal deployment road map
Introduction         Corporate infrastructure   Technology   Workflows   Outlook



                               Corporate infrastructure


           • comprises servers located in offices
               • support local office infrastructure
               • low-latency services (e.g. DNS, caches)
               • (very) small numbers of machines
               • spread across many offices
           • and servers located in datacenters
               • various purposes
               • just a few datacenters
               • but many machines per datacenter
           • note that we virtualise mostly Linux servers
Introduction          Corporate infrastructure   Technology    Workflows        Outlook



                                     Office deployments


           • how to provide (redundant) services with as few machines as
               possible?
                 • some offices are remote enough that timely replacement of
                   parts is not a given
                 • other offices are big enough to need multiple, redundant copies
                   of services
           • initial use of Xen (early 2006), and start of tool development
           • this allowed shrinking footprint down to 2-4 machines
           • and improved reliability against hardware failures
           • however it somewhat decreased software reliability
Introduction         Corporate infrastructure   Technology     Workflows   Outlook



                               Datacenter deployments



           • in DCs we have multiple use cases:
                • again corporate infrastructure, e.g. DNS, LDAP, etc.
                • team servers/one-off applications
                • virtual workstations
           • and the challenges are different:
                • scaling management software to many machines
                • capacity planning
                • redundancy across racks
                • intra and inter-DC VM moves
Introduction         Corporate infrastructure   Technology    Workflows        Outlook



                        Use case: server consolidation


           • the “standard” way to use virtualisation
           • reduces hardware/footprint/power
           • services owned by dedicated services team or by a small team
                • redundancy can be implemented at VM level or at service level
                • resource guarantees needs can vary from “very strict” to
                  relaxed
           • interesting corner cases
                • services which cannot afford the downtime of live migration
                • services which cannot afford the performance penalty of
                  virtualisation
Introduction          Corporate infrastructure   Technology   Workflows        Outlook



                       Use case: desktop virtualisation
           • internal project named Ubiquity
           • allows every engineer to have a virtual workstation in the
               “internal cloud” (a nearby datacenter)
           • accessible over either SSH or NX
           • advantages:
               • workstation state stored in the “cloud”, not on (less managed)
                 physical workstation
               • workstation closer to datacenter-based services
               • easier to provision more hardware in a datacenter than in a
                 (possibly space-restricted) office
               • workstations can follow people as they travel
           • potential issues:
               • depending on hardware refresh cycles, a dedicated physical
                 workstation can be more powerful than a shared virtual one
               • latency to datacenter can sometimes be a problem
Introduction         Corporate infrastructure   Technology   Workflows          Outlook



                   Use case: machine management layer

           • some workloads are too big for a shared environment
           • but virtualisation has other advantages beside consolidation:
               • independence from hardware (well, storage. . . )
               • the hypervisor layer can abstract/unify hardware monitoring
               • much easier to move to new platforms
           • hence the use of virtualisation in single-VM-per-machine
               model, aka “dedicated” model
           • still in testing
           • what do to when size of VM smaller than size of HW?
           • currently investigating a “hard-partitioned” model:
               • share machines, but do not oversubscribe any resource
               • try to isolate CPU cores, disk spindles, network, RAM
Introduction       Corporate infrastructure       Technology   Workflows   Outlook



                                              Outline
       Introduction
           Talk overview
       Corporate infrastructure
          Overview
          Use cases
       Technology
          Open source components
          Internal components
       Workflows
         Common workflows
       Outlook
          Open-source road map
          Internal deployment road map
Introduction         Corporate infrastructure        Technology   Workflows      Outlook



                                                Overview
           • we deploy Xen. . .
               • on standard (off-the-shelf) x86 hardware (amd64)
               • on top of standard operating systems (Debian and Ubuntu)
               • in paravirtualised mode
           • no SAN/NAS: compute nodes are storage nodes as well
           • layered software model: machine ⇒ cluster ⇒ fleet
           • machine level handles hardware and hypervisor management
           • cluster level abstracts machines:
                • all resources are internal to, and managed by the cluster
                • software scales from one to a few hundred physical machines
                • upper level deals with clusters, not machines
           • fleet level abstracts clusters:
                • end-users do not care about specific clusters (maybe
                  geographic location)
Introduction          Corporate infrastructure   Technology   Workflows         Outlook



                          Machine level: OS/hypervisor


           • we use Xen as KVM still has some disadvantages for us:
               • mostly performance-related, but note that also Xen pvops is
                 not as good as Xen “native” (2.6.18 patch)
               • but the field is still evolving
               • and we could convert easily from one to the other
           • we use Debian stable/Ubuntu LTS as base OS:
               • choice of OS is due to many Debian developers in the team
               • standard OS install, just trimmed down
               • standard tools for base OS configuration (cfengine/puppet)
               • all machine installs are fully automated
           • at this level, it’s mostly what you would get from a plain
               Debian + Xen install
Introduction         Corporate infrastructure       Technology   Workflows        Outlook



                                            Cluster level

           • we use Ganeti as virtualisation manager
                • it supports other hypervisors but we only use Xen
                • for storage, we use mostly DRBD (network-level RAID 1), and
                  also simple LVM storage
           • cluster layout
                • physical machines (“nodes”) are organised in “node groups”
                • multiple node groups constitute a cluster
                • the node group is the default mobility domain for the VMs
           • no single point of failure for the cluster
                • one machine acts as “cluster master”, but this role can be
                  moved
                • no external resource dependencies (especially storage)
                • no network-level services required for the cluster operation
           • all software at cluster level is open-sourced
Cluster diagram

                         SSH access
Remote API




                                        Ganeti node                                               Ganeti node                                             Ganeti node



                                                          Per machine monitoring (external)




                                                                                                                    Per machine monitoring (external)




                                                                                                                                                                            Per machine monitoring (external)
                                        Ganeti node                                               Ganeti node                                             Ganeti node

                                        Ganeti node                                               Ganeti node                                             Ganeti node

                                        Ganeti node                                               Ganeti node                                             Ganeti node
             current master node




                                        Ganeti node                                               Ganeti node                                             Ganeti node

                                        Ganeti node                                               Ganeti node                                             Ganeti node


                                      Ganeti node group                                         Ganeti node group                                       Ganeti node group



  Ganeti cluster
Introduction          Corporate infrastructure          Technology   Workflows    Outlook



                                                 Fleet level
           • we integrate with internal systems at this level
           • this is done via internal software (not open source)
                • both generic (to Google):
                      • monitoring
                      • machine database
                 • and specific to Ganeti-in-Google:
                     • web interface to the clusters (code name Virgil)
                     • cluster-level configuration management (Dradis)
                     • machine (repair) workflow manager (Euripides)
                 • these are related to hardware work-flows, not virtualisation
           • the generic components have open-source alternatives
           • “Ganeti Web Manager” is an open source web console
           • no known equivalents for Dradis and Euripides
           • large deployments of Ganeti will most likely need to
               reimplement them
Introduction          Corporate infrastructure   Technology       Workflows          Outlook



                                       Fleet organisation

           • clusters are split (categorised) according to customer type
           • clusters of the same type and in the same region are split in
               two “maintenance windows”
                 • allow for maintenance work on only half of the clusters in a
                    region
                 • compensates for the fact that the cluster is a single point of
                    failure for a given VM
           • Virgil talks to all the clusters and provides fleet overview
           • such meta-level organisation is implemented at Ganeti level
               via cluster tags
           • tags are used for many other tasks that cannot be expressed
               directly at Ganeti level
Fleet diagram
   Ganeti cluster                                                 Ganeti cluster
   type Office                                                    type General
   no maint window                                                maint window A
                                            Virgil
                                            Euripides
        Office ZURICH                       Dradis
                                                                  Ganeti cluster
                                                                  type General
                                                                  maint window B


Ganeti cluster
type General                                                      Ganeti cluster
maint window A                                                    type Ubiquity
                                                                  no maint window
                                       Ganeti cluster
                                       type Dedicated
Ganeti cluster                         maint window A
type General
maint window B                                                            Datacenter Z
                        VM




                                       Ganeti cluster
                         tra




                                       type Dedicated
                             ns




Ganeti cluster                         maint window B
                               fe
                                  r




type Ubiquity
no maint window

                                       Ganeti cluster
                                       type General
                                       maint window A
        Datacenter X
                                                   Datacenter Y
Introduction         Corporate infrastructure   Technology       Workflows           Outlook



                                    Other internal tools

           • machine history console
               • displays physical machine history
               • ties into monitoring, hardware repairs process, life-cycle, etc.
           • rolling-reboot tool
               • allows rebooting an entire cluster without VM impact
               • uses live migration and sequential reboots
           • ganeti-capacity: a capacity planning tool
               • computes simulated cluster capacity
               • VM specs versus physical resources, space, power
               • soon to be open sourced, not related to internal systems
           • and many other small tools
               • notification of owners per cluster/physical machine
               • monitoring and resource dashboards
               • etc.
Introduction         Corporate infrastructure     Technology   Workflows         Outlook



                                           Known issues



           • VM clock issues:
               • a long-standing problem
               • we still see cases where VM clocks are rolled back 3000s due
                  to machine clock-source problems
           • IO issues
               • DRBD + Xen much lower performance than just DRBD or
                  just Xen
               • In general, hard to model I/O performance
Introduction       Corporate infrastructure       Technology   Workflows   Outlook



                                              Outline
       Introduction
           Talk overview
       Corporate infrastructure
          Overview
          Use cases
       Technology
          Open source components
          Internal components
       Workflows
         Common workflows
       Outlook
          Open-source road map
          Internal deployment road map
Introduction          Corporate infrastructure    Technology            Workflows                                             Outlook



                                           VM allocation

      1. Virgil gets an allocation                     Ganeti cluster                                        Alloc request
                                                       type General




                                                                            RAPI interface
         request (region, cluster type)
      2. creates machine record
         (DNS, other systems)                          Ganeti cluster
                                                       type General
      3. selects “best” cluster based
         on VM spec, capacity data                                                                           Virgil
                                                       Ganeti cluster
      4. and tells it to create the VM                 type Ubiquity




                                                                                             scan capacity
      5. cluster selects best physical
         machine(s) to host the VM
                                                       Ganeti cluster                                           Machine DB
      6. VM is created, and OS                         type Dedicated
         installation scripts are run
               • install software                                                                     Monitoring
                                                      gather capacity
               • configure authentication
Introduction                                   Corporate infrastructure                       Technology          Workflows                                                   Outlook



                                                       Handling machine failures

                                        Euripides                                   Send machine              Virgil
                                                                                    to repairs (2)




                                                                                                                             Mark machine broken (3)
       Monitoring − detects fault (1)




                                                                                                                                                       Send to repairs (5)
                                                                                           Tell cluster to evacuate
                                           Ganeti HW
                                                                                           the broken machine (4)

                                           Ganeti HW
                                                                          Ganeti cluster



                                           broken HW

                                           Ganeti HW                                                          Machine database
Introduction        Corporate infrastructure   Technology       Workflows        Outlook



                            Handling machine failures

          1. monitoring detects a HW problem (e.g. disk error, memory
             problem, etc.)
          2. Euripides (for non-critical problems) tells Virgil a machine
             needs to be sent to repairs
               • for critical events (machine dead), on-call person is paged,
                  instructs Euripides how to proceed
          3. Virgil first marks the machine as “not in production”
          4. then tells the cluster to evacuate the VMs from it
          5. finally requests repairs by local tech

       Note
           • for “known” errors, the process if fully automated
           • otherwise, an “exception” case is created for investigation
Introduction                                  Corporate infrastructure             Technology       Workflows                        Outlook



                                               Handling repaired/new machines

                                       Euripides                           Detects machine      Machine DB
                                                                           was repaired (1)

                                                              Tells Virgil to reintegrate machine (3)




                                                                                                                      serving (6)
       Watches machine for 24hrs (2)




                                                                                                               Mark
                                                                          Add to cluster (5)
                                          Ganeti HW
                                                                          Rebalance (7)         Virgil
                                          Ganeti HW
                                                                         Configure (4)
                                          repaired HW
                                                                                                Dradis
                                          Ganeti HW
Introduction          Corporate infrastructure   Technology      Workflows        Outlook



                      Handling repaired/new machines

          1. Euripides detects new or repaired machine in Machine DB
                 • at first, it’s being kept “under watch” for a period of time
          2. it tells Virgil to integrate new machine
          3. Virgil calls Dradis to configure the machine appropriately
          4. Virgil tells the cluster to add the new machine
          5. finally the new machine is marked as serving
          6. the cluster will be rebalanced in order to utilise the machine

       Note
           • assuming no errors in the OS installation, configuration, etc.,
               the process is fully automated
Introduction         Corporate infrastructure   Technology     Workflows        Outlook



                             Internal cluster workflows


           • htools component shipped with Ganeti can
                • balance the cluster
                • compute cluster capacity
                • compute node evacuation strategy
                • do automatic selection of nodes for VM placement
           • plugin versus API
                • node evacuation and instance placement use internal
                  “IAllocator” plugin framework
                • the other are command line tools that run talk to Ganeti using
                  its external APIs
           • the documentation explains how to use all of these
Introduction       Corporate infrastructure       Technology   Workflows   Outlook



                                              Outline
       Introduction
           Talk overview
       Corporate infrastructure
          Overview
          Use cases
       Technology
          Open source components
          Internal components
       Workflows
         Common workflows
       Outlook
          Open-source road map
          Internal deployment road map
Introduction        Corporate infrastructure   Technology   Workflows       Outlook



                                  Software road map I


           • improve the cluster resource model
               • currently models only RAM/VCPUs/disk space as resources
               • will add spindles/networking (IO) resources
               • will add dynamic memory (ballooning/tmem) support
               • will improve support for non-Xen hypervisors
               • addressing these in the next releases
           • improve remote API: eliminate the need for SSH
               • some operations not available over RAPI
               • will integrate e.g. cluster capacity reporting
               • ongoing effort towards full parity
               • eventually SSH will not be needed for operations
           • will add “hard-partitioning” model (still being designed)
Introduction         Corporate infrastructure   Technology      Workflows    Outlook



                                  Software road map II


           • improve the VM OS deployment model:
               • currently OS scripts run on the physical machine
               • this requires trusted source for OS templates
               • Ganeti users must write their own installation scripts
           • improve Xen CPU scheduler control
           • add smart LVM allocation
               • currently based on simple biggest-free-space model
               • optimisations possible for DRBD layout, flash usage, etc.
           • better handling of SAN/NAS storage
Introduction        Corporate infrastructure   Technology    Workflows           Outlook



                              Deployment road map I

           • looking at using ballooning/tmem
                • first need support in Ganeti and capacity planning tools
                • plan to start testing early next year
           • fleet refresh in progress
                • current fleet a mix of older and newer hardware
                • many machines still using only 1GbE
                • machine heterogeneity makes cluster algorithms more complex
           • investigating “embedded OS” dom0 model
                • current dom0 is a regular Linux distribution
                • sub-optimal with many machines performing the same role
                • all machines should be (roughly) identical at all times
                • aiming at an image-based setup in order to eliminate the
                  installation step and individual package upgrades
Introduction         Corporate infrastructure   Technology    Workflows       Outlook



                              Deployment road map II


       dom0 kernel versions unification:
         • currently running a mix of kernel versions
                • originally we used 2.6.18 + “native” Xen patch
                • “native” Xen had better performance than “pvops”
                • for us, 2.6.3x kernels have I/O performance problems
           • still trying to identify a current, well performing kernel
           • this prevents us from properly cooperating with upstreams
                • would like to give feedback on stability and performance
                • but hard to track down patterns across multiple versions
Introduction   Corporate infrastructure         Technology   Workflows   Outlook



                                          Questions?




                                    Thanks!
Appendix



                                      Links



      Ganeti homepage http://code.google.com/p/ganeti
      Code repositories http://git.ganeti.org/
      Documentation http://docs.ganeti.org/ganeti/current/html/
      Ganeti Web Manager http://code.osuosl.org/projects/ganeti-webmgr
      Image-based OS template
                    http://code.osuosl.org/projects/ganeti-image
      Presentation on virtual workstations http://neatx.googlecode.com/files/
                     herding-virtual-workstations-fisl-2009.pdf

Weitere ähnliche Inhalte

Was ist angesagt?

Xen: Hypervisor for the Cloud - CCC13
Xen: Hypervisor for the Cloud - CCC13Xen: Hypervisor for the Cloud - CCC13
Xen: Hypervisor for the Cloud - CCC13The Linux Foundation
 
OWF: Xen - Open Source Hypervisor Designed for Clouds
OWF: Xen - Open Source Hypervisor Designed for CloudsOWF: Xen - Open Source Hypervisor Designed for Clouds
OWF: Xen - Open Source Hypervisor Designed for CloudsThe Linux Foundation
 
Linaro Connect Asia 13 : Citrix - Xen on ARM plenary session
Linaro Connect Asia 13 : Citrix - Xen on ARM plenary sessionLinaro Connect Asia 13 : Citrix - Xen on ARM plenary session
Linaro Connect Asia 13 : Citrix - Xen on ARM plenary sessionThe Linux Foundation
 
Scale11x : Virtualization with Xen and XCP
Scale11x : Virtualization with Xen and XCP Scale11x : Virtualization with Xen and XCP
Scale11x : Virtualization with Xen and XCP The Linux Foundation
 
Getting Started with XenServer and OpenStack.pptx
Getting Started with XenServer and OpenStack.pptxGetting Started with XenServer and OpenStack.pptx
Getting Started with XenServer and OpenStack.pptxOpenStack Foundation
 
Xen Project Hypervisor for the Cloud
Xen Project Hypervisor for the CloudXen Project Hypervisor for the Cloud
Xen Project Hypervisor for the CloudThe Linux Foundation
 
XPDS14: Xen and the Art of Certification - Nathan Studer & Robert VonVossen, ...
XPDS14: Xen and the Art of Certification - Nathan Studer & Robert VonVossen, ...XPDS14: Xen and the Art of Certification - Nathan Studer & Robert VonVossen, ...
XPDS14: Xen and the Art of Certification - Nathan Studer & Robert VonVossen, ...The Linux Foundation
 
S4 xen hypervisor_20080622
S4 xen hypervisor_20080622S4 xen hypervisor_20080622
S4 xen hypervisor_20080622Todd Deshane
 
LCEU13: Securing your cloud with Xen's advanced security features - George Du...
LCEU13: Securing your cloud with Xen's advanced security features - George Du...LCEU13: Securing your cloud with Xen's advanced security features - George Du...
LCEU13: Securing your cloud with Xen's advanced security features - George Du...The Linux Foundation
 
BACD July 2012 : The Xen Cloud Platform
BACD July 2012 : The Xen Cloud Platform BACD July 2012 : The Xen Cloud Platform
BACD July 2012 : The Xen Cloud Platform The Linux Foundation
 
Xenalyze: Finding meaning in the chaos
Xenalyze: Finding meaning in the chaosXenalyze: Finding meaning in the chaos
Xenalyze: Finding meaning in the chaosThe Linux Foundation
 

Was ist angesagt? (20)

Why xen slides
Why xen slidesWhy xen slides
Why xen slides
 
Xen: Hypervisor for the Cloud - CCC13
Xen: Hypervisor for the Cloud - CCC13Xen: Hypervisor for the Cloud - CCC13
Xen: Hypervisor for the Cloud - CCC13
 
OWF: Xen - Open Source Hypervisor Designed for Clouds
OWF: Xen - Open Source Hypervisor Designed for CloudsOWF: Xen - Open Source Hypervisor Designed for Clouds
OWF: Xen - Open Source Hypervisor Designed for Clouds
 
Linaro Connect Asia 13 : Citrix - Xen on ARM plenary session
Linaro Connect Asia 13 : Citrix - Xen on ARM plenary sessionLinaro Connect Asia 13 : Citrix - Xen on ARM plenary session
Linaro Connect Asia 13 : Citrix - Xen on ARM plenary session
 
Skylark: Easy Cloud Computing
Skylark: Easy Cloud ComputingSkylark: Easy Cloud Computing
Skylark: Easy Cloud Computing
 
Scale11x : Virtualization with Xen and XCP
Scale11x : Virtualization with Xen and XCP Scale11x : Virtualization with Xen and XCP
Scale11x : Virtualization with Xen and XCP
 
Xen and Apache cloudstack
Xen and Apache cloudstack  Xen and Apache cloudstack
Xen and Apache cloudstack
 
Getting Started with XenServer and OpenStack.pptx
Getting Started with XenServer and OpenStack.pptxGetting Started with XenServer and OpenStack.pptx
Getting Started with XenServer and OpenStack.pptx
 
Xen Project Hypervisor for the Cloud
Xen Project Hypervisor for the CloudXen Project Hypervisor for the Cloud
Xen Project Hypervisor for the Cloud
 
Openstack Xen and XCP
Openstack Xen and XCPOpenstack Xen and XCP
Openstack Xen and XCP
 
Xen.org Overview 2009
Xen.org Overview 2009Xen.org Overview 2009
Xen.org Overview 2009
 
XPDS14: Xen and the Art of Certification - Nathan Studer & Robert VonVossen, ...
XPDS14: Xen and the Art of Certification - Nathan Studer & Robert VonVossen, ...XPDS14: Xen and the Art of Certification - Nathan Studer & Robert VonVossen, ...
XPDS14: Xen and the Art of Certification - Nathan Studer & Robert VonVossen, ...
 
BSDcon Asia 2015: Xen on FreeBSD
BSDcon Asia 2015: Xen on FreeBSDBSDcon Asia 2015: Xen on FreeBSD
BSDcon Asia 2015: Xen on FreeBSD
 
Art of Using Xen at Scale
Art of Using Xen at ScaleArt of Using Xen at Scale
Art of Using Xen at Scale
 
S4 xen hypervisor_20080622
S4 xen hypervisor_20080622S4 xen hypervisor_20080622
S4 xen hypervisor_20080622
 
LCEU13: Securing your cloud with Xen's advanced security features - George Du...
LCEU13: Securing your cloud with Xen's advanced security features - George Du...LCEU13: Securing your cloud with Xen's advanced security features - George Du...
LCEU13: Securing your cloud with Xen's advanced security features - George Du...
 
BACD July 2012 : The Xen Cloud Platform
BACD July 2012 : The Xen Cloud Platform BACD July 2012 : The Xen Cloud Platform
BACD July 2012 : The Xen Cloud Platform
 
Xenalyze: Finding meaning in the chaos
Xenalyze: Finding meaning in the chaosXenalyze: Finding meaning in the chaos
Xenalyze: Finding meaning in the chaos
 
Xen summit amd_2010v3
Xen summit amd_2010v3Xen summit amd_2010v3
Xen summit amd_2010v3
 
Ian Pratt Nsdi Keynote Apr2008
Ian Pratt Nsdi Keynote Apr2008Ian Pratt Nsdi Keynote Apr2008
Ian Pratt Nsdi Keynote Apr2008
 

Andere mochten auch

Windsor: Domain 0 Disaggregation for XenServer and XCP
	Windsor: Domain 0 Disaggregation for XenServer and XCP	Windsor: Domain 0 Disaggregation for XenServer and XCP
Windsor: Domain 0 Disaggregation for XenServer and XCPThe Linux Foundation
 
Under the Hood: Open vSwitch & OpenFlow in XCP & XenServer
Under the Hood: Open vSwitch & OpenFlow in XCP & XenServerUnder the Hood: Open vSwitch & OpenFlow in XCP & XenServer
Under the Hood: Open vSwitch & OpenFlow in XCP & XenServerThe Linux Foundation
 
Mirage: extreme specialisation of virtual appliances
Mirage: extreme specialisation of virtual appliancesMirage: extreme specialisation of virtual appliances
Mirage: extreme specialisation of virtual appliancesThe Linux Foundation
 
Xen PV Performance Status and Optimization Opportunities
Xen PV Performance Status and Optimization OpportunitiesXen PV Performance Status and Optimization Opportunities
Xen PV Performance Status and Optimization OpportunitiesThe Linux Foundation
 
Xen.org: The past, the present and exciting Future
Xen.org: The past, the present and exciting FutureXen.org: The past, the present and exciting Future
Xen.org: The past, the present and exciting FutureThe Linux Foundation
 
PV-Drivers for SeaBIOS using Upstream Qemu
PV-Drivers for SeaBIOS using Upstream QemuPV-Drivers for SeaBIOS using Upstream Qemu
PV-Drivers for SeaBIOS using Upstream QemuThe Linux Foundation
 
Keynote Speech: Xen ARM Virtualization
Keynote Speech: Xen ARM VirtualizationKeynote Speech: Xen ARM Virtualization
Keynote Speech: Xen ARM VirtualizationThe Linux Foundation
 
Hardware accelerated Virtualization in the ARM Cortex™ Processors
Hardware accelerated Virtualization in the ARM Cortex™ ProcessorsHardware accelerated Virtualization in the ARM Cortex™ Processors
Hardware accelerated Virtualization in the ARM Cortex™ ProcessorsThe Linux Foundation
 
LinuxCon Japan 13 : 10 years of Xen and Beyond
LinuxCon Japan 13 : 10 years of Xen and BeyondLinuxCon Japan 13 : 10 years of Xen and Beyond
LinuxCon Japan 13 : 10 years of Xen and BeyondThe Linux Foundation
 
Red Hat Enterprise Linux 5.2 Virtualization Guide
Red Hat Enterprise Linux 5.2 Virtualization GuideRed Hat Enterprise Linux 5.2 Virtualization Guide
Red Hat Enterprise Linux 5.2 Virtualization GuideRishi Sharma
 
Engaging the Xen Developer Comminity
Engaging the Xen Developer ComminityEngaging the Xen Developer Comminity
Engaging the Xen Developer ComminityThe Linux Foundation
 
Xen and Art of Virtualization (Xen Architecture)
Xen and Art of Virtualization (Xen Architecture)Xen and Art of Virtualization (Xen Architecture)
Xen and Art of Virtualization (Xen Architecture)Mr Cracker
 

Andere mochten auch (20)

Windsor: Domain 0 Disaggregation for XenServer and XCP
	Windsor: Domain 0 Disaggregation for XenServer and XCP	Windsor: Domain 0 Disaggregation for XenServer and XCP
Windsor: Domain 0 Disaggregation for XenServer and XCP
 
Under the Hood: Open vSwitch & OpenFlow in XCP & XenServer
Under the Hood: Open vSwitch & OpenFlow in XCP & XenServerUnder the Hood: Open vSwitch & OpenFlow in XCP & XenServer
Under the Hood: Open vSwitch & OpenFlow in XCP & XenServer
 
Mirage: extreme specialisation of virtual appliances
Mirage: extreme specialisation of virtual appliancesMirage: extreme specialisation of virtual appliances
Mirage: extreme specialisation of virtual appliances
 
Xen PV Performance Status and Optimization Opportunities
Xen PV Performance Status and Optimization OpportunitiesXen PV Performance Status and Optimization Opportunities
Xen PV Performance Status and Optimization Opportunities
 
Engaging the xen community
Engaging the xen communityEngaging the xen community
Engaging the xen community
 
Xen.org: The past, the present and exciting Future
Xen.org: The past, the present and exciting FutureXen.org: The past, the present and exciting Future
Xen.org: The past, the present and exciting Future
 
PV-Drivers for SeaBIOS using Upstream Qemu
PV-Drivers for SeaBIOS using Upstream QemuPV-Drivers for SeaBIOS using Upstream Qemu
PV-Drivers for SeaBIOS using Upstream Qemu
 
Keynote Speech: Xen ARM Virtualization
Keynote Speech: Xen ARM VirtualizationKeynote Speech: Xen ARM Virtualization
Keynote Speech: Xen ARM Virtualization
 
Xen in Linux 3.x (or PVOPS)
Xen in Linux 3.x (or PVOPS)Xen in Linux 3.x (or PVOPS)
Xen in Linux 3.x (or PVOPS)
 
Hardware accelerated Virtualization in the ARM Cortex™ Processors
Hardware accelerated Virtualization in the ARM Cortex™ ProcessorsHardware accelerated Virtualization in the ARM Cortex™ Processors
Hardware accelerated Virtualization in the ARM Cortex™ Processors
 
µ-Xen
µ-Xenµ-Xen
µ-Xen
 
LinuxCon Japan 13 : 10 years of Xen and Beyond
LinuxCon Japan 13 : 10 years of Xen and BeyondLinuxCon Japan 13 : 10 years of Xen and Beyond
LinuxCon Japan 13 : 10 years of Xen and Beyond
 
Deep Dive into OpenStack
Deep Dive into OpenStackDeep Dive into OpenStack
Deep Dive into OpenStack
 
Xen brandeis case study
Xen brandeis case studyXen brandeis case study
Xen brandeis case study
 
Xen revisited
Xen revisitedXen revisited
Xen revisited
 
Red Hat Enterprise Linux 5.2 Virtualization Guide
Red Hat Enterprise Linux 5.2 Virtualization GuideRed Hat Enterprise Linux 5.2 Virtualization Guide
Red Hat Enterprise Linux 5.2 Virtualization Guide
 
Engaging the Xen Developer Comminity
Engaging the Xen Developer ComminityEngaging the Xen Developer Comminity
Engaging the Xen Developer Comminity
 
Pivot3 case study
Pivot3 case studyPivot3 case study
Pivot3 case study
 
Linux PV on HVM
Linux PV on HVMLinux PV on HVM
Linux PV on HVM
 
Xen and Art of Virtualization (Xen Architecture)
Xen and Art of Virtualization (Xen Architecture)Xen and Art of Virtualization (Xen Architecture)
Xen and Art of Virtualization (Xen Architecture)
 

Ähnlich wie Xen @ Google, 2011

Operating OpenStack on a Budget
Operating OpenStack on a BudgetOperating OpenStack on a Budget
Operating OpenStack on a BudgetSusan Wu
 
Operating OpenStack on a Budget
Operating OpenStack on a BudgetOperating OpenStack on a Budget
Operating OpenStack on a BudgetSamir Ibradzic
 
DCSF19 Transforming a 15+ Year Old Semiconductor Manufacturing Environment
DCSF19 Transforming a 15+ Year Old Semiconductor Manufacturing EnvironmentDCSF19 Transforming a 15+ Year Old Semiconductor Manufacturing Environment
DCSF19 Transforming a 15+ Year Old Semiconductor Manufacturing EnvironmentDocker, Inc.
 
How to Build a Compute Cluster
How to Build a Compute ClusterHow to Build a Compute Cluster
How to Build a Compute ClusterRamsay Key
 
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...Splunk
 
Accelerating Server Hardware Upgrades with PlateSpin Migrate P2P
Accelerating Server Hardware Upgrades with PlateSpin Migrate P2PAccelerating Server Hardware Upgrades with PlateSpin Migrate P2P
Accelerating Server Hardware Upgrades with PlateSpin Migrate P2PNovell
 
The Foundation of the Software Defined Data Center
The Foundation of the Software Defined Data CenterThe Foundation of the Software Defined Data Center
The Foundation of the Software Defined Data CenterArraya Solutions
 
Virtualisation at Ringo
Virtualisation at RingoVirtualisation at Ringo
Virtualisation at RingoJeremy Brown
 
Run Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateRun Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateNovell
 
Run Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateRun Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateNovell
 
Run Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateRun Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateNovell
 
Run Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateRun Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateNovell
 
Run Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateRun Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateNovell
 
Docker for the enterprise
Docker for the enterpriseDocker for the enterprise
Docker for the enterpriseBert Poller
 
The Kubernetes WebLogic revival (part 1)
The Kubernetes WebLogic revival (part 1)The Kubernetes WebLogic revival (part 1)
The Kubernetes WebLogic revival (part 1)Simon Haslam
 
High Performance Cloud Computing
High Performance Cloud ComputingHigh Performance Cloud Computing
High Performance Cloud ComputingNephoScale
 

Ähnlich wie Xen @ Google, 2011 (20)

Operating OpenStack on a Budget
Operating OpenStack on a BudgetOperating OpenStack on a Budget
Operating OpenStack on a Budget
 
Operating OpenStack on a Budget
Operating OpenStack on a BudgetOperating OpenStack on a Budget
Operating OpenStack on a Budget
 
Elatt Presentation
Elatt PresentationElatt Presentation
Elatt Presentation
 
DCSF19 Transforming a 15+ Year Old Semiconductor Manufacturing Environment
DCSF19 Transforming a 15+ Year Old Semiconductor Manufacturing EnvironmentDCSF19 Transforming a 15+ Year Old Semiconductor Manufacturing Environment
DCSF19 Transforming a 15+ Year Old Semiconductor Manufacturing Environment
 
How to Build a Compute Cluster
How to Build a Compute ClusterHow to Build a Compute Cluster
How to Build a Compute Cluster
 
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
 
Accelerating Server Hardware Upgrades with PlateSpin Migrate P2P
Accelerating Server Hardware Upgrades with PlateSpin Migrate P2PAccelerating Server Hardware Upgrades with PlateSpin Migrate P2P
Accelerating Server Hardware Upgrades with PlateSpin Migrate P2P
 
The Foundation of the Software Defined Data Center
The Foundation of the Software Defined Data CenterThe Foundation of the Software Defined Data Center
The Foundation of the Software Defined Data Center
 
Virtualisation at Ringo
Virtualisation at RingoVirtualisation at Ringo
Virtualisation at Ringo
 
Run Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateRun Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin Orchestrate
 
Run Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateRun Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin Orchestrate
 
Run Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateRun Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin Orchestrate
 
Run Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateRun Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin Orchestrate
 
Run Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin OrchestrateRun Book Automation with PlateSpin Orchestrate
Run Book Automation with PlateSpin Orchestrate
 
Docker for the enterprise
Docker for the enterpriseDocker for the enterprise
Docker for the enterprise
 
resume_new
resume_newresume_new
resume_new
 
The Kubernetes WebLogic revival (part 1)
The Kubernetes WebLogic revival (part 1)The Kubernetes WebLogic revival (part 1)
The Kubernetes WebLogic revival (part 1)
 
unit 2.ppt
unit 2.pptunit 2.ppt
unit 2.ppt
 
Windows Azure introduction
Windows Azure introductionWindows Azure introduction
Windows Azure introduction
 
High Performance Cloud Computing
High Performance Cloud ComputingHigh Performance Cloud Computing
High Performance Cloud Computing
 

Mehr von The Linux Foundation

ELC2019: Static Partitioning Made Simple
ELC2019: Static Partitioning Made SimpleELC2019: Static Partitioning Made Simple
ELC2019: Static Partitioning Made SimpleThe Linux Foundation
 
XPDDS19: How TrenchBoot is Enabling Measured Launch for Open-Source Platform ...
XPDDS19: How TrenchBoot is Enabling Measured Launch for Open-Source Platform ...XPDDS19: How TrenchBoot is Enabling Measured Launch for Open-Source Platform ...
XPDDS19: How TrenchBoot is Enabling Measured Launch for Open-Source Platform ...The Linux Foundation
 
XPDDS19 Keynote: Xen in Automotive - Artem Mygaiev, Director, Technology Solu...
XPDDS19 Keynote: Xen in Automotive - Artem Mygaiev, Director, Technology Solu...XPDDS19 Keynote: Xen in Automotive - Artem Mygaiev, Director, Technology Solu...
XPDDS19 Keynote: Xen in Automotive - Artem Mygaiev, Director, Technology Solu...The Linux Foundation
 
XPDDS19 Keynote: Xen Project Weather Report 2019 - Lars Kurth, Director of Op...
XPDDS19 Keynote: Xen Project Weather Report 2019 - Lars Kurth, Director of Op...XPDDS19 Keynote: Xen Project Weather Report 2019 - Lars Kurth, Director of Op...
XPDDS19 Keynote: Xen Project Weather Report 2019 - Lars Kurth, Director of Op...The Linux Foundation
 
XPDDS19 Keynote: Unikraft Weather Report
XPDDS19 Keynote:  Unikraft Weather ReportXPDDS19 Keynote:  Unikraft Weather Report
XPDDS19 Keynote: Unikraft Weather ReportThe Linux Foundation
 
XPDDS19 Keynote: Secret-free Hypervisor: Now and Future - Wei Liu, Software E...
XPDDS19 Keynote: Secret-free Hypervisor: Now and Future - Wei Liu, Software E...XPDDS19 Keynote: Secret-free Hypervisor: Now and Future - Wei Liu, Software E...
XPDDS19 Keynote: Secret-free Hypervisor: Now and Future - Wei Liu, Software E...The Linux Foundation
 
XPDDS19 Keynote: Xen Dom0-less - Stefano Stabellini, Principal Engineer, Xilinx
XPDDS19 Keynote: Xen Dom0-less - Stefano Stabellini, Principal Engineer, XilinxXPDDS19 Keynote: Xen Dom0-less - Stefano Stabellini, Principal Engineer, Xilinx
XPDDS19 Keynote: Xen Dom0-less - Stefano Stabellini, Principal Engineer, XilinxThe Linux Foundation
 
XPDDS19 Keynote: Patch Review for Non-maintainers - George Dunlap, Citrix Sys...
XPDDS19 Keynote: Patch Review for Non-maintainers - George Dunlap, Citrix Sys...XPDDS19 Keynote: Patch Review for Non-maintainers - George Dunlap, Citrix Sys...
XPDDS19 Keynote: Patch Review for Non-maintainers - George Dunlap, Citrix Sys...The Linux Foundation
 
XPDDS19: Memories of a VM Funk - Mihai Donțu, Bitdefender
XPDDS19: Memories of a VM Funk - Mihai Donțu, BitdefenderXPDDS19: Memories of a VM Funk - Mihai Donțu, Bitdefender
XPDDS19: Memories of a VM Funk - Mihai Donțu, BitdefenderThe Linux Foundation
 
OSSJP/ALS19: The Road to Safety Certification: Overcoming Community Challeng...
OSSJP/ALS19:  The Road to Safety Certification: Overcoming Community Challeng...OSSJP/ALS19:  The Road to Safety Certification: Overcoming Community Challeng...
OSSJP/ALS19: The Road to Safety Certification: Overcoming Community Challeng...The Linux Foundation
 
OSSJP/ALS19: The Road to Safety Certification: How the Xen Project is Making...
 OSSJP/ALS19: The Road to Safety Certification: How the Xen Project is Making... OSSJP/ALS19: The Road to Safety Certification: How the Xen Project is Making...
OSSJP/ALS19: The Road to Safety Certification: How the Xen Project is Making...The Linux Foundation
 
XPDDS19: Speculative Sidechannels and Mitigations - Andrew Cooper, Citrix
XPDDS19: Speculative Sidechannels and Mitigations - Andrew Cooper, CitrixXPDDS19: Speculative Sidechannels and Mitigations - Andrew Cooper, Citrix
XPDDS19: Speculative Sidechannels and Mitigations - Andrew Cooper, CitrixThe Linux Foundation
 
XPDDS19: Keeping Coherency on Arm: Reborn - Julien Grall, Arm ltd
XPDDS19: Keeping Coherency on Arm: Reborn - Julien Grall, Arm ltdXPDDS19: Keeping Coherency on Arm: Reborn - Julien Grall, Arm ltd
XPDDS19: Keeping Coherency on Arm: Reborn - Julien Grall, Arm ltdThe Linux Foundation
 
XPDDS19: QEMU PV Backend 'qdevification'... What Does it Mean? - Paul Durrant...
XPDDS19: QEMU PV Backend 'qdevification'... What Does it Mean? - Paul Durrant...XPDDS19: QEMU PV Backend 'qdevification'... What Does it Mean? - Paul Durrant...
XPDDS19: QEMU PV Backend 'qdevification'... What Does it Mean? - Paul Durrant...The Linux Foundation
 
XPDDS19: Status of PCI Emulation in Xen - Roger Pau Monné, Citrix Systems R&D
XPDDS19: Status of PCI Emulation in Xen - Roger Pau Monné, Citrix Systems R&DXPDDS19: Status of PCI Emulation in Xen - Roger Pau Monné, Citrix Systems R&D
XPDDS19: Status of PCI Emulation in Xen - Roger Pau Monné, Citrix Systems R&DThe Linux Foundation
 
XPDDS19: [ARM] OP-TEE Mediator in Xen - Volodymyr Babchuk, EPAM Systems
XPDDS19: [ARM] OP-TEE Mediator in Xen - Volodymyr Babchuk, EPAM SystemsXPDDS19: [ARM] OP-TEE Mediator in Xen - Volodymyr Babchuk, EPAM Systems
XPDDS19: [ARM] OP-TEE Mediator in Xen - Volodymyr Babchuk, EPAM SystemsThe Linux Foundation
 
XPDDS19: Bringing Xen to the Masses: The Story of Building a Community-driven...
XPDDS19: Bringing Xen to the Masses: The Story of Building a Community-driven...XPDDS19: Bringing Xen to the Masses: The Story of Building a Community-driven...
XPDDS19: Bringing Xen to the Masses: The Story of Building a Community-driven...The Linux Foundation
 
XPDDS19: Will Robots Automate Your Job Away? Streamlining Xen Project Contrib...
XPDDS19: Will Robots Automate Your Job Away? Streamlining Xen Project Contrib...XPDDS19: Will Robots Automate Your Job Away? Streamlining Xen Project Contrib...
XPDDS19: Will Robots Automate Your Job Away? Streamlining Xen Project Contrib...The Linux Foundation
 
XPDDS19: Client Virtualization Toolstack in Go - Nick Rosbrook & Brendan Kerr...
XPDDS19: Client Virtualization Toolstack in Go - Nick Rosbrook & Brendan Kerr...XPDDS19: Client Virtualization Toolstack in Go - Nick Rosbrook & Brendan Kerr...
XPDDS19: Client Virtualization Toolstack in Go - Nick Rosbrook & Brendan Kerr...The Linux Foundation
 
XPDDS19: Core Scheduling in Xen - Jürgen Groß, SUSE
XPDDS19: Core Scheduling in Xen - Jürgen Groß, SUSEXPDDS19: Core Scheduling in Xen - Jürgen Groß, SUSE
XPDDS19: Core Scheduling in Xen - Jürgen Groß, SUSEThe Linux Foundation
 

Mehr von The Linux Foundation (20)

ELC2019: Static Partitioning Made Simple
ELC2019: Static Partitioning Made SimpleELC2019: Static Partitioning Made Simple
ELC2019: Static Partitioning Made Simple
 
XPDDS19: How TrenchBoot is Enabling Measured Launch for Open-Source Platform ...
XPDDS19: How TrenchBoot is Enabling Measured Launch for Open-Source Platform ...XPDDS19: How TrenchBoot is Enabling Measured Launch for Open-Source Platform ...
XPDDS19: How TrenchBoot is Enabling Measured Launch for Open-Source Platform ...
 
XPDDS19 Keynote: Xen in Automotive - Artem Mygaiev, Director, Technology Solu...
XPDDS19 Keynote: Xen in Automotive - Artem Mygaiev, Director, Technology Solu...XPDDS19 Keynote: Xen in Automotive - Artem Mygaiev, Director, Technology Solu...
XPDDS19 Keynote: Xen in Automotive - Artem Mygaiev, Director, Technology Solu...
 
XPDDS19 Keynote: Xen Project Weather Report 2019 - Lars Kurth, Director of Op...
XPDDS19 Keynote: Xen Project Weather Report 2019 - Lars Kurth, Director of Op...XPDDS19 Keynote: Xen Project Weather Report 2019 - Lars Kurth, Director of Op...
XPDDS19 Keynote: Xen Project Weather Report 2019 - Lars Kurth, Director of Op...
 
XPDDS19 Keynote: Unikraft Weather Report
XPDDS19 Keynote:  Unikraft Weather ReportXPDDS19 Keynote:  Unikraft Weather Report
XPDDS19 Keynote: Unikraft Weather Report
 
XPDDS19 Keynote: Secret-free Hypervisor: Now and Future - Wei Liu, Software E...
XPDDS19 Keynote: Secret-free Hypervisor: Now and Future - Wei Liu, Software E...XPDDS19 Keynote: Secret-free Hypervisor: Now and Future - Wei Liu, Software E...
XPDDS19 Keynote: Secret-free Hypervisor: Now and Future - Wei Liu, Software E...
 
XPDDS19 Keynote: Xen Dom0-less - Stefano Stabellini, Principal Engineer, Xilinx
XPDDS19 Keynote: Xen Dom0-less - Stefano Stabellini, Principal Engineer, XilinxXPDDS19 Keynote: Xen Dom0-less - Stefano Stabellini, Principal Engineer, Xilinx
XPDDS19 Keynote: Xen Dom0-less - Stefano Stabellini, Principal Engineer, Xilinx
 
XPDDS19 Keynote: Patch Review for Non-maintainers - George Dunlap, Citrix Sys...
XPDDS19 Keynote: Patch Review for Non-maintainers - George Dunlap, Citrix Sys...XPDDS19 Keynote: Patch Review for Non-maintainers - George Dunlap, Citrix Sys...
XPDDS19 Keynote: Patch Review for Non-maintainers - George Dunlap, Citrix Sys...
 
XPDDS19: Memories of a VM Funk - Mihai Donțu, Bitdefender
XPDDS19: Memories of a VM Funk - Mihai Donțu, BitdefenderXPDDS19: Memories of a VM Funk - Mihai Donțu, Bitdefender
XPDDS19: Memories of a VM Funk - Mihai Donțu, Bitdefender
 
OSSJP/ALS19: The Road to Safety Certification: Overcoming Community Challeng...
OSSJP/ALS19:  The Road to Safety Certification: Overcoming Community Challeng...OSSJP/ALS19:  The Road to Safety Certification: Overcoming Community Challeng...
OSSJP/ALS19: The Road to Safety Certification: Overcoming Community Challeng...
 
OSSJP/ALS19: The Road to Safety Certification: How the Xen Project is Making...
 OSSJP/ALS19: The Road to Safety Certification: How the Xen Project is Making... OSSJP/ALS19: The Road to Safety Certification: How the Xen Project is Making...
OSSJP/ALS19: The Road to Safety Certification: How the Xen Project is Making...
 
XPDDS19: Speculative Sidechannels and Mitigations - Andrew Cooper, Citrix
XPDDS19: Speculative Sidechannels and Mitigations - Andrew Cooper, CitrixXPDDS19: Speculative Sidechannels and Mitigations - Andrew Cooper, Citrix
XPDDS19: Speculative Sidechannels and Mitigations - Andrew Cooper, Citrix
 
XPDDS19: Keeping Coherency on Arm: Reborn - Julien Grall, Arm ltd
XPDDS19: Keeping Coherency on Arm: Reborn - Julien Grall, Arm ltdXPDDS19: Keeping Coherency on Arm: Reborn - Julien Grall, Arm ltd
XPDDS19: Keeping Coherency on Arm: Reborn - Julien Grall, Arm ltd
 
XPDDS19: QEMU PV Backend 'qdevification'... What Does it Mean? - Paul Durrant...
XPDDS19: QEMU PV Backend 'qdevification'... What Does it Mean? - Paul Durrant...XPDDS19: QEMU PV Backend 'qdevification'... What Does it Mean? - Paul Durrant...
XPDDS19: QEMU PV Backend 'qdevification'... What Does it Mean? - Paul Durrant...
 
XPDDS19: Status of PCI Emulation in Xen - Roger Pau Monné, Citrix Systems R&D
XPDDS19: Status of PCI Emulation in Xen - Roger Pau Monné, Citrix Systems R&DXPDDS19: Status of PCI Emulation in Xen - Roger Pau Monné, Citrix Systems R&D
XPDDS19: Status of PCI Emulation in Xen - Roger Pau Monné, Citrix Systems R&D
 
XPDDS19: [ARM] OP-TEE Mediator in Xen - Volodymyr Babchuk, EPAM Systems
XPDDS19: [ARM] OP-TEE Mediator in Xen - Volodymyr Babchuk, EPAM SystemsXPDDS19: [ARM] OP-TEE Mediator in Xen - Volodymyr Babchuk, EPAM Systems
XPDDS19: [ARM] OP-TEE Mediator in Xen - Volodymyr Babchuk, EPAM Systems
 
XPDDS19: Bringing Xen to the Masses: The Story of Building a Community-driven...
XPDDS19: Bringing Xen to the Masses: The Story of Building a Community-driven...XPDDS19: Bringing Xen to the Masses: The Story of Building a Community-driven...
XPDDS19: Bringing Xen to the Masses: The Story of Building a Community-driven...
 
XPDDS19: Will Robots Automate Your Job Away? Streamlining Xen Project Contrib...
XPDDS19: Will Robots Automate Your Job Away? Streamlining Xen Project Contrib...XPDDS19: Will Robots Automate Your Job Away? Streamlining Xen Project Contrib...
XPDDS19: Will Robots Automate Your Job Away? Streamlining Xen Project Contrib...
 
XPDDS19: Client Virtualization Toolstack in Go - Nick Rosbrook & Brendan Kerr...
XPDDS19: Client Virtualization Toolstack in Go - Nick Rosbrook & Brendan Kerr...XPDDS19: Client Virtualization Toolstack in Go - Nick Rosbrook & Brendan Kerr...
XPDDS19: Client Virtualization Toolstack in Go - Nick Rosbrook & Brendan Kerr...
 
XPDDS19: Core Scheduling in Xen - Jürgen Groß, SUSE
XPDDS19: Core Scheduling in Xen - Jürgen Groß, SUSEXPDDS19: Core Scheduling in Xen - Jürgen Groß, SUSE
XPDDS19: Core Scheduling in Xen - Jürgen Groß, SUSE
 

Kürzlich hochgeladen

Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Kürzlich hochgeladen (20)

Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Xen @ Google, 2011

  • 1. Xen @ Google Iustin Pop, <iustin@google.com> Google Switzerland Sponsored by: & &
  • 2. Introduction Corporate infrastructure Technology Workflows Outlook Outline Introduction Talk overview Corporate infrastructure Overview Use cases Technology Open source components Internal components Workflows Common workflows Outlook Open-source road map Internal deployment road map
  • 3. Introduction Corporate infrastructure Technology Workflows Outlook Overview I will talk about. . . • virtualization at Google: • in the corporate (internal) infrastructure • this is not used for user-facing products (search, gmail, . . . ) • use cases, software used, tools and infrastructure Terminology I might use (accidentally): node physical machine (dom0) instance virtual machine (domU)
  • 4. Introduction Corporate infrastructure Technology Workflows Outlook Outline Introduction Talk overview Corporate infrastructure Overview Use cases Technology Open source components Internal components Workflows Common workflows Outlook Open-source road map Internal deployment road map
  • 5. Introduction Corporate infrastructure Technology Workflows Outlook Corporate infrastructure • comprises servers located in offices • support local office infrastructure • low-latency services (e.g. DNS, caches) • (very) small numbers of machines • spread across many offices • and servers located in datacenters • various purposes • just a few datacenters • but many machines per datacenter • note that we virtualise mostly Linux servers
  • 6. Introduction Corporate infrastructure Technology Workflows Outlook Office deployments • how to provide (redundant) services with as few machines as possible? • some offices are remote enough that timely replacement of parts is not a given • other offices are big enough to need multiple, redundant copies of services • initial use of Xen (early 2006), and start of tool development • this allowed shrinking footprint down to 2-4 machines • and improved reliability against hardware failures • however it somewhat decreased software reliability
  • 7. Introduction Corporate infrastructure Technology Workflows Outlook Datacenter deployments • in DCs we have multiple use cases: • again corporate infrastructure, e.g. DNS, LDAP, etc. • team servers/one-off applications • virtual workstations • and the challenges are different: • scaling management software to many machines • capacity planning • redundancy across racks • intra and inter-DC VM moves
  • 8. Introduction Corporate infrastructure Technology Workflows Outlook Use case: server consolidation • the “standard” way to use virtualisation • reduces hardware/footprint/power • services owned by dedicated services team or by a small team • redundancy can be implemented at VM level or at service level • resource guarantees needs can vary from “very strict” to relaxed • interesting corner cases • services which cannot afford the downtime of live migration • services which cannot afford the performance penalty of virtualisation
  • 9. Introduction Corporate infrastructure Technology Workflows Outlook Use case: desktop virtualisation • internal project named Ubiquity • allows every engineer to have a virtual workstation in the “internal cloud” (a nearby datacenter) • accessible over either SSH or NX • advantages: • workstation state stored in the “cloud”, not on (less managed) physical workstation • workstation closer to datacenter-based services • easier to provision more hardware in a datacenter than in a (possibly space-restricted) office • workstations can follow people as they travel • potential issues: • depending on hardware refresh cycles, a dedicated physical workstation can be more powerful than a shared virtual one • latency to datacenter can sometimes be a problem
  • 10. Introduction Corporate infrastructure Technology Workflows Outlook Use case: machine management layer • some workloads are too big for a shared environment • but virtualisation has other advantages beside consolidation: • independence from hardware (well, storage. . . ) • the hypervisor layer can abstract/unify hardware monitoring • much easier to move to new platforms • hence the use of virtualisation in single-VM-per-machine model, aka “dedicated” model • still in testing • what do to when size of VM smaller than size of HW? • currently investigating a “hard-partitioned” model: • share machines, but do not oversubscribe any resource • try to isolate CPU cores, disk spindles, network, RAM
  • 11. Introduction Corporate infrastructure Technology Workflows Outlook Outline Introduction Talk overview Corporate infrastructure Overview Use cases Technology Open source components Internal components Workflows Common workflows Outlook Open-source road map Internal deployment road map
  • 12. Introduction Corporate infrastructure Technology Workflows Outlook Overview • we deploy Xen. . . • on standard (off-the-shelf) x86 hardware (amd64) • on top of standard operating systems (Debian and Ubuntu) • in paravirtualised mode • no SAN/NAS: compute nodes are storage nodes as well • layered software model: machine ⇒ cluster ⇒ fleet • machine level handles hardware and hypervisor management • cluster level abstracts machines: • all resources are internal to, and managed by the cluster • software scales from one to a few hundred physical machines • upper level deals with clusters, not machines • fleet level abstracts clusters: • end-users do not care about specific clusters (maybe geographic location)
  • 13. Introduction Corporate infrastructure Technology Workflows Outlook Machine level: OS/hypervisor • we use Xen as KVM still has some disadvantages for us: • mostly performance-related, but note that also Xen pvops is not as good as Xen “native” (2.6.18 patch) • but the field is still evolving • and we could convert easily from one to the other • we use Debian stable/Ubuntu LTS as base OS: • choice of OS is due to many Debian developers in the team • standard OS install, just trimmed down • standard tools for base OS configuration (cfengine/puppet) • all machine installs are fully automated • at this level, it’s mostly what you would get from a plain Debian + Xen install
  • 14. Introduction Corporate infrastructure Technology Workflows Outlook Cluster level • we use Ganeti as virtualisation manager • it supports other hypervisors but we only use Xen • for storage, we use mostly DRBD (network-level RAID 1), and also simple LVM storage • cluster layout • physical machines (“nodes”) are organised in “node groups” • multiple node groups constitute a cluster • the node group is the default mobility domain for the VMs • no single point of failure for the cluster • one machine acts as “cluster master”, but this role can be moved • no external resource dependencies (especially storage) • no network-level services required for the cluster operation • all software at cluster level is open-sourced
  • 15. Cluster diagram SSH access Remote API Ganeti node Ganeti node Ganeti node Per machine monitoring (external) Per machine monitoring (external) Per machine monitoring (external) Ganeti node Ganeti node Ganeti node Ganeti node Ganeti node Ganeti node Ganeti node Ganeti node Ganeti node current master node Ganeti node Ganeti node Ganeti node Ganeti node Ganeti node Ganeti node Ganeti node group Ganeti node group Ganeti node group Ganeti cluster
  • 16. Introduction Corporate infrastructure Technology Workflows Outlook Fleet level • we integrate with internal systems at this level • this is done via internal software (not open source) • both generic (to Google): • monitoring • machine database • and specific to Ganeti-in-Google: • web interface to the clusters (code name Virgil) • cluster-level configuration management (Dradis) • machine (repair) workflow manager (Euripides) • these are related to hardware work-flows, not virtualisation • the generic components have open-source alternatives • “Ganeti Web Manager” is an open source web console • no known equivalents for Dradis and Euripides • large deployments of Ganeti will most likely need to reimplement them
  • 17. Introduction Corporate infrastructure Technology Workflows Outlook Fleet organisation • clusters are split (categorised) according to customer type • clusters of the same type and in the same region are split in two “maintenance windows” • allow for maintenance work on only half of the clusters in a region • compensates for the fact that the cluster is a single point of failure for a given VM • Virgil talks to all the clusters and provides fleet overview • such meta-level organisation is implemented at Ganeti level via cluster tags • tags are used for many other tasks that cannot be expressed directly at Ganeti level
  • 18. Fleet diagram Ganeti cluster Ganeti cluster type Office type General no maint window maint window A Virgil Euripides Office ZURICH Dradis Ganeti cluster type General maint window B Ganeti cluster type General Ganeti cluster maint window A type Ubiquity no maint window Ganeti cluster type Dedicated Ganeti cluster maint window A type General maint window B Datacenter Z VM Ganeti cluster tra type Dedicated ns Ganeti cluster maint window B fe r type Ubiquity no maint window Ganeti cluster type General maint window A Datacenter X Datacenter Y
  • 19. Introduction Corporate infrastructure Technology Workflows Outlook Other internal tools • machine history console • displays physical machine history • ties into monitoring, hardware repairs process, life-cycle, etc. • rolling-reboot tool • allows rebooting an entire cluster without VM impact • uses live migration and sequential reboots • ganeti-capacity: a capacity planning tool • computes simulated cluster capacity • VM specs versus physical resources, space, power • soon to be open sourced, not related to internal systems • and many other small tools • notification of owners per cluster/physical machine • monitoring and resource dashboards • etc.
  • 20. Introduction Corporate infrastructure Technology Workflows Outlook Known issues • VM clock issues: • a long-standing problem • we still see cases where VM clocks are rolled back 3000s due to machine clock-source problems • IO issues • DRBD + Xen much lower performance than just DRBD or just Xen • In general, hard to model I/O performance
  • 21. Introduction Corporate infrastructure Technology Workflows Outlook Outline Introduction Talk overview Corporate infrastructure Overview Use cases Technology Open source components Internal components Workflows Common workflows Outlook Open-source road map Internal deployment road map
  • 22. Introduction Corporate infrastructure Technology Workflows Outlook VM allocation 1. Virgil gets an allocation Ganeti cluster Alloc request type General RAPI interface request (region, cluster type) 2. creates machine record (DNS, other systems) Ganeti cluster type General 3. selects “best” cluster based on VM spec, capacity data Virgil Ganeti cluster 4. and tells it to create the VM type Ubiquity scan capacity 5. cluster selects best physical machine(s) to host the VM Ganeti cluster Machine DB 6. VM is created, and OS type Dedicated installation scripts are run • install software Monitoring gather capacity • configure authentication
  • 23. Introduction Corporate infrastructure Technology Workflows Outlook Handling machine failures Euripides Send machine Virgil to repairs (2) Mark machine broken (3) Monitoring − detects fault (1) Send to repairs (5) Tell cluster to evacuate Ganeti HW the broken machine (4) Ganeti HW Ganeti cluster broken HW Ganeti HW Machine database
  • 24. Introduction Corporate infrastructure Technology Workflows Outlook Handling machine failures 1. monitoring detects a HW problem (e.g. disk error, memory problem, etc.) 2. Euripides (for non-critical problems) tells Virgil a machine needs to be sent to repairs • for critical events (machine dead), on-call person is paged, instructs Euripides how to proceed 3. Virgil first marks the machine as “not in production” 4. then tells the cluster to evacuate the VMs from it 5. finally requests repairs by local tech Note • for “known” errors, the process if fully automated • otherwise, an “exception” case is created for investigation
  • 25. Introduction Corporate infrastructure Technology Workflows Outlook Handling repaired/new machines Euripides Detects machine Machine DB was repaired (1) Tells Virgil to reintegrate machine (3) serving (6) Watches machine for 24hrs (2) Mark Add to cluster (5) Ganeti HW Rebalance (7) Virgil Ganeti HW Configure (4) repaired HW Dradis Ganeti HW
  • 26. Introduction Corporate infrastructure Technology Workflows Outlook Handling repaired/new machines 1. Euripides detects new or repaired machine in Machine DB • at first, it’s being kept “under watch” for a period of time 2. it tells Virgil to integrate new machine 3. Virgil calls Dradis to configure the machine appropriately 4. Virgil tells the cluster to add the new machine 5. finally the new machine is marked as serving 6. the cluster will be rebalanced in order to utilise the machine Note • assuming no errors in the OS installation, configuration, etc., the process is fully automated
  • 27. Introduction Corporate infrastructure Technology Workflows Outlook Internal cluster workflows • htools component shipped with Ganeti can • balance the cluster • compute cluster capacity • compute node evacuation strategy • do automatic selection of nodes for VM placement • plugin versus API • node evacuation and instance placement use internal “IAllocator” plugin framework • the other are command line tools that run talk to Ganeti using its external APIs • the documentation explains how to use all of these
  • 28. Introduction Corporate infrastructure Technology Workflows Outlook Outline Introduction Talk overview Corporate infrastructure Overview Use cases Technology Open source components Internal components Workflows Common workflows Outlook Open-source road map Internal deployment road map
  • 29. Introduction Corporate infrastructure Technology Workflows Outlook Software road map I • improve the cluster resource model • currently models only RAM/VCPUs/disk space as resources • will add spindles/networking (IO) resources • will add dynamic memory (ballooning/tmem) support • will improve support for non-Xen hypervisors • addressing these in the next releases • improve remote API: eliminate the need for SSH • some operations not available over RAPI • will integrate e.g. cluster capacity reporting • ongoing effort towards full parity • eventually SSH will not be needed for operations • will add “hard-partitioning” model (still being designed)
  • 30. Introduction Corporate infrastructure Technology Workflows Outlook Software road map II • improve the VM OS deployment model: • currently OS scripts run on the physical machine • this requires trusted source for OS templates • Ganeti users must write their own installation scripts • improve Xen CPU scheduler control • add smart LVM allocation • currently based on simple biggest-free-space model • optimisations possible for DRBD layout, flash usage, etc. • better handling of SAN/NAS storage
  • 31. Introduction Corporate infrastructure Technology Workflows Outlook Deployment road map I • looking at using ballooning/tmem • first need support in Ganeti and capacity planning tools • plan to start testing early next year • fleet refresh in progress • current fleet a mix of older and newer hardware • many machines still using only 1GbE • machine heterogeneity makes cluster algorithms more complex • investigating “embedded OS” dom0 model • current dom0 is a regular Linux distribution • sub-optimal with many machines performing the same role • all machines should be (roughly) identical at all times • aiming at an image-based setup in order to eliminate the installation step and individual package upgrades
  • 32. Introduction Corporate infrastructure Technology Workflows Outlook Deployment road map II dom0 kernel versions unification: • currently running a mix of kernel versions • originally we used 2.6.18 + “native” Xen patch • “native” Xen had better performance than “pvops” • for us, 2.6.3x kernels have I/O performance problems • still trying to identify a current, well performing kernel • this prevents us from properly cooperating with upstreams • would like to give feedback on stability and performance • but hard to track down patterns across multiple versions
  • 33. Introduction Corporate infrastructure Technology Workflows Outlook Questions? Thanks!
  • 34. Appendix Links Ganeti homepage http://code.google.com/p/ganeti Code repositories http://git.ganeti.org/ Documentation http://docs.ganeti.org/ganeti/current/html/ Ganeti Web Manager http://code.osuosl.org/projects/ganeti-webmgr Image-based OS template http://code.osuosl.org/projects/ganeti-image Presentation on virtual workstations http://neatx.googlecode.com/files/ herding-virtual-workstations-fisl-2009.pdf