SlideShare ist ein Scribd-Unternehmen logo
1 von 51
HP Insight CMU
Cluster Management Utility Tour
Sébastien Cabaniols, CMU WW Team lead / EMEA HPC Presales consultant
7th HPC Day conference, Kiev, Ukraine
October 2012

© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Agenda
   HP Insight CMU Introduction & Review
               Introduction, History / Customers
               Product mindset


   Insight CMU v7.0 tour
               Provisioning (Cloning / Autoinstall / Diskless)
               Monitoring ( TimeView / Collectl / GPGPUs…)
               Scalable/Frictionless administration ( cmudiff…)
               Custom GUI & partners integration
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight CMU

Introduction & History




© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight CMU introduction
                                                         CMU = Cluster Management Utility
                                           ‚CMU optimizes the TCO of compute farms‛

   CMU scaling specification: 4k nodes

   CMU has lots of industrial clusters in production with 2k/3k+ nodes

   CMU has a strong presence in the TOP500 (www.top500.org)

   CMU at customer site since 2000
    4   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight CMU major milestones

2000: initial implementation for Tru64 Unix (Alphaserver)‫‏‬

2001: port to Alpha Linux, 1600 servers commercial cluster

2002: port to x86 & IA64 Linux / HPUX Itanium

2004: port to x86_64 Linux. (only port maintained*)

2007: Swedish gov, 6th @ TOP500

2010: Tsubame 2, HP first public 1+ PFlop cluster, 5th @ TOP500

*ARM port in progress...
 5   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Worldwide CMU Deployments
                             HP ships >2 CMU clusters per week WW
          UNIVERSITIES                                                                                                                 ENGINEERING




              GOVERNMENT and RESEARCH LABS                                                                                              ENERGY
6
    6
              April 2009
    © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight CMU project mindset




© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight CMU project mindset
                     CMU provides the core functionalities for a farm/cloud
   runs any HP* server (even mix) / any Linux distribution (even mix)‫‏‬

   independent of many architectural aspects of the system:

        interconnects / GPGPUs / IO accelerators...
        network topology (open cluster, guarded cluster, WAN…)

        batch/job schedulers, MPI stacks, math libraries, compilers....



                                    CMU is not a ‘predefined’ SW supercomputer appliance


 >90% systems delivered as ‚turn-key solutions‛
 CMU can also be purchased standalone with support and manuals
    8   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight CMU v7.0 tour

      CMU functionalities / typical CMU implementation
      CMU Provisioning
      CMU Monitoring
      CMU Scalable / Frictionless administration
      CMU Custom GUI & partners integration




© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
CMU starts here: typical { farm / HPC cluster } implementation

                                                                                                    { high speed Interconnect }




  { Highly Avail.}Head node


© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight Cluster Management Utility Basics
 CMU is a single package running on the head node (upgrade is trivial)
CMU mgt node can be an HA cluster (HP service guard,Redhat Cluster, SLES HA)

 Provides a full fledged interactive CLI


 Provides cmu_* commands as an API (for scripting)
For integration with other software or command-line activity (see partner’s integration)


 Provides GUI client for single dashboard control
launch from a web page served from head node (JAVA© webstart technology)
run on a local laptop/desktop
user mode for monitoring
admin mode for remote administrationcontained herein is subject to change without notice.
  11 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information
The three pillars of HP Insight CMU
              Provision                                                                     Monitor                                      Control
     • Simplified discovery,                                                   • ‘At a glance’ view of                             • GUI & CLI & API
        firmware audits                                                           entire system /                                  • Easy GUI, friction-
     • Fast & scalable                                                            partition                                         less control of
        cloning                                                                • Customizable                                       remote servers
     • Legacy support of                                                       • Lightweight                                       • Scalable pdsh with
        Kickstart/Autoyast/D                                                   • Instant 2D view                                    cmudiff analyser
        ebian Preseed                                                          • TimeView, 3D live
     • Diskless support                                                           history


© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight CMU v7.0 Tour
       CMU Provisioning


                   Scalable Cloning
                   Legacy/Compatible Autoinstall (Kickstart/Autoyast/Preseed)
                   Diskless
                   Firmware audit
                   Bare metal netboot low level tools (hpacucli / hponcf / ipmitool)



© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
CMU provisioning engine
 backup/cloning (up to 4k nodes)‫‏‬

 RHEL (& clones) / SLES ( & clones) / Debian / Ubuntu (only compute nodes)‫‏‬
 performance only depends on the image size & harddrive speed
     no architectural dependence on trunked/IB/10gig networks, 1Gig ethernet is sufficient.

 22 minutes to clone 1000 nodes with SAS drives and 10 GB image
      continuous cloning for reprovisionning from batch schedulers

 autoinstall ‫ ‏‬CMU bridge to legacy/standard tools
              :

 Redhat Kickstart / SLES Autoyast / Debian Preseed
      do not use above 100 nodes

 diskless (advised if improving the density of the solution and/or data security)

 statefull diskless engine (hybrid NFS ro + rw personalities)
 14   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
CMU firmware / bare metal netboot tools

 firmware hooks:

 firmware version checks (HP conrep based currently / HP rcu soon)
 firmware settings audit (HP conrep + cmudiff)
 firmware flashing engine (to feed with SCEXE HP files)

 bare metal netboot tools (available from ‚pre_reconf ‚/ ‚reconf‛ )

 hpacucli : configure HP smartarray controllers
 locfg.pl/ hponcfg: configure HP ILO from the CMU netboot environment
 ipmitools: configure IPMI capable BMC from the CMU netboot environment




 19   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight CMU v7.0 Tour
   CMU Monitoring
               scalable / ‘HPC aware’ monitoring engine (collectl, GPGPUS)
               2D Instant View / 3D Time View (Live History)




    © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
CMU monitoring
   Backend: ‚HPC aware‛ monitoring since years
       Scalable monitoring ( proven on 4k nodes system )
       Non intrusive (leverage collectl + ‚HPC synchro‛ mode)
       Programmable (monitor anything you can script )
           Nvidia & AMD GPGPUs monitoring tool
       Extended Monitoring to inject arbitrary monitoring data
       Alerting system & CMU Reactions


   Frontend GUI (JAVA client/server) / CLI
       GUI: Instant view 2D / TimeView 3D (Live History)
       cmu_dynamic_user groups (see later in presentation)
    21 CLI/API: cmu_monstat and flat human readable files
         © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
‚Instant View*‛ CMU Display




* renamed « Instant View » since CMU v7.0
22   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
23   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
TimeView (Live History)




© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Existing ‘well known’ CMU Display since 2004




25   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
3D Display of Sensor Histories
readability, efficiency, precision




26
     2
     © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
3D Display of Sensor Histories
global job overview




27   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
28   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
29   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
30   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
31   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
GPGPUs monitoring




© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
CMU GPGPU Support
CMU provides a tool for extracting GPU metric data
from GPU driver

‚cmu_get_nvidia_gpu‛ monitors:

load, mem_util, mem_alloc, power_state, and ECC_double_bit
alerts by default
Power_usage, various clock speeds, fan speeds, and
temperature also configured but commented out by default


 33   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
37   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

F
3ooter goes here
Extended monitoring




© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
CMU Extended Monitoring
Inject monitoring data from another source into CMU

Extended metrics will be used for:
Server hardware metrics (ILO4 out-of-band & agentless monitoring)
• Temperatures, fan speeds, power usage
• Gathered out-of-band
• OS-neutral
Cluster peripherals
• MCS temperatures, switch status
Workload schedulers
 39   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
CMU alerts & reactions




© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
CMU alerts & CMU reactions

CMU monitoring engine can trigger alerts
CMU alerts can trigger scripts as reactions to alerts

reaction examples:
• SNMP traps (send all alerts to an SNMP sink such as HPSIM)
• Send an email
• Remove a compute node from a batch scheduler…


44   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight CMU v7.0 Tour
     Scalable / Frictionless monitoring
                 Interactive command broadcast (ssh, BMC interfaces)
                 cmudiff non interactive scalable command output analyzer
                 GUI accelerators (power off / UID leds/ three clicks ‘en masse’ cloning….)




© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight CMU GUI basics
Cluster mgmt
panel
displays all
nodes in
selected
groupings: by
switch
location; by
image; or by
custom
grouping

     node states
     display
     current state
     of each node                                                              CMU Main                                                 Alerts displayed
47
                                                                               Display Panel
     © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
                                                                                                                                        along the bottom
CMU GUI Basics
–   Right-click to select sensors to
    display

–   CMU pre-configured with standard
    sensors: CPU and memory usage,
    and disk and network I/O

–   Simple to add any sensor or alert

–   CMU provides simple support for
    monitoring GPU temp and ECC
    errors

–   Three clicks to clone compute nodes
    !

     48   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Friction-less remote control of target nodes
                Selected                                       Power
                nodes                                                                                     Broadcast
                                                               commands
                                                                                                          commands
                                                                                                                                        Provisioning
                                                                                                                                        commands




                                                                                                                                                 User-defined
                                                                                                                                                 commands




49   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
CMU remote management commands
•    Multi-window broadcast command (access OS or console)




51
      type here…
     © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
                                                                                                                                        ...and see it there
cmudiff

scaling the command line.




© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Compare node outputs with Scalable Text Analyser (cmudiff)

                                                                                                                                         Single-window pdsh with
                                                                                                                                           cmu_diff example
                                                                                                                                            One command
                                                                                                                                            executed across a
                                                                                                                                            set of selected
                                                                                                                                            nodes…

                                                                                                                                           …finds one node
                                                                                                                                           running with an old
                                                                                                                                           BIOS version!




 53   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
dshbak vs cmudiff: round #1…. ‘date’ on five hosts


                                                                                                                                                 cmudiff
dshbak




         57   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
dshbak vs cmudiff: round #2..‘ifconfig’ on 3 hosts


                                                                                                                                                  cmudiff
dshbak




         58    © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
              58 HP Confidential
Partners software integration
& Custom menu GUI




© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
62   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
63   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
CMU Custom Menu Support

                                               /opt/cmu/etc/cmu_custom_menu




64   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
cmu_dynamic_user_groups




© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight CMU as a (job) power monitor




70   © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Insight CMU Partner Integrations

   Moab – Dynamic Provisioning
   PBS Pro – Green Scheduling & OS Provisioning
   LSF – Platform HPC
   ScaleMP – create large virtual SMP nodes
   StackIQ – CMU part of HP ‚roll‛

   HP Matrix CMU CloudMap




© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Thank you for your interest in
HP Insight CMU




 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Weitere ähnliche Inhalte

Was ist angesagt?

Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Linaro
 
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
Edge AI and Vision Alliance
 

Was ist angesagt? (20)

Isn’t it Ironic that a Redfish is software defining you
Isn’t it Ironic that a Redfish is software defining you Isn’t it Ironic that a Redfish is software defining you
Isn’t it Ironic that a Redfish is software defining you
 
Understanding the endianess and the benefits of RHEL for Power, little endian
Understanding the endianess and the benefits of RHEL for Power, little endianUnderstanding the endianess and the benefits of RHEL for Power, little endian
Understanding the endianess and the benefits of RHEL for Power, little endian
 
RedHat Linux
RedHat LinuxRedHat Linux
RedHat Linux
 
Red Hat for IBM System z IBM Enterprise2014 Las Vegas
Red Hat for IBM System z IBM Enterprise2014 Las Vegas Red Hat for IBM System z IBM Enterprise2014 Las Vegas
Red Hat for IBM System z IBM Enterprise2014 Las Vegas
 
Linux Containers and Docker SHARE.ORG Seattle 2015
Linux Containers and Docker SHARE.ORG Seattle 2015Linux Containers and Docker SHARE.ORG Seattle 2015
Linux Containers and Docker SHARE.ORG Seattle 2015
 
UplinQ - ubuntu linux on the qualcomm® snapdragon™ 600 processor
UplinQ - ubuntu linux on the qualcomm® snapdragon™ 600 processorUplinQ - ubuntu linux on the qualcomm® snapdragon™ 600 processor
UplinQ - ubuntu linux on the qualcomm® snapdragon™ 600 processor
 
IPMI is dead, Long live Redfish
IPMI is dead, Long live RedfishIPMI is dead, Long live Redfish
IPMI is dead, Long live Redfish
 
All in one
All in oneAll in one
All in one
 
Picking a distro_1_
Picking a distro_1_Picking a distro_1_
Picking a distro_1_
 
Ugif 09 2013 psm
Ugif 09 2013   psmUgif 09 2013   psm
Ugif 09 2013 psm
 
IBM Systems Technical Symposium Melbourne, 2015
IBM Systems Technical Symposium Melbourne, 2015IBM Systems Technical Symposium Melbourne, 2015
IBM Systems Technical Symposium Melbourne, 2015
 
SHARE.ORG Orlando 2015
SHARE.ORG Orlando 2015SHARE.ORG Orlando 2015
SHARE.ORG Orlando 2015
 
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
 
Develop, Deploy, and Innovate with Intel® Cluster Ready
Develop, Deploy, and Innovate with Intel® Cluster ReadyDevelop, Deploy, and Innovate with Intel® Cluster Ready
Develop, Deploy, and Innovate with Intel® Cluster Ready
 
Introduction to nfv movilforum
Introduction to nfv   movilforumIntroduction to nfv   movilforum
Introduction to nfv movilforum
 
5 pipeline arch_rationale
5 pipeline arch_rationale5 pipeline arch_rationale
5 pipeline arch_rationale
 
ONS 2018 LA - Intel Tutorial: Cloud Native to NFV - Alon Bernstein, Cisco & K...
ONS 2018 LA - Intel Tutorial: Cloud Native to NFV - Alon Bernstein, Cisco & K...ONS 2018 LA - Intel Tutorial: Cloud Native to NFV - Alon Bernstein, Cisco & K...
ONS 2018 LA - Intel Tutorial: Cloud Native to NFV - Alon Bernstein, Cisco & K...
 
Simple Virtualization Overview
Simple Virtualization OverviewSimple Virtualization Overview
Simple Virtualization Overview
 
NFV features in kubernetes
NFV features in kubernetesNFV features in kubernetes
NFV features in kubernetes
 
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
 

Andere mochten auch (8)

SGI HPC DAY 2011 Kiev
SGI HPC DAY 2011 KievSGI HPC DAY 2011 Kiev
SGI HPC DAY 2011 Kiev
 
Itpe brief
Itpe briefItpe brief
Itpe brief
 
Secuirty based hellman protocols
Secuirty based hellman protocolsSecuirty based hellman protocols
Secuirty based hellman protocols
 
SGI - HPC-29mai2012
SGI - HPC-29mai2012SGI - HPC-29mai2012
SGI - HPC-29mai2012
 
Extreme networks - network design principles for hpc @ hpcday 2012 kiev
Extreme networks - network design principles for hpc @ hpcday 2012 kievExtreme networks - network design principles for hpc @ hpcday 2012 kiev
Extreme networks - network design principles for hpc @ hpcday 2012 kiev
 
Fujifilm - where zettabytes lives @ hpc day 2012 kiev
Fujifilm - where zettabytes lives @ hpc day 2012 kievFujifilm - where zettabytes lives @ hpc day 2012 kiev
Fujifilm - where zettabytes lives @ hpc day 2012 kiev
 
SGI HPC Update for June 2013
SGI HPC Update for June 2013SGI HPC Update for June 2013
SGI HPC Update for June 2013
 
Hp kiev hpcday_20121012
Hp kiev hpcday_20121012Hp kiev hpcday_20121012
Hp kiev hpcday_20121012
 

Ähnlich wie Hp cmu – easy to use cluster management utility @ hpcday 2012 kiev

4 metals workshop igor quintao
4   metals workshop igor quintao4   metals workshop igor quintao
4 metals workshop igor quintao
GE_Energy
 
Integrierte Experten Systeme_Erik-Werner Radtke_IBM Symposium 2013
Integrierte Experten Systeme_Erik-Werner Radtke_IBM Symposium 2013Integrierte Experten Systeme_Erik-Werner Radtke_IBM Symposium 2013
Integrierte Experten Systeme_Erik-Werner Radtke_IBM Symposium 2013
IBM Switzerland
 
"Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"...
"Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"..."Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"...
"Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"...
Edge AI and Vision Alliance
 
Pre-configured Cloud Solutions
Pre-configured Cloud Solutions Pre-configured Cloud Solutions
Pre-configured Cloud Solutions
Sougata Mitra
 

Ähnlich wie Hp cmu – easy to use cluster management utility @ hpcday 2012 kiev (20)

Hp moonshot Server
Hp moonshot Server Hp moonshot Server
Hp moonshot Server
 
HP Moonshot system
HP Moonshot systemHP Moonshot system
HP Moonshot system
 
4 metals workshop igor quintao
4   metals workshop igor quintao4   metals workshop igor quintao
4 metals workshop igor quintao
 
Nagios Conference 2014 - Dave Williams - Multi-Tenant Nagios Monitoring
Nagios Conference 2014 - Dave Williams - Multi-Tenant Nagios MonitoringNagios Conference 2014 - Dave Williams - Multi-Tenant Nagios Monitoring
Nagios Conference 2014 - Dave Williams - Multi-Tenant Nagios Monitoring
 
z/VM and OpenStack
z/VM and OpenStackz/VM and OpenStack
z/VM and OpenStack
 
DHPA Techday 2015 - Johan Benning - HP Mobility
DHPA Techday 2015 - Johan Benning - HP MobilityDHPA Techday 2015 - Johan Benning - HP Mobility
DHPA Techday 2015 - Johan Benning - HP Mobility
 
Integrierte Experten Systeme_Erik-Werner Radtke_IBM Symposium 2013
Integrierte Experten Systeme_Erik-Werner Radtke_IBM Symposium 2013Integrierte Experten Systeme_Erik-Werner Radtke_IBM Symposium 2013
Integrierte Experten Systeme_Erik-Werner Radtke_IBM Symposium 2013
 
Pivotal: Operationalizing 1000 Node Hadoop Cluster - Analytics Workbench
Pivotal: Operationalizing 1000 Node Hadoop Cluster - Analytics WorkbenchPivotal: Operationalizing 1000 Node Hadoop Cluster - Analytics Workbench
Pivotal: Operationalizing 1000 Node Hadoop Cluster - Analytics Workbench
 
Automation Evolution with Junos
Automation Evolution with JunosAutomation Evolution with Junos
Automation Evolution with Junos
 
Why z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIsWhy z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIs
 
Multi-OS Continuous Packaging with docker and Project-Builder.org
Multi-OS Continuous Packaging with docker and Project-Builder.orgMulti-OS Continuous Packaging with docker and Project-Builder.org
Multi-OS Continuous Packaging with docker and Project-Builder.org
 
Pivotal Container Service Overview
Pivotal Container Service Overview Pivotal Container Service Overview
Pivotal Container Service Overview
 
Überwachung virtueller Umgebungen
Überwachung virtueller UmgebungenÜberwachung virtueller Umgebungen
Überwachung virtueller Umgebungen
 
"Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"...
"Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"..."Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"...
"Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"...
 
Using GPUs to Handle Big Data with Java
Using GPUs to Handle Big Data with JavaUsing GPUs to Handle Big Data with Java
Using GPUs to Handle Big Data with Java
 
Innovate 2014: Get an A+ on Testing Your Enterprise Applications with Rationa...
Innovate 2014: Get an A+ on Testing Your Enterprise Applications with Rationa...Innovate 2014: Get an A+ on Testing Your Enterprise Applications with Rationa...
Innovate 2014: Get an A+ on Testing Your Enterprise Applications with Rationa...
 
Lego Cloud SAP Virtualization Week 2012
Lego Cloud SAP Virtualization Week 2012Lego Cloud SAP Virtualization Week 2012
Lego Cloud SAP Virtualization Week 2012
 
HP Converged System One (CSO)
HP Converged System One (CSO)HP Converged System One (CSO)
HP Converged System One (CSO)
 
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Deep Learning and Gene Computing Acceleration with Alluxio in KubernetesDeep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
 
Pre-configured Cloud Solutions
Pre-configured Cloud Solutions Pre-configured Cloud Solutions
Pre-configured Cloud Solutions
 

Mehr von Volodymyr Saviak

Technical supercomputers laboratory. & insitute of cybernetics of ukraine @ h...
Technical supercomputers laboratory. & insitute of cybernetics of ukraine @ h...Technical supercomputers laboratory. & insitute of cybernetics of ukraine @ h...
Technical supercomputers laboratory. & insitute of cybernetics of ukraine @ h...
Volodymyr Saviak
 
Altair - compute manager your gateway to hpc cloud computing with pbs profess...
Altair - compute manager your gateway to hpc cloud computing with pbs profess...Altair - compute manager your gateway to hpc cloud computing with pbs profess...
Altair - compute manager your gateway to hpc cloud computing with pbs profess...
Volodymyr Saviak
 
Nvidia kepler architecture performance efficiency availability @ hpcday 2012 ...
Nvidia kepler architecture performance efficiency availability @ hpcday 2012 ...Nvidia kepler architecture performance efficiency availability @ hpcday 2012 ...
Nvidia kepler architecture performance efficiency availability @ hpcday 2012 ...
Volodymyr Saviak
 
Mellanox hpc update @ hpcday 2012 kiev
Mellanox hpc update @ hpcday 2012 kievMellanox hpc update @ hpcday 2012 kiev
Mellanox hpc update @ hpcday 2012 kiev
Volodymyr Saviak
 
Alekseev hpc day 2011 Kiev
Alekseev hpc day 2011 KievAlekseev hpc day 2011 Kiev
Alekseev hpc day 2011 Kiev
Volodymyr Saviak
 
Petrenko hpc day 2011 Kiev
Petrenko hpc day 2011 KievPetrenko hpc day 2011 Kiev
Petrenko hpc day 2011 Kiev
Volodymyr Saviak
 
Kindratenko hpc day 2011 Kiev
Kindratenko hpc day 2011 KievKindratenko hpc day 2011 Kiev
Kindratenko hpc day 2011 Kiev
Volodymyr Saviak
 
Mellanox hpc day 2011 kiev
Mellanox hpc day 2011 kievMellanox hpc day 2011 kiev
Mellanox hpc day 2011 kiev
Volodymyr Saviak
 
Massive solutions hpc day 2011 kiev
Massive solutions hpc day 2011 kievMassive solutions hpc day 2011 kiev
Massive solutions hpc day 2011 kiev
Volodymyr Saviak
 

Mehr von Volodymyr Saviak (12)

Technical supercomputers laboratory. & insitute of cybernetics of ukraine @ h...
Technical supercomputers laboratory. & insitute of cybernetics of ukraine @ h...Technical supercomputers laboratory. & insitute of cybernetics of ukraine @ h...
Technical supercomputers laboratory. & insitute of cybernetics of ukraine @ h...
 
Altair - compute manager your gateway to hpc cloud computing with pbs profess...
Altair - compute manager your gateway to hpc cloud computing with pbs profess...Altair - compute manager your gateway to hpc cloud computing with pbs profess...
Altair - compute manager your gateway to hpc cloud computing with pbs profess...
 
Nvidia kepler architecture performance efficiency availability @ hpcday 2012 ...
Nvidia kepler architecture performance efficiency availability @ hpcday 2012 ...Nvidia kepler architecture performance efficiency availability @ hpcday 2012 ...
Nvidia kepler architecture performance efficiency availability @ hpcday 2012 ...
 
Mellanox hpc update @ hpcday 2012 kiev
Mellanox hpc update @ hpcday 2012 kievMellanox hpc update @ hpcday 2012 kiev
Mellanox hpc update @ hpcday 2012 kiev
 
Apc hpc day 2011 kiev
Apc hpc day 2011 kievApc hpc day 2011 kiev
Apc hpc day 2011 kiev
 
Golovinskiy hpc day 2011
Golovinskiy hpc day 2011Golovinskiy hpc day 2011
Golovinskiy hpc day 2011
 
Alekseev hpc day 2011 Kiev
Alekseev hpc day 2011 KievAlekseev hpc day 2011 Kiev
Alekseev hpc day 2011 Kiev
 
Petrenko hpc day 2011 Kiev
Petrenko hpc day 2011 KievPetrenko hpc day 2011 Kiev
Petrenko hpc day 2011 Kiev
 
Kindratenko hpc day 2011 Kiev
Kindratenko hpc day 2011 KievKindratenko hpc day 2011 Kiev
Kindratenko hpc day 2011 Kiev
 
Mellanox hpc day 2011 kiev
Mellanox hpc day 2011 kievMellanox hpc day 2011 kiev
Mellanox hpc day 2011 kiev
 
Massive solutions hpc day 2011 kiev
Massive solutions hpc day 2011 kievMassive solutions hpc day 2011 kiev
Massive solutions hpc day 2011 kiev
 
Nvidia hpc day 2011 kiev
Nvidia hpc day 2011 kievNvidia hpc day 2011 kiev
Nvidia hpc day 2011 kiev
 

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

Hp cmu – easy to use cluster management utility @ hpcday 2012 kiev

  • 1. HP Insight CMU Cluster Management Utility Tour Sébastien Cabaniols, CMU WW Team lead / EMEA HPC Presales consultant 7th HPC Day conference, Kiev, Ukraine October 2012 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 2. Agenda  HP Insight CMU Introduction & Review  Introduction, History / Customers  Product mindset  Insight CMU v7.0 tour  Provisioning (Cloning / Autoinstall / Diskless)  Monitoring ( TimeView / Collectl / GPGPUs…)  Scalable/Frictionless administration ( cmudiff…)  Custom GUI & partners integration © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 3. Insight CMU Introduction & History © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 4. Insight CMU introduction CMU = Cluster Management Utility ‚CMU optimizes the TCO of compute farms‛  CMU scaling specification: 4k nodes  CMU has lots of industrial clusters in production with 2k/3k+ nodes  CMU has a strong presence in the TOP500 (www.top500.org)  CMU at customer site since 2000 4 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 5. Insight CMU major milestones 2000: initial implementation for Tru64 Unix (Alphaserver)‫‏‬ 2001: port to Alpha Linux, 1600 servers commercial cluster 2002: port to x86 & IA64 Linux / HPUX Itanium 2004: port to x86_64 Linux. (only port maintained*) 2007: Swedish gov, 6th @ TOP500 2010: Tsubame 2, HP first public 1+ PFlop cluster, 5th @ TOP500 *ARM port in progress... 5 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 6. Worldwide CMU Deployments HP ships >2 CMU clusters per week WW UNIVERSITIES ENGINEERING GOVERNMENT and RESEARCH LABS ENERGY 6 6 April 2009 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 7. Insight CMU project mindset © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 8. Insight CMU project mindset CMU provides the core functionalities for a farm/cloud  runs any HP* server (even mix) / any Linux distribution (even mix)‫‏‬  independent of many architectural aspects of the system: interconnects / GPGPUs / IO accelerators... network topology (open cluster, guarded cluster, WAN…) batch/job schedulers, MPI stacks, math libraries, compilers.... CMU is not a ‘predefined’ SW supercomputer appliance  >90% systems delivered as ‚turn-key solutions‛  CMU can also be purchased standalone with support and manuals 8 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 9. Insight CMU v7.0 tour  CMU functionalities / typical CMU implementation  CMU Provisioning  CMU Monitoring  CMU Scalable / Frictionless administration  CMU Custom GUI & partners integration © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 10. CMU starts here: typical { farm / HPC cluster } implementation { high speed Interconnect } { Highly Avail.}Head node © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 11. Insight Cluster Management Utility Basics  CMU is a single package running on the head node (upgrade is trivial) CMU mgt node can be an HA cluster (HP service guard,Redhat Cluster, SLES HA)  Provides a full fledged interactive CLI  Provides cmu_* commands as an API (for scripting) For integration with other software or command-line activity (see partner’s integration)  Provides GUI client for single dashboard control launch from a web page served from head node (JAVA© webstart technology) run on a local laptop/desktop user mode for monitoring admin mode for remote administrationcontained herein is subject to change without notice. 11 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information
  • 12. The three pillars of HP Insight CMU Provision Monitor Control • Simplified discovery, • ‘At a glance’ view of • GUI & CLI & API firmware audits entire system / • Easy GUI, friction- • Fast & scalable partition less control of cloning • Customizable remote servers • Legacy support of • Lightweight • Scalable pdsh with Kickstart/Autoyast/D • Instant 2D view cmudiff analyser ebian Preseed • TimeView, 3D live • Diskless support history © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 13. Insight CMU v7.0 Tour  CMU Provisioning  Scalable Cloning  Legacy/Compatible Autoinstall (Kickstart/Autoyast/Preseed)  Diskless  Firmware audit  Bare metal netboot low level tools (hpacucli / hponcf / ipmitool) © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 14. CMU provisioning engine  backup/cloning (up to 4k nodes)‫‏‬  RHEL (& clones) / SLES ( & clones) / Debian / Ubuntu (only compute nodes)‫‏‬  performance only depends on the image size & harddrive speed  no architectural dependence on trunked/IB/10gig networks, 1Gig ethernet is sufficient.  22 minutes to clone 1000 nodes with SAS drives and 10 GB image  continuous cloning for reprovisionning from batch schedulers  autoinstall ‫ ‏‬CMU bridge to legacy/standard tools :  Redhat Kickstart / SLES Autoyast / Debian Preseed  do not use above 100 nodes  diskless (advised if improving the density of the solution and/or data security)  statefull diskless engine (hybrid NFS ro + rw personalities) 14 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 15. CMU firmware / bare metal netboot tools  firmware hooks:  firmware version checks (HP conrep based currently / HP rcu soon)  firmware settings audit (HP conrep + cmudiff)  firmware flashing engine (to feed with SCEXE HP files)  bare metal netboot tools (available from ‚pre_reconf ‚/ ‚reconf‛ )  hpacucli : configure HP smartarray controllers  locfg.pl/ hponcfg: configure HP ILO from the CMU netboot environment  ipmitools: configure IPMI capable BMC from the CMU netboot environment 19 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 16. Insight CMU v7.0 Tour  CMU Monitoring  scalable / ‘HPC aware’ monitoring engine (collectl, GPGPUS)  2D Instant View / 3D Time View (Live History) © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 17. CMU monitoring  Backend: ‚HPC aware‛ monitoring since years  Scalable monitoring ( proven on 4k nodes system )  Non intrusive (leverage collectl + ‚HPC synchro‛ mode)  Programmable (monitor anything you can script )  Nvidia & AMD GPGPUs monitoring tool  Extended Monitoring to inject arbitrary monitoring data  Alerting system & CMU Reactions  Frontend GUI (JAVA client/server) / CLI  GUI: Instant view 2D / TimeView 3D (Live History)  cmu_dynamic_user groups (see later in presentation) 21 CLI/API: cmu_monstat and flat human readable files © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 18. ‚Instant View*‛ CMU Display * renamed « Instant View » since CMU v7.0 22 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 19. 23 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 20. TimeView (Live History) © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 21. Existing ‘well known’ CMU Display since 2004 25 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 22. 3D Display of Sensor Histories readability, efficiency, precision 26 2 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 23. 3D Display of Sensor Histories global job overview 27 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 24. 28 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 25. 29 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 26. 30 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 27. 31 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 28. GPGPUs monitoring © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 29. CMU GPGPU Support CMU provides a tool for extracting GPU metric data from GPU driver ‚cmu_get_nvidia_gpu‛ monitors: load, mem_util, mem_alloc, power_state, and ECC_double_bit alerts by default Power_usage, various clock speeds, fan speeds, and temperature also configured but commented out by default 33 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 30. 37 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. F 3ooter goes here
  • 31. Extended monitoring © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 32. CMU Extended Monitoring Inject monitoring data from another source into CMU Extended metrics will be used for: Server hardware metrics (ILO4 out-of-band & agentless monitoring) • Temperatures, fan speeds, power usage • Gathered out-of-band • OS-neutral Cluster peripherals • MCS temperatures, switch status Workload schedulers 39 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 33. CMU alerts & reactions © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 34. CMU alerts & CMU reactions CMU monitoring engine can trigger alerts CMU alerts can trigger scripts as reactions to alerts reaction examples: • SNMP traps (send all alerts to an SNMP sink such as HPSIM) • Send an email • Remove a compute node from a batch scheduler… 44 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 35. Insight CMU v7.0 Tour  Scalable / Frictionless monitoring  Interactive command broadcast (ssh, BMC interfaces)  cmudiff non interactive scalable command output analyzer  GUI accelerators (power off / UID leds/ three clicks ‘en masse’ cloning….) © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 36. Insight CMU GUI basics Cluster mgmt panel displays all nodes in selected groupings: by switch location; by image; or by custom grouping node states display current state of each node CMU Main Alerts displayed 47 Display Panel © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. along the bottom
  • 37. CMU GUI Basics – Right-click to select sensors to display – CMU pre-configured with standard sensors: CPU and memory usage, and disk and network I/O – Simple to add any sensor or alert – CMU provides simple support for monitoring GPU temp and ECC errors – Three clicks to clone compute nodes ! 48 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 38. Friction-less remote control of target nodes Selected Power nodes Broadcast commands commands Provisioning commands User-defined commands 49 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 39. CMU remote management commands • Multi-window broadcast command (access OS or console) 51 type here… © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. ...and see it there
  • 40. cmudiff scaling the command line. © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 41. Compare node outputs with Scalable Text Analyser (cmudiff) Single-window pdsh with cmu_diff example One command executed across a set of selected nodes… …finds one node running with an old BIOS version! 53 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 42. dshbak vs cmudiff: round #1…. ‘date’ on five hosts cmudiff dshbak 57 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 43. dshbak vs cmudiff: round #2..‘ifconfig’ on 3 hosts cmudiff dshbak 58 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. 58 HP Confidential
  • 44. Partners software integration & Custom menu GUI © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 45. 62 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 46. 63 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 47. CMU Custom Menu Support /opt/cmu/etc/cmu_custom_menu 64 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 48. cmu_dynamic_user_groups © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 49. Insight CMU as a (job) power monitor 70 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 50. Insight CMU Partner Integrations  Moab – Dynamic Provisioning  PBS Pro – Green Scheduling & OS Provisioning  LSF – Platform HPC  ScaleMP – create large virtual SMP nodes  StackIQ – CMU part of HP ‚roll‛  HP Matrix CMU CloudMap © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
  • 51. Thank you for your interest in HP Insight CMU © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.