SlideShare a Scribd company logo
1 of 12
Download to read offline
©ARM 2017
ARM HPC Ecosystem
Darren Cepulis
HPC Forum, Santa Fe, NM
HPC Segment Manager
ARM Business Segment Group
19th April 2017
©ARM 20172
ARM Collaboration for Exascale Programs
Japan
Fujitsu and RIKEN announced that the
Post-K system targeted at Exascale will
be based on ARMv8 with new Scalable
Vector Extensions.
China
James Lin, vice director for the Center
of HPC at Shanghai JiaoTong University
claims China will build three pre-
Exascale prototypes to select the
architecture for their Exascale system.
The three prototypes are based on
AMD, SunWei TaihuLight, and ARMv8.
United States
ARM is currently a participant in
two Department of Energy funded
pre-Exascale projects: Data
Movement Dominates and Fast
Forward 2.
European Union
Through FP7 and Horizon 2020,
ARM has been involved in several
funded pre-Exascale projects
including the Mont Blanc program
which deployed one of the first
ARM prototype HPC systems.
©ARM 20173
ARM HPC deployments starting in 2H2017
Two recent announcements about ARM in HPC in Europe:
©ARM 20174
Japan Exascale
slides from Fujitsu at ISC’16
©ARM 20175
Foundational SW Ecosystem for HPC
 Linux OS’s – RedHat, SUSE, CENTOS, UBUNTU,…
 Compilers – ARM, GNU, LLVM,…
 Libraries – ARM, OpenBLAS, BLIS,ATLAS, FFTW…
 Parallelism – OpenMP, OpenMPI, MVAPICH2,…
 Debugging – Allinea, RWTotalview, GDB,…
 Analysis – ARM,Allinea, HPCToolkit,TAU,…
 Job schedulers – LSF, PBS Pro, SLURM,…
 Cluster mgmt – Bright, CMU, warewulf,…
Open-source
Commercial
Predictable Baseline
©ARM 20176
OpenHPC defines a baseline. It is a community effort to
provide a common, verified set of open source packages for
HPC deployments
ARM’s participation:
 Silver member of OpenHPC
 ARM is on the OpenHPCTechnical Steering Committee
in order to drive ARM architecture build support
Status (April 2017):
 1.3.0 release out now
 All packages built on ARMv8 for both CentOS and SUSE
 ARM-based machines are being used for builds in the
OpenHPC build infrastructure atTACC
 https://github.com/arm-hpc/ohpc/
– now on ARM
Functional
Areas
Supported packages / components
Base OS RHEL/CentOS 7.1, SLES 12
Administrative
Tools
Conman, Ganglia, Lmod, LosF, ORCM, Nagios, pdsh,
prun
Provisioning Warewulf
Resource Mgmt. SLURM, Munge. Altair PBS Pro
I/O Services Lustre client (community version)
Numerical,
Scientific Libraries
Boost, GSL, FFTW, Metis, PETSc, Trilinos, Hypre,
SuperLU, Mumps
I/O Libraries HDF5 (pHDF5), NetCDF (including C++ and Fortran
interfaces), Adios
Compiler Families GNU (gcc, g++, gfortran)
MPI Families OpenMPI, MVAPICH2
Development
Tools
Autotools (autoconf, automake, libtool), Valgrind,R,
SciPy/NumPy
Performance
Tools
PAPI, Intel IMB, mpiP, pdtoolkit TAU
©ARM 20177
ARM HPC tools commercial portfolio
ARM Code AdvisorARM C/C++ Compiler
ACTIONABLE ADVICETO
OPTIMIZEYOUR CODE
COMMERCIALLY SUPPORTED
FOR HPC APPLICATIONS
ARM Instruction EmulatorARM SVE C/C++ Compiler
DEVELOP SOFTWARE FOR
TOMORROW’S HARDWARETODAY
COMPILER SUPPORT FOR
ARM SCALABLEVECTOR EXTENSION
ARM Performance Libraries
BLAS, LAPACK and FFT
MICRO-ARCHITECTURALLYTUNED
Allinea Performance ReportsAllinea Forge (DDT+MAP)
MONITOR AND TUNE APPLICATION
PERFORMANCE INTHE FIELD
CODE DEBUGGING and PROFILING
TOOLS for HPC APPLICATIONS
©ARM 20178
ARM Compiler
Commercially supported C/C++ compiler for Linux user-space HPC applications
LLVM-based
 LLVM-based
 ARM-on-ARM compiler
 For application development (not bare-metal/embedded)
Fortran in the works
 Leveraging on-going PGI FLANG collaboration
OpenMP
 Uses latest open source (now ARM-optimized) LLVM OpenMP runtime
©ARM 20179
ARM Performance Libraries
Optimized BLAS, LAPACK and FFT
Commercial 64-bit ARMv8 math libraries
 Commonly used low-level math routines - BLAS, LAPACK and FFT.
 Validated with NAG’s test suite, a de-facto standard.
Best-in-class performance with commercial support
 Tuned by ARM for Cortex-A72, Cortex-A57 and Cortex-A53.
 Maintained and Supported by ARM for a wide range of ARM-based SoCs.
 Regular benchmarking against open source alternatives.
Silicon partners can provide tuned micro-kernels for their SoCs
 Partners can collaborate directly working with our source-code and test suite.
 Alternatively they can contribute through open source route.
Commercially Supported
by ARM
Validated with
NAG test suite
Performance on par
with best-in-class math libraries
©ARM 201710
Open source in the ARM HPC ecosystem
 Many open-source applications have been ported to ARMv8 for a variety of
HPC sectors:
©ARM 201711
www.arm.com/hpc
https://www.nextplatform.com/2017/03/15/arm-antes-hpc-software-stack/
Further reading:
©ARM 201712
Server OS support solidifies for ARM in 2016
 SUSE (SLES 12) providing first commercial enterprise Linux distribution optimized for ARM-
based 64-bit servers - https://www.suse.com/products/arm
 Red Hat Enterprise Linux Server for ARM (RHELSA) 7.3 release with single-host virtualization
based on KVM, Ceph for prototyping and deploying and initial support for Red Hat Developer
Toolset (latest stable open source C and C++) http://red.ht/2flVOH7
 CentOS 7.3 for ARM64 released with greatly improved support for CaviumThunderX SoC family
https://lists.centos.org/pipermail/centos-announce/2017-January/022193.html
 Canonical and ARM collaborate on commercial Ubuntu OpenStack and Ceph for ARMv8
servers: https://insights.ubuntu.com/2016/10/17/canonical-and-arm-collaborate-on-openstack/
 FreeBSD 11 released - ARM joins x86 as primary architectures supported -
https://www.freebsdfoundation.org/blog/freebsd-takes-open-source-to-11-with-latest-
release/ and https://www.freebsd.org/releases/11.0R/relnotes.html
 Xen hypervisor 4.8 released - Updates focus on ARM server enhancements, security hardening and
quality code: http://www.marketwired.com/press-release/xen-project-hypervisor-continues-to-
expand-into-embedded-use-cases-in-latest-release-2181499.htm

More Related Content

What's hot

A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator S...
A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator S...A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator S...
A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator S...
inside-BigData.com
 
RDMA on ARM
RDMA on ARMRDMA on ARM
RDMA on ARM
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Linaro
 

What's hot (20)

High Performance Interconnects: Landscape, Assessments & Rankings
High Performance Interconnects: Landscape, Assessments & RankingsHigh Performance Interconnects: Landscape, Assessments & Rankings
High Performance Interconnects: Landscape, Assessments & Rankings
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
 
High Performance Interconnects: Assessment & Rankings
High Performance Interconnects: Assessment & RankingsHigh Performance Interconnects: Assessment & Rankings
High Performance Interconnects: Assessment & Rankings
 
A Fresh Look at HPC from Huawei Enterprise
A Fresh Look at HPC from Huawei EnterpriseA Fresh Look at HPC from Huawei Enterprise
A Fresh Look at HPC from Huawei Enterprise
 
A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator S...
A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator S...A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator S...
A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator S...
 
Mellanox Announces HDR 200 Gb/s InfiniBand Solutions
Mellanox Announces HDR 200 Gb/s InfiniBand SolutionsMellanox Announces HDR 200 Gb/s InfiniBand Solutions
Mellanox Announces HDR 200 Gb/s InfiniBand Solutions
 
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
 
RDMA on ARM
RDMA on ARMRDMA on ARM
RDMA on ARM
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
Accelerating apache spark with rdma
Accelerating apache spark with rdmaAccelerating apache spark with rdma
Accelerating apache spark with rdma
 
SPACK: A Package Manager for Supercomputers, Linux, and MacOS
SPACK: A Package Manager for Supercomputers, Linux, and MacOSSPACK: A Package Manager for Supercomputers, Linux, and MacOS
SPACK: A Package Manager for Supercomputers, Linux, and MacOS
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
NNSA Explorations: ARM for Supercomputing
NNSA Explorations: ARM for SupercomputingNNSA Explorations: ARM for Supercomputing
NNSA Explorations: ARM for Supercomputing
 
TAU E4S ON OpenPOWER /POWER9 platform
TAU E4S ON OpenPOWER /POWER9 platformTAU E4S ON OpenPOWER /POWER9 platform
TAU E4S ON OpenPOWER /POWER9 platform
 
Hardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and MLHardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and ML
 
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
 
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
 
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua MoraHuawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
 
SGI: Meeting Manufacturing's Need for Production Supercomputing
SGI: Meeting Manufacturing's Need for Production SupercomputingSGI: Meeting Manufacturing's Need for Production Supercomputing
SGI: Meeting Manufacturing's Need for Production Supercomputing
 

Similar to ARM HPC Ecosystem

HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018
Linaro
 
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre..."APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
Edge AI and Vision Alliance
 

Similar to ARM HPC Ecosystem (20)

Arm as a Viable Architecture for HPC and AI
Arm as a Viable Architecture for HPC and AIArm as a Viable Architecture for HPC and AI
Arm as a Viable Architecture for HPC and AI
 
Balance, Flexibility, and Partnership: An ARM Approach to Future HPC Node Arc...
Balance, Flexibility, and Partnership: An ARM Approach to Future HPC Node Arc...Balance, Flexibility, and Partnership: An ARM Approach to Future HPC Node Arc...
Balance, Flexibility, and Partnership: An ARM Approach to Future HPC Node Arc...
 
Arm - ceph on arm update
Arm - ceph on arm updateArm - ceph on arm update
Arm - ceph on arm update
 
An Update on the European Processor Initiative
An Update on the European Processor InitiativeAn Update on the European Processor Initiative
An Update on the European Processor Initiative
 
HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018
 
An Update on Arm HPC
An Update on Arm HPCAn Update on Arm HPC
An Update on Arm HPC
 
ARM 32-bit Microcontroller Cortex-M3 introduction
ARM 32-bit Microcontroller Cortex-M3 introductionARM 32-bit Microcontroller Cortex-M3 introduction
ARM 32-bit Microcontroller Cortex-M3 introduction
 
Arm in HPC
Arm in HPCArm in HPC
Arm in HPC
 
Summit 16: OPNFV on ARM - Hardware Freedom of Choice Has Arrived!
Summit 16: OPNFV on ARM - Hardware Freedom of Choice Has Arrived!Summit 16: OPNFV on ARM - Hardware Freedom of Choice Has Arrived!
Summit 16: OPNFV on ARM - Hardware Freedom of Choice Has Arrived!
 
LCE12: LCE12 ARMv8 Plenary
LCE12: LCE12 ARMv8 PlenaryLCE12: LCE12 ARMv8 Plenary
LCE12: LCE12 ARMv8 Plenary
 
SUSE Linux for HPC - It Just Keeps Getting Better
SUSE Linux for HPC - It Just Keeps Getting BetterSUSE Linux for HPC - It Just Keeps Getting Better
SUSE Linux for HPC - It Just Keeps Getting Better
 
Involvement in OpenHPC
Involvement in OpenHPC	Involvement in OpenHPC
Involvement in OpenHPC
 
Panda scalable hpc_bestpractices_tue100418
Panda scalable hpc_bestpractices_tue100418Panda scalable hpc_bestpractices_tue100418
Panda scalable hpc_bestpractices_tue100418
 
Designing Scalable HPC, Deep Learning and Cloud Middleware for Exascale Systems
Designing Scalable HPC, Deep Learning and Cloud Middleware for Exascale SystemsDesigning Scalable HPC, Deep Learning and Cloud Middleware for Exascale Systems
Designing Scalable HPC, Deep Learning and Cloud Middleware for Exascale Systems
 
UCX: An Open Source Framework for HPC Network APIs and Beyond
UCX: An Open Source Framework for HPC Network APIs and BeyondUCX: An Open Source Framework for HPC Network APIs and Beyond
UCX: An Open Source Framework for HPC Network APIs and Beyond
 
Deployment of an HPC Cloud based on Intel hardware
Deployment of an HPC Cloud based on Intel hardwareDeployment of an HPC Cloud based on Intel hardware
Deployment of an HPC Cloud based on Intel hardware
 
SUSE High Performance Computing: It just keeps getting better
SUSE High Performance Computing: It just keeps getting betterSUSE High Performance Computing: It just keeps getting better
SUSE High Performance Computing: It just keeps getting better
 
PowerAI Deep dive
PowerAI Deep divePowerAI Deep dive
PowerAI Deep dive
 
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre..."APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 

More from inside-BigData.com

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
inside-BigData.com
 
Making Supernovae with Jets
Making Supernovae with JetsMaking Supernovae with Jets
Making Supernovae with Jets
inside-BigData.com
 
Scientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous ArchitecturesScientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous Architectures
inside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
 
Data Parallel Deep Learning
Data Parallel Deep LearningData Parallel Deep Learning
Data Parallel Deep Learning
 
Making Supernovae with Jets
Making Supernovae with JetsMaking Supernovae with Jets
Making Supernovae with Jets
 
Adaptive Linear Solvers and Eigensolvers
Adaptive Linear Solvers and EigensolversAdaptive Linear Solvers and Eigensolvers
Adaptive Linear Solvers and Eigensolvers
 
Scientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous ArchitecturesScientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous Architectures
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 

ARM HPC Ecosystem

  • 1. ©ARM 2017 ARM HPC Ecosystem Darren Cepulis HPC Forum, Santa Fe, NM HPC Segment Manager ARM Business Segment Group 19th April 2017
  • 2. ©ARM 20172 ARM Collaboration for Exascale Programs Japan Fujitsu and RIKEN announced that the Post-K system targeted at Exascale will be based on ARMv8 with new Scalable Vector Extensions. China James Lin, vice director for the Center of HPC at Shanghai JiaoTong University claims China will build three pre- Exascale prototypes to select the architecture for their Exascale system. The three prototypes are based on AMD, SunWei TaihuLight, and ARMv8. United States ARM is currently a participant in two Department of Energy funded pre-Exascale projects: Data Movement Dominates and Fast Forward 2. European Union Through FP7 and Horizon 2020, ARM has been involved in several funded pre-Exascale projects including the Mont Blanc program which deployed one of the first ARM prototype HPC systems.
  • 3. ©ARM 20173 ARM HPC deployments starting in 2H2017 Two recent announcements about ARM in HPC in Europe:
  • 4. ©ARM 20174 Japan Exascale slides from Fujitsu at ISC’16
  • 5. ©ARM 20175 Foundational SW Ecosystem for HPC  Linux OS’s – RedHat, SUSE, CENTOS, UBUNTU,…  Compilers – ARM, GNU, LLVM,…  Libraries – ARM, OpenBLAS, BLIS,ATLAS, FFTW…  Parallelism – OpenMP, OpenMPI, MVAPICH2,…  Debugging – Allinea, RWTotalview, GDB,…  Analysis – ARM,Allinea, HPCToolkit,TAU,…  Job schedulers – LSF, PBS Pro, SLURM,…  Cluster mgmt – Bright, CMU, warewulf,… Open-source Commercial Predictable Baseline
  • 6. ©ARM 20176 OpenHPC defines a baseline. It is a community effort to provide a common, verified set of open source packages for HPC deployments ARM’s participation:  Silver member of OpenHPC  ARM is on the OpenHPCTechnical Steering Committee in order to drive ARM architecture build support Status (April 2017):  1.3.0 release out now  All packages built on ARMv8 for both CentOS and SUSE  ARM-based machines are being used for builds in the OpenHPC build infrastructure atTACC  https://github.com/arm-hpc/ohpc/ – now on ARM Functional Areas Supported packages / components Base OS RHEL/CentOS 7.1, SLES 12 Administrative Tools Conman, Ganglia, Lmod, LosF, ORCM, Nagios, pdsh, prun Provisioning Warewulf Resource Mgmt. SLURM, Munge. Altair PBS Pro I/O Services Lustre client (community version) Numerical, Scientific Libraries Boost, GSL, FFTW, Metis, PETSc, Trilinos, Hypre, SuperLU, Mumps I/O Libraries HDF5 (pHDF5), NetCDF (including C++ and Fortran interfaces), Adios Compiler Families GNU (gcc, g++, gfortran) MPI Families OpenMPI, MVAPICH2 Development Tools Autotools (autoconf, automake, libtool), Valgrind,R, SciPy/NumPy Performance Tools PAPI, Intel IMB, mpiP, pdtoolkit TAU
  • 7. ©ARM 20177 ARM HPC tools commercial portfolio ARM Code AdvisorARM C/C++ Compiler ACTIONABLE ADVICETO OPTIMIZEYOUR CODE COMMERCIALLY SUPPORTED FOR HPC APPLICATIONS ARM Instruction EmulatorARM SVE C/C++ Compiler DEVELOP SOFTWARE FOR TOMORROW’S HARDWARETODAY COMPILER SUPPORT FOR ARM SCALABLEVECTOR EXTENSION ARM Performance Libraries BLAS, LAPACK and FFT MICRO-ARCHITECTURALLYTUNED Allinea Performance ReportsAllinea Forge (DDT+MAP) MONITOR AND TUNE APPLICATION PERFORMANCE INTHE FIELD CODE DEBUGGING and PROFILING TOOLS for HPC APPLICATIONS
  • 8. ©ARM 20178 ARM Compiler Commercially supported C/C++ compiler for Linux user-space HPC applications LLVM-based  LLVM-based  ARM-on-ARM compiler  For application development (not bare-metal/embedded) Fortran in the works  Leveraging on-going PGI FLANG collaboration OpenMP  Uses latest open source (now ARM-optimized) LLVM OpenMP runtime
  • 9. ©ARM 20179 ARM Performance Libraries Optimized BLAS, LAPACK and FFT Commercial 64-bit ARMv8 math libraries  Commonly used low-level math routines - BLAS, LAPACK and FFT.  Validated with NAG’s test suite, a de-facto standard. Best-in-class performance with commercial support  Tuned by ARM for Cortex-A72, Cortex-A57 and Cortex-A53.  Maintained and Supported by ARM for a wide range of ARM-based SoCs.  Regular benchmarking against open source alternatives. Silicon partners can provide tuned micro-kernels for their SoCs  Partners can collaborate directly working with our source-code and test suite.  Alternatively they can contribute through open source route. Commercially Supported by ARM Validated with NAG test suite Performance on par with best-in-class math libraries
  • 10. ©ARM 201710 Open source in the ARM HPC ecosystem  Many open-source applications have been ported to ARMv8 for a variety of HPC sectors:
  • 12. ©ARM 201712 Server OS support solidifies for ARM in 2016  SUSE (SLES 12) providing first commercial enterprise Linux distribution optimized for ARM- based 64-bit servers - https://www.suse.com/products/arm  Red Hat Enterprise Linux Server for ARM (RHELSA) 7.3 release with single-host virtualization based on KVM, Ceph for prototyping and deploying and initial support for Red Hat Developer Toolset (latest stable open source C and C++) http://red.ht/2flVOH7  CentOS 7.3 for ARM64 released with greatly improved support for CaviumThunderX SoC family https://lists.centos.org/pipermail/centos-announce/2017-January/022193.html  Canonical and ARM collaborate on commercial Ubuntu OpenStack and Ceph for ARMv8 servers: https://insights.ubuntu.com/2016/10/17/canonical-and-arm-collaborate-on-openstack/  FreeBSD 11 released - ARM joins x86 as primary architectures supported - https://www.freebsdfoundation.org/blog/freebsd-takes-open-source-to-11-with-latest- release/ and https://www.freebsd.org/releases/11.0R/relnotes.html  Xen hypervisor 4.8 released - Updates focus on ARM server enhancements, security hardening and quality code: http://www.marketwired.com/press-release/xen-project-hypervisor-continues-to- expand-into-embedded-use-cases-in-latest-release-2181499.htm