SlideShare a Scribd company logo
1 of 28
Download to read offline
Microsof t Project Olympus
Hyperscale GPU Accelerator
(HGX-1)
Siamak Tavallaei
⎻ Principal Architect, Microsoft Azure Cloud Hardware Infrastructure
Robert Ober
⎻ Tesla Chief Platform Architect, NVIDIA Corp.
• Project Olympus Modular Architecture
• nVidia SXM2 with NVLink
• Collaborative Chassis Design with Ingrasys
• Enabling Components
• High-level Feature List
• Use cases
• Performance Advantages for various Workloads
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1)
Talk Outline
PROJECT OLYMPUS BASE
PROJECT OLYMPUS MODULAR ARCHITECTURE
Establishes a baseline for cloud-scale standard deployment
Datacenter management, power, cooling, performance
• Configurable and Flexible Accelerators
⎻ 8 x NVIDIA P100_SXM2 & NVLink
⎻ 8 x GPGPUs in PCIe Card Form Factor
• Expandable to Scale UP
⎻ From one to four Chassis
⎻ Internal PCIe Fabric Interconnect
• Scale Out via InfiniBand Fabric
• Host Head Node Options
⎻ 2S Project Olympus Server
⎻ 1S, 2S, 4S Server Head Nodes (eight x16 PCIe Links)
⎻ Up to 16 Head Nodes (sixteen x8 PCIe Links)
Industry-standard
Accelerated CLOUD COMPUTING
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1)
• Configurable and Flexible Accelerators
⎻ 8 x NVIDIA P100_SXM2 & NVLink
⎻ 8 x GPGPUs in PCIe Card Form Factor
• Expandable to Scale UP
⎻ From one to four Chassis
⎻ Internal PCIe Fabric Interconnect
• Scale Out via InfiniBand Fabric
• Host Head Node Options
⎻ 2S Project Olympus Server
⎻ 1S, 2S, 4S Server Head Nodes (eight x16 PCIe Links)
⎻ Up to 16 Head Nodes (sixteen x8 PCIe Links)
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1) CHASSIS
Industry-standard
Accelerated CLOUD COMPUTING
• Configurable and Flexible Accelerators
⎻ 8 x NVIDIA P100_SXM2 & NVLink
⎻ 8 x GPGPUs in PCIe Card Form Factor
• Expandable to Scale UP (CNTK)
⎻ From one to four Chassis
⎻ Internal PCIe Fabric Interconnect
• Scale Out via InfiniBand Fabric (CNTK)
• Host Head Node Options
⎻ 2S Project Olympus Server
⎻ 1S, 2S, 4S Server Head Nodes (eight x16 PCIe Links)
⎻ Up to 16 Head Nodes (via sixteen x8 PCIe Links)
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1)
Industry-standard
Accelerated CLOUD COMPUTING
• Configurable and Flexible Accelerators
⎻ 8 x NVIDIA P100_SXM2 & NVLink
⎻ 8 x GPGPUs in PCIe Card Form Factor
• Expandable to Scale UP (CNTK)
⎻ From one to four Chassis
⎻ Internal PCIe Fabric Interconnect
• Scale Out via InfiniBand Fabric (CNTK)
• Host Head Node Options
⎻ 2S Project Olympus Server
⎻ 1S, 2S, 4S Server Head Nodes (eight x16 PCIe Links)
⎻ Up to 16 Head Nodes (via sixteen x8 PCIe Links)
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1)
Industry-standard
Accelerated CLOUD COMPUTING
• Configurable and Flexible Accelerators
⎻ 8 x NVIDIA P100_SXM2 & NVLink
⎻ 8 x GPGPUs in PCIe Card Form Factor
• Expandable to Scale UP (CNTK)
⎻ From one to four Chassis
⎻ Internal PCIe Fabric Interconnect
• Scale Out via InfiniBand Fabric (CNTK)
• Host Head Node Options
⎻ 2S Project Olympus Server
⎻ 1S, 2S, 4S Server Head Nodes (eight x16 PCIe Links)
⎻ Up to 16 Head Nodes (via sixteen x8 PCIe Links)
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1)
Industry-standard
Accelerated CLOUD COMPUTING
Video
Transcoding
HPC
• Configurable and Flexible Accelerators
⎻ 8 x NVIDIA P100_SXM2 & NVLink
⎻ 8 x GPGPUs in PCIe Card Form Factor
• Expandable to Scale UP
⎻ From one to four Chassis
⎻ Internal PCIe Fabric Interconnect
• Scale Out via InfiniBand Fabric
• Host Head Node Options
⎻ 2S Project Olympus Server
⎻ 1S, 2S, 4S Server Head Nodes (eight x16 PCIe Links)
⎻ Up to 16 Head Nodes (sixteen x8 PCIe Links)
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS
Industry-standard
Accelerated CLOUD COMPUTING
Enabling Components
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS
• Flexible PCIe Interconnect Topology
• GPGPU-to-Host via high-BW PCIe Links
• Peer-to-peer without Host interaction
⎻ GPGPU peer-to-peer via NVLink
⎻ GPGPU peer-to-peer to IB NICs via x16 PCIe
Enabling Components
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS
• Riser Boards
⎻ Plug into the Server Head Node
⎻ x16, x8 Type-A, x8 Type-B
• X8 OCuLink Cable/Connector
⎻ For Chassis-to-Chassis Interconnect
• Mezzanines
⎻ MEZZ1x16
⎻ Various PCIe Slot Configs.
Enabling Components
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS
• Flexible PCIe Interconnect Topology
• Great peer-to-peer bandwidth
• Extensible as Chassis-to-Chassis Interconnect
Enabling Components
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS
• Flexible PCIe Interconnect Topology
• Great peer-to-peer bandwidth
• Extensible as Chassis-to-Chassis Interconnect
Enabling Components
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS
• Flexible Inter-Chassis PCIe Interconnect
Topology
Specification Highlights
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS HIGHLIGHTS
• 4U Chassis Form Factor
• Six 1600W PSUs (N+N)
• Twelve Fans (N+2)
• Sixteen x8 OCuLink Cables for External PCIe Interconnect (8 x16)
• 4 x FH¾L PCIe Cards + 8 x 300W GPGPUs (SXM2 or double-width FH¾L PCIe Form Factors)
• Node Management (AST2500/2400 BMC family, 1GbE Link to Rack Manager)
• Rack Management Sideband: 2x RJ45 Ports for OoB Power Management
• PCIe Fabric Management for multi-Chassis Configurations, multi-Hosting, and IO-Sharing
Specification Highlights
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS HIGHLIGHTS
• Flexible choice of GPGPUs
⎻ Eight Pascal P100 SXM2_NVLink
⎻ Various GPGPUs in double-width, 300W PCIe Card form factor
• Such as P100, P40, P4, M40, K80, M60 etc.
• High PCIe Bandwidth to Host Memory and for peer-to-peer
• Up to 4 PCIe-interconnected Chassis (with a dedicated PCIe Fabric Management Network)
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS
Use Cases &
Performance Advantage
For Various Workloads
19NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.
CLOUD CHALLENGES
1 SKU, Multiple Instances
Integration into Existing Datacenter
INSTANCES
Granular, Latency Sensitive
High Throughput Batch
HPC: different CPU:GPU ratios
DevOps / Development
Production Deployment
PROJECT OLYMPUS HGX-1
HYPERSCALE GPU
ACCELERATOR
PARTNERSHIP + INTEROPERABILITY
20NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.
Project Olympus
HGX-1
Hyperscale GPU
Accelerator
Configurable PCIe Cable to host + Expansion slots
NVIDIA P100 GPU
NVLink Hybrid Cube Mesh Fabric
20 Gbyte/sec per link Duplex
Adapters for other GPUs
21NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.
DEEP LEARNING
2 CPU : 8 GPU
8x P100 SXM2 | 4x x16 PCIe
2 CPU : 16 GPU
16x P100 SXM2 | 4x x16 PCIe
CPUCPU CPUCPU
22NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.
HPC
4 CPU : 8 GPU
8x P100 SXM2 | 4x x16 PCIe
8 CPU : 8 GPU
8x P100 SXM2 | 8x x16 PCIe
23NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.
WORKLOAD OPTIMIZED PERFORMANCE
HPC : QUDA
High Energy Physics Application
CPU 8X K80 8X
P100
X4 2X
P100
8 CPU : 8 GPU
8x P100 SXM2 | 8x x16 PCIe
2 CPU : 8 GPU
8x P100 SXM2 | 4x x16 PCIe
• To augment the performance of Project Olympus Servers, we have
collaborated with Ingrasys and nVidia on a PCIe Expansion Box we call:
⎻ Project Olympus Hyperscale GPU Accelerator (HGX-1)
• We are contributing this specification and its associated product/design
to OCP
PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1)
Summary
SpecificationsMechanical CAD Schematics &
Board Files
OCP Contributions
https://github.com/opencomputeproject/Project_Olympus
Project Olympus
Hyperscale GPU
Accelerator (HGX-1)
Author:
Siamak Tavallaei, Principal Architect
Siamak Tavallaei
Principal Architect, Microsoft
Siamak Tavallaei is a Principal Architect at Microsoft’s Azure division. Collaborating with industry
partners, he drives a number of initiatives in research, design, and deployment of hardware for
Microsoft’s cloud-scale services such as Azure, Bing, Office 365, Exchange, and SQL across a global
datacenter footprint. With over 30 patents and 27 years of computer industry experience, he has
been instrumental in development and evolution of innovative multi-processor servers and
technology initiatives in areas of storage and memory hierarchy as well as heterogeneous,
distributed computing. He held the rank of Principal Member Technical Staff at Compaq and was a
Distinguished Technologist at Hewlett-Packard before joining Microsoft. He is interested in Big
Compute, Big Data, and Artificial Intelligence solutions based on distributed, heterogeneous,
accelerated, and energy-efficient computing. His current focus is the optimization of large-scale,
mega-datacenters for general-purpose computing and accelerated, tightly-connected, problem-
solving machines built on collaborative designs of hardware, software, and management.
Rob Ober
Chief Platform Architect, Tesla Datacenter Products
At NVIDIA Rob works with hyperscales like Microsoft to define the Tesla GPU platforms. Previously
Rob was Senior Fellow at SanDisk, FusionIO, LSI, AMD and Chief Architect at Infineon. Rob has more
than 30 years experience in computer architecture, has more than 40 international patents in
processors and systems, and has a degree in Systems Design Engineering from the University of
Waterloo in Canada.
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)

More Related Content

What's hot

Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 

What's hot (20)

Lenovo HPC Strategy Update
Lenovo HPC Strategy UpdateLenovo HPC Strategy Update
Lenovo HPC Strategy Update
 
Trends in Systems and How to Get Efficient Performance
Trends in Systems and How to Get Efficient PerformanceTrends in Systems and How to Get Efficient Performance
Trends in Systems and How to Get Efficient Performance
 
OpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software StackOpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software Stack
 
A Fresh Look at HPC from Huawei Enterprise
A Fresh Look at HPC from Huawei EnterpriseA Fresh Look at HPC from Huawei Enterprise
A Fresh Look at HPC from Huawei Enterprise
 
TAU E4S ON OpenPOWER /POWER9 platform
TAU E4S ON OpenPOWER /POWER9 platformTAU E4S ON OpenPOWER /POWER9 platform
TAU E4S ON OpenPOWER /POWER9 platform
 
BXI: Bull eXascale Interconnect
BXI: Bull eXascale InterconnectBXI: Bull eXascale Interconnect
BXI: Bull eXascale Interconnect
 
IBM Data Centric Systems & OpenPOWER
IBM Data Centric Systems & OpenPOWERIBM Data Centric Systems & OpenPOWER
IBM Data Centric Systems & OpenPOWER
 
DOME 64-bit μDataCenter
DOME 64-bit μDataCenterDOME 64-bit μDataCenter
DOME 64-bit μDataCenter
 
IBM HPC Transformation with AI
IBM HPC Transformation with AI IBM HPC Transformation with AI
IBM HPC Transformation with AI
 
Programming Models for Exascale Systems
Programming Models for Exascale SystemsProgramming Models for Exascale Systems
Programming Models for Exascale Systems
 
Manycores for the Masses
Manycores for the MassesManycores for the Masses
Manycores for the Masses
 
Utilizing AMD GPUs: Tuning, programming models, and roadmap
Utilizing AMD GPUs: Tuning, programming models, and roadmapUtilizing AMD GPUs: Tuning, programming models, and roadmap
Utilizing AMD GPUs: Tuning, programming models, and roadmap
 
MIT's experience on OpenPOWER/POWER 9 platform
MIT's experience on OpenPOWER/POWER 9 platformMIT's experience on OpenPOWER/POWER 9 platform
MIT's experience on OpenPOWER/POWER 9 platform
 
Nvidia SC16: The Greatest Challenges Can't Wait
Nvidia SC16: The Greatest Challenges Can't WaitNvidia SC16: The Greatest Challenges Can't Wait
Nvidia SC16: The Greatest Challenges Can't Wait
 
Tesla Accelerated Computing Platform
Tesla Accelerated Computing PlatformTesla Accelerated Computing Platform
Tesla Accelerated Computing Platform
 
The Past, Present, and Future of OpenACC
The Past, Present, and Future of OpenACCThe Past, Present, and Future of OpenACC
The Past, Present, and Future of OpenACC
 
State of Containers and the Convergence of HPC and BigData
State of Containers and the Convergence of HPC and BigDataState of Containers and the Convergence of HPC and BigData
State of Containers and the Convergence of HPC and BigData
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Hardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and MLHardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and ML
 
SCFE 2020 OpenCAPI presentation as part of OpenPWOER Tutorial
SCFE 2020 OpenCAPI presentation as part of OpenPWOER TutorialSCFE 2020 OpenCAPI presentation as part of OpenPWOER Tutorial
SCFE 2020 OpenCAPI presentation as part of OpenPWOER Tutorial
 

Viewers also liked

Mobile Finance: 2017 Trends and Innovations
Mobile Finance: 2017 Trends and InnovationsMobile Finance: 2017 Trends and Innovations
Mobile Finance: 2017 Trends and Innovations
Corporate Insight
 
The HPE Machine and Gen-Z - BUD17-503
The HPE Machine and Gen-Z - BUD17-503The HPE Machine and Gen-Z - BUD17-503
The HPE Machine and Gen-Z - BUD17-503
Linaro
 
RDMA on ARM
RDMA on ARMRDMA on ARM
RDMA on ARM
inside-BigData.com
 
Accelerating Hadoop, Spark, and Memcached with HPC Technologies
Accelerating Hadoop, Spark, and Memcached with HPC TechnologiesAccelerating Hadoop, Spark, and Memcached with HPC Technologies
Accelerating Hadoop, Spark, and Memcached with HPC Technologies
inside-BigData.com
 
Multi-Physics Methods, Modeling, Simulation & Analysis
Multi-Physics Methods, Modeling, Simulation & AnalysisMulti-Physics Methods, Modeling, Simulation & Analysis
Multi-Physics Methods, Modeling, Simulation & Analysis
inside-BigData.com
 

Viewers also liked (20)

Video: The State of the Solid State Drive SSD
Video: The State of the Solid State Drive SSDVideo: The State of the Solid State Drive SSD
Video: The State of the Solid State Drive SSD
 
HPC Computing Trends
HPC Computing TrendsHPC Computing Trends
HPC Computing Trends
 
Introduction to GPUs in HPC
Introduction to GPUs in HPCIntroduction to GPUs in HPC
Introduction to GPUs in HPC
 
ARM and Machine Learning
ARM and Machine LearningARM and Machine Learning
ARM and Machine Learning
 
Mobile Finance: 2017 Trends and Innovations
Mobile Finance: 2017 Trends and InnovationsMobile Finance: 2017 Trends and Innovations
Mobile Finance: 2017 Trends and Innovations
 
The HPE Machine and Gen-Z - BUD17-503
The HPE Machine and Gen-Z - BUD17-503The HPE Machine and Gen-Z - BUD17-503
The HPE Machine and Gen-Z - BUD17-503
 
Next Generation Data Centers – Are you ready for scale?
Next Generation Data Centers – Are you ready for scale?Next Generation Data Centers – Are you ready for scale?
Next Generation Data Centers – Are you ready for scale?
 
RDMA on ARM
RDMA on ARMRDMA on ARM
RDMA on ARM
 
Exascale Computing Project - Driving a HUGE Change in a Changing World
Exascale Computing Project - Driving a HUGE Change in a Changing WorldExascale Computing Project - Driving a HUGE Change in a Changing World
Exascale Computing Project - Driving a HUGE Change in a Changing World
 
Accelerating Hadoop, Spark, and Memcached with HPC Technologies
Accelerating Hadoop, Spark, and Memcached with HPC TechnologiesAccelerating Hadoop, Spark, and Memcached with HPC Technologies
Accelerating Hadoop, Spark, and Memcached with HPC Technologies
 
Ibm systems servidor linux power8 de 1u ibm power system s821 lc un servidor ...
Ibm systems servidor linux power8 de 1u ibm power system s821 lc un servidor ...Ibm systems servidor linux power8 de 1u ibm power system s821 lc un servidor ...
Ibm systems servidor linux power8 de 1u ibm power system s821 lc un servidor ...
 
Intersect360 Top of All Things in HPC Snapshot Analysis
Intersect360 Top of All Things in HPC Snapshot AnalysisIntersect360 Top of All Things in HPC Snapshot Analysis
Intersect360 Top of All Things in HPC Snapshot Analysis
 
IDC HPC Market Update
IDC HPC Market UpdateIDC HPC Market Update
IDC HPC Market Update
 
SC17 Exhibitor Prospectus
SC17 Exhibitor ProspectusSC17 Exhibitor Prospectus
SC17 Exhibitor Prospectus
 
Application Profiling at the HPCAC High Performance Center
Application Profiling at the HPCAC High Performance CenterApplication Profiling at the HPCAC High Performance Center
Application Profiling at the HPCAC High Performance Center
 
2016 IDC HPC Market Update
2016 IDC HPC Market Update2016 IDC HPC Market Update
2016 IDC HPC Market Update
 
HPC Market Update from IDC
HPC Market Update from IDCHPC Market Update from IDC
HPC Market Update from IDC
 
Multi-Physics Methods, Modeling, Simulation & Analysis
Multi-Physics Methods, Modeling, Simulation & AnalysisMulti-Physics Methods, Modeling, Simulation & Analysis
Multi-Physics Methods, Modeling, Simulation & Analysis
 
AgensGraph: a Multi-model Graph Database based on PostgreSql
AgensGraph: a Multi-model Graph Database based on PostgreSqlAgensGraph: a Multi-model Graph Database based on PostgreSql
AgensGraph: a Multi-model Graph Database based on PostgreSql
 
Re-Architecting Spark For Performance Understandability
Re-Architecting Spark For Performance UnderstandabilityRe-Architecting Spark For Performance Understandability
Re-Architecting Spark For Performance Understandability
 

Similar to Microsoft Project Olympus AI Accelerator Chassis (HGX-1)

"Performance Evaluation, Scalability Analysis, and Optimization Tuning of A...
"Performance Evaluation,  Scalability Analysis, and  Optimization Tuning of A..."Performance Evaluation,  Scalability Analysis, and  Optimization Tuning of A...
"Performance Evaluation, Scalability Analysis, and Optimization Tuning of A...
Altair
 
Building a GPU-enabled OpenStack Cloud for HPC - Blair Bethwaite, Monash Univ...
Building a GPU-enabled OpenStack Cloud for HPC - Blair Bethwaite, Monash Univ...Building a GPU-enabled OpenStack Cloud for HPC - Blair Bethwaite, Monash Univ...
Building a GPU-enabled OpenStack Cloud for HPC - Blair Bethwaite, Monash Univ...
OpenStack
 
GTC15-Manoj-Roge-OpenPOWER
GTC15-Manoj-Roge-OpenPOWERGTC15-Manoj-Roge-OpenPOWER
GTC15-Manoj-Roge-OpenPOWER
Achronix
 

Similar to Microsoft Project Olympus AI Accelerator Chassis (HGX-1) (20)

Introduction to HPC & Supercomputing in AI
Introduction to HPC & Supercomputing in AIIntroduction to HPC & Supercomputing in AI
Introduction to HPC & Supercomputing in AI
 
Evolution of Supermicro GPU Server Solution
Evolution of Supermicro GPU Server SolutionEvolution of Supermicro GPU Server Solution
Evolution of Supermicro GPU Server Solution
 
AWS re:Invent 2016: High Performance Computing on AWS (CMP207)
AWS re:Invent 2016: High Performance Computing on AWS (CMP207)AWS re:Invent 2016: High Performance Computing on AWS (CMP207)
AWS re:Invent 2016: High Performance Computing on AWS (CMP207)
 
Heterogeneous Computing on POWER - IBM and OpenPOWER technologies to accelera...
Heterogeneous Computing on POWER - IBM and OpenPOWER technologies to accelera...Heterogeneous Computing on POWER - IBM and OpenPOWER technologies to accelera...
Heterogeneous Computing on POWER - IBM and OpenPOWER technologies to accelera...
 
OpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC SystemsOpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC Systems
 
LEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous Hardware
 
"Performance Evaluation, Scalability Analysis, and Optimization Tuning of A...
"Performance Evaluation,  Scalability Analysis, and  Optimization Tuning of A..."Performance Evaluation,  Scalability Analysis, and  Optimization Tuning of A...
"Performance Evaluation, Scalability Analysis, and Optimization Tuning of A...
 
NSCC Training Introductory Class
NSCC Training Introductory Class NSCC Training Introductory Class
NSCC Training Introductory Class
 
Building a GPU-enabled OpenStack Cloud for HPC - Blair Bethwaite, Monash Univ...
Building a GPU-enabled OpenStack Cloud for HPC - Blair Bethwaite, Monash Univ...Building a GPU-enabled OpenStack Cloud for HPC - Blair Bethwaite, Monash Univ...
Building a GPU-enabled OpenStack Cloud for HPC - Blair Bethwaite, Monash Univ...
 
HPC DAY 2017 | Altair's PBS Pro: Your Gateway to HPC Computing
HPC DAY 2017 | Altair's PBS Pro: Your Gateway to HPC ComputingHPC DAY 2017 | Altair's PBS Pro: Your Gateway to HPC Computing
HPC DAY 2017 | Altair's PBS Pro: Your Gateway to HPC Computing
 
POWER9 for AI & HPC
POWER9 for AI & HPCPOWER9 for AI & HPC
POWER9 for AI & HPC
 
Demystify OpenPOWER
Demystify OpenPOWERDemystify OpenPOWER
Demystify OpenPOWER
 
ODSA Sub-Project Launch
 ODSA Sub-Project Launch ODSA Sub-Project Launch
ODSA Sub-Project Launch
 
ODSA Sub-Project Launch
ODSA Sub-Project LaunchODSA Sub-Project Launch
ODSA Sub-Project Launch
 
IBM Power8 announce
IBM Power8 announceIBM Power8 announce
IBM Power8 announce
 
GTC15-Manoj-Roge-OpenPOWER
GTC15-Manoj-Roge-OpenPOWERGTC15-Manoj-Roge-OpenPOWER
GTC15-Manoj-Roge-OpenPOWER
 
Infrastructure optimization for seismic processing (eng)
Infrastructure optimization for seismic processing (eng)Infrastructure optimization for seismic processing (eng)
Infrastructure optimization for seismic processing (eng)
 
E3MV - Embedded Vision - Sundance
E3MV - Embedded Vision - SundanceE3MV - Embedded Vision - Sundance
E3MV - Embedded Vision - Sundance
 
QNAP NAS打造私有雲平台
QNAP NAS打造私有雲平台QNAP NAS打造私有雲平台
QNAP NAS打造私有雲平台
 
Sven Vogel: Running CloudStack and OpenShift with NetApp on KVM
Sven Vogel: Running CloudStack and OpenShift with NetApp on KVMSven Vogel: Running CloudStack and OpenShift with NetApp on KVM
Sven Vogel: Running CloudStack and OpenShift with NetApp on KVM
 

More from inside-BigData.com

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
inside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 

Microsoft Project Olympus AI Accelerator Chassis (HGX-1)

  • 1.
  • 2. Microsof t Project Olympus Hyperscale GPU Accelerator (HGX-1) Siamak Tavallaei ⎻ Principal Architect, Microsoft Azure Cloud Hardware Infrastructure Robert Ober ⎻ Tesla Chief Platform Architect, NVIDIA Corp.
  • 3. • Project Olympus Modular Architecture • nVidia SXM2 with NVLink • Collaborative Chassis Design with Ingrasys • Enabling Components • High-level Feature List • Use cases • Performance Advantages for various Workloads PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1) Talk Outline
  • 4. PROJECT OLYMPUS BASE PROJECT OLYMPUS MODULAR ARCHITECTURE Establishes a baseline for cloud-scale standard deployment Datacenter management, power, cooling, performance
  • 5. • Configurable and Flexible Accelerators ⎻ 8 x NVIDIA P100_SXM2 & NVLink ⎻ 8 x GPGPUs in PCIe Card Form Factor • Expandable to Scale UP ⎻ From one to four Chassis ⎻ Internal PCIe Fabric Interconnect • Scale Out via InfiniBand Fabric • Host Head Node Options ⎻ 2S Project Olympus Server ⎻ 1S, 2S, 4S Server Head Nodes (eight x16 PCIe Links) ⎻ Up to 16 Head Nodes (sixteen x8 PCIe Links) Industry-standard Accelerated CLOUD COMPUTING PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1)
  • 6. • Configurable and Flexible Accelerators ⎻ 8 x NVIDIA P100_SXM2 & NVLink ⎻ 8 x GPGPUs in PCIe Card Form Factor • Expandable to Scale UP ⎻ From one to four Chassis ⎻ Internal PCIe Fabric Interconnect • Scale Out via InfiniBand Fabric • Host Head Node Options ⎻ 2S Project Olympus Server ⎻ 1S, 2S, 4S Server Head Nodes (eight x16 PCIe Links) ⎻ Up to 16 Head Nodes (sixteen x8 PCIe Links) PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1) CHASSIS Industry-standard Accelerated CLOUD COMPUTING
  • 7. • Configurable and Flexible Accelerators ⎻ 8 x NVIDIA P100_SXM2 & NVLink ⎻ 8 x GPGPUs in PCIe Card Form Factor • Expandable to Scale UP (CNTK) ⎻ From one to four Chassis ⎻ Internal PCIe Fabric Interconnect • Scale Out via InfiniBand Fabric (CNTK) • Host Head Node Options ⎻ 2S Project Olympus Server ⎻ 1S, 2S, 4S Server Head Nodes (eight x16 PCIe Links) ⎻ Up to 16 Head Nodes (via sixteen x8 PCIe Links) PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1) Industry-standard Accelerated CLOUD COMPUTING
  • 8. • Configurable and Flexible Accelerators ⎻ 8 x NVIDIA P100_SXM2 & NVLink ⎻ 8 x GPGPUs in PCIe Card Form Factor • Expandable to Scale UP (CNTK) ⎻ From one to four Chassis ⎻ Internal PCIe Fabric Interconnect • Scale Out via InfiniBand Fabric (CNTK) • Host Head Node Options ⎻ 2S Project Olympus Server ⎻ 1S, 2S, 4S Server Head Nodes (eight x16 PCIe Links) ⎻ Up to 16 Head Nodes (via sixteen x8 PCIe Links) PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1) Industry-standard Accelerated CLOUD COMPUTING
  • 9. • Configurable and Flexible Accelerators ⎻ 8 x NVIDIA P100_SXM2 & NVLink ⎻ 8 x GPGPUs in PCIe Card Form Factor • Expandable to Scale UP (CNTK) ⎻ From one to four Chassis ⎻ Internal PCIe Fabric Interconnect • Scale Out via InfiniBand Fabric (CNTK) • Host Head Node Options ⎻ 2S Project Olympus Server ⎻ 1S, 2S, 4S Server Head Nodes (eight x16 PCIe Links) ⎻ Up to 16 Head Nodes (via sixteen x8 PCIe Links) PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1) Industry-standard Accelerated CLOUD COMPUTING Video Transcoding HPC
  • 10. • Configurable and Flexible Accelerators ⎻ 8 x NVIDIA P100_SXM2 & NVLink ⎻ 8 x GPGPUs in PCIe Card Form Factor • Expandable to Scale UP ⎻ From one to four Chassis ⎻ Internal PCIe Fabric Interconnect • Scale Out via InfiniBand Fabric • Host Head Node Options ⎻ 2S Project Olympus Server ⎻ 1S, 2S, 4S Server Head Nodes (eight x16 PCIe Links) ⎻ Up to 16 Head Nodes (sixteen x8 PCIe Links) PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS Industry-standard Accelerated CLOUD COMPUTING
  • 11. Enabling Components PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS • Flexible PCIe Interconnect Topology • GPGPU-to-Host via high-BW PCIe Links • Peer-to-peer without Host interaction ⎻ GPGPU peer-to-peer via NVLink ⎻ GPGPU peer-to-peer to IB NICs via x16 PCIe
  • 12. Enabling Components PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS • Riser Boards ⎻ Plug into the Server Head Node ⎻ x16, x8 Type-A, x8 Type-B • X8 OCuLink Cable/Connector ⎻ For Chassis-to-Chassis Interconnect • Mezzanines ⎻ MEZZ1x16 ⎻ Various PCIe Slot Configs.
  • 13. Enabling Components PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS • Flexible PCIe Interconnect Topology • Great peer-to-peer bandwidth • Extensible as Chassis-to-Chassis Interconnect
  • 14. Enabling Components PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS • Flexible PCIe Interconnect Topology • Great peer-to-peer bandwidth • Extensible as Chassis-to-Chassis Interconnect
  • 15. Enabling Components PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS • Flexible Inter-Chassis PCIe Interconnect Topology
  • 16. Specification Highlights PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS HIGHLIGHTS • 4U Chassis Form Factor • Six 1600W PSUs (N+N) • Twelve Fans (N+2) • Sixteen x8 OCuLink Cables for External PCIe Interconnect (8 x16) • 4 x FH¾L PCIe Cards + 8 x 300W GPGPUs (SXM2 or double-width FH¾L PCIe Form Factors) • Node Management (AST2500/2400 BMC family, 1GbE Link to Rack Manager) • Rack Management Sideband: 2x RJ45 Ports for OoB Power Management • PCIe Fabric Management for multi-Chassis Configurations, multi-Hosting, and IO-Sharing
  • 17. Specification Highlights PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS HIGHLIGHTS • Flexible choice of GPGPUs ⎻ Eight Pascal P100 SXM2_NVLink ⎻ Various GPGPUs in double-width, 300W PCIe Card form factor • Such as P100, P40, P4, M40, K80, M60 etc. • High PCIe Bandwidth to Host Memory and for peer-to-peer • Up to 4 PCIe-interconnected Chassis (with a dedicated PCIe Fabric Management Network)
  • 18. PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR CHASSIS Use Cases & Performance Advantage For Various Workloads
  • 19. 19NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. CLOUD CHALLENGES 1 SKU, Multiple Instances Integration into Existing Datacenter INSTANCES Granular, Latency Sensitive High Throughput Batch HPC: different CPU:GPU ratios DevOps / Development Production Deployment PROJECT OLYMPUS HGX-1 HYPERSCALE GPU ACCELERATOR PARTNERSHIP + INTEROPERABILITY
  • 20. 20NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. Project Olympus HGX-1 Hyperscale GPU Accelerator Configurable PCIe Cable to host + Expansion slots NVIDIA P100 GPU NVLink Hybrid Cube Mesh Fabric 20 Gbyte/sec per link Duplex Adapters for other GPUs
  • 21. 21NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. DEEP LEARNING 2 CPU : 8 GPU 8x P100 SXM2 | 4x x16 PCIe 2 CPU : 16 GPU 16x P100 SXM2 | 4x x16 PCIe CPUCPU CPUCPU
  • 22. 22NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. HPC 4 CPU : 8 GPU 8x P100 SXM2 | 4x x16 PCIe 8 CPU : 8 GPU 8x P100 SXM2 | 8x x16 PCIe
  • 23. 23NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. WORKLOAD OPTIMIZED PERFORMANCE HPC : QUDA High Energy Physics Application CPU 8X K80 8X P100 X4 2X P100 8 CPU : 8 GPU 8x P100 SXM2 | 8x x16 PCIe 2 CPU : 8 GPU 8x P100 SXM2 | 4x x16 PCIe
  • 24. • To augment the performance of Project Olympus Servers, we have collaborated with Ingrasys and nVidia on a PCIe Expansion Box we call: ⎻ Project Olympus Hyperscale GPU Accelerator (HGX-1) • We are contributing this specification and its associated product/design to OCP PROJECT OLYMPUS HYPERSCALE GPU ACCELERATOR (HGX-1) Summary
  • 25. SpecificationsMechanical CAD Schematics & Board Files OCP Contributions https://github.com/opencomputeproject/Project_Olympus Project Olympus Hyperscale GPU Accelerator (HGX-1) Author: Siamak Tavallaei, Principal Architect
  • 26. Siamak Tavallaei Principal Architect, Microsoft Siamak Tavallaei is a Principal Architect at Microsoft’s Azure division. Collaborating with industry partners, he drives a number of initiatives in research, design, and deployment of hardware for Microsoft’s cloud-scale services such as Azure, Bing, Office 365, Exchange, and SQL across a global datacenter footprint. With over 30 patents and 27 years of computer industry experience, he has been instrumental in development and evolution of innovative multi-processor servers and technology initiatives in areas of storage and memory hierarchy as well as heterogeneous, distributed computing. He held the rank of Principal Member Technical Staff at Compaq and was a Distinguished Technologist at Hewlett-Packard before joining Microsoft. He is interested in Big Compute, Big Data, and Artificial Intelligence solutions based on distributed, heterogeneous, accelerated, and energy-efficient computing. His current focus is the optimization of large-scale, mega-datacenters for general-purpose computing and accelerated, tightly-connected, problem- solving machines built on collaborative designs of hardware, software, and management.
  • 27. Rob Ober Chief Platform Architect, Tesla Datacenter Products At NVIDIA Rob works with hyperscales like Microsoft to define the Tesla GPU platforms. Previously Rob was Senior Fellow at SanDisk, FusionIO, LSI, AMD and Chief Architect at Infineon. Rob has more than 30 years experience in computer architecture, has more than 40 international patents in processors and systems, and has a degree in Systems Design Engineering from the University of Waterloo in Canada.