SlideShare ist ein Scribd-Unternehmen logo
1 von 48
Downloaden Sie, um offline zu lesen
1© 2018 Mellanox Technologies | Confidential
The Critical Path for HPC, Cloud and Machine Learning
Lugano April 9 2018
Advanced Networking
2© 2018 Mellanox Technologies | Confidential
Modern Data Centers
Orchestration SDSSDN
Multi-
Tenancy
On Demand
3© 2018 Mellanox Technologies | Confidential
The Challenge – Software Implementation
Performance
Programmability
4© 2018 Mellanox Technologies | Confidential
The Solution - Hardware Acceleration
Software defined everything is a key for modern data centers
With today’s software based solutions, functionality is (almost) there
To gain flexibility, performance and cost efficiency, Hardware Acceleration is needed
High Performance Workloads Can’t Deliver Without HW Acceleration
Software Software + Hardware Acceleration
5© 2018 Mellanox Technologies | Confidential
Advance Network
Technologies For
OpenStack
6© 2018 Mellanox Technologies | Confidential
Server
VM1 VM2 VM3 VM4
Overlay Networks
Overlay Network Advantages: Isolation, Simplicity, Scalability
Virtual Domain 3
Virtual Domain 2
Virtual Domain 1
Physical
View
Server
VM5 VM6 VM7 VM8
Virtual
View
NVGRE/VXLAN/Geneve Overlay Networks
7© 2018 Mellanox Technologies | Confidential
Turbocharge Overlay Networks
Overlay tunnels add network
processing
 Limits bandwidth
 Consumes CPU
System efficiency drops 10s of
percents
For penalty free overlays, at bare-
metal performance use NIC with
overlay Network HW offloads
 ConnectX-4 and ConnectX-5 family
Mellanox adapters also supports
VxLAN VTEP (encap/decap)
37.5
17.62
36.21
0.7
3.5
0.7
0
0.5
1
1.5
2
2.5
3
3.5
4
Physical VXLAN No Offloads VXLAN HW Offloads
0
5
10
15
20
25
30
35
40
CPU%Per1Gb/s
Bendwith(Gb/s)
40G/s ConnectX-3 Pro
8 VM Pairs BW 8 VM Pairs CPU
8© 2018 Mellanox Technologies | Confidential
Para-Virtualized SR-IOV
Single Root I/O Virtualization (SR-IOV)
PCIe device presents multiple
instances to the OS/Hypervisor
Enables Application Direct
Access
 Bare metal performance for VM
 Reduces CPU overhead
Enables many advanced NIC
features (e.g. DPDK, RDMA, ASAP2)
NIC
Hypervisor
vSwitch
VM VM
SR-IOV NIC
Hypervisor VM VM
eSwitch
Physical Function
(PF)
Virtual Function
(VF)
Fully Integrated And Upstream With OpenStack
9© 2018 Mellanox Technologies | Confidential
Per VF (SR-IOV) Quality of Service (QoS)
New Neutron API for
 Per VF Rate Limiting
 Per VF BW Guarantee
 Packet Pacing
Same model for ParaVirt and SR-
IOV
In SR-IOV mode, QoS is enforced
by HW
 Finer grain
 More predictable
 Less Jitter
 Less CPU utilization
Rate ShaperRate Shaper
QoS
Queue
Work
Queue
Work
Queue
Work
Queue
Priority 0
Arbiter
QoS
Queue
Work
Queue
Work
Queue
Work
Queue
QoS
Queue
Work
Queue
Work
Queue
Work
Queue
QoS
Queue
Work
Queue
Work
Queue
Work
Queue
Priority 1
Arbiter
RR
arbiter
RR
arbiter
RR
arbiter
RR
arbiter
Strict Priority
TC Group 0
DWRR
TC Group 1
DWRR
TC Group 7
DWRR
TC0
TC1
Flow
Ctrl
Flow
Ctrl
TC2
TC3
Flow
Ctrl
Flow
Ctrl
TC7
Flow
Ctrl...
HL
...
...Priority
0
Priority
1
Priority
2
Priority
3
Priority
7
Rate Limiter
Rate Limiter
Enhanced ETSPer VF Rate Limiter
Mellanox Advance HW QoS Implementation
10© 2018 Mellanox Technologies | Confidential
SR-IOV High Availability / VF LAG
 SR-IOV VMs don’t support bonding/HA
 Mellanox enable transparent SR-IOV HA on a single NIC
 LAG will be implemented on Mellanox NIC so VM will
only see a single Virtual Function (VF)
 Mode supported
 Active Passive (Single port BW)
 Active Active (Double port BW)
 LACP
NIC
Host
Virtual Function
VM
VF driver
User
Kernel
Virtual Function
Port 1 Port 2
LAG
11© 2018 Mellanox Technologies | Confidential
Tradeoffs Between Virtual Switch and SR-IOV
Virtual
Switch
SR-IOV
12© 2018 Mellanox Technologies | Confidential
Open Virtual Switch (OVS) Challenges
 Virtual switches such as Open vSwitch (OVS) are used as the
forwarding plane in the hypervisor
 Virtual switches implement extensive support for SDN (e.g.
enforce policies) and are widely used by the industry
 Supports L2-L3 networking features:
 L2 & L3 Forwarding, NAT, ACL, Connection Tracking etc.
 Flow based
 OVS Challenges:
 Awful Packet Performance: <1M w/ 2-4 cores,
 Burns CPU like Hell : Even w/ 12 cores, can’t get 1/3rd 100G NIC Speed
 Bad User Experience: High and unpredictable latency, packet drops
 Solution
 Offload OVS data plane into Mellanox NIC using ASAP2 technology
13© 2018 Mellanox Technologies | Confidential
ConnectX-5 Packet Processing Offload Capabilities
 Flow Tables
 Multiple, Programmable tables
 Dedicate, isolated tables for hypervisor and/or VMs
 Practically unlimited table size
 Can support million of rules/flows
 Classification
 Match on all header fields including encapsulated packets
 Flexible fields extraction by “Flexparse”
 Actions
 Steering
 Encap/Decap
 VXLAN, NVGRE, Geneve, MPLSoGRE/UDP, NSH
 Flex encap/decap
 Drop / Allow
 Mirror
 Flow ID
 Header rewrite
 Hairpin mode
14© 2018 Mellanox Technologies | Confidential
Accelerated Switching And Packet Processing (ASAP2)
 ASAP2 take advantage of ConnectX-5 capability to accelerate or offload “in host” network stack
 Family of solutions
ASAP2 Direct
Full vSwitch offload
ASAP2 Flex
vSwitch acceleration
ASAP2 Flex
VNF/VM acceleration
15© 2018 Mellanox Technologies | Confidential
ASAP2 Direct: Full OVS Offload
 Enable SR-IOV data path with OVS control plane
 In other words, enable support for most SDN controllers with SR-IOV data plane
 Use Open vSwitch to be the management interface and offload OVS data-
plane to Mellanox embedded Switch (eSwitch) using ASAP2 Direct
OVS-eSwitch
Netdev
Representor
Netdev
Representor
Netdev
Representor
Netdev
Representor
eSwitch
PF (wire)
Host IP interface Host exception path (user-space)
VF VF VF
netdev netdev
Para-virt Para-virt
Hypervisor
Representor Ports
VM
ConnectX-5 eSwitch
VM
Hypervisor
OVS
SR-IOV
VF
SR-IOV
VF
DataPath
PF
16© 2018 Mellanox Technologies | Confidential
OVS over DPDK VS. OVS Offload
ConnectX-5 provide significant performance
boost
 Without adding CPU resources
7.6
MPPS
66
MPPS
4 Cores
0 Cores
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
0
10
20
30
40
50
60
70
OVS over DPDK OVS Offload
NumberofDedicatedCores
MillionPacketPerSecond
Message Rate Dedicated Hypervisor Cores
Test ASAP2
Direct
OVS
DPDK
Benefit
1 Flow
VXLAN
66M PPS 7.6M PPS
(VLAN)
8.6X
60K flows
VXLAN
19.8M PPS 1.9M PPS 10.4X
17© 2018 Mellanox Technologies | Confidential
Remote Direct Memory Access (RDMA)
ZERO Copy Remote Data Transfer
Low Latency, High Performance Data Transfers
InfiniBand - 100Gb/s RoCE* – 100Gb/s
Kernel Bypass Protocol Offload
Application ApplicationUSER
KERNEL
HARDWARE
Buffer Buffer
18© 2018 Mellanox Technologies | Confidential
RDMA In Cloud
Enable RDMA applications to run on cloud
 Scientific
 HPC
 Machine Learning and AI
 Data bases
Accelerate cloud infrastructure
 VM migration over RDMA
 Message queue over RDMA (e.g. gRPC)
Accelerate cloud storage
 iSER
 NVMf
Cognitive Toolkit
19© 2018 Mellanox Technologies | Confidential
RDMA Provide Fastest OpenStack Block Storage Access
Using OpenStack Built-in components and management (Open-iSCSI, tgt target, Cinder), no additional
software is required, RDMA is already inbox and used by our OpenStack customers !
Hypervisor (KVM)
OS
VM
OS
VM
OS
VM
Adapter
Open-iSCSI w iSER
Compute Servers
Switching Fabric
iSCSI/iSER Target (tgt)
Adapter Local Disks
RDMA Cache
Storage Servers
OpenStack (Cinder)
Using RDMA
to accelerate
iSCSI storage
0
1000
2000
3000
4000
5000
6000
7000
1 2 4 8 16 32 64 128 256
Bandwidth[MB/s]
I/O Size [KB]
iSER 4 VMs Write
iSER 8 VMs Write
iSER 16 VMs Write
iSCSI Write 8 vms
iSCSI Write 16 VMs
PCIe Limit
6X
RDMA enables 6x More Bandwidth, 5x lower I/O latency, and lower CPU%
20© 2018 Mellanox Technologies | Confidential
NVMe Over Fabrics
Sharing NVMe based storage across multiple servers
 Better utilization: capacity, rack space, power
 Scalability, management, fault isolation
RDMA protocol is part of the standard
 InfiniBand or Ethernet (RoCE)
OpenStack Integration
 Cinder driver
* Roadmap
21© 2018 Mellanox Technologies | Confidential
Data Plane Development Kit (DPDK)
 What is DPDK?
 Set of open source libraries and drivers for fast packet processing
 What is the main usage and benefits of DPDK?
 Receive and send packets within the minimum number of CPU
cycles (usually less than 80 cycles)
 Develop fast packet capture algorithms
 Run third-party fast path stacks
 Can be used as an abstraction layer that will enable application
porting between CPU architectures
 DPDK in the cloud
 Accelerate virtual switches (i.e., OVS over DPDK)
 Enable Virtual Network Functions (VNFs)
22© 2018 Mellanox Technologies | Confidential
DPDK with Mellanox - Industry Leading Performance
Mellanoxwith
66% lower latency compared to competition
Highest Performance and Message Rate in the Market!!!
139.22
84.46
45.29
23.50
11.97 9.62 8.13
0.00
20.00
40.00
60.00
80.00
100.00
120.00
140.00
160.00
64 128 256 512 1024 1280 1518
Framerate[mpps]
Frame Size [B]
Lossless ConnectX-5 Ex 100GbE Frame Rate [Mpps]
16 cores
100GbE line rate
23© 2018 Mellanox Technologies | Confidential
DPDK with Mellanox – Secure & Cost Effective
S E C U R E
NIC based hardware memory protection
and translation by memory registration
and isolation per application
Benefits:
 Better Secured
 Supports Containerized DPDK
applications without SR-IOV
THROUGH
MEMORY
PROTECTION
In hardware
Allows concurrent use of DPDK and
NON-DPDK applications on the same NIC
unlike competition
Benefits:
 Save CapEx of dedicated DPDK NIC
C O S T E F F E C T I V E
Supporting multiple architectures
Benefits:
 Tightly integrated with processor
specific accelerators (Neon, AVX,
etc)
M U LT I A R C H
24© 2018 Mellanox Technologies | Confidential
OpenStack Over InfiniBand – The Route To
Extreme Performance
 Transparent InfiniBand integration into OpenStack
 Since Havana OpenStack release
 RDMA directly from VM
 Requires SR-IOV
 MAC to GUID mapping
 VLAN to pkey mapping
 InfiniBand SDN network
 Ideal fit for High Performance Computing Clouds
InfiniBand Enables The Highest Performance and Efficiency
25© 2018 Mellanox Technologies | Confidential
Ironic Ethernet and InfiniBand Support
 Ironic is OpenStack bare metal provisioning
 Useful for High Performance Compute (HPC) and Big Data
 Mellanox enabled Ironic support for IB and Eth
 Bare metal, multi-tenancy IB and Eth
 Provide zero touch VLAN switch provision
 Enable InfiniBand support for Ironic with Neutron using pkey segmentation and
OpenSM integration (via Neo/UFM)
I’m “Pixie Boots”
the mascot of the "Bear Metal"
Provisioning, a.k.a Ironic
26© 2018 Mellanox Technologies | Confidential
Comprehensive OpenStack Integration
Integrated with Major
OpenStack
Distributions
In-Box
Neturon-ML2
support for
mixed
environment
(VXLAN, PV,
SRIOV)
Ethernet
Neutron :
Hardware
support for
security and
isolation
Accelerating
storage
access by up
to 5X
OpenStack Plugins Create Seamless Integration , Control, & Management
27© 2018 Mellanox Technologies | Confidential
Machine Learning
Network Needs
28© 2018 Mellanox Technologies | Confidential
Neural Networks Complexity Growth
2014 2015 2016 2017
DeepSpeech DeepSpeech-2
DeepSpeech-3
30X
2013 2014 2015 2016
AlexNet GoogleNet
ResNet
Inception-V2
350X
Inception-V4
Image
Recognition
Speech
Recognition
PolyNet
29© 2018 Mellanox Technologies | Confidential
Training Challenges
Training with large data sets and increasing networks can take long time
 In some cases even weeks
In many cases training need to happen frequently
 Model development and tuning
 Real life use cases may require retraining regularly
Accelerate training time by scale out architecture
 Add workers (nodes) to reduce training time
Types of parallelism that are now popular
Data parallelism
Model parallelism
Network is critical element to accelerate Distributed Training!
30© 2018 Mellanox Technologies | Confidential
Model and Data Parallelism
Main Model/Parameter Server/Allreaduce
Local
Model
Mini
Batch
Mini
Batch
Mini
Batch
Mini
Batch
Mini
Batch
Local
Model
Local
Model
Local
Model
Local
Model
Local
Model
Mini
BatchData Data
Model Parallelism Data Parallelism
31© 2018 Mellanox Technologies | Confidential
Accelerating Data Parallelism
Data Parallelism communication pattern
 Gradient updates to parameter servers or among workers.
 Model parameters distribution among workers.
 Frequent – each training step due to the sequential nature of
SGD
 High bandwidth is needed, as models become larger and larger
 Number of parameters is increasing
 Usually characterized with Bursts on the network - workers are
synchronized
RDMA and GPU Direct Accelerates Model Parallelism
32© 2018 Mellanox Technologies | Confidential
What Is RDMA?
 Remote Direct Memory Access (RDMA)
 Advance transport protocol (same layer as TCP and UDP)
 Main features
 Remote memory read/write semantics in addition to send/receive
 Kernel bypass / direct user space access
 Full hardware offload
 Secure, channel based IO
 Application advantage
 Low latency
 High bandwidth
 Low CPU consumption
 RoCE: RDMA over Converged Ethernet
 Available for all Ethernet speeds 10 – 100G
 Verbs: RDMA SW interface (equivalent to sockets)
33© 2018 Mellanox Technologies | Confidential
GPUDirect™ RDMA Technology
34© 2018 Mellanox Technologies | Confidential
All Major Machine Learning Frameworks Support
RDMA
TensorFlow: Several implementations upstream
 Native (verbs) -
https://github.com/tensorflow/tensorflow/tree/master/tensorflow/contrib/
verbs
 MPI, Horavod – Donated by Uber among others
Caffe2: Over MPI or Gloo library
Microsoft Cognitive Toolkit: Native support
NVIDIA NCCL2: Native support in NCCL
Cognitive Toolkit
35© 2018 Mellanox Technologies | Confidential
TensorFlow with Mellanox RDMA Test Report
 System Configuration
 8 x x86 servers
 4 x NVIDIA P100 per server
 Mellanox 100G RDMA network
 NVMe driver per server
RDMA vs. TCP: Up to 50% Better Performance
Advanced RDMA vs. TCP: Up to 173% Better Performance
Reference Deployment Guide
36© 2018 Mellanox Technologies | Confidential
SHARP To Accelerate Parameter Server
 Bottleneck created by Parameter Server  SHARP Reduces Network Load and Latency
SCALE
∆𝑾 𝟏 ∆𝑾 𝟐 ∆𝑾 𝟑 ∆𝑾 𝑵
∑∆𝑾𝒊
∑∆𝑾𝒊
∑∆𝑾𝒊∑∆𝑾𝒊∑∆𝑾𝒊
𝒊=𝟏
𝒊=𝟑
∆𝑾𝒊
𝒊=𝑵−𝟑
𝒊=𝑵
∆𝑾𝒊
37© 2018 Mellanox Technologies | Confidential
Data Ingestion
 Data ingestion is the process of acquiring and preparing the input
 Preprocessing stage before accessing machine learning frameworks
 Examples
 Convert file/image formats
 Combine multiple data sources
 Clean noise / enhance input
 Relevant for training and inference
 Data Ingestion typically includes
 Access to storage (local, distributed, network storage)
 Pre-processing in a big data framework
such as Hadoop or Spark
Accelerate Data Ingest is critical for machine learning performance
38© 2018 Mellanox Technologies | Confidential
Data Pipeline
39© 2018 Mellanox Technologies | Confidential
Mellanox Is Driving High Performance Storage
A majority of these customer are doing NVMe-oF POCs or early development with us Today
40© 2018 Mellanox Technologies | Confidential
Accelerate Big Data - Enabling Real-time Decisions
Benchmark: TeraSort Benchmark: Cassandra Stress
0
200
400
600
800
1000
1200
Intel 10Gb/s Mellanox 10Gb/s Mellanox 40Gb/s
ExecutionTime(inseconds)
Ethernet Network
3X Faster
Benchmark: Fraud Detection
0
100
200
300
400
500
600
700
Existing Solution Aerospike with
Mellanox + Samsung
NVMe
TotalTransactionTime(inms)
CPU + Storage + Network Fraud Detection Algorithm
~2x more time for running
fraud detection algorithm
3X Faster Runtime! ~2X Faster Runtime! 25G BW for Database!
Mellanox is Certified by Leading Big Data Partners
41© 2018 Mellanox Technologies | Confidential
Spark Over RDMA – Accelerate Map/Reduce
Map
Reduce task
MapReduce
Map
Map
Map
Map
Input Map output
File
File
File
File
File
Driver
Reduce task
Reduce task
Reduce task
Reduce task
Fetch blocks
Fetch blocks
Fetch blocks
Fetch blocks
Fetch blocks
 Shuffling is very expensive in terms of CPU, RAM, disk and network IOs
 Spark Over RDMA allow to speed up shuffle operations
 RDMA is provided by “DiSNI” (Open-source Java interface to RDMA user libraries)
 https://github.com/zrlio/disni
42© 2018 Mellanox Technologies | Confidential
Spark over RDMA Performance Results: TeraSort
Testbed:
 HiBench TeraSort
 Workload: 175GB
 HDFS on Hadoop 2.6.0
 No replication
 Spark 2.2.0
 1 Master
 16 Workers
 28 active Spark cores on each node,
420 total
 Node info:
 Intel Xeon E5-2697 v3 @ 2.60GHz
 RoCE 100GbE
 256GB RAM
 HDD is used for Spark local directories
and HDFS
RDMA
Standard
0 10 20 30 40 50 60 70 80
seconds
43© 2018 Mellanox Technologies | Confidential
17%
23%
19%
0
20
40
60
80
100
120
140
160
Customer App #1 Customer App #2 HiBench TeraSort
Runtime(inseconds)
TCP
RDMA
Spark over RDMA: Real Applications Results
Runtime samples Input Size Nodes Cores per node RAM per node Improvement
Customer App #1 5GB 14 24 85GB 17%
Customer App #2 540GB 14 24 85GB 23%
HiBench TeraSort 300GB 15 28 256GB 19%
Lower is better
44© 2018 Mellanox Technologies | Confidential
Containers
45© 2018 Mellanox Technologies | Confidential
Containers Vs. Virtual Machines
Infrastructure Infrastructure
Operating System
Hypervisor
Libs
App
Libs
App
Libs
App
Operating System
Container Engine
Libs
App
Libs
App
Libs
App
OS OS OS
VM VM VM
Container Container Container
Packaging technology and light weight virtualization
46© 2018 Mellanox Technologies | Confidential
Containers Networking – Many Options
Host
Container
C
Container D Container E Container FContainer A Container B
Direct
Host
network
Unix-domain
sockets and
other IPC
Linux bridge
(Docker0)
iptables
(Docker proxy)
Open vSwitch
Port
mapping
NICNIC NIC
47© 2018 Mellanox Technologies | Confidential
Containers Networking With Mellanox
10-100Gb/s Ethernet
Stateless offloads
Scalable and secure DPDK usage from container (*)
RDMA for InfiniBand and RoCE from within the container (*)
SR-IOV (*)
Roadmap
Container Direct networking
vSwitch offload with ASAP2
(*) Initial support available; Integration with Kubernetes later this year
48© 2018 Mellanox Technologies | Confidential
Thank You

Weitere ähnliche Inhalte

Was ist angesagt?

Virtualization
VirtualizationVirtualization
VirtualizationBirju Tank
 
Q2.12: Debugging with GDB
Q2.12: Debugging with GDBQ2.12: Debugging with GDB
Q2.12: Debugging with GDBLinaro
 
DPDK & Layer 4 Packet Processing
DPDK & Layer 4 Packet ProcessingDPDK & Layer 4 Packet Processing
DPDK & Layer 4 Packet ProcessingMichelle Holley
 
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski buildacloud
 
Vxlan deep dive session rev0.5 final
Vxlan deep dive session rev0.5   finalVxlan deep dive session rev0.5   final
Vxlan deep dive session rev0.5 finalKwonSun Bae
 
SIP & TLS - Security in a peer to peer world
SIP & TLS - Security in a peer to peer worldSIP & TLS - Security in a peer to peer world
SIP & TLS - Security in a peer to peer worldOlle E Johansson
 
MP BGP-EVPN 실전기술-1편(개념잡기)
MP BGP-EVPN 실전기술-1편(개념잡기)MP BGP-EVPN 실전기술-1편(개념잡기)
MP BGP-EVPN 실전기술-1편(개념잡기)JuHwan Lee
 
VMware vSphere Networking deep dive
VMware vSphere Networking deep diveVMware vSphere Networking deep dive
VMware vSphere Networking deep diveSanjeev Kumar
 
Rootlinux17: Hypervisors on ARM - Overview and Design Choices by Julien Grall...
Rootlinux17: Hypervisors on ARM - Overview and Design Choices by Julien Grall...Rootlinux17: Hypervisors on ARM - Overview and Design Choices by Julien Grall...
Rootlinux17: Hypervisors on ARM - Overview and Design Choices by Julien Grall...The Linux Foundation
 
Get Hands-On with NGINX and QUIC+HTTP/3
Get Hands-On with NGINX and QUIC+HTTP/3Get Hands-On with NGINX and QUIC+HTTP/3
Get Hands-On with NGINX and QUIC+HTTP/3NGINX, Inc.
 
Debian general presentation
Debian general presentationDebian general presentation
Debian general presentationDing Zhou
 
Open vSwitch Introduction
Open vSwitch IntroductionOpen vSwitch Introduction
Open vSwitch IntroductionHungWei Chiu
 
Chp5 - Sécurité des Services
Chp5 - Sécurité des ServicesChp5 - Sécurité des Services
Chp5 - Sécurité des ServicesLilia Sfaxi
 
OpenStack High Availability
OpenStack High AvailabilityOpenStack High Availability
OpenStack High AvailabilityJakub Pavlik
 
import rdma: zero-copy networking with RDMA and Python
import rdma: zero-copy networking with RDMA and Pythonimport rdma: zero-copy networking with RDMA and Python
import rdma: zero-copy networking with RDMA and Pythongroveronline
 

Was ist angesagt? (20)

Virtualization
VirtualizationVirtualization
Virtualization
 
Q2.12: Debugging with GDB
Q2.12: Debugging with GDBQ2.12: Debugging with GDB
Q2.12: Debugging with GDB
 
DPDK & Layer 4 Packet Processing
DPDK & Layer 4 Packet ProcessingDPDK & Layer 4 Packet Processing
DPDK & Layer 4 Packet Processing
 
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
 
Vxlan deep dive session rev0.5 final
Vxlan deep dive session rev0.5   finalVxlan deep dive session rev0.5   final
Vxlan deep dive session rev0.5 final
 
Logiciels libres
Logiciels libresLogiciels libres
Logiciels libres
 
SIP & TLS - Security in a peer to peer world
SIP & TLS - Security in a peer to peer worldSIP & TLS - Security in a peer to peer world
SIP & TLS - Security in a peer to peer world
 
Routage ospf
Routage ospfRoutage ospf
Routage ospf
 
NVMe over Fabric
NVMe over FabricNVMe over Fabric
NVMe over Fabric
 
QNX Sales Engineering Presentation
QNX Sales Engineering PresentationQNX Sales Engineering Presentation
QNX Sales Engineering Presentation
 
MP BGP-EVPN 실전기술-1편(개념잡기)
MP BGP-EVPN 실전기술-1편(개념잡기)MP BGP-EVPN 실전기술-1편(개념잡기)
MP BGP-EVPN 실전기술-1편(개념잡기)
 
VMware vSphere Networking deep dive
VMware vSphere Networking deep diveVMware vSphere Networking deep dive
VMware vSphere Networking deep dive
 
Rootlinux17: Hypervisors on ARM - Overview and Design Choices by Julien Grall...
Rootlinux17: Hypervisors on ARM - Overview and Design Choices by Julien Grall...Rootlinux17: Hypervisors on ARM - Overview and Design Choices by Julien Grall...
Rootlinux17: Hypervisors on ARM - Overview and Design Choices by Julien Grall...
 
Ldap intro
Ldap introLdap intro
Ldap intro
 
Get Hands-On with NGINX and QUIC+HTTP/3
Get Hands-On with NGINX and QUIC+HTTP/3Get Hands-On with NGINX and QUIC+HTTP/3
Get Hands-On with NGINX and QUIC+HTTP/3
 
Debian general presentation
Debian general presentationDebian general presentation
Debian general presentation
 
Open vSwitch Introduction
Open vSwitch IntroductionOpen vSwitch Introduction
Open vSwitch Introduction
 
Chp5 - Sécurité des Services
Chp5 - Sécurité des ServicesChp5 - Sécurité des Services
Chp5 - Sécurité des Services
 
OpenStack High Availability
OpenStack High AvailabilityOpenStack High Availability
OpenStack High Availability
 
import rdma: zero-copy networking with RDMA and Python
import rdma: zero-copy networking with RDMA and Pythonimport rdma: zero-copy networking with RDMA and Python
import rdma: zero-copy networking with RDMA and Python
 

Ähnlich wie Advanced Networking: The Critical Path for HPC, Cloud, Machine Learning and more

[OpenStack Days Korea 2016] Track1 - Mellanox CloudX - Acceleration for Cloud...
[OpenStack Days Korea 2016] Track1 - Mellanox CloudX - Acceleration for Cloud...[OpenStack Days Korea 2016] Track1 - Mellanox CloudX - Acceleration for Cloud...
[OpenStack Days Korea 2016] Track1 - Mellanox CloudX - Acceleration for Cloud...OpenStack Korea Community
 
Mellanox for OpenStack - OpenStack最新情報セミナー 2014年10月
Mellanox for OpenStack  - OpenStack最新情報セミナー 2014年10月Mellanox for OpenStack  - OpenStack最新情報セミナー 2014年10月
Mellanox for OpenStack - OpenStack最新情報セミナー 2014年10月VirtualTech Japan Inc.
 
6WINDGate™ - Enabling Cloud RAN Virtualization
6WINDGate™ - Enabling Cloud RAN Virtualization6WINDGate™ - Enabling Cloud RAN Virtualization
6WINDGate™ - Enabling Cloud RAN Virtualization6WIND
 
Open coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi AlkobiOpen coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi AlkobiOpenInfra Days Poland 2019
 
OVNC 2015-Open Ethernet과 SDN을 통한 Mellanox의 차세대 네트워크 혁신 방안
OVNC 2015-Open Ethernet과 SDN을 통한 Mellanox의 차세대 네트워크 혁신 방안OVNC 2015-Open Ethernet과 SDN을 통한 Mellanox의 차세대 네트워크 혁신 방안
OVNC 2015-Open Ethernet과 SDN을 통한 Mellanox의 차세대 네트워크 혁신 방안NAIM Networks, Inc.
 
Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph
Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph
Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph Ceph Community
 
Software Stacks to enable SDN and NFV
Software Stacks to enable SDN and NFVSoftware Stacks to enable SDN and NFV
Software Stacks to enable SDN and NFVYoshihiro Nakajima
 
The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:
The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:
The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:Cisco Canada
 
Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...
Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...
Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...Ceph Community
 
Ceph Day Chicago - Deploying flash storage for Ceph without compromising perf...
Ceph Day Chicago - Deploying flash storage for Ceph without compromising perf...Ceph Day Chicago - Deploying flash storage for Ceph without compromising perf...
Ceph Day Chicago - Deploying flash storage for Ceph without compromising perf...Ceph Community
 
Announcing the Mellanox ConnectX-5 100G InfiniBand Adapter
Announcing the Mellanox ConnectX-5 100G InfiniBand AdapterAnnouncing the Mellanox ConnectX-5 100G InfiniBand Adapter
Announcing the Mellanox ConnectX-5 100G InfiniBand Adapterinside-BigData.com
 
High Performance Networking Leveraging the DPDK and Growing Community
High Performance Networking Leveraging the DPDK and Growing CommunityHigh Performance Networking Leveraging the DPDK and Growing Community
High Performance Networking Leveraging the DPDK and Growing Community6WIND
 
 Network Innovations Driving Business Transformation
 Network Innovations Driving Business Transformation Network Innovations Driving Business Transformation
 Network Innovations Driving Business TransformationCisco Service Provider
 
Turbocharge the NFV Data Plane in the SDN Era - a Radisys presentation
Turbocharge the NFV Data Plane in the SDN Era - a Radisys presentationTurbocharge the NFV Data Plane in the SDN Era - a Radisys presentation
Turbocharge the NFV Data Plane in the SDN Era - a Radisys presentationRadisys Corporation
 
Brain in the Cloud: Machine Learning on OpenStack & Kubernetes Done Right - E...
Brain in the Cloud: Machine Learning on OpenStack & Kubernetes Done Right - E...Brain in the Cloud: Machine Learning on OpenStack & Kubernetes Done Right - E...
Brain in the Cloud: Machine Learning on OpenStack & Kubernetes Done Right - E...Cloud Native Day Tel Aviv
 

Ähnlich wie Advanced Networking: The Critical Path for HPC, Cloud, Machine Learning and more (20)

CloudX on OpenStack
CloudX on OpenStackCloudX on OpenStack
CloudX on OpenStack
 
[OpenStack Days Korea 2016] Track1 - Mellanox CloudX - Acceleration for Cloud...
[OpenStack Days Korea 2016] Track1 - Mellanox CloudX - Acceleration for Cloud...[OpenStack Days Korea 2016] Track1 - Mellanox CloudX - Acceleration for Cloud...
[OpenStack Days Korea 2016] Track1 - Mellanox CloudX - Acceleration for Cloud...
 
Mellanox for OpenStack - OpenStack最新情報セミナー 2014年10月
Mellanox for OpenStack  - OpenStack最新情報セミナー 2014年10月Mellanox for OpenStack  - OpenStack最新情報セミナー 2014年10月
Mellanox for OpenStack - OpenStack最新情報セミナー 2014年10月
 
6WINDGate™ - Enabling Cloud RAN Virtualization
6WINDGate™ - Enabling Cloud RAN Virtualization6WINDGate™ - Enabling Cloud RAN Virtualization
6WINDGate™ - Enabling Cloud RAN Virtualization
 
Open coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi AlkobiOpen coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi Alkobi
 
Interconnect your future
Interconnect your futureInterconnect your future
Interconnect your future
 
OVNC 2015-Open Ethernet과 SDN을 통한 Mellanox의 차세대 네트워크 혁신 방안
OVNC 2015-Open Ethernet과 SDN을 통한 Mellanox의 차세대 네트워크 혁신 방안OVNC 2015-Open Ethernet과 SDN을 통한 Mellanox의 차세대 네트워크 혁신 방안
OVNC 2015-Open Ethernet과 SDN을 통한 Mellanox의 차세대 네트워크 혁신 방안
 
Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph
Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph
Ceph Day Seoul - AFCeph: SKT Scale Out Storage Ceph
 
Software Stacks to enable SDN and NFV
Software Stacks to enable SDN and NFVSoftware Stacks to enable SDN and NFV
Software Stacks to enable SDN and NFV
 
The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:
The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:
The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:
 
Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...
Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...
Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...
 
Ceph Day Chicago - Deploying flash storage for Ceph without compromising perf...
Ceph Day Chicago - Deploying flash storage for Ceph without compromising perf...Ceph Day Chicago - Deploying flash storage for Ceph without compromising perf...
Ceph Day Chicago - Deploying flash storage for Ceph without compromising perf...
 
Virtualization Acceleration
Virtualization Acceleration Virtualization Acceleration
Virtualization Acceleration
 
Announcing the Mellanox ConnectX-5 100G InfiniBand Adapter
Announcing the Mellanox ConnectX-5 100G InfiniBand AdapterAnnouncing the Mellanox ConnectX-5 100G InfiniBand Adapter
Announcing the Mellanox ConnectX-5 100G InfiniBand Adapter
 
High Performance Networking Leveraging the DPDK and Growing Community
High Performance Networking Leveraging the DPDK and Growing CommunityHigh Performance Networking Leveraging the DPDK and Growing Community
High Performance Networking Leveraging the DPDK and Growing Community
 
Mellanox Approach to NFV & SDN
Mellanox Approach to NFV & SDNMellanox Approach to NFV & SDN
Mellanox Approach to NFV & SDN
 
 Network Innovations Driving Business Transformation
 Network Innovations Driving Business Transformation Network Innovations Driving Business Transformation
 Network Innovations Driving Business Transformation
 
Turbocharge the NFV Data Plane in the SDN Era - a Radisys presentation
Turbocharge the NFV Data Plane in the SDN Era - a Radisys presentationTurbocharge the NFV Data Plane in the SDN Era - a Radisys presentation
Turbocharge the NFV Data Plane in the SDN Era - a Radisys presentation
 
Brain in the Cloud: Machine Learning on OpenStack & Kubernetes Done Right - E...
Brain in the Cloud: Machine Learning on OpenStack & Kubernetes Done Right - E...Brain in the Cloud: Machine Learning on OpenStack & Kubernetes Done Right - E...
Brain in the Cloud: Machine Learning on OpenStack & Kubernetes Done Right - E...
 
Mellanox IBM
Mellanox IBMMellanox IBM
Mellanox IBM
 

Mehr von inside-BigData.com

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networksinside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networksinside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoringinside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecastsinside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Updateinside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuninginside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODinside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Accelerationinside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficientlyinside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Erainside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computinginside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Clusterinside-BigData.com
 

Mehr von inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Kürzlich hochgeladen

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 

Kürzlich hochgeladen (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Advanced Networking: The Critical Path for HPC, Cloud, Machine Learning and more

  • 1. 1© 2018 Mellanox Technologies | Confidential The Critical Path for HPC, Cloud and Machine Learning Lugano April 9 2018 Advanced Networking
  • 2. 2© 2018 Mellanox Technologies | Confidential Modern Data Centers Orchestration SDSSDN Multi- Tenancy On Demand
  • 3. 3© 2018 Mellanox Technologies | Confidential The Challenge – Software Implementation Performance Programmability
  • 4. 4© 2018 Mellanox Technologies | Confidential The Solution - Hardware Acceleration Software defined everything is a key for modern data centers With today’s software based solutions, functionality is (almost) there To gain flexibility, performance and cost efficiency, Hardware Acceleration is needed High Performance Workloads Can’t Deliver Without HW Acceleration Software Software + Hardware Acceleration
  • 5. 5© 2018 Mellanox Technologies | Confidential Advance Network Technologies For OpenStack
  • 6. 6© 2018 Mellanox Technologies | Confidential Server VM1 VM2 VM3 VM4 Overlay Networks Overlay Network Advantages: Isolation, Simplicity, Scalability Virtual Domain 3 Virtual Domain 2 Virtual Domain 1 Physical View Server VM5 VM6 VM7 VM8 Virtual View NVGRE/VXLAN/Geneve Overlay Networks
  • 7. 7© 2018 Mellanox Technologies | Confidential Turbocharge Overlay Networks Overlay tunnels add network processing  Limits bandwidth  Consumes CPU System efficiency drops 10s of percents For penalty free overlays, at bare- metal performance use NIC with overlay Network HW offloads  ConnectX-4 and ConnectX-5 family Mellanox adapters also supports VxLAN VTEP (encap/decap) 37.5 17.62 36.21 0.7 3.5 0.7 0 0.5 1 1.5 2 2.5 3 3.5 4 Physical VXLAN No Offloads VXLAN HW Offloads 0 5 10 15 20 25 30 35 40 CPU%Per1Gb/s Bendwith(Gb/s) 40G/s ConnectX-3 Pro 8 VM Pairs BW 8 VM Pairs CPU
  • 8. 8© 2018 Mellanox Technologies | Confidential Para-Virtualized SR-IOV Single Root I/O Virtualization (SR-IOV) PCIe device presents multiple instances to the OS/Hypervisor Enables Application Direct Access  Bare metal performance for VM  Reduces CPU overhead Enables many advanced NIC features (e.g. DPDK, RDMA, ASAP2) NIC Hypervisor vSwitch VM VM SR-IOV NIC Hypervisor VM VM eSwitch Physical Function (PF) Virtual Function (VF) Fully Integrated And Upstream With OpenStack
  • 9. 9© 2018 Mellanox Technologies | Confidential Per VF (SR-IOV) Quality of Service (QoS) New Neutron API for  Per VF Rate Limiting  Per VF BW Guarantee  Packet Pacing Same model for ParaVirt and SR- IOV In SR-IOV mode, QoS is enforced by HW  Finer grain  More predictable  Less Jitter  Less CPU utilization Rate ShaperRate Shaper QoS Queue Work Queue Work Queue Work Queue Priority 0 Arbiter QoS Queue Work Queue Work Queue Work Queue QoS Queue Work Queue Work Queue Work Queue QoS Queue Work Queue Work Queue Work Queue Priority 1 Arbiter RR arbiter RR arbiter RR arbiter RR arbiter Strict Priority TC Group 0 DWRR TC Group 1 DWRR TC Group 7 DWRR TC0 TC1 Flow Ctrl Flow Ctrl TC2 TC3 Flow Ctrl Flow Ctrl TC7 Flow Ctrl... HL ... ...Priority 0 Priority 1 Priority 2 Priority 3 Priority 7 Rate Limiter Rate Limiter Enhanced ETSPer VF Rate Limiter Mellanox Advance HW QoS Implementation
  • 10. 10© 2018 Mellanox Technologies | Confidential SR-IOV High Availability / VF LAG  SR-IOV VMs don’t support bonding/HA  Mellanox enable transparent SR-IOV HA on a single NIC  LAG will be implemented on Mellanox NIC so VM will only see a single Virtual Function (VF)  Mode supported  Active Passive (Single port BW)  Active Active (Double port BW)  LACP NIC Host Virtual Function VM VF driver User Kernel Virtual Function Port 1 Port 2 LAG
  • 11. 11© 2018 Mellanox Technologies | Confidential Tradeoffs Between Virtual Switch and SR-IOV Virtual Switch SR-IOV
  • 12. 12© 2018 Mellanox Technologies | Confidential Open Virtual Switch (OVS) Challenges  Virtual switches such as Open vSwitch (OVS) are used as the forwarding plane in the hypervisor  Virtual switches implement extensive support for SDN (e.g. enforce policies) and are widely used by the industry  Supports L2-L3 networking features:  L2 & L3 Forwarding, NAT, ACL, Connection Tracking etc.  Flow based  OVS Challenges:  Awful Packet Performance: <1M w/ 2-4 cores,  Burns CPU like Hell : Even w/ 12 cores, can’t get 1/3rd 100G NIC Speed  Bad User Experience: High and unpredictable latency, packet drops  Solution  Offload OVS data plane into Mellanox NIC using ASAP2 technology
  • 13. 13© 2018 Mellanox Technologies | Confidential ConnectX-5 Packet Processing Offload Capabilities  Flow Tables  Multiple, Programmable tables  Dedicate, isolated tables for hypervisor and/or VMs  Practically unlimited table size  Can support million of rules/flows  Classification  Match on all header fields including encapsulated packets  Flexible fields extraction by “Flexparse”  Actions  Steering  Encap/Decap  VXLAN, NVGRE, Geneve, MPLSoGRE/UDP, NSH  Flex encap/decap  Drop / Allow  Mirror  Flow ID  Header rewrite  Hairpin mode
  • 14. 14© 2018 Mellanox Technologies | Confidential Accelerated Switching And Packet Processing (ASAP2)  ASAP2 take advantage of ConnectX-5 capability to accelerate or offload “in host” network stack  Family of solutions ASAP2 Direct Full vSwitch offload ASAP2 Flex vSwitch acceleration ASAP2 Flex VNF/VM acceleration
  • 15. 15© 2018 Mellanox Technologies | Confidential ASAP2 Direct: Full OVS Offload  Enable SR-IOV data path with OVS control plane  In other words, enable support for most SDN controllers with SR-IOV data plane  Use Open vSwitch to be the management interface and offload OVS data- plane to Mellanox embedded Switch (eSwitch) using ASAP2 Direct OVS-eSwitch Netdev Representor Netdev Representor Netdev Representor Netdev Representor eSwitch PF (wire) Host IP interface Host exception path (user-space) VF VF VF netdev netdev Para-virt Para-virt Hypervisor Representor Ports VM ConnectX-5 eSwitch VM Hypervisor OVS SR-IOV VF SR-IOV VF DataPath PF
  • 16. 16© 2018 Mellanox Technologies | Confidential OVS over DPDK VS. OVS Offload ConnectX-5 provide significant performance boost  Without adding CPU resources 7.6 MPPS 66 MPPS 4 Cores 0 Cores 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 0 10 20 30 40 50 60 70 OVS over DPDK OVS Offload NumberofDedicatedCores MillionPacketPerSecond Message Rate Dedicated Hypervisor Cores Test ASAP2 Direct OVS DPDK Benefit 1 Flow VXLAN 66M PPS 7.6M PPS (VLAN) 8.6X 60K flows VXLAN 19.8M PPS 1.9M PPS 10.4X
  • 17. 17© 2018 Mellanox Technologies | Confidential Remote Direct Memory Access (RDMA) ZERO Copy Remote Data Transfer Low Latency, High Performance Data Transfers InfiniBand - 100Gb/s RoCE* – 100Gb/s Kernel Bypass Protocol Offload Application ApplicationUSER KERNEL HARDWARE Buffer Buffer
  • 18. 18© 2018 Mellanox Technologies | Confidential RDMA In Cloud Enable RDMA applications to run on cloud  Scientific  HPC  Machine Learning and AI  Data bases Accelerate cloud infrastructure  VM migration over RDMA  Message queue over RDMA (e.g. gRPC) Accelerate cloud storage  iSER  NVMf Cognitive Toolkit
  • 19. 19© 2018 Mellanox Technologies | Confidential RDMA Provide Fastest OpenStack Block Storage Access Using OpenStack Built-in components and management (Open-iSCSI, tgt target, Cinder), no additional software is required, RDMA is already inbox and used by our OpenStack customers ! Hypervisor (KVM) OS VM OS VM OS VM Adapter Open-iSCSI w iSER Compute Servers Switching Fabric iSCSI/iSER Target (tgt) Adapter Local Disks RDMA Cache Storage Servers OpenStack (Cinder) Using RDMA to accelerate iSCSI storage 0 1000 2000 3000 4000 5000 6000 7000 1 2 4 8 16 32 64 128 256 Bandwidth[MB/s] I/O Size [KB] iSER 4 VMs Write iSER 8 VMs Write iSER 16 VMs Write iSCSI Write 8 vms iSCSI Write 16 VMs PCIe Limit 6X RDMA enables 6x More Bandwidth, 5x lower I/O latency, and lower CPU%
  • 20. 20© 2018 Mellanox Technologies | Confidential NVMe Over Fabrics Sharing NVMe based storage across multiple servers  Better utilization: capacity, rack space, power  Scalability, management, fault isolation RDMA protocol is part of the standard  InfiniBand or Ethernet (RoCE) OpenStack Integration  Cinder driver * Roadmap
  • 21. 21© 2018 Mellanox Technologies | Confidential Data Plane Development Kit (DPDK)  What is DPDK?  Set of open source libraries and drivers for fast packet processing  What is the main usage and benefits of DPDK?  Receive and send packets within the minimum number of CPU cycles (usually less than 80 cycles)  Develop fast packet capture algorithms  Run third-party fast path stacks  Can be used as an abstraction layer that will enable application porting between CPU architectures  DPDK in the cloud  Accelerate virtual switches (i.e., OVS over DPDK)  Enable Virtual Network Functions (VNFs)
  • 22. 22© 2018 Mellanox Technologies | Confidential DPDK with Mellanox - Industry Leading Performance Mellanoxwith 66% lower latency compared to competition Highest Performance and Message Rate in the Market!!! 139.22 84.46 45.29 23.50 11.97 9.62 8.13 0.00 20.00 40.00 60.00 80.00 100.00 120.00 140.00 160.00 64 128 256 512 1024 1280 1518 Framerate[mpps] Frame Size [B] Lossless ConnectX-5 Ex 100GbE Frame Rate [Mpps] 16 cores 100GbE line rate
  • 23. 23© 2018 Mellanox Technologies | Confidential DPDK with Mellanox – Secure & Cost Effective S E C U R E NIC based hardware memory protection and translation by memory registration and isolation per application Benefits:  Better Secured  Supports Containerized DPDK applications without SR-IOV THROUGH MEMORY PROTECTION In hardware Allows concurrent use of DPDK and NON-DPDK applications on the same NIC unlike competition Benefits:  Save CapEx of dedicated DPDK NIC C O S T E F F E C T I V E Supporting multiple architectures Benefits:  Tightly integrated with processor specific accelerators (Neon, AVX, etc) M U LT I A R C H
  • 24. 24© 2018 Mellanox Technologies | Confidential OpenStack Over InfiniBand – The Route To Extreme Performance  Transparent InfiniBand integration into OpenStack  Since Havana OpenStack release  RDMA directly from VM  Requires SR-IOV  MAC to GUID mapping  VLAN to pkey mapping  InfiniBand SDN network  Ideal fit for High Performance Computing Clouds InfiniBand Enables The Highest Performance and Efficiency
  • 25. 25© 2018 Mellanox Technologies | Confidential Ironic Ethernet and InfiniBand Support  Ironic is OpenStack bare metal provisioning  Useful for High Performance Compute (HPC) and Big Data  Mellanox enabled Ironic support for IB and Eth  Bare metal, multi-tenancy IB and Eth  Provide zero touch VLAN switch provision  Enable InfiniBand support for Ironic with Neutron using pkey segmentation and OpenSM integration (via Neo/UFM) I’m “Pixie Boots” the mascot of the "Bear Metal" Provisioning, a.k.a Ironic
  • 26. 26© 2018 Mellanox Technologies | Confidential Comprehensive OpenStack Integration Integrated with Major OpenStack Distributions In-Box Neturon-ML2 support for mixed environment (VXLAN, PV, SRIOV) Ethernet Neutron : Hardware support for security and isolation Accelerating storage access by up to 5X OpenStack Plugins Create Seamless Integration , Control, & Management
  • 27. 27© 2018 Mellanox Technologies | Confidential Machine Learning Network Needs
  • 28. 28© 2018 Mellanox Technologies | Confidential Neural Networks Complexity Growth 2014 2015 2016 2017 DeepSpeech DeepSpeech-2 DeepSpeech-3 30X 2013 2014 2015 2016 AlexNet GoogleNet ResNet Inception-V2 350X Inception-V4 Image Recognition Speech Recognition PolyNet
  • 29. 29© 2018 Mellanox Technologies | Confidential Training Challenges Training with large data sets and increasing networks can take long time  In some cases even weeks In many cases training need to happen frequently  Model development and tuning  Real life use cases may require retraining regularly Accelerate training time by scale out architecture  Add workers (nodes) to reduce training time Types of parallelism that are now popular Data parallelism Model parallelism Network is critical element to accelerate Distributed Training!
  • 30. 30© 2018 Mellanox Technologies | Confidential Model and Data Parallelism Main Model/Parameter Server/Allreaduce Local Model Mini Batch Mini Batch Mini Batch Mini Batch Mini Batch Local Model Local Model Local Model Local Model Local Model Mini BatchData Data Model Parallelism Data Parallelism
  • 31. 31© 2018 Mellanox Technologies | Confidential Accelerating Data Parallelism Data Parallelism communication pattern  Gradient updates to parameter servers or among workers.  Model parameters distribution among workers.  Frequent – each training step due to the sequential nature of SGD  High bandwidth is needed, as models become larger and larger  Number of parameters is increasing  Usually characterized with Bursts on the network - workers are synchronized RDMA and GPU Direct Accelerates Model Parallelism
  • 32. 32© 2018 Mellanox Technologies | Confidential What Is RDMA?  Remote Direct Memory Access (RDMA)  Advance transport protocol (same layer as TCP and UDP)  Main features  Remote memory read/write semantics in addition to send/receive  Kernel bypass / direct user space access  Full hardware offload  Secure, channel based IO  Application advantage  Low latency  High bandwidth  Low CPU consumption  RoCE: RDMA over Converged Ethernet  Available for all Ethernet speeds 10 – 100G  Verbs: RDMA SW interface (equivalent to sockets)
  • 33. 33© 2018 Mellanox Technologies | Confidential GPUDirect™ RDMA Technology
  • 34. 34© 2018 Mellanox Technologies | Confidential All Major Machine Learning Frameworks Support RDMA TensorFlow: Several implementations upstream  Native (verbs) - https://github.com/tensorflow/tensorflow/tree/master/tensorflow/contrib/ verbs  MPI, Horavod – Donated by Uber among others Caffe2: Over MPI or Gloo library Microsoft Cognitive Toolkit: Native support NVIDIA NCCL2: Native support in NCCL Cognitive Toolkit
  • 35. 35© 2018 Mellanox Technologies | Confidential TensorFlow with Mellanox RDMA Test Report  System Configuration  8 x x86 servers  4 x NVIDIA P100 per server  Mellanox 100G RDMA network  NVMe driver per server RDMA vs. TCP: Up to 50% Better Performance Advanced RDMA vs. TCP: Up to 173% Better Performance Reference Deployment Guide
  • 36. 36© 2018 Mellanox Technologies | Confidential SHARP To Accelerate Parameter Server  Bottleneck created by Parameter Server  SHARP Reduces Network Load and Latency SCALE ∆𝑾 𝟏 ∆𝑾 𝟐 ∆𝑾 𝟑 ∆𝑾 𝑵 ∑∆𝑾𝒊 ∑∆𝑾𝒊 ∑∆𝑾𝒊∑∆𝑾𝒊∑∆𝑾𝒊 𝒊=𝟏 𝒊=𝟑 ∆𝑾𝒊 𝒊=𝑵−𝟑 𝒊=𝑵 ∆𝑾𝒊
  • 37. 37© 2018 Mellanox Technologies | Confidential Data Ingestion  Data ingestion is the process of acquiring and preparing the input  Preprocessing stage before accessing machine learning frameworks  Examples  Convert file/image formats  Combine multiple data sources  Clean noise / enhance input  Relevant for training and inference  Data Ingestion typically includes  Access to storage (local, distributed, network storage)  Pre-processing in a big data framework such as Hadoop or Spark Accelerate Data Ingest is critical for machine learning performance
  • 38. 38© 2018 Mellanox Technologies | Confidential Data Pipeline
  • 39. 39© 2018 Mellanox Technologies | Confidential Mellanox Is Driving High Performance Storage A majority of these customer are doing NVMe-oF POCs or early development with us Today
  • 40. 40© 2018 Mellanox Technologies | Confidential Accelerate Big Data - Enabling Real-time Decisions Benchmark: TeraSort Benchmark: Cassandra Stress 0 200 400 600 800 1000 1200 Intel 10Gb/s Mellanox 10Gb/s Mellanox 40Gb/s ExecutionTime(inseconds) Ethernet Network 3X Faster Benchmark: Fraud Detection 0 100 200 300 400 500 600 700 Existing Solution Aerospike with Mellanox + Samsung NVMe TotalTransactionTime(inms) CPU + Storage + Network Fraud Detection Algorithm ~2x more time for running fraud detection algorithm 3X Faster Runtime! ~2X Faster Runtime! 25G BW for Database! Mellanox is Certified by Leading Big Data Partners
  • 41. 41© 2018 Mellanox Technologies | Confidential Spark Over RDMA – Accelerate Map/Reduce Map Reduce task MapReduce Map Map Map Map Input Map output File File File File File Driver Reduce task Reduce task Reduce task Reduce task Fetch blocks Fetch blocks Fetch blocks Fetch blocks Fetch blocks  Shuffling is very expensive in terms of CPU, RAM, disk and network IOs  Spark Over RDMA allow to speed up shuffle operations  RDMA is provided by “DiSNI” (Open-source Java interface to RDMA user libraries)  https://github.com/zrlio/disni
  • 42. 42© 2018 Mellanox Technologies | Confidential Spark over RDMA Performance Results: TeraSort Testbed:  HiBench TeraSort  Workload: 175GB  HDFS on Hadoop 2.6.0  No replication  Spark 2.2.0  1 Master  16 Workers  28 active Spark cores on each node, 420 total  Node info:  Intel Xeon E5-2697 v3 @ 2.60GHz  RoCE 100GbE  256GB RAM  HDD is used for Spark local directories and HDFS RDMA Standard 0 10 20 30 40 50 60 70 80 seconds
  • 43. 43© 2018 Mellanox Technologies | Confidential 17% 23% 19% 0 20 40 60 80 100 120 140 160 Customer App #1 Customer App #2 HiBench TeraSort Runtime(inseconds) TCP RDMA Spark over RDMA: Real Applications Results Runtime samples Input Size Nodes Cores per node RAM per node Improvement Customer App #1 5GB 14 24 85GB 17% Customer App #2 540GB 14 24 85GB 23% HiBench TeraSort 300GB 15 28 256GB 19% Lower is better
  • 44. 44© 2018 Mellanox Technologies | Confidential Containers
  • 45. 45© 2018 Mellanox Technologies | Confidential Containers Vs. Virtual Machines Infrastructure Infrastructure Operating System Hypervisor Libs App Libs App Libs App Operating System Container Engine Libs App Libs App Libs App OS OS OS VM VM VM Container Container Container Packaging technology and light weight virtualization
  • 46. 46© 2018 Mellanox Technologies | Confidential Containers Networking – Many Options Host Container C Container D Container E Container FContainer A Container B Direct Host network Unix-domain sockets and other IPC Linux bridge (Docker0) iptables (Docker proxy) Open vSwitch Port mapping NICNIC NIC
  • 47. 47© 2018 Mellanox Technologies | Confidential Containers Networking With Mellanox 10-100Gb/s Ethernet Stateless offloads Scalable and secure DPDK usage from container (*) RDMA for InfiniBand and RoCE from within the container (*) SR-IOV (*) Roadmap Container Direct networking vSwitch offload with ASAP2 (*) Initial support available; Integration with Kubernetes later this year
  • 48. 48© 2018 Mellanox Technologies | Confidential Thank You