SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Downloaden Sie, um offline zu lesen
Accelerate networking innovation through programmable data plane 
Removing switches from datacenters with TRILL/VNT and smartNIC 
Ahmed Amamou, ahmed@gandi.net 
Benoît Ganne, bganne@kalray.eu
•Gandi is a domain name registrars since 1999 and a cloud provider since 2008 
•We provide both 
–IaaS: Infrastructure As A Service 
–PaaS: Platform As A Service 
•We support open source community: 
–Provide open source code : https://github.com/Gandi 
–Support open source project: VLC, Debian, … * 
* Check http://www.gandi.net/supports/ for exhaustive list 
Who is Gandi? 
2
IaaS new network’s challenges 
3 
•Cisco Forecast report*: 
–Cloud traffic was about 3.3 zetta (1021) Bytes in 2013 
–Cloud traffic will reach 6.6 zetta Bytes in 2016 
–76% of cloud traffic are East-West (within the same datacenter) 
 A high density of links within a datacenter is needed 
•Customer need a full network access 
–Should be isolated 
– VM network configuration should not be restrictive 
Overlaying tenant traffic should be considered 
* Cisco Global Cloud Index Forecast and Methodology, 2011-2016.
•New protocols are proposed to solve these problems (TRILL , VXLAN, 802.1 ad STT …) but: 
– Hardware integration is slow 
– Protocol extensions are hard to integrate 
•We believe the OpenCompute community can help us 
–To define an open, vendor-neutral API for programmable data plane 
–Bring open hardware fulfilling those needs 
Why OpenCompute? 
4
•Switch from classic datacenter architecture to a full-mesh one 
•Upgrade hardware to improve performances 
New datacenter architecture 
5
TRILL @Gandi 
6 
•Gandi uses commodity hardware as TRILL Rbridges since 2013 
•We did not yet found hardware that suits our needs.
•Layer 2 Routing Protocol 
•Uses a control and a data plane 
•Control plane : based on IS-IS that computes all Routing information 
•Data plane : forward packets using provided information from control plane 
•Uses Mac-in-Mac encapsulation 
TRILL: TRansparent Interconnection of Lot of Links 
7 
Original payload 
TRILL Header
TRILL benefits 
8 
Commutation(L2) 
Routing (L3) 
TRILL 
Configuration 
Minimal 
Intense 
Minimal 
Plug & play 
Yes 
No 
Yes 
Discovery 
Automatic 
Configured 
Automatic 
Learning 
Automatic 
Configured 
Automatic 
Multi path 
No 
Yes 
Yes 
Convergence 
Slow 
Fast 
Fast 
Connectivity 
Inflexible 
Flexible 
Flexible 
Scale 
Limited 
Important 
Important
Control Plane: Forwarding database 
9
Multitenancy: Virtual Network over TRILL (VNT) 
10 
New cloud architecture have to take into consideration Multitenancy 
Trill does not provide Multitenancy handling mechanisms 
→ We need to extend it
•Update Both control and data planes 
–Control plane : Prune multicast tree to limit multicast traffic 
–Data plane : Forwarding is conditioned by VNI support 
VNT vs TRILL 
11 
VNT Encapsulation 
Original Ethernet Frame 
Outer Destination 
Mac Address 
Outer Source 
Mac Address 
Optional 
Outer IEEE 802.1Q 
TRILL Header 
VNT Header 
Extensions 
Original 
Packet Payload 
Egress Rbridge Nickname 
Ingress Rbridge Nickname 
Options description 
TLV 
VNI Tag (24 bits) 
L2 Routing information 
Tenant 
identification 
Publication: 
Amamou, A., Haddadou, K., & Pujolle, G. (2014). 
A TRILL-based multi-tenant data center network. Computer Networks.
VNT: Multicast tree pruning 
12 
n3 
n4 
n5 
n8 
n7 
n6 
n1 
n2 
i1 
i1 
i2 
i2 
i1 
i2 
i1 
i2 
i1 
i2 
i1 
i3 
i3 
i3 
i2 
i2 
i1 
i2 
i1 
i3 
n3 
n4 
n5 
n8 
n7 
n6 
n1 
n2 
i1 
i1 
i2 
i2 
i1 
i2 
i1 
i2 
i1 
i2 
i1 
i3 
i3 
i3 
i2 
i2 
i1 
i2 
i1 
i3 
n5 
n2 
n8 
n1 
n7 
n6 
n4 
n3 
n5 
n2 
n1 
n6 
A –Vni1 
A –Vni1 
B –Vni1 
Topology 
Multicast tree
Current VNT implementation on Linux 
13 
Control plane : Quagga daemon 
Data plane: 
Linux Bridge Module
Current VNT implementation on Linux 
14 
Control plane : Quagga daemon 
Data plane: 
Linux Bridge Module 
https://github.com/Gandi/
•Throughput is affected by the addition processing operation 
•Processing for a single packet is not affected 
Data plane: performance 
15 
Throughput 
Delay
•Shift data plane from host to smartNIC 
–Increase performance 
–Offload x86 for other usages 
•eg. Customers workload 
Improving performance 
16 
Host 
Host 
NIC 
smartNIC 
Control plane 
Data plane 
Control plane 
Data plane
•Founded in 2008, fabless semiconductor company 
•Kalray has developed the disruptive MPPA® (Multi-Purpose Processing Array) programmable architecture 
–Leading Performance / Energy Ratio Worldwide 
–Time predictability and low latency 
–Heterogeneous applications on the same chip 
–High programmability 
•Working with industry-leading partners and customers 
•55 employees 
•Offices in France and US 
KALRAY deterministic supercomputing on a chip 
17 
First MPPA®-256 Chips with CMOS 28nm TSMC Leading Performance / Energy Ratio Worldwide
Software Defined NIC 
Smart packet classification/dispatching 
256 cores for packets processing 
Standard C/C++ with GCC-4.9 
Advanced debugging and profiling 
Low latency 
Zero-copy Ethernet  PCIe 
< 1μs port-to-port transparent mode 
< 1μs port to system memory 
System integration 
Linux support 
Virtualization support 
Low power 
High throughput / Line rate 
80 Gbps full-duplex line-rate (2x120MPPS) 
3400 instructions per packet @64B 
AES, SHA-1, SHA-2,CRC accelerators 
2 x PCIe Gen3 8-lanes 
MPPA®-256 Bostan Networking Strengths 
18
MPPA®-256 Bostan 
•64-bit processor 
•Up to 800MHz 
•High Performance 
–845 GFLOPS SP / 422 GFLOPS DP 
–1 TOPS 
•High Bandwidth Network On a Chip 
–2 x 12.8 GB/s 
•High Speed Ethernet 
–Up to 2x40 Gbps / 2x120 MPPS @ 64B 
•DDR3 Memory interfaces 
–2 x 64-bit + ECC @2133MT/s / 2 x 17GB/s 
•PCIe Gen3 interface 
–2 x 8-lanes / 2 x 8 GB/s full duplex 
–End Point / Root Complex 
•NoCX extension 
–2 x 40 Gbps + 2 x 80 Gbps ILK 
•Flash controller, GPIOs… 
19
MPPA®-256 Processor Hierarchical Architecture 256 Processing Engine cores + 32 Resource Management cores 
20 
Manycore Processor 
Compute Cluster 
VLIW Core 
Instruction Level Parallelism 
Thread Level Parallelism 
Process Level Parallelism
High Speed Ethernet Packet processing 
•Ethernet Rx dispatcher 
–8 classification tables 
•Classify 
•Extract fields 
•Smart Dispatch 
–Round Robin way 
–Flexible cores allocation 
•Round Robin vs. classification 
•Per 10G Ports 
• Ethernet Tx 
–64 Tx FIFOs 
–QoS between the FIFOs 
–Flow Control between clusters and Tx FIFOs 
21 
Patent pending
VNT on a programmable data plane Multicast forwarding example 
22 
MPPA Linux ethernet driver 
Linux networking stack 
TRILL controller 
Kalray Bostan smartNIC 
x86 
Hypervisor 
MPPA Linux ethernet driver 
Linux networking stack 
Userspace application 
•On-going work between Gandi and Kalray 
–Explore programmable data plane opportunities 
–Study a VNT smartNIC feasibility and architecture 
•Multicast forwarding put a high load on each node 
IO ethernet driver 
8x10GbE
VNT on a programmable data plane Multicast forwarding example 
23 
MPPA Linux ethernet driver 
Linux networking stack 
TRILL controller 
x86 
Hypervisor 
MPPA Linux ethernet driver 
Linux networking stack 
Userspace application 
•Dispatch the packet based on Egress Rbridge 
–In case of multicast, Egress RBridge is set to the tree root 
–Each cluster “owns” a subset of the possible Egress RBridge (ie. a FIB subset) 
8x10GbE 
IO ethernet driver 
if (Packet[Ethertype] == TRILL) { 
send to cluster #HASH(Egress RBridge) 
} 
Kalray Bostan smartNIC 
<Ethertype=TRILL, Egress=DTROOT, VNI=VNI-1>
VNT on a programmable data plane Multicast forwarding example 
24 
MPPA Linux ethernet driver 
Linux networking stack 
TRILL controller 
x86 
Hypervisor 
MPPA Linux ethernet driver 
Linux networking stack 
Userspace application 
8x10GbE 
IO ethernet driver 
Kalray Bostan smartNIC 
•Dispatch the packet based on Egress Rbridge 
–In case of multicast, Egress RBridge is set to the tree root 
–Each cluster “owns” a subset of the possible Egress RBridge (ie. a FIB subset)
VNT on a programmable data plane Multicast forwarding example 
25 
MPPA Linux ethernet driver 
Linux networking stack 
TRILL controller 
x86 
Hypervisor 
MPPA Linux ethernet driver 
Linux networking stack 
Userspace application 
•Lookup the list of next-hop RBridges for this multicast tree 
–RBridge owner clusters can be local or remote 
•Lookup the LIB for local ports if any 
8x10GbE 
IO ethernet driver 
Kalray Bostan smartNIC 
FIB[Egress RBridge] = { 
Egress RBridge MAC; 
Egress RBridge Interface; 
MCTree = [ RBx, RBy, … ]; 
VNI = [ VNI-1, VNI-2, … ]; 
} 
LIB = { 
(Local MACx, Local Portx, VNI-1); 
… 
}
VNT on a programmable data plane Multicast forwarding example 
26 
MPPA Linux ethernet driver 
Linux networking stack 
TRILL controller 
x86 
Hypervisor 
MPPA Linux ethernet driver 
Linux networking stack 
Userspace application 
•Forward the frame 
–Remote 
•Forward to clusters owning the next-hop RBridge 
–Local 
•Decapsulte inner frame 
•Forward it the local VM 
8x10GbE 
IO ethernet driver 
Kalray Bostan smartNIC
VNT on a programmable data plane Multicast forwarding example 
27 
MPPA Linux ethernet driver 
Linux networking stack 
TRILL controller 
x86 
Hypervisor 
MPPA Linux ethernet driver 
Linux networking stack 
Userspace application 
•Check if the RBridge support the appropriate VNI 
–If yes forward to Rbridge 
–If not, stop here 
8x10GbE 
IO ethernet driver 
Kalray Bostan smartNIC 
FIB[Egress RBridge] = { 
Egress RBridge MAC; 
Egress RBridge Interface; 
MCTree = [ RBx, RBy, … ]; 
VNI = [ VNI-1, VNI-2, … ]; 
}
VNT on a programmable data plane Multicast forwarding example 
28 
MPPA Linux ethernet driver 
Linux networking stack 
TRILL controller 
x86 
Hypervisor 
MPPA Linux ethernet driver 
Linux networking stack 
Userspace application 
•Check if the RBridge support the appropriate VNI 
–If yes forward to Rbridge 
–If not, stop here 
8x10GbE 
IO ethernet driver 
Kalray Bostan smartNIC
•Solving SDN and network virtualization challenges requires new protocols 
–eg. VXLAN, NVGRE, TRILL/VNT… 
•Efficiency generally means hardware support 
…But hardware development cannot keep up with software and slow down innovation 
•Gandi and Kalray think a programmable data plane can reconcile efficiency and innovation 
…But we need open ecosystems, standards and API 
Innovation and efficiency 
29
Thank you for your attention! 
Questions? 
Ahmed Amamou, ahmed@gandi.net 
Benoît Ganne, bganne@kalray.eu

Weitere ähnliche Inhalte

Was ist angesagt?

"Dynamically Reconfigurable Processor Technology for Vision Processing," a Pr...
"Dynamically Reconfigurable Processor Technology for Vision Processing," a Pr..."Dynamically Reconfigurable Processor Technology for Vision Processing," a Pr...
"Dynamically Reconfigurable Processor Technology for Vision Processing," a Pr...Edge AI and Vision Alliance
 
Open Compute Project - Tech Update Amsterdam 2018
Open Compute Project - Tech Update Amsterdam 2018Open Compute Project - Tech Update Amsterdam 2018
Open Compute Project - Tech Update Amsterdam 2018Inflectiontech Ltd
 
Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...
Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...
Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...KTN
 
DCD Zettastructure 2017 OCP data center project engineering workshop
DCD Zettastructure 2017 OCP data center project engineering workshopDCD Zettastructure 2017 OCP data center project engineering workshop
DCD Zettastructure 2017 OCP data center project engineering workshopInflectiontech Ltd
 
Решения конвергентного доступа Cisco. Обновление продуктовой линейки коммутат...
Решения конвергентного доступа Cisco. Обновление продуктовой линейки коммутат...Решения конвергентного доступа Cisco. Обновление продуктовой линейки коммутат...
Решения конвергентного доступа Cisco. Обновление продуктовой линейки коммутат...Cisco Russia
 

Was ist angesagt? (10)

An Optics Life
An Optics LifeAn Optics Life
An Optics Life
 
2016 2017 ieee vlsi project titles
2016   2017 ieee vlsi project titles2016   2017 ieee vlsi project titles
2016 2017 ieee vlsi project titles
 
slides
slidesslides
slides
 
"Dynamically Reconfigurable Processor Technology for Vision Processing," a Pr...
"Dynamically Reconfigurable Processor Technology for Vision Processing," a Pr..."Dynamically Reconfigurable Processor Technology for Vision Processing," a Pr...
"Dynamically Reconfigurable Processor Technology for Vision Processing," a Pr...
 
Open Compute Project - Tech Update Amsterdam 2018
Open Compute Project - Tech Update Amsterdam 2018Open Compute Project - Tech Update Amsterdam 2018
Open Compute Project - Tech Update Amsterdam 2018
 
Yertl v2 granada
Yertl v2 granadaYertl v2 granada
Yertl v2 granada
 
Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...
Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...
Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...
 
DCD Zettastructure 2017 OCP data center project engineering workshop
DCD Zettastructure 2017 OCP data center project engineering workshopDCD Zettastructure 2017 OCP data center project engineering workshop
DCD Zettastructure 2017 OCP data center project engineering workshop
 
Tim gray
Tim grayTim gray
Tim gray
 
Решения конвергентного доступа Cisco. Обновление продуктовой линейки коммутат...
Решения конвергентного доступа Cisco. Обновление продуктовой линейки коммутат...Решения конвергентного доступа Cisco. Обновление продуктовой линейки коммутат...
Решения конвергентного доступа Cisco. Обновление продуктовой линейки коммутат...
 

Ähnlich wie Ocpeu14

Osnug meetup-tungsten fabric - overview.pptx
Osnug meetup-tungsten fabric - overview.pptxOsnug meetup-tungsten fabric - overview.pptx
Osnug meetup-tungsten fabric - overview.pptxM.Qasim Arham
 
Building DataCenter networks with VXLAN BGP-EVPN
Building DataCenter networks with VXLAN BGP-EVPNBuilding DataCenter networks with VXLAN BGP-EVPN
Building DataCenter networks with VXLAN BGP-EVPNCisco Canada
 
Cilium - Fast IPv6 Container Networking with BPF and XDP
Cilium - Fast IPv6 Container Networking with BPF and XDPCilium - Fast IPv6 Container Networking with BPF and XDP
Cilium - Fast IPv6 Container Networking with BPF and XDPThomas Graf
 
PLNOG16: Obsługa 100M pps na platformie PC , Przemysław Frasunek, Paweł Mała...
PLNOG16: Obsługa 100M pps na platformie PC, Przemysław Frasunek, Paweł Mała...PLNOG16: Obsługa 100M pps na platformie PC, Przemysław Frasunek, Paweł Mała...
PLNOG16: Obsługa 100M pps na platformie PC , Przemysław Frasunek, Paweł Mała...PROIDEA
 
PLNOG 17 - Nicolai van der Smagt - Building and connecting the eBay Classifie...
PLNOG 17 - Nicolai van der Smagt - Building and connecting the eBay Classifie...PLNOG 17 - Nicolai van der Smagt - Building and connecting the eBay Classifie...
PLNOG 17 - Nicolai van der Smagt - Building and connecting the eBay Classifie...PROIDEA
 
Pristine rina-sdk-icc-2016
Pristine rina-sdk-icc-2016Pristine rina-sdk-icc-2016
Pristine rina-sdk-icc-2016ICT PRISTINE
 
FD.io Vector Packet Processing (VPP)
FD.io Vector Packet Processing (VPP)FD.io Vector Packet Processing (VPP)
FD.io Vector Packet Processing (VPP)Kirill Tsym
 
FD.IO Vector Packet Processing
FD.IO Vector Packet ProcessingFD.IO Vector Packet Processing
FD.IO Vector Packet ProcessingKernel TLV
 
100G Networking Berlin.pdf
100G Networking Berlin.pdf100G Networking Berlin.pdf
100G Networking Berlin.pdfJunZhao68
 
HiPEAC 2022-DL4IoT workshop_René Griessl presentation
HiPEAC 2022-DL4IoT workshop_René Griessl presentationHiPEAC 2022-DL4IoT workshop_René Griessl presentation
HiPEAC 2022-DL4IoT workshop_René Griessl presentationVEDLIoT Project
 
HiPEAC-CSW 2022_Kevin Mika presentation
HiPEAC-CSW 2022_Kevin Mika presentationHiPEAC-CSW 2022_Kevin Mika presentation
HiPEAC-CSW 2022_Kevin Mika presentationVEDLIoT Project
 
Platforms for Accelerating the Software Defined and Virtual Infrastructure
Platforms for Accelerating the Software Defined and Virtual InfrastructurePlatforms for Accelerating the Software Defined and Virtual Infrastructure
Platforms for Accelerating the Software Defined and Virtual Infrastructure6WIND
 
IBM System Networking Overview - Jul 2013
IBM System Networking Overview - Jul 2013IBM System Networking Overview - Jul 2013
IBM System Networking Overview - Jul 2013Angel Villar Garea
 
LEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGATO project
 
Pristine rina-tnc-2016
Pristine rina-tnc-2016Pristine rina-tnc-2016
Pristine rina-tnc-2016ARCFIRE ICT
 

Ähnlich wie Ocpeu14 (20)

Osnug meetup-tungsten fabric - overview.pptx
Osnug meetup-tungsten fabric - overview.pptxOsnug meetup-tungsten fabric - overview.pptx
Osnug meetup-tungsten fabric - overview.pptx
 
100 M pps on PC.
100 M pps on PC.100 M pps on PC.
100 M pps on PC.
 
Building DataCenter networks with VXLAN BGP-EVPN
Building DataCenter networks with VXLAN BGP-EVPNBuilding DataCenter networks with VXLAN BGP-EVPN
Building DataCenter networks with VXLAN BGP-EVPN
 
Cilium - Fast IPv6 Container Networking with BPF and XDP
Cilium - Fast IPv6 Container Networking with BPF and XDPCilium - Fast IPv6 Container Networking with BPF and XDP
Cilium - Fast IPv6 Container Networking with BPF and XDP
 
PLNOG16: Obsługa 100M pps na platformie PC , Przemysław Frasunek, Paweł Mała...
PLNOG16: Obsługa 100M pps na platformie PC, Przemysław Frasunek, Paweł Mała...PLNOG16: Obsługa 100M pps na platformie PC, Przemysław Frasunek, Paweł Mała...
PLNOG16: Obsługa 100M pps na platformie PC , Przemysław Frasunek, Paweł Mała...
 
PLNOG 17 - Nicolai van der Smagt - Building and connecting the eBay Classifie...
PLNOG 17 - Nicolai van der Smagt - Building and connecting the eBay Classifie...PLNOG 17 - Nicolai van der Smagt - Building and connecting the eBay Classifie...
PLNOG 17 - Nicolai van der Smagt - Building and connecting the eBay Classifie...
 
Pristine rina-sdk-icc-2016
Pristine rina-sdk-icc-2016Pristine rina-sdk-icc-2016
Pristine rina-sdk-icc-2016
 
FD.io Vector Packet Processing (VPP)
FD.io Vector Packet Processing (VPP)FD.io Vector Packet Processing (VPP)
FD.io Vector Packet Processing (VPP)
 
FD.IO Vector Packet Processing
FD.IO Vector Packet ProcessingFD.IO Vector Packet Processing
FD.IO Vector Packet Processing
 
FD.io - The Universal Dataplane
FD.io - The Universal DataplaneFD.io - The Universal Dataplane
FD.io - The Universal Dataplane
 
100G Networking Berlin.pdf
100G Networking Berlin.pdf100G Networking Berlin.pdf
100G Networking Berlin.pdf
 
HiPEAC 2022-DL4IoT workshop_René Griessl presentation
HiPEAC 2022-DL4IoT workshop_René Griessl presentationHiPEAC 2022-DL4IoT workshop_René Griessl presentation
HiPEAC 2022-DL4IoT workshop_René Griessl presentation
 
HiPEAC-CSW 2022_Kevin Mika presentation
HiPEAC-CSW 2022_Kevin Mika presentationHiPEAC-CSW 2022_Kevin Mika presentation
HiPEAC-CSW 2022_Kevin Mika presentation
 
Mellanox Approach to NFV & SDN
Mellanox Approach to NFV & SDNMellanox Approach to NFV & SDN
Mellanox Approach to NFV & SDN
 
mTCP使ってみた
mTCP使ってみたmTCP使ってみた
mTCP使ってみた
 
Platforms for Accelerating the Software Defined and Virtual Infrastructure
Platforms for Accelerating the Software Defined and Virtual InfrastructurePlatforms for Accelerating the Software Defined and Virtual Infrastructure
Platforms for Accelerating the Software Defined and Virtual Infrastructure
 
IBM System Networking Overview - Jul 2013
IBM System Networking Overview - Jul 2013IBM System Networking Overview - Jul 2013
IBM System Networking Overview - Jul 2013
 
LEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous Hardware
 
uCluster
uClusteruCluster
uCluster
 
Pristine rina-tnc-2016
Pristine rina-tnc-2016Pristine rina-tnc-2016
Pristine rina-tnc-2016
 

Kürzlich hochgeladen

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 

Kürzlich hochgeladen (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

Ocpeu14

  • 1. Accelerate networking innovation through programmable data plane Removing switches from datacenters with TRILL/VNT and smartNIC Ahmed Amamou, ahmed@gandi.net Benoît Ganne, bganne@kalray.eu
  • 2. •Gandi is a domain name registrars since 1999 and a cloud provider since 2008 •We provide both –IaaS: Infrastructure As A Service –PaaS: Platform As A Service •We support open source community: –Provide open source code : https://github.com/Gandi –Support open source project: VLC, Debian, … * * Check http://www.gandi.net/supports/ for exhaustive list Who is Gandi? 2
  • 3. IaaS new network’s challenges 3 •Cisco Forecast report*: –Cloud traffic was about 3.3 zetta (1021) Bytes in 2013 –Cloud traffic will reach 6.6 zetta Bytes in 2016 –76% of cloud traffic are East-West (within the same datacenter)  A high density of links within a datacenter is needed •Customer need a full network access –Should be isolated – VM network configuration should not be restrictive Overlaying tenant traffic should be considered * Cisco Global Cloud Index Forecast and Methodology, 2011-2016.
  • 4. •New protocols are proposed to solve these problems (TRILL , VXLAN, 802.1 ad STT …) but: – Hardware integration is slow – Protocol extensions are hard to integrate •We believe the OpenCompute community can help us –To define an open, vendor-neutral API for programmable data plane –Bring open hardware fulfilling those needs Why OpenCompute? 4
  • 5. •Switch from classic datacenter architecture to a full-mesh one •Upgrade hardware to improve performances New datacenter architecture 5
  • 6. TRILL @Gandi 6 •Gandi uses commodity hardware as TRILL Rbridges since 2013 •We did not yet found hardware that suits our needs.
  • 7. •Layer 2 Routing Protocol •Uses a control and a data plane •Control plane : based on IS-IS that computes all Routing information •Data plane : forward packets using provided information from control plane •Uses Mac-in-Mac encapsulation TRILL: TRansparent Interconnection of Lot of Links 7 Original payload TRILL Header
  • 8. TRILL benefits 8 Commutation(L2) Routing (L3) TRILL Configuration Minimal Intense Minimal Plug & play Yes No Yes Discovery Automatic Configured Automatic Learning Automatic Configured Automatic Multi path No Yes Yes Convergence Slow Fast Fast Connectivity Inflexible Flexible Flexible Scale Limited Important Important
  • 10. Multitenancy: Virtual Network over TRILL (VNT) 10 New cloud architecture have to take into consideration Multitenancy Trill does not provide Multitenancy handling mechanisms → We need to extend it
  • 11. •Update Both control and data planes –Control plane : Prune multicast tree to limit multicast traffic –Data plane : Forwarding is conditioned by VNI support VNT vs TRILL 11 VNT Encapsulation Original Ethernet Frame Outer Destination Mac Address Outer Source Mac Address Optional Outer IEEE 802.1Q TRILL Header VNT Header Extensions Original Packet Payload Egress Rbridge Nickname Ingress Rbridge Nickname Options description TLV VNI Tag (24 bits) L2 Routing information Tenant identification Publication: Amamou, A., Haddadou, K., & Pujolle, G. (2014). A TRILL-based multi-tenant data center network. Computer Networks.
  • 12. VNT: Multicast tree pruning 12 n3 n4 n5 n8 n7 n6 n1 n2 i1 i1 i2 i2 i1 i2 i1 i2 i1 i2 i1 i3 i3 i3 i2 i2 i1 i2 i1 i3 n3 n4 n5 n8 n7 n6 n1 n2 i1 i1 i2 i2 i1 i2 i1 i2 i1 i2 i1 i3 i3 i3 i2 i2 i1 i2 i1 i3 n5 n2 n8 n1 n7 n6 n4 n3 n5 n2 n1 n6 A –Vni1 A –Vni1 B –Vni1 Topology Multicast tree
  • 13. Current VNT implementation on Linux 13 Control plane : Quagga daemon Data plane: Linux Bridge Module
  • 14. Current VNT implementation on Linux 14 Control plane : Quagga daemon Data plane: Linux Bridge Module https://github.com/Gandi/
  • 15. •Throughput is affected by the addition processing operation •Processing for a single packet is not affected Data plane: performance 15 Throughput Delay
  • 16. •Shift data plane from host to smartNIC –Increase performance –Offload x86 for other usages •eg. Customers workload Improving performance 16 Host Host NIC smartNIC Control plane Data plane Control plane Data plane
  • 17. •Founded in 2008, fabless semiconductor company •Kalray has developed the disruptive MPPA® (Multi-Purpose Processing Array) programmable architecture –Leading Performance / Energy Ratio Worldwide –Time predictability and low latency –Heterogeneous applications on the same chip –High programmability •Working with industry-leading partners and customers •55 employees •Offices in France and US KALRAY deterministic supercomputing on a chip 17 First MPPA®-256 Chips with CMOS 28nm TSMC Leading Performance / Energy Ratio Worldwide
  • 18. Software Defined NIC Smart packet classification/dispatching 256 cores for packets processing Standard C/C++ with GCC-4.9 Advanced debugging and profiling Low latency Zero-copy Ethernet  PCIe < 1μs port-to-port transparent mode < 1μs port to system memory System integration Linux support Virtualization support Low power High throughput / Line rate 80 Gbps full-duplex line-rate (2x120MPPS) 3400 instructions per packet @64B AES, SHA-1, SHA-2,CRC accelerators 2 x PCIe Gen3 8-lanes MPPA®-256 Bostan Networking Strengths 18
  • 19. MPPA®-256 Bostan •64-bit processor •Up to 800MHz •High Performance –845 GFLOPS SP / 422 GFLOPS DP –1 TOPS •High Bandwidth Network On a Chip –2 x 12.8 GB/s •High Speed Ethernet –Up to 2x40 Gbps / 2x120 MPPS @ 64B •DDR3 Memory interfaces –2 x 64-bit + ECC @2133MT/s / 2 x 17GB/s •PCIe Gen3 interface –2 x 8-lanes / 2 x 8 GB/s full duplex –End Point / Root Complex •NoCX extension –2 x 40 Gbps + 2 x 80 Gbps ILK •Flash controller, GPIOs… 19
  • 20. MPPA®-256 Processor Hierarchical Architecture 256 Processing Engine cores + 32 Resource Management cores 20 Manycore Processor Compute Cluster VLIW Core Instruction Level Parallelism Thread Level Parallelism Process Level Parallelism
  • 21. High Speed Ethernet Packet processing •Ethernet Rx dispatcher –8 classification tables •Classify •Extract fields •Smart Dispatch –Round Robin way –Flexible cores allocation •Round Robin vs. classification •Per 10G Ports • Ethernet Tx –64 Tx FIFOs –QoS between the FIFOs –Flow Control between clusters and Tx FIFOs 21 Patent pending
  • 22. VNT on a programmable data plane Multicast forwarding example 22 MPPA Linux ethernet driver Linux networking stack TRILL controller Kalray Bostan smartNIC x86 Hypervisor MPPA Linux ethernet driver Linux networking stack Userspace application •On-going work between Gandi and Kalray –Explore programmable data plane opportunities –Study a VNT smartNIC feasibility and architecture •Multicast forwarding put a high load on each node IO ethernet driver 8x10GbE
  • 23. VNT on a programmable data plane Multicast forwarding example 23 MPPA Linux ethernet driver Linux networking stack TRILL controller x86 Hypervisor MPPA Linux ethernet driver Linux networking stack Userspace application •Dispatch the packet based on Egress Rbridge –In case of multicast, Egress RBridge is set to the tree root –Each cluster “owns” a subset of the possible Egress RBridge (ie. a FIB subset) 8x10GbE IO ethernet driver if (Packet[Ethertype] == TRILL) { send to cluster #HASH(Egress RBridge) } Kalray Bostan smartNIC <Ethertype=TRILL, Egress=DTROOT, VNI=VNI-1>
  • 24. VNT on a programmable data plane Multicast forwarding example 24 MPPA Linux ethernet driver Linux networking stack TRILL controller x86 Hypervisor MPPA Linux ethernet driver Linux networking stack Userspace application 8x10GbE IO ethernet driver Kalray Bostan smartNIC •Dispatch the packet based on Egress Rbridge –In case of multicast, Egress RBridge is set to the tree root –Each cluster “owns” a subset of the possible Egress RBridge (ie. a FIB subset)
  • 25. VNT on a programmable data plane Multicast forwarding example 25 MPPA Linux ethernet driver Linux networking stack TRILL controller x86 Hypervisor MPPA Linux ethernet driver Linux networking stack Userspace application •Lookup the list of next-hop RBridges for this multicast tree –RBridge owner clusters can be local or remote •Lookup the LIB for local ports if any 8x10GbE IO ethernet driver Kalray Bostan smartNIC FIB[Egress RBridge] = { Egress RBridge MAC; Egress RBridge Interface; MCTree = [ RBx, RBy, … ]; VNI = [ VNI-1, VNI-2, … ]; } LIB = { (Local MACx, Local Portx, VNI-1); … }
  • 26. VNT on a programmable data plane Multicast forwarding example 26 MPPA Linux ethernet driver Linux networking stack TRILL controller x86 Hypervisor MPPA Linux ethernet driver Linux networking stack Userspace application •Forward the frame –Remote •Forward to clusters owning the next-hop RBridge –Local •Decapsulte inner frame •Forward it the local VM 8x10GbE IO ethernet driver Kalray Bostan smartNIC
  • 27. VNT on a programmable data plane Multicast forwarding example 27 MPPA Linux ethernet driver Linux networking stack TRILL controller x86 Hypervisor MPPA Linux ethernet driver Linux networking stack Userspace application •Check if the RBridge support the appropriate VNI –If yes forward to Rbridge –If not, stop here 8x10GbE IO ethernet driver Kalray Bostan smartNIC FIB[Egress RBridge] = { Egress RBridge MAC; Egress RBridge Interface; MCTree = [ RBx, RBy, … ]; VNI = [ VNI-1, VNI-2, … ]; }
  • 28. VNT on a programmable data plane Multicast forwarding example 28 MPPA Linux ethernet driver Linux networking stack TRILL controller x86 Hypervisor MPPA Linux ethernet driver Linux networking stack Userspace application •Check if the RBridge support the appropriate VNI –If yes forward to Rbridge –If not, stop here 8x10GbE IO ethernet driver Kalray Bostan smartNIC
  • 29. •Solving SDN and network virtualization challenges requires new protocols –eg. VXLAN, NVGRE, TRILL/VNT… •Efficiency generally means hardware support …But hardware development cannot keep up with software and slow down innovation •Gandi and Kalray think a programmable data plane can reconcile efficiency and innovation …But we need open ecosystems, standards and API Innovation and efficiency 29
  • 30. Thank you for your attention! Questions? Ahmed Amamou, ahmed@gandi.net Benoît Ganne, bganne@kalray.eu