SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Downloaden Sie, um offline zu lesen
Sean Hefty
Openfabrics Interfaces Working Group Co-Chair
Intel
November 2016
OPENFABRICS INTERFACES:
PAST, PRESENT, AND FUTURE
33
Scalable	
  
Implementa-on	
  
Agnos-c	
  
OFIWG: develop … interfaces aligned
with … application needs
So2ware	
  interfaces	
  aligned	
  with	
  
applica-on	
  requirements	
  
• Careful	
  analysis	
  of	
  requirement	
  
Expand	
  open	
  source	
  community	
  
• Inclusive	
  development	
  effort	
  
• App	
  and	
  HW	
  developers	
  
Good	
  impedance	
  match	
  with	
  mul-ple	
  
fabric	
  hardware	
  
• InfiniBand*,	
  iWarp,	
  RoCE,	
  Ethernet,	
  
UDP	
  offload,	
  Intel®,	
  Cray*,	
  IBM*,	
  others	
  
Open	
  Source	
   Applica-on-­‐Centric	
  
libfabric
*	
  Other	
  names	
  and	
  brands	
  may	
  be	
  claimed	
  as	
  the	
  property	
  of	
  
others	
  
Op-mized	
  SW	
  path	
  to	
  HW	
  
• Minimize	
  cache/memory	
  footprint	
  
• Reduce	
  instrucLon	
  count	
  
• Minimize	
  memory	
  accesses	
  
4
OFI APPLICATION REQUIREMENTS
Give us a high-
level interface!
Give us a low-
level interface!
MPI developers
OFI strives to meet
both requirements
5
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Fabric	
  Services	
  
Application
OFI
Provider
Application
OFI
Provider
Provider	
  opLmizes	
  for	
  
OFI	
  features	
  
Common	
  opLmizaLon	
  
for	
  all	
  apps/providers	
  
App	
  uses	
  OFI	
  features	
  
Application
OFI
Provider
App	
  opLmizes	
  based	
  
on	
  supported	
  features	
  
Provider	
  supports	
  low-­‐level	
  features	
  only	
  
OFI SOFTWARE DEVELOPMENT STRATEGIES
One Size Does Not Fit All
OFI DEVELOPMENT STATUS
6
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Fabric	
  Services	
  
Application
libfabric
Provider
Provider optimizes
for OFI features
Common	
  opLmizaLon	
  
for	
  all	
  apps/providers	
  
Provider supports low-
level features only
Many	
  apps	
   Few	
  apps	
  
Provider’s	
  choice	
  
App	
  opLmizes	
  based	
  
on	
  supported	
  features	
  
App uses OFI
features
OFI-provider
gap
6
OFI LIBFABRIC COMMUNITY
7
*	
  Other	
  names	
  and	
  brands	
  may	
  be	
  claimed	
  as	
  the	
  property	
  of	
  
others	
  
libfabric	
  
Intel®	
  MPI	
  
Library	
  
MPICH	
  
Netmod/CH4	
  
Open	
  MPI	
  
MTL/BTL	
  
Open	
  MPI	
  
SHMEM	
  
Sandia	
  
SHMEM	
  
GASNet	
  
Clang	
  
UPC	
  
rsocket	
  
ES-­‐API	
  
libfabric	
  Enabled	
  Middleware	
  
Control	
  Services	
   CommunicaLon	
  
Services	
  
CompleLon	
  
Services	
  
Data	
  Transfer	
  Services	
  
Discovery	
  
fi_info	
  
ConnecLon	
  
Management	
  
Address	
  
Vectors	
  
Event	
  
Queues	
  
Event	
  
Counters	
  
Message	
  
Queue	
  
Tag	
  
Matching	
  
RMA	
  
Atomics	
  
Sockets	
  
TCP,	
  UDP	
  
Verbs	
  
Cisco	
  
usNIC	
  
Intel	
  
OPA	
  PSM	
  
Cray	
  
GNI	
  
Mellanox	
  
MXM	
  
IBM	
  Blue	
  
Gene	
  
A3Cube	
  
RONNIE	
  
* * * * *®
experimental	
  supported	
  
*
Because of the OFI-provider gap,
not all apps work with all providers
LIBFABRIC SCALABILITY
8
By Courtesy Argonne* National Laboratory, CC BY 2.0,
https://commons.wikimedia.org/w/index.php?curid=24653857
Developed	
  to	
  evaluate	
  the	
  
Aurora	
  so_ware	
  stack	
  at	
  scale	
  
and	
  assist	
  applicaLons	
  in	
  the	
  
transiLon	
  from	
  Mira	
  to	
  Aurora	
  
NaLve	
  provider	
  implementaLon	
  that	
  directly	
  
uses	
  the	
  Blue	
  Gene/Q	
  hardware	
  and	
  network	
  
interfaces	
  for	
  communicaLon	
  
*	
  Other	
  names	
  and	
  brands	
  may	
  be	
  claimed	
  as	
  the	
  property	
  of	
  
others	
  
Blue Gene / Q
§  IBM* MPICH / PAMI
• IBM XL C compiler for BG, v12.1
• Optimized for single-threaded latency
• …/comm/xl.legacy.ndebug/bin/mpicc
• v1r2m2
§  MPICH / CH4 / libfabric
• gcc 4.4.7
• global locks, inline, direct, etc.
• Provider not optimized for performance
PAMI	
  
MPICH	
  
PAMID	
  
hardware	
  
BG/Q	
  
Provider	
  
libfabric	
  
MPICH	
  
CH4	
  OFI	
  
Completely	
  subjec.ve	
  
so_ware	
  stack	
  comparison	
  
vs	
  
32	
  nodes	
  on	
  ALCF	
  
Vesta	
  machine	
  
PAMI and libfabric
performance
LIBFABRIC SCALABILITY
9
Blue Gene / Q
10
1	
  
2	
  
4	
  
8	
  
16	
  
1	
   8	
   64	
   512	
   4096	
  
Latency	
  (us)	
  
Bytes	
  
IBM	
   OFI	
  
OSU* MPI
Performance
Tests v5.0
0.1	
  
1	
  
10	
  
100	
  
1000	
  
1	
   8	
   64	
   512	
   4096	
   32768	
  
Bandwidth	
  (MB/s)	
  
Bytes	
  
IBM	
   OFI	
  
100,000	
  
1,000,000	
  
10,000,000	
  
1	
   8	
   64	
   512	
   4096	
  32768	
  
Msgs/s	
  
Bytes	
  
IBM	
   OFI	
  
MPI scale out testing:
- cpi – 1M ranks,
- ISx benchmark – 0.5M ranks
Tests	
  document	
  performance	
  of	
  components	
  on	
  a	
  parLcular	
  test,	
  in	
  specific	
  
systems.	
  Differences	
  in	
  hardware,	
  so_ware,	
  or	
  configuraLon	
  will	
  affect	
  actual	
  
performance.	
  Consult	
  other	
  sources	
  of	
  informaLon	
  to	
  evaluate	
  performance	
  as	
  
you	
  consider	
  your	
  purchase.	
  	
  For	
  more	
  complete	
  informaLon	
  about	
  
performance	
  and	
  benchmark	
  results,	
  visit	
  hkp://www.intel.com/performance.	
  
*	
  Other	
  names	
  and	
  brands	
  may	
  be	
  claimed	
  as	
  the	
  property	
  of	
  others	
  
LIBFABRIC SCALABILITY
Blue Gene / Q
LIBFABRIC SCALABILITY
11
Evaluate	
  libfabric	
  SHMEM	
  
performance	
  on	
  high-­‐
performance	
  interconnect	
  
Provider	
  implementaLon	
  that	
  uses	
  the	
  
Cray*	
  uGNI	
  hardware	
  and	
  network	
  
interface	
  for	
  communicaLon	
  
*	
  Other	
  names	
  and	
  brands	
  may	
  be	
  claimed	
  as	
  the	
  property	
  of	
  
others	
  
Computing Sciences
Lawrence Berkeley National Laboratory
SHMEM
CRAY XC40
§  Cray* SHMEM
• Cray* Aries, Dragonfly* topology
• CLE (Cray* Linux*), SLURM*
• DMAPP
• Designed for PGAS
• Optimized for small messages
§  Sandia* OpenSHMEM / libfabric
•  uGNI
•  Designed for MPI and PGAS
•  Optimized for large messages
§  https://www.nersc.gov/users/computational-systems/cori/
configuration
DMAPP	
  
Cray	
  
SHMEM	
  
Aries	
  Interconnect	
  
uGNI	
  
libfabric	
  
Open	
  
SHMEM	
  
OFI	
  
vs	
  
1630	
  nodes	
  on	
  
Cray*	
  XC40	
  (Cori)	
  
LIBFABRIC SCALABILITY
12
*	
  Other	
  names	
  and	
  brands	
  may	
  be	
  claimed	
  as	
  the	
  property	
  of	
  
others	
  
SHMEM
CRAY XC40
13
Tests	
  document	
  performance	
  of	
  components	
  on	
  a	
  parLcular	
  test,	
  in	
  specific	
  
systems.	
  Differences	
  in	
  hardware,	
  so_ware,	
  or	
  configuraLon	
  will	
  affect	
  actual	
  
performance.	
  Consult	
  other	
  sources	
  of	
  informaLon	
  to	
  evaluate	
  performance	
  as	
  
you	
  consider	
  your	
  purchase.	
  	
  For	
  more	
  complete	
  informaLon	
  about	
  
performance	
  and	
  benchmark	
  results,	
  visit	
  hkp://www.intel.com/performance.	
  
LIBFABRIC SCALABILITY
*	
  Other	
  names	
  and	
  brands	
  may	
  be	
  claimed	
  as	
  the	
  property	
  of	
  others	
  
Put – up to 61% improvement
Get – within 2%
Blocking Get/Put B/W
SHMEM
CRAY XC40
14
Tests	
  document	
  performance	
  of	
  components	
  on	
  a	
  parLcular	
  test,	
  in	
  specific	
  
systems.	
  Differences	
  in	
  hardware,	
  so_ware,	
  or	
  configuraLon	
  will	
  affect	
  actual	
  
performance.	
  Consult	
  other	
  sources	
  of	
  informaLon	
  to	
  evaluate	
  performance	
  as	
  
you	
  consider	
  your	
  purchase.	
  	
  For	
  more	
  complete	
  informaLon	
  about	
  
performance	
  and	
  benchmark	
  results,	
  visit	
  hkp://www.intel.com/performance.	
  
*	
  Other	
  names	
  and	
  brands	
  may	
  be	
  claimed	
  as	
  the	
  property	
  of	
  others	
  
XPMEM
Improved scalability
GUPS Scaling
slight improvement
(lower is better)
LIBFABRIC SCALABILITY
NAS ISx (Integer Sort)
weak scaling
SHMEM
CRAY XC40
ADDRESSING THE OFI-PROVIDER GAP
15
Libfabric Framework
libfabric	
  API	
  
Components	
  
templates,	
  lists,	
  rbtree,	
  hash	
  
table,	
  free	
  pool,	
  ring	
  buffer,	
  
stack,	
  …	
  
Base	
  Class	
  Implementa-ons	
  
fabric,	
  domain,	
  EQ,	
  wait	
  sets,	
  
AV,	
  CQ,	
  …	
  SHM	
  primiLves	
  
Provider	
  Services	
  
•  Logging	
  
•  Environment	
  variables	
  
U-lity	
  Provider	
  
Core	
  Provider	
  
Interface	
  ‘extensions’	
  
–	
  for	
  consistency	
  
Assist	
  in	
  provider	
  
development	
  
Enhance	
  core	
  
provider	
  
UTILITY PROVIDER
16
Performance is a
primary objective
MOVING FORWARD
17
Beyond	
  HPC	
  
Enterprise,	
  Cloud,	
  Storage	
  (NVM)	
  
Stronger	
  engagement	
  
with	
  these	
  communiLes	
  
Beyond	
  Linux*	
  
Sockets	
  –	
  TCP/UDP	
   NetworkDirect	
  
Analyze requests
to expand OFI
community
*	
  Other	
  names	
  and	
  brands	
  may	
  be	
  claimed	
  as	
  the	
  property	
  of	
  
others	
  
TARGET SCHEDULE
18
§  Driven	
  by	
  implementaLon	
  feedback	
  
§  Improve	
  error	
  handling,	
  flow	
  control	
  
§  Beker	
  support	
  for	
  non-­‐tradiLonal	
  fabrics	
  
§  OpLmize	
  compleLon	
  handling	
  
§  Address	
  deferred	
  features	
  
2016	
   Q2	
   Q3	
   Q4	
   2017	
   Q2	
   Q3	
   Q4	
  
RDM	
  over	
  DGRAM	
  ULl	
  
RDM	
  over	
  MSG	
  ULl	
  
Shared	
  Memory	
  
New	
  Core	
  Providers	
  
ABI	
  1.1	
  
ULlity	
  provider	
  is	
  
ongoing	
  
TradiLonal	
  and	
  non-­‐tradiLonal	
  
RDMA	
  providers	
  
SUMMARY
19
§ OFIWG development model
working well
§ Interest in OFI and libfabric is
high
§ Growing community
§ Significant effort being made to
simplify the lives of developers
• Applications and providers
OFI	
  is	
  so	
  
good	
  
LEGAL DISCLAIMER & OPTIMIZATION NOTICE
20
§  No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. Intel disclaims all
express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-
infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade. This document contains
information on products, services and/or processes in development. All information provided here is subject to change without notice. Contact
your Intel representative to obtain the latest forecast, schedule, specifications and roadmaps. The products and services described may contain
defects or errors known as errata which may cause deviations from published specifications. Current characterized errata are available on
request.
§  Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests,
such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any
change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully
evaluating your contemplated purchases, including the performance of that product when combined with other products.
§  Copyright © 2016, Intel Corporation. All rights reserved. Intel, Pentium, Xeon, Xeon Phi, Core, VTune, Cilk, and the Intel logo are trademarks of
Intel Corporation in the U.S. and other countries.
§  *Other names and brands may be claimed as the property of others
Optimization Notice
Intel’s compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not
unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other
optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on
microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use
with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel
microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the
specific instruction sets covered by this notice.
Notice revision #20110804
Thank you for your time!
Sean Hefty
sean.hefty@intel.com
www.intel.com/hpcdevcon

Weitere ähnliche Inhalte

Was ist angesagt?

"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre..."APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...Edge AI and Vision Alliance
 
Develop, Deploy, and Innovate with Intel® Cluster Ready
Develop, Deploy, and Innovate with Intel® Cluster ReadyDevelop, Deploy, and Innovate with Intel® Cluster Ready
Develop, Deploy, and Innovate with Intel® Cluster ReadyIntel IT Center
 
Embedded Webinar #12 “GloDroid or Boosting True Open Source Android Stack Dev...
Embedded Webinar #12 “GloDroid or Boosting True Open Source Android Stack Dev...Embedded Webinar #12 “GloDroid or Boosting True Open Source Android Stack Dev...
Embedded Webinar #12 “GloDroid or Boosting True Open Source Android Stack Dev...GlobalLogic Ukraine
 
Application Report: Migrating from Discrete to Virtual Servers
Application Report: Migrating from Discrete to Virtual ServersApplication Report: Migrating from Discrete to Virtual Servers
Application Report: Migrating from Discrete to Virtual ServersIT Brand Pulse
 
"An Update on Open Standard APIs for Vision Processing," a Presentation from ...
"An Update on Open Standard APIs for Vision Processing," a Presentation from ..."An Update on Open Standard APIs for Vision Processing," a Presentation from ...
"An Update on Open Standard APIs for Vision Processing," a Presentation from ...Edge AI and Vision Alliance
 
A Java Implementer's Guide to Better Apache Spark Performance
A Java Implementer's Guide to Better Apache Spark PerformanceA Java Implementer's Guide to Better Apache Spark Performance
A Java Implementer's Guide to Better Apache Spark PerformanceTim Ellison
 
Isn’t it Ironic that a Redfish is software defining you
Isn’t it Ironic that a Redfish is software defining you Isn’t it Ironic that a Redfish is software defining you
Isn’t it Ironic that a Redfish is software defining you Bruno Cornec
 
Java Webinar #9: “Raspberry Pi Platform for Java Programmers”
Java Webinar #9: “Raspberry Pi Platform for Java Programmers”Java Webinar #9: “Raspberry Pi Platform for Java Programmers”
Java Webinar #9: “Raspberry Pi Platform for Java Programmers”GlobalLogic Ukraine
 
DCC Labs Overview
DCC Labs OverviewDCC Labs Overview
DCC Labs OverviewDCC Labs
 
Developing for Industrial IoT with Linux OS on DragonBoard™ 410c: Session 3
Developing for Industrial IoT with Linux OS on DragonBoard™ 410c: Session 3Developing for Industrial IoT with Linux OS on DragonBoard™ 410c: Session 3
Developing for Industrial IoT with Linux OS on DragonBoard™ 410c: Session 3Qualcomm Developer Network
 
Eclipse OMR: a modern toolkit for building language runtimes
Eclipse OMR: a modern toolkit for building language runtimesEclipse OMR: a modern toolkit for building language runtimes
Eclipse OMR: a modern toolkit for building language runtimesMark Stoodley
 
Developer's Guide to Knights Landing
Developer's Guide to Knights LandingDeveloper's Guide to Knights Landing
Developer's Guide to Knights LandingAndrey Vladimirov
 
Simulation Directed Co-Design from Smartphones to Supercomputers
Simulation Directed Co-Design from Smartphones to SupercomputersSimulation Directed Co-Design from Smartphones to Supercomputers
Simulation Directed Co-Design from Smartphones to SupercomputersEric Van Hensbergen
 
IPMI is dead, Long live Redfish
IPMI is dead, Long live RedfishIPMI is dead, Long live Redfish
IPMI is dead, Long live RedfishBruno Cornec
 

Was ist angesagt? (20)

Shree_Duth_Awasthi_Resume
Shree_Duth_Awasthi_ResumeShree_Duth_Awasthi_Resume
Shree_Duth_Awasthi_Resume
 
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre..."APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
 
Develop, Deploy, and Innovate with Intel® Cluster Ready
Develop, Deploy, and Innovate with Intel® Cluster ReadyDevelop, Deploy, and Innovate with Intel® Cluster Ready
Develop, Deploy, and Innovate with Intel® Cluster Ready
 
Embedded Webinar #12 “GloDroid or Boosting True Open Source Android Stack Dev...
Embedded Webinar #12 “GloDroid or Boosting True Open Source Android Stack Dev...Embedded Webinar #12 “GloDroid or Boosting True Open Source Android Stack Dev...
Embedded Webinar #12 “GloDroid or Boosting True Open Source Android Stack Dev...
 
Application Report: Migrating from Discrete to Virtual Servers
Application Report: Migrating from Discrete to Virtual ServersApplication Report: Migrating from Discrete to Virtual Servers
Application Report: Migrating from Discrete to Virtual Servers
 
"An Update on Open Standard APIs for Vision Processing," a Presentation from ...
"An Update on Open Standard APIs for Vision Processing," a Presentation from ..."An Update on Open Standard APIs for Vision Processing," a Presentation from ...
"An Update on Open Standard APIs for Vision Processing," a Presentation from ...
 
A Java Implementer's Guide to Better Apache Spark Performance
A Java Implementer's Guide to Better Apache Spark PerformanceA Java Implementer's Guide to Better Apache Spark Performance
A Java Implementer's Guide to Better Apache Spark Performance
 
Isn’t it Ironic that a Redfish is software defining you
Isn’t it Ironic that a Redfish is software defining you Isn’t it Ironic that a Redfish is software defining you
Isn’t it Ironic that a Redfish is software defining you
 
Collabograte
CollabograteCollabograte
Collabograte
 
Java Webinar #9: “Raspberry Pi Platform for Java Programmers”
Java Webinar #9: “Raspberry Pi Platform for Java Programmers”Java Webinar #9: “Raspberry Pi Platform for Java Programmers”
Java Webinar #9: “Raspberry Pi Platform for Java Programmers”
 
Benchmarking Market Pricing
Benchmarking Market PricingBenchmarking Market Pricing
Benchmarking Market Pricing
 
HOW Series: Knights Landing
HOW Series: Knights LandingHOW Series: Knights Landing
HOW Series: Knights Landing
 
DCC Labs Overview
DCC Labs OverviewDCC Labs Overview
DCC Labs Overview
 
Developing for Industrial IoT with Linux OS on DragonBoard™ 410c: Session 3
Developing for Industrial IoT with Linux OS on DragonBoard™ 410c: Session 3Developing for Industrial IoT with Linux OS on DragonBoard™ 410c: Session 3
Developing for Industrial IoT with Linux OS on DragonBoard™ 410c: Session 3
 
Eclipse OMR: a modern toolkit for building language runtimes
Eclipse OMR: a modern toolkit for building language runtimesEclipse OMR: a modern toolkit for building language runtimes
Eclipse OMR: a modern toolkit for building language runtimes
 
Developer's Guide to Knights Landing
Developer's Guide to Knights LandingDeveloper's Guide to Knights Landing
Developer's Guide to Knights Landing
 
Simulation Directed Co-Design from Smartphones to Supercomputers
Simulation Directed Co-Design from Smartphones to SupercomputersSimulation Directed Co-Design from Smartphones to Supercomputers
Simulation Directed Co-Design from Smartphones to Supercomputers
 
Open j9 jdk on RISC-V
Open j9 jdk on RISC-VOpen j9 jdk on RISC-V
Open j9 jdk on RISC-V
 
IPMI is dead, Long live Redfish
IPMI is dead, Long live RedfishIPMI is dead, Long live Redfish
IPMI is dead, Long live Redfish
 
KIRANKUMAR_MV
KIRANKUMAR_MVKIRANKUMAR_MV
KIRANKUMAR_MV
 

Ähnlich wie Intel the-latest-on-ofi

OFI Overview 2019 Webinar
OFI Overview 2019 WebinarOFI Overview 2019 Webinar
OFI Overview 2019 Webinarseanhefty
 
Learn more about the tremendous value Open Data Plane brings to NFV
Learn more about the tremendous value Open Data Plane brings to NFVLearn more about the tremendous value Open Data Plane brings to NFV
Learn more about the tremendous value Open Data Plane brings to NFVGhodhbane Mohamed Amine
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSSteve Wong
 
Amazon Elastic Fabric Adapter: Anatomy, Capabilities, and the Road Ahead
Amazon Elastic Fabric Adapter: Anatomy, Capabilities, and the Road AheadAmazon Elastic Fabric Adapter: Anatomy, Capabilities, and the Road Ahead
Amazon Elastic Fabric Adapter: Anatomy, Capabilities, and the Road Aheadinside-BigData.com
 
PLNOG14: The benefits of "OPEN" in networking for operators - Joerg Ammon, Br...
PLNOG14: The benefits of "OPEN" in networking for operators - Joerg Ammon, Br...PLNOG14: The benefits of "OPEN" in networking for operators - Joerg Ammon, Br...
PLNOG14: The benefits of "OPEN" in networking for operators - Joerg Ammon, Br...PROIDEA
 
Open coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi AlkobiOpen coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi AlkobiOpenInfra Days Poland 2019
 
Unleash the Power of Open Networking
Unleash the Power of Open NetworkingUnleash the Power of Open Networking
Unleash the Power of Open NetworkingCumulus Networks
 
Redpanda and ClickHouse
Redpanda and ClickHouseRedpanda and ClickHouse
Redpanda and ClickHouseAltinity Ltd
 
NFF-GO (YANFF) - Yet Another Network Function Framework
NFF-GO (YANFF) - Yet Another Network Function FrameworkNFF-GO (YANFF) - Yet Another Network Function Framework
NFF-GO (YANFF) - Yet Another Network Function FrameworkMichelle Holley
 
한국통신학회 워크샵: SDN/NFV for Secure Services - Understanding Open Source SDN Contr...
한국통신학회 워크샵: SDN/NFV for Secure Services - Understanding Open Source SDN Contr...한국통신학회 워크샵: SDN/NFV for Secure Services - Understanding Open Source SDN Contr...
한국통신학회 워크샵: SDN/NFV for Secure Services - Understanding Open Source SDN Contr...Ian Choi
 
OpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software StackOpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software Stackinside-BigData.com
 
Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015WaveMaker, Inc.
 
PLNOG 17 - Grzegorz Kornacki - F5 and OpenStack
PLNOG 17 - Grzegorz Kornacki - F5 and OpenStackPLNOG 17 - Grzegorz Kornacki - F5 and OpenStack
PLNOG 17 - Grzegorz Kornacki - F5 and OpenStackPROIDEA
 
Kartik_Parmar_Resume_2016
Kartik_Parmar_Resume_2016Kartik_Parmar_Resume_2016
Kartik_Parmar_Resume_2016Kartik Parmar
 
Docker Platform and Ecosystem Nov 2015
Docker Platform and Ecosystem Nov 2015Docker Platform and Ecosystem Nov 2015
Docker Platform and Ecosystem Nov 2015Patrick Chanezon
 
Kafka Summit SF 2017 - Kafka and the Polyglot Programmer
Kafka Summit SF 2017 - Kafka and the Polyglot ProgrammerKafka Summit SF 2017 - Kafka and the Polyglot Programmer
Kafka Summit SF 2017 - Kafka and the Polyglot Programmerconfluent
 
Open Stack Cloud Services
Open Stack Cloud ServicesOpen Stack Cloud Services
Open Stack Cloud ServicesSaurabh Gupta
 

Ähnlich wie Intel the-latest-on-ofi (20)

OFI Overview 2019 Webinar
OFI Overview 2019 WebinarOFI Overview 2019 Webinar
OFI Overview 2019 Webinar
 
Learn more about the tremendous value Open Data Plane brings to NFV
Learn more about the tremendous value Open Data Plane brings to NFVLearn more about the tremendous value Open Data Plane brings to NFV
Learn more about the tremendous value Open Data Plane brings to NFV
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OS
 
TFI2014 Session II - Requirements for SDN - Brian Field
TFI2014 Session II - Requirements for SDN - Brian FieldTFI2014 Session II - Requirements for SDN - Brian Field
TFI2014 Session II - Requirements for SDN - Brian Field
 
Avoiding cloud lock-in
Avoiding cloud lock-inAvoiding cloud lock-in
Avoiding cloud lock-in
 
Amazon Elastic Fabric Adapter: Anatomy, Capabilities, and the Road Ahead
Amazon Elastic Fabric Adapter: Anatomy, Capabilities, and the Road AheadAmazon Elastic Fabric Adapter: Anatomy, Capabilities, and the Road Ahead
Amazon Elastic Fabric Adapter: Anatomy, Capabilities, and the Road Ahead
 
PLNOG14: The benefits of "OPEN" in networking for operators - Joerg Ammon, Br...
PLNOG14: The benefits of "OPEN" in networking for operators - Joerg Ammon, Br...PLNOG14: The benefits of "OPEN" in networking for operators - Joerg Ammon, Br...
PLNOG14: The benefits of "OPEN" in networking for operators - Joerg Ammon, Br...
 
What's New in RHEL 6 for Linux on System z?
What's New in RHEL 6 for Linux on System z?What's New in RHEL 6 for Linux on System z?
What's New in RHEL 6 for Linux on System z?
 
Open coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi AlkobiOpen coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi Alkobi
 
Unleash the Power of Open Networking
Unleash the Power of Open NetworkingUnleash the Power of Open Networking
Unleash the Power of Open Networking
 
Redpanda and ClickHouse
Redpanda and ClickHouseRedpanda and ClickHouse
Redpanda and ClickHouse
 
NFF-GO (YANFF) - Yet Another Network Function Framework
NFF-GO (YANFF) - Yet Another Network Function FrameworkNFF-GO (YANFF) - Yet Another Network Function Framework
NFF-GO (YANFF) - Yet Another Network Function Framework
 
한국통신학회 워크샵: SDN/NFV for Secure Services - Understanding Open Source SDN Contr...
한국통신학회 워크샵: SDN/NFV for Secure Services - Understanding Open Source SDN Contr...한국통신학회 워크샵: SDN/NFV for Secure Services - Understanding Open Source SDN Contr...
한국통신학회 워크샵: SDN/NFV for Secure Services - Understanding Open Source SDN Contr...
 
OpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software StackOpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software Stack
 
Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015
 
PLNOG 17 - Grzegorz Kornacki - F5 and OpenStack
PLNOG 17 - Grzegorz Kornacki - F5 and OpenStackPLNOG 17 - Grzegorz Kornacki - F5 and OpenStack
PLNOG 17 - Grzegorz Kornacki - F5 and OpenStack
 
Kartik_Parmar_Resume_2016
Kartik_Parmar_Resume_2016Kartik_Parmar_Resume_2016
Kartik_Parmar_Resume_2016
 
Docker Platform and Ecosystem Nov 2015
Docker Platform and Ecosystem Nov 2015Docker Platform and Ecosystem Nov 2015
Docker Platform and Ecosystem Nov 2015
 
Kafka Summit SF 2017 - Kafka and the Polyglot Programmer
Kafka Summit SF 2017 - Kafka and the Polyglot ProgrammerKafka Summit SF 2017 - Kafka and the Polyglot Programmer
Kafka Summit SF 2017 - Kafka and the Polyglot Programmer
 
Open Stack Cloud Services
Open Stack Cloud ServicesOpen Stack Cloud Services
Open Stack Cloud Services
 

Kürzlich hochgeladen

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 

Kürzlich hochgeladen (20)

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 

Intel the-latest-on-ofi

  • 1.
  • 2. Sean Hefty Openfabrics Interfaces Working Group Co-Chair Intel November 2016 OPENFABRICS INTERFACES: PAST, PRESENT, AND FUTURE
  • 3. 33 Scalable   Implementa-on   Agnos-c   OFIWG: develop … interfaces aligned with … application needs So2ware  interfaces  aligned  with   applica-on  requirements   • Careful  analysis  of  requirement   Expand  open  source  community   • Inclusive  development  effort   • App  and  HW  developers   Good  impedance  match  with  mul-ple   fabric  hardware   • InfiniBand*,  iWarp,  RoCE,  Ethernet,   UDP  offload,  Intel®,  Cray*,  IBM*,  others   Open  Source   Applica-on-­‐Centric   libfabric *  Other  names  and  brands  may  be  claimed  as  the  property  of   others   Op-mized  SW  path  to  HW   • Minimize  cache/memory  footprint   • Reduce  instrucLon  count   • Minimize  memory  accesses  
  • 4. 4 OFI APPLICATION REQUIREMENTS Give us a high- level interface! Give us a low- level interface! MPI developers OFI strives to meet both requirements
  • 5. 5                        Fabric  Services   Application OFI Provider Application OFI Provider Provider  opLmizes  for   OFI  features   Common  opLmizaLon   for  all  apps/providers   App  uses  OFI  features   Application OFI Provider App  opLmizes  based   on  supported  features   Provider  supports  low-­‐level  features  only   OFI SOFTWARE DEVELOPMENT STRATEGIES One Size Does Not Fit All
  • 6. OFI DEVELOPMENT STATUS 6                        Fabric  Services   Application libfabric Provider Provider optimizes for OFI features Common  opLmizaLon   for  all  apps/providers   Provider supports low- level features only Many  apps   Few  apps   Provider’s  choice   App  opLmizes  based   on  supported  features   App uses OFI features OFI-provider gap 6
  • 7. OFI LIBFABRIC COMMUNITY 7 *  Other  names  and  brands  may  be  claimed  as  the  property  of   others   libfabric   Intel®  MPI   Library   MPICH   Netmod/CH4   Open  MPI   MTL/BTL   Open  MPI   SHMEM   Sandia   SHMEM   GASNet   Clang   UPC   rsocket   ES-­‐API   libfabric  Enabled  Middleware   Control  Services   CommunicaLon   Services   CompleLon   Services   Data  Transfer  Services   Discovery   fi_info   ConnecLon   Management   Address   Vectors   Event   Queues   Event   Counters   Message   Queue   Tag   Matching   RMA   Atomics   Sockets   TCP,  UDP   Verbs   Cisco   usNIC   Intel   OPA  PSM   Cray   GNI   Mellanox   MXM   IBM  Blue   Gene   A3Cube   RONNIE   * * * * *® experimental  supported   * Because of the OFI-provider gap, not all apps work with all providers
  • 8. LIBFABRIC SCALABILITY 8 By Courtesy Argonne* National Laboratory, CC BY 2.0, https://commons.wikimedia.org/w/index.php?curid=24653857 Developed  to  evaluate  the   Aurora  so_ware  stack  at  scale   and  assist  applicaLons  in  the   transiLon  from  Mira  to  Aurora   NaLve  provider  implementaLon  that  directly   uses  the  Blue  Gene/Q  hardware  and  network   interfaces  for  communicaLon   *  Other  names  and  brands  may  be  claimed  as  the  property  of   others   Blue Gene / Q
  • 9. §  IBM* MPICH / PAMI • IBM XL C compiler for BG, v12.1 • Optimized for single-threaded latency • …/comm/xl.legacy.ndebug/bin/mpicc • v1r2m2 §  MPICH / CH4 / libfabric • gcc 4.4.7 • global locks, inline, direct, etc. • Provider not optimized for performance PAMI   MPICH   PAMID   hardware   BG/Q   Provider   libfabric   MPICH   CH4  OFI   Completely  subjec.ve   so_ware  stack  comparison   vs   32  nodes  on  ALCF   Vesta  machine   PAMI and libfabric performance LIBFABRIC SCALABILITY 9 Blue Gene / Q
  • 10. 10 1   2   4   8   16   1   8   64   512   4096   Latency  (us)   Bytes   IBM   OFI   OSU* MPI Performance Tests v5.0 0.1   1   10   100   1000   1   8   64   512   4096   32768   Bandwidth  (MB/s)   Bytes   IBM   OFI   100,000   1,000,000   10,000,000   1   8   64   512   4096  32768   Msgs/s   Bytes   IBM   OFI   MPI scale out testing: - cpi – 1M ranks, - ISx benchmark – 0.5M ranks Tests  document  performance  of  components  on  a  parLcular  test,  in  specific   systems.  Differences  in  hardware,  so_ware,  or  configuraLon  will  affect  actual   performance.  Consult  other  sources  of  informaLon  to  evaluate  performance  as   you  consider  your  purchase.    For  more  complete  informaLon  about   performance  and  benchmark  results,  visit  hkp://www.intel.com/performance.   *  Other  names  and  brands  may  be  claimed  as  the  property  of  others   LIBFABRIC SCALABILITY Blue Gene / Q
  • 11. LIBFABRIC SCALABILITY 11 Evaluate  libfabric  SHMEM   performance  on  high-­‐ performance  interconnect   Provider  implementaLon  that  uses  the   Cray*  uGNI  hardware  and  network   interface  for  communicaLon   *  Other  names  and  brands  may  be  claimed  as  the  property  of   others   Computing Sciences Lawrence Berkeley National Laboratory SHMEM CRAY XC40
  • 12. §  Cray* SHMEM • Cray* Aries, Dragonfly* topology • CLE (Cray* Linux*), SLURM* • DMAPP • Designed for PGAS • Optimized for small messages §  Sandia* OpenSHMEM / libfabric •  uGNI •  Designed for MPI and PGAS •  Optimized for large messages §  https://www.nersc.gov/users/computational-systems/cori/ configuration DMAPP   Cray   SHMEM   Aries  Interconnect   uGNI   libfabric   Open   SHMEM   OFI   vs   1630  nodes  on   Cray*  XC40  (Cori)   LIBFABRIC SCALABILITY 12 *  Other  names  and  brands  may  be  claimed  as  the  property  of   others   SHMEM CRAY XC40
  • 13. 13 Tests  document  performance  of  components  on  a  parLcular  test,  in  specific   systems.  Differences  in  hardware,  so_ware,  or  configuraLon  will  affect  actual   performance.  Consult  other  sources  of  informaLon  to  evaluate  performance  as   you  consider  your  purchase.    For  more  complete  informaLon  about   performance  and  benchmark  results,  visit  hkp://www.intel.com/performance.   LIBFABRIC SCALABILITY *  Other  names  and  brands  may  be  claimed  as  the  property  of  others   Put – up to 61% improvement Get – within 2% Blocking Get/Put B/W SHMEM CRAY XC40
  • 14. 14 Tests  document  performance  of  components  on  a  parLcular  test,  in  specific   systems.  Differences  in  hardware,  so_ware,  or  configuraLon  will  affect  actual   performance.  Consult  other  sources  of  informaLon  to  evaluate  performance  as   you  consider  your  purchase.    For  more  complete  informaLon  about   performance  and  benchmark  results,  visit  hkp://www.intel.com/performance.   *  Other  names  and  brands  may  be  claimed  as  the  property  of  others   XPMEM Improved scalability GUPS Scaling slight improvement (lower is better) LIBFABRIC SCALABILITY NAS ISx (Integer Sort) weak scaling SHMEM CRAY XC40
  • 15. ADDRESSING THE OFI-PROVIDER GAP 15 Libfabric Framework libfabric  API   Components   templates,  lists,  rbtree,  hash   table,  free  pool,  ring  buffer,   stack,  …   Base  Class  Implementa-ons   fabric,  domain,  EQ,  wait  sets,   AV,  CQ,  …  SHM  primiLves   Provider  Services   •  Logging   •  Environment  variables   U-lity  Provider   Core  Provider   Interface  ‘extensions’   –  for  consistency   Assist  in  provider   development   Enhance  core   provider  
  • 16. UTILITY PROVIDER 16 Performance is a primary objective
  • 17. MOVING FORWARD 17 Beyond  HPC   Enterprise,  Cloud,  Storage  (NVM)   Stronger  engagement   with  these  communiLes   Beyond  Linux*   Sockets  –  TCP/UDP   NetworkDirect   Analyze requests to expand OFI community *  Other  names  and  brands  may  be  claimed  as  the  property  of   others  
  • 18. TARGET SCHEDULE 18 §  Driven  by  implementaLon  feedback   §  Improve  error  handling,  flow  control   §  Beker  support  for  non-­‐tradiLonal  fabrics   §  OpLmize  compleLon  handling   §  Address  deferred  features   2016   Q2   Q3   Q4   2017   Q2   Q3   Q4   RDM  over  DGRAM  ULl   RDM  over  MSG  ULl   Shared  Memory   New  Core  Providers   ABI  1.1   ULlity  provider  is   ongoing   TradiLonal  and  non-­‐tradiLonal   RDMA  providers  
  • 19. SUMMARY 19 § OFIWG development model working well § Interest in OFI and libfabric is high § Growing community § Significant effort being made to simplify the lives of developers • Applications and providers OFI  is  so   good  
  • 20. LEGAL DISCLAIMER & OPTIMIZATION NOTICE 20 §  No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non- infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade. This document contains information on products, services and/or processes in development. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest forecast, schedule, specifications and roadmaps. The products and services described may contain defects or errors known as errata which may cause deviations from published specifications. Current characterized errata are available on request. §  Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. §  Copyright © 2016, Intel Corporation. All rights reserved. Intel, Pentium, Xeon, Xeon Phi, Core, VTune, Cilk, and the Intel logo are trademarks of Intel Corporation in the U.S. and other countries. §  *Other names and brands may be claimed as the property of others Optimization Notice Intel’s compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Notice revision #20110804
  • 21. Thank you for your time! Sean Hefty sean.hefty@intel.com www.intel.com/hpcdevcon