SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Downloaden Sie, um offline zu lesen
© 2020 The Khronos Group
Khronos Standard APIs for
Accelerating Vision and Inferencing
Neil Trevett
Khronos President
NVIDIA VP Developer Ecosystems
22nd September 2020
© 2020 The Khronos Group
Khronos Connects Software to Silicon
3D graphics, XR, parallel
programming, vision acceleration
and machine learning
Non-profit, member-driven
standards-defining industry
consortium
Open to any
interested company
All Khronos standards
are royalty-free
Well-defined IP Framework
protects participant’s
intellectual property
Founded in 2000
>150 Members ~ 40% US, 30% Europe, 30% Asia
Open interoperability standards to enable software to effectively
harness the power of 3D and multiprocessor acceleration
2
© 2020 The Khronos Group
Khronos Active Initiatives
3D Graphics
Desktop, Mobile, Web
Embedded and Safety Critical
3D Assets
Authoring
and Delivery
Portable XR
Augmented and
Virtual Reality
Parallel Computation
Vision, Inferencing,
Machine Learning
3
© 2020 The Khronos Group
Khronos Compute Acceleration Standards
Increasing industry
interest in parallel
compute acceleration
to combat the ‘End of
Moore’s Law’
GPU
GPU rendering +
compute
acceleration
Heterogeneous
compute
acceleration
Single source C++ programming
with compute acceleration
Graph-based vision and
inferencing acceleration
Lower-level APIs
Direct Hardware Control
Intermediate
Representation (IR)
supporting parallel
execution and
graphics
Higher-level
Languages and APIs
Streamlined development and
performance portability
GPU
FPGA DSP
Custom Hardware
GPU
CPU
CPU
CPU
AI/Tensor HW
Hardware
4
© 2020 The Khronos Group
Sensor
Data
Training
Data
Trained
Networks
Neural Network
Training
C++ Application
Code
Embedded Vision and Inferencing Acceleration
Compilation Ingestion
FPGA
DSP
Dedicated
Hardware
GPU
Vision / Inferencing
Engine
Compiled
Code
Hardware Acceleration APIs
Diverse Embedded Hardware
(GPUs, DSPs, FPGAs)
Applications link to compiled
inferencing code or call
vision/inferencing API
Networks trained on high-end
desktop and cloud systems
5
© 2020 The Khronos Group
NNEF Neural Network Exchange Format
Training Framework 1
Training Framework 2
Training Framework 3
Inference Engine 1
Inference Engine 2
Inference Engine 3
Every Inferencing Engine needs a custom importer
from every Framework
Before - Training and Inferencing Fragmentation
After - NN Training and Inferencing Interoperability
Training Framework 1
Training Framework 2
Training Framework 3
Inference Engine 1
Inference Engine 2
Inference Engine 3
Common optimization
and processing tools
6
© 2020 The Khronos Group
NNEF and ONNX
NNEF V1.0 released in August 2018
After positive industry feedback on Provisional Specification.
Maintenance update issued in September 2019
Extensions to V1.0 released for expanded functionality
NNEF Working Group Participants
ONNX 1.6 Released in September 2019
Introduced support for Quantization
ONNX Runtime being integrated with GPU inferencing engines
such as NVIDIA TensorRT
ONNX Supporters
Embedded Inferencing Import Training Interchange
Defined Specification Open Source Project
Multi-company Governance at Khronos Initiated by Facebook & Microsoft
Stability for hardware deployment Software stack flexibility
ONNX and NNEF
are Complementary
ONNX moves quickly to track authoring
framework updates
NNEF provides a stable bridge from
training into edge inferencing engines
7
© 2020 The Khronos Group
NNEF Open Source Tools Ecosystem
Files
Caffe and
Caffe2
Import/Export
TensorFlow and
TensorFlow Lite
Import/Export
NNEF open source projects hosted on Khronos NNEF
GitHub repository under Apache 2.0
https://github.com/KhronosGroup/NNEF-Tools
ONNX
Import/Export
Syntax
Parser and
Validator
OpenVX
Ingestion and
Execution
NNEF Model Zoo
Now available on GitHub. Useful for
checking that ingested NNEF produces
acceptable results on target system
Compound operations
captured by exporting
graph Python script
NNEF adopts a rigorous approach to
design lifecycle
Especially important for safety-critical or
mission-critical applications in automotive,
industrial and infrastructure markets
8
© 2020 The Khronos Group
SYCL Single Source C++ Parallel Programming
GPU
FPGA DSP
Custom Hardware
GPU
CPU
CPU
CPU
Standard C++
Application
Code
C++
Libraries
ML
Frameworks
C++ Template
Libraries
C++ Template
Libraries
C++ Template
Libraries
SYCL Compiler
for OpenCL
CPU
Compiler
CPU
SYCL-BLAS, SYCL-DNN,
SYCL-Eigen,
SYCL Parallel STL
C++ templates and lambda
functions separate host &
accelerated device code
Accelerated code
passed into device
OpenCL compilers
Complex ML frameworks
can be directly compiled
and accelerated
SYCL is ideal for accelerating larger
C++-based engines and applications
with performance portability
C++ Kernel Fusion can give
better performance on
complex apps and libs than
hand-coding
AI/Tensor HW
9
© 2020 The Khronos Group
SYCL Implementations
Multiple Backends in Development
SYCL beginning to be supported on multiple
low-level APIs in addition to OpenCL
e.g. ROCm and CUDA
For more information: http://sycl.tech
SYCL enables Khronos to influence
ISO C++ to (eventually) support
heterogeneous compute
SYCL
Source Code
DPC++
Uses LLVM/Clang
Part of oneAPI
ComputeCpp
SYCL 1.2.1 on
multiple hardware
triSYCL
Open source
test bed
hipSYCL
SYCL 1.2.1 on
CUDA & HIP/ROCm
Any CPU
OpenCL +
SPIR-V
Any CPU
OpenCL +
SPIR(-V)
OpenCL+PTX
Intel CPUs
Intel GPUs
Intel FPGAs
Intel CPUs
Intel GPUs
Intel FPGAs
AMD GPUs
(depends on driver stack)
Arm Mali
IMG PowerVR
Renesas R-Car
NVIDIA GPUs
OpenMP
OpenCL +
SPIR/LLVM
XILINX FPGAs
POCL
(open source OpenCL supporting
CPUs and NVIDIA GPUs and more)
Any CPU
Experimental
OpenMP
ROCm
CUDA
AMD GPUs
NVIDIA GPUs
Any CPU
CUDA+PTX
NVIDIA GPUs
SYCL, OpenCL and SPIR-V, as open industry
standards, enable flexible integration and
deployment of multiple acceleration technologies
10
© 2020 The Khronos Group
OpenVX Cross-Vendor Vision and Inferencing
Vision
Node
Vision
Node
Vision
Node
Downstream
Application
Processing
Native
Camera
Control CNN Nodes
NNEF Translator converts NNEF
representation into OpenVX Node Graphs
OpenVX
High-level graph-based abstraction for portable, efficient vision processing
Graph can contain vision processing and NN nodes – enables global optimizations
Optimized OpenVX drivers created, optimized and shipped by processor vendors
Implementable on almost any hardware or processor with performance portability
Run-time graph execution need very little host CPU interaction
Performance comparable to hand-optimized, non-portable code
Real, complex applications on real, complex hardware
Much lower development effort than hand-optimized code
Hardware Implementations
OpenVX Graph
11
© 2020 The Khronos Group
OpenVX 1.3 Released October 2019
Deployment Flexibility through Feature Sets
Conformant Implementations ship one or more complete feature sets
Enables market-focused Implementations
- Baseline Graph Infrastructure (enables other Feature Sets)
- Default Vision Functions
- Enhanced Vision Functions (introduced in OpenVX 1.2)
- Neural Network Inferencing (including tensor objects)
- NNEF Kernel import (including tensor objects)
- Binary Images
- Safety Critical (reduced features for easier safety certification)
https://www.khronos.org/registry/OpenVX/specs/1.3/html/OpenVX_Specification_1_3.html
Functionality Consolidation into Core
Neural Net Extension, NNEF Kernel Import,
Safety Critical etc.
Open Source Conformance Test Suite
https://github.com/KhronosGroup/OpenVX-cts/tree/openvx_1.3
OpenCL Interop
Custom accelerated Nodes
OpenCL Command Queue
Application
cl_mem buffers
Fully asynchronous host-device
operations during data exchange
OpenVX data objects
Runtime
Runtime Map or copy OpenVX data objects
into cl_mem buffers
Copy or export
cl_mem buffers into OpenVX data
objects
OpenVX user-kernels can access command queue
and cl_mem objects to asynchronously schedule
OpenCL kernel execution
OpenVX/OpenCL Interop
12
© 2020 The Khronos Group
Open Source OpenVX & Samples
Open Source OpenVX Tutorial and Code Samples
https://github.com/rgiduthuri/openvx_tutorial
https://github.com/KhronosGroup/openvx-samples
Fully Conformant Open Source OpenVX 1.3
for Raspberry Pi
https://github.com/KhronosGroup/OpenVX-sample-impl/tree/openvx_1.3
Raspberry Pi 3 and 4 Model B with Raspbian OS
Memory access optimization via tiling/chaining
Highly optimized kernels on multimedia instruction set
Automatic parallelization for multicore CPUs and GPUs
Automatic merging of common kernel sequences
13
© 2020 The Khronos Group
OpenCL is Widely Deployed and Used
Accelerated Implementations
Modo
Desktop Creative Apps
CLBlast
SYCL-BLAS
Linear Algebra
Libraries
Parallel
Languages
Math and Physics
Libraries
Vision, Imaging
and Video Libraries
The industry’s most pervasive, cross-vendor, open standard
for low-level heterogeneous parallel programming
Arm Compute Library
SYCL-DNN
Machine Learning
Libraries and Frameworks
TI DL Library (TIDL)
VeriSilicon
Xiaomi
clDNN
Intel
Intel
Synopsis
MetaWare EV
NNAPI
https://en.wikipedia.org/wiki/List_of_OpenCL_applications
Vegas Pro
ForceBalance
Molecular Modelling Libraries
Machine Learning
Compilers
14
© 2020 The Khronos Group
OpenCL – Low-level Parallel Programing
Complements GPU-only APIs
Simpler programming model
Relatively lightweight run-time
More language flexibility, e.g. pointers
Rigorously defined numeric precision
OpenCL
Kernel
Code
OpenCL
Kernel
Code
OpenCL
Kernel
Code
OpenCL C
Kernel
Code
GPU
DSP
CPU
CPU
FPGA
OpenCL
Devices
Host
CPU
NN HW
Runtime OpenCL API to
compile, load and execute
kernels across devices
Programming and Runtime Framework
for Application Acceleration
Offload compute-intensive kernels onto parallel
heterogeneous processors
CPUs, GPUs, DSPs, FPGAs, Tensor Processors
OpenCL C or C++ kernel languages
Platform Layer API
Query, select and initialize compute devices
Runtime API
Build and execute kernels programs on multiple devices
Explicit Application Control
Which programs execute on what device
Where data is stored in memories in the system
When programs are run, and what operations are
dependent on earlier operations
15
© 2020 The Khronos Group
OpenCL 3.0
OpenCL C:
- kernels,
- address spaces,
- special types,
...
Most of C++17:
- inheritance,
- templates,
- type deduction,
...
C++ for OpenCL
Increased Ecosystem Flexibility
All functionality beyond OpenCL 1.2 queryable plus
macros for optional OpenCL C language features
New extensions that become widely adopted will be
integrated into new OpenCL core specifications
OpenCL C++ for OpenCL
Open source C++ for OpenCL front end compiler
combines OpenCL C and C++17 replacing
OpenCL C++ language specification
Unified Specification
All versions of OpenCL in one specification for easier
maintenance, evolution and accessibility
Source on Khronos GitHub for community feedback,
functionality requests and bug fixes
Moving Applications to OpenCL 3.0
OpenCL 1.2 applications – no change
OpenCL 2.X applications - no code changes if all used
functionality is present
Queries recommended for future portability
C++ for OpenCL
Supported by Clang and uses the LLVM
compiler infrastructure
OpenCL C code is valid and fully compatible
Supports most C++17 features
Generates SPIR-V kernels
16
© 2020 The Khronos Group
Google Ports TensorFlow Lite to OpenCL
OpenCL providing ~2x inferencing
speedup over OpenGL ES
acceleration
TensorFlow Lite uses OpenGL ES as a
backup if OpenCL not available …
…but most mobile GPU vendors
provide an OpenCL drivers - even if
not exposed directly to Android
developers
OpenCL is increasingly used as
acceleration target for higher-level
framework and compilers
17
© 2020 The Khronos Group
Primary Machine Learning Compilers
Import Formats
Caffe, Keras,
MXNet, ONNX
TensorFlow Graph,
MXNet, PaddlePaddle,
Keras, ONNX
PyTorch, ONNX
TensorFlow Graph,
PyTorch, ONNX
Front-end / IR NNVM / Relay IR nGraph / Stripe IR Glow Core / Glow IR XLA HLO
Output
OpenCL, LLVM,
CUDA, Metal
OpenCL,
LLVM, CUDA
OpenCL
LLVM
LLVM, TPU IR, XLA IR
TensorFlow Lite / NNAPI
(inc. HW accel)
18
© 2020 The Khronos Group
ML Compiler Steps
1.Import Trained
Network Description
2. Apply graph-level
optimizations e.g. node fusion,
node lowering and memory tiling
3. Decompose to primitive
instructions and emit programs
for accelerated run-times
Consistent Steps
Fast progress but still area of intense research
If compiler optimizations are effective - hardware accelerator APIs can stay ‘simple’ and
won’t need complex metacommands (e.g. combined primitive commands like DirectML)
19
© 2020 The Khronos Group
Google MLIR and IREE Compilers
MLIR
Multi-level Intermediate Representation
Format and library of compiler utilities that sits
between the trained model representation and
low-level compilers/executors that generate
hardware-specific code
IREE
Intermediate Representation
Execution Environment
Lowers and optimizes ML models for real-time
accelerated inferencing on mobile/edge
heterogeneous hardware
Contains scheduling logic to communicate data
dependencies to low-level parallel pipelined
hardware/APIs like Vulkan, and execution logic
to encode dense computation in the form of
hardware/API-specific binaries like SPIR-V
IREE is a research project today. Google is working with Khronos
working groups to explore how SPIR-V code can provide effective
inferencing acceleration on APIs such as Vulkan through SPIR-V
Trained Models
Generate Hardware
Specific Binaries
Optimizes and Lowers
for Acceleration
20
© 2020 The Khronos Group
SPIR-V Language Ecosystem
OpenCL C
C++ for OpenCL
clspv
triSYCL
Intel DPC++
Codeplay
ComputeCpp
LLVM
Clang
SYCL
SPIR-V LLVM
IR Translator
Khronos Open Source
3rd Party Open Source
Language Definitions
Closed Source
Environment Specs
OpenCL Vulkan
OpenCLon12
Inc. Mesa SPIR-V to DXIL
SPIRV-Cross
GLSL
HLSL
Metal
Shading
Language
glslang
GLSL
HLSL DXC
DXIL
SPIR-V Tools
(Dis)Assembler
Validator
Optimize/Remap
Fuzzer
Reducer
OpenCL C
Online
Compilation
SPIR-V enables a rich ecosystem of languages and compilers to
target low-level APIs such as Vulkan and OpenCL, including
deployment flexibility: e.g. running OpenCL C kernels on Vulkan
IREE
21
© 2020 The Khronos Group
Khronos for Global Industry Collaboration
Khronos membership is open
to any company
Influence the design and direction
of key open standards that will
drive your business
Accelerate time-to-market with
early access to specification drafts
Provide industry thought
leadership and gain insights into
industry trends and directions
Benefit from Adopter discounts
www.khronos.org/members/
ntrevett@nvidia.com | @neilt3d
22
© 2020 The Khronos Group
Resources
• Khronos Website and home page for all Khronos Standards
• https://www.khronos.org/
• OpenCL Resources and C++ for OpenCL documentation
• https://www.khronos.org/opencl/resources
• https://github.com/KhronosGroup/Khronosdotorg/blob/master/api/opencl/assets/CXX_for_OpenCL.pdf
• OpenVX Tutorial, Samples and Sample Implementation
• https://github.com/rgiduthuri/openvx_tutorial
• https://github.com/KhronosGroup/openvx-samples
• https://github.com/KhronosGroup/OpenVX-sample-impl/tree/openvx_1.3
• NNEF Tools
• https://github.com/KhronosGroup/NNEF-Tools
• SYCL Resources
• http://sycl.tech
• SPIR-V User Guide
• https://github.com/KhronosGroup/SPIRV-Guide
• MLIR Blog
• https://blog.tensorflow.org/2019/04/mlir-new-intermediate-representation.html
• IREE GitHub Repository
• https://google.github.io/iree/
23

Weitere ähnliche Inhalte

Was ist angesagt?

IoT Meetup HiveMQ and MQTT
IoT Meetup HiveMQ and MQTTIoT Meetup HiveMQ and MQTT
IoT Meetup HiveMQ and MQTTIan Skerrett
 
Open Source 5G/Edge Automation via ONAP
Open Source 5G/Edge Automation via ONAPOpen Source 5G/Edge Automation via ONAP
Open Source 5G/Edge Automation via ONAPLiz Warner
 
Introducing the Vitis Unified Software Platform for Programming FPGAs
Introducing the Vitis Unified Software Platform for Programming FPGAsIntroducing the Vitis Unified Software Platform for Programming FPGAs
Introducing the Vitis Unified Software Platform for Programming FPGAsinside-BigData.com
 
Cisco Service Provider Vision and Strategy: Business Transforming Through Inn...
Cisco Service Provider Vision and Strategy: Business Transforming Through Inn...Cisco Service Provider Vision and Strategy: Business Transforming Through Inn...
Cisco Service Provider Vision and Strategy: Business Transforming Through Inn...Cisco Service Provider
 
Edge Computing Architecture using GPUs and Kubernetes
Edge Computing Architecture using GPUs and KubernetesEdge Computing Architecture using GPUs and Kubernetes
Edge Computing Architecture using GPUs and KubernetesVirtualTech Japan Inc.
 
Telefónica Edge Computing Case Study
Telefónica Edge Computing Case StudyTelefónica Edge Computing Case Study
Telefónica Edge Computing Case StudyDavid Artuñedo
 
Edge Computing: Bringing the Internet Closer to You
Edge Computing: Bringing the Internet Closer to YouEdge Computing: Bringing the Internet Closer to You
Edge Computing: Bringing the Internet Closer to YouMegan O'Keefe
 
“Challenges and Approaches for Cascaded DNNs: A Case Study of Face Detection ...
“Challenges and Approaches for Cascaded DNNs: A Case Study of Face Detection ...“Challenges and Approaches for Cascaded DNNs: A Case Study of Face Detection ...
“Challenges and Approaches for Cascaded DNNs: A Case Study of Face Detection ...Edge AI and Vision Alliance
 
"2D and 3D Sensing: Markets, Applications, and Technologies," a Presentation ...
"2D and 3D Sensing: Markets, Applications, and Technologies," a Presentation ..."2D and 3D Sensing: Markets, Applications, and Technologies," a Presentation ...
"2D and 3D Sensing: Markets, Applications, and Technologies," a Presentation ...Edge AI and Vision Alliance
 
LFI18-Solving the challenges of commissioning a wireless lighting infrastruc...
LFI18-Solving the challenges of commissioning  a wireless lighting infrastruc...LFI18-Solving the challenges of commissioning  a wireless lighting infrastruc...
LFI18-Solving the challenges of commissioning a wireless lighting infrastruc...Silvair
 
Virtualized Transport for Edge Computing Services
Virtualized Transport for Edge Computing ServicesVirtualized Transport for Edge Computing Services
Virtualized Transport for Edge Computing ServicesSigal Biran-Nagar
 
Orchestrating, operationalizing, monetizing SDN/NFV enabled networks
Orchestrating, operationalizing, monetizing SDN/NFV enabled networksOrchestrating, operationalizing, monetizing SDN/NFV enabled networks
Orchestrating, operationalizing, monetizing SDN/NFV enabled networksEricsson
 
IoT based Industrial Gateway (IoT-SDK) built around Sitara™ AM437x processors...
IoT based Industrial Gateway (IoT-SDK) built around Sitara™ AM437x processors...IoT based Industrial Gateway (IoT-SDK) built around Sitara™ AM437x processors...
IoT based Industrial Gateway (IoT-SDK) built around Sitara™ AM437x processors...Mistral Solutions
 
Hey IT, Meet OT with Hima Mukkamala
Hey IT, Meet OT with Hima MukkamalaHey IT, Meet OT with Hima Mukkamala
Hey IT, Meet OT with Hima Mukkamalagogo6
 
Mr. Tam Kin Lui's presentation at QITCOM 2011
Mr. Tam Kin Lui's presentation at QITCOM 2011Mr. Tam Kin Lui's presentation at QITCOM 2011
Mr. Tam Kin Lui's presentation at QITCOM 2011QITCOM
 
Ericsson introduces a hyperscale cloud solution
Ericsson introduces a hyperscale cloud solutionEricsson introduces a hyperscale cloud solution
Ericsson introduces a hyperscale cloud solutionEricsson
 
5 tips for open ran success
5 tips for open ran success5 tips for open ran success
5 tips for open ran successLuke Wang
 

Was ist angesagt? (20)

IoT Meetup HiveMQ and MQTT
IoT Meetup HiveMQ and MQTTIoT Meetup HiveMQ and MQTT
IoT Meetup HiveMQ and MQTT
 
Open Source 5G/Edge Automation via ONAP
Open Source 5G/Edge Automation via ONAPOpen Source 5G/Edge Automation via ONAP
Open Source 5G/Edge Automation via ONAP
 
Introducing the Vitis Unified Software Platform for Programming FPGAs
Introducing the Vitis Unified Software Platform for Programming FPGAsIntroducing the Vitis Unified Software Platform for Programming FPGAs
Introducing the Vitis Unified Software Platform for Programming FPGAs
 
Cisco Service Provider Vision and Strategy: Business Transforming Through Inn...
Cisco Service Provider Vision and Strategy: Business Transforming Through Inn...Cisco Service Provider Vision and Strategy: Business Transforming Through Inn...
Cisco Service Provider Vision and Strategy: Business Transforming Through Inn...
 
Edge Computing Architecture using GPUs and Kubernetes
Edge Computing Architecture using GPUs and KubernetesEdge Computing Architecture using GPUs and Kubernetes
Edge Computing Architecture using GPUs and Kubernetes
 
Telefónica Edge Computing Case Study
Telefónica Edge Computing Case StudyTelefónica Edge Computing Case Study
Telefónica Edge Computing Case Study
 
Edge Computing: Bringing the Internet Closer to You
Edge Computing: Bringing the Internet Closer to YouEdge Computing: Bringing the Internet Closer to You
Edge Computing: Bringing the Internet Closer to You
 
“Challenges and Approaches for Cascaded DNNs: A Case Study of Face Detection ...
“Challenges and Approaches for Cascaded DNNs: A Case Study of Face Detection ...“Challenges and Approaches for Cascaded DNNs: A Case Study of Face Detection ...
“Challenges and Approaches for Cascaded DNNs: A Case Study of Face Detection ...
 
5G Enablers and Use Cases, an European Pespective
5G Enablers and Use Cases, an European Pespective5G Enablers and Use Cases, an European Pespective
5G Enablers and Use Cases, an European Pespective
 
"2D and 3D Sensing: Markets, Applications, and Technologies," a Presentation ...
"2D and 3D Sensing: Markets, Applications, and Technologies," a Presentation ..."2D and 3D Sensing: Markets, Applications, and Technologies," a Presentation ...
"2D and 3D Sensing: Markets, Applications, and Technologies," a Presentation ...
 
5 g
5 g5 g
5 g
 
LFI18-Solving the challenges of commissioning a wireless lighting infrastruc...
LFI18-Solving the challenges of commissioning  a wireless lighting infrastruc...LFI18-Solving the challenges of commissioning  a wireless lighting infrastruc...
LFI18-Solving the challenges of commissioning a wireless lighting infrastruc...
 
Virtualized Transport for Edge Computing Services
Virtualized Transport for Edge Computing ServicesVirtualized Transport for Edge Computing Services
Virtualized Transport for Edge Computing Services
 
Orchestrating, operationalizing, monetizing SDN/NFV enabled networks
Orchestrating, operationalizing, monetizing SDN/NFV enabled networksOrchestrating, operationalizing, monetizing SDN/NFV enabled networks
Orchestrating, operationalizing, monetizing SDN/NFV enabled networks
 
IoT based Industrial Gateway (IoT-SDK) built around Sitara™ AM437x processors...
IoT based Industrial Gateway (IoT-SDK) built around Sitara™ AM437x processors...IoT based Industrial Gateway (IoT-SDK) built around Sitara™ AM437x processors...
IoT based Industrial Gateway (IoT-SDK) built around Sitara™ AM437x processors...
 
Hey IT, Meet OT with Hima Mukkamala
Hey IT, Meet OT with Hima MukkamalaHey IT, Meet OT with Hima Mukkamala
Hey IT, Meet OT with Hima Mukkamala
 
Mr. Tam Kin Lui's presentation at QITCOM 2011
Mr. Tam Kin Lui's presentation at QITCOM 2011Mr. Tam Kin Lui's presentation at QITCOM 2011
Mr. Tam Kin Lui's presentation at QITCOM 2011
 
Ericsson introduces a hyperscale cloud solution
Ericsson introduces a hyperscale cloud solutionEricsson introduces a hyperscale cloud solution
Ericsson introduces a hyperscale cloud solution
 
5 tips for open ran success
5 tips for open ran success5 tips for open ran success
5 tips for open ran success
 
Industrial IoT bootcamp
Industrial IoT bootcampIndustrial IoT bootcamp
Industrial IoT bootcamp
 

Ähnlich wie “Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentation from the Khronos Group

"Current and Planned Standards for Computer Vision and Machine Learning," a P...
"Current and Planned Standards for Computer Vision and Machine Learning," a P..."Current and Planned Standards for Computer Vision and Machine Learning," a P...
"Current and Planned Standards for Computer Vision and Machine Learning," a P...Edge AI and Vision Alliance
 
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre..."APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...Edge AI and Vision Alliance
 
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...Edge AI and Vision Alliance
 
OpenCL Overview Japan Virtual Open House Feb 2021
OpenCL Overview Japan Virtual Open House Feb 2021OpenCL Overview Japan Virtual Open House Feb 2021
OpenCL Overview Japan Virtual Open House Feb 2021The Khronos Group Inc.
 
Coscup2018 itri android-in-cloud
Coscup2018 itri android-in-cloudCoscup2018 itri android-in-cloud
Coscup2018 itri android-in-cloudTian-Jian Wu
 
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...Edge AI and Vision Alliance
 
oneAPI: Industry Initiative & Intel Product
oneAPI: Industry Initiative & Intel ProductoneAPI: Industry Initiative & Intel Product
oneAPI: Industry Initiative & Intel ProductTyrone Systems
 
"Update on Khronos Standards for Vision and Machine Learning," a Presentation...
"Update on Khronos Standards for Vision and Machine Learning," a Presentation..."Update on Khronos Standards for Vision and Machine Learning," a Presentation...
"Update on Khronos Standards for Vision and Machine Learning," a Presentation...Edge AI and Vision Alliance
 
AWS Summit Singapore 2019 | Latest Trends for Cloud-Native Application Develo...
AWS Summit Singapore 2019 | Latest Trends for Cloud-Native Application Develo...AWS Summit Singapore 2019 | Latest Trends for Cloud-Native Application Develo...
AWS Summit Singapore 2019 | Latest Trends for Cloud-Native Application Develo...AWS Summits
 
"APIs for Accelerating Vision and Inferencing: An Industry Overview of Option...
"APIs for Accelerating Vision and Inferencing: An Industry Overview of Option..."APIs for Accelerating Vision and Inferencing: An Industry Overview of Option...
"APIs for Accelerating Vision and Inferencing: An Industry Overview of Option...Edge AI and Vision Alliance
 
HKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteHKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteLinaro
 
OpenShift 4 installation
OpenShift 4 installationOpenShift 4 installation
OpenShift 4 installationRobert Bohne
 
Software Tools for Building Industry 4.0 Applications
Software Tools for Building Industry 4.0 ApplicationsSoftware Tools for Building Industry 4.0 Applications
Software Tools for Building Industry 4.0 ApplicationsPankesh Patel
 
Containers Anywhere with OpenShift by Red Hat
Containers Anywhere with OpenShift by Red HatContainers Anywhere with OpenShift by Red Hat
Containers Anywhere with OpenShift by Red HatAmazon Web Services
 
Canonical Ubuntu OpenStack Overview Presentation
Canonical Ubuntu OpenStack Overview PresentationCanonical Ubuntu OpenStack Overview Presentation
Canonical Ubuntu OpenStack Overview PresentationThe World Bank
 
"The Vision API Maze: Options and Trade-offs," a Presentation from the Khrono...
"The Vision API Maze: Options and Trade-offs," a Presentation from the Khrono..."The Vision API Maze: Options and Trade-offs," a Presentation from the Khrono...
"The Vision API Maze: Options and Trade-offs," a Presentation from the Khrono...Edge AI and Vision Alliance
 
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloLinaro
 
Transtec nice webinar v2
Transtec nice webinar v2Transtec nice webinar v2
Transtec nice webinar v2Vincent Pfleger
 
La visualisation 3D distante sans compromis avec NICE DCV
La visualisation 3D distante sans compromis avec NICE DCVLa visualisation 3D distante sans compromis avec NICE DCV
La visualisation 3D distante sans compromis avec NICE DCVCyril Baudillon
 

Ähnlich wie “Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentation from the Khronos Group (20)

"Current and Planned Standards for Computer Vision and Machine Learning," a P...
"Current and Planned Standards for Computer Vision and Machine Learning," a P..."Current and Planned Standards for Computer Vision and Machine Learning," a P...
"Current and Planned Standards for Computer Vision and Machine Learning," a P...
 
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre..."APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
 
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
 
OpenCL Overview Japan Virtual Open House Feb 2021
OpenCL Overview Japan Virtual Open House Feb 2021OpenCL Overview Japan Virtual Open House Feb 2021
OpenCL Overview Japan Virtual Open House Feb 2021
 
Coscup2018 itri android-in-cloud
Coscup2018 itri android-in-cloudCoscup2018 itri android-in-cloud
Coscup2018 itri android-in-cloud
 
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
 
oneAPI: Industry Initiative & Intel Product
oneAPI: Industry Initiative & Intel ProductoneAPI: Industry Initiative & Intel Product
oneAPI: Industry Initiative & Intel Product
 
"Update on Khronos Standards for Vision and Machine Learning," a Presentation...
"Update on Khronos Standards for Vision and Machine Learning," a Presentation..."Update on Khronos Standards for Vision and Machine Learning," a Presentation...
"Update on Khronos Standards for Vision and Machine Learning," a Presentation...
 
SYCL 2020 Specification
SYCL 2020 SpecificationSYCL 2020 Specification
SYCL 2020 Specification
 
AWS Summit Singapore 2019 | Latest Trends for Cloud-Native Application Develo...
AWS Summit Singapore 2019 | Latest Trends for Cloud-Native Application Develo...AWS Summit Singapore 2019 | Latest Trends for Cloud-Native Application Develo...
AWS Summit Singapore 2019 | Latest Trends for Cloud-Native Application Develo...
 
"APIs for Accelerating Vision and Inferencing: An Industry Overview of Option...
"APIs for Accelerating Vision and Inferencing: An Industry Overview of Option..."APIs for Accelerating Vision and Inferencing: An Industry Overview of Option...
"APIs for Accelerating Vision and Inferencing: An Industry Overview of Option...
 
HKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteHKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening Keynote
 
OpenShift 4 installation
OpenShift 4 installationOpenShift 4 installation
OpenShift 4 installation
 
Software Tools for Building Industry 4.0 Applications
Software Tools for Building Industry 4.0 ApplicationsSoftware Tools for Building Industry 4.0 Applications
Software Tools for Building Industry 4.0 Applications
 
Containers Anywhere with OpenShift by Red Hat
Containers Anywhere with OpenShift by Red HatContainers Anywhere with OpenShift by Red Hat
Containers Anywhere with OpenShift by Red Hat
 
Canonical Ubuntu OpenStack Overview Presentation
Canonical Ubuntu OpenStack Overview PresentationCanonical Ubuntu OpenStack Overview Presentation
Canonical Ubuntu OpenStack Overview Presentation
 
"The Vision API Maze: Options and Trade-offs," a Presentation from the Khrono...
"The Vision API Maze: Options and Trade-offs," a Presentation from the Khrono..."The Vision API Maze: Options and Trade-offs," a Presentation from the Khrono...
"The Vision API Maze: Options and Trade-offs," a Presentation from the Khrono...
 
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
 
Transtec nice webinar v2
Transtec nice webinar v2Transtec nice webinar v2
Transtec nice webinar v2
 
La visualisation 3D distante sans compromis avec NICE DCV
La visualisation 3D distante sans compromis avec NICE DCVLa visualisation 3D distante sans compromis avec NICE DCV
La visualisation 3D distante sans compromis avec NICE DCV
 

Mehr von Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...Edge AI and Vision Alliance
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...Edge AI and Vision Alliance
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsightsEdge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from SamsaraEdge AI and Vision Alliance
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...Edge AI and Vision Alliance
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...Edge AI and Vision Alliance
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...Edge AI and Vision Alliance
 

Mehr von Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Kürzlich hochgeladen

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 

Kürzlich hochgeladen (20)

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

“Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentation from the Khronos Group

  • 1. © 2020 The Khronos Group Khronos Standard APIs for Accelerating Vision and Inferencing Neil Trevett Khronos President NVIDIA VP Developer Ecosystems 22nd September 2020
  • 2. © 2020 The Khronos Group Khronos Connects Software to Silicon 3D graphics, XR, parallel programming, vision acceleration and machine learning Non-profit, member-driven standards-defining industry consortium Open to any interested company All Khronos standards are royalty-free Well-defined IP Framework protects participant’s intellectual property Founded in 2000 >150 Members ~ 40% US, 30% Europe, 30% Asia Open interoperability standards to enable software to effectively harness the power of 3D and multiprocessor acceleration 2
  • 3. © 2020 The Khronos Group Khronos Active Initiatives 3D Graphics Desktop, Mobile, Web Embedded and Safety Critical 3D Assets Authoring and Delivery Portable XR Augmented and Virtual Reality Parallel Computation Vision, Inferencing, Machine Learning 3
  • 4. © 2020 The Khronos Group Khronos Compute Acceleration Standards Increasing industry interest in parallel compute acceleration to combat the ‘End of Moore’s Law’ GPU GPU rendering + compute acceleration Heterogeneous compute acceleration Single source C++ programming with compute acceleration Graph-based vision and inferencing acceleration Lower-level APIs Direct Hardware Control Intermediate Representation (IR) supporting parallel execution and graphics Higher-level Languages and APIs Streamlined development and performance portability GPU FPGA DSP Custom Hardware GPU CPU CPU CPU AI/Tensor HW Hardware 4
  • 5. © 2020 The Khronos Group Sensor Data Training Data Trained Networks Neural Network Training C++ Application Code Embedded Vision and Inferencing Acceleration Compilation Ingestion FPGA DSP Dedicated Hardware GPU Vision / Inferencing Engine Compiled Code Hardware Acceleration APIs Diverse Embedded Hardware (GPUs, DSPs, FPGAs) Applications link to compiled inferencing code or call vision/inferencing API Networks trained on high-end desktop and cloud systems 5
  • 6. © 2020 The Khronos Group NNEF Neural Network Exchange Format Training Framework 1 Training Framework 2 Training Framework 3 Inference Engine 1 Inference Engine 2 Inference Engine 3 Every Inferencing Engine needs a custom importer from every Framework Before - Training and Inferencing Fragmentation After - NN Training and Inferencing Interoperability Training Framework 1 Training Framework 2 Training Framework 3 Inference Engine 1 Inference Engine 2 Inference Engine 3 Common optimization and processing tools 6
  • 7. © 2020 The Khronos Group NNEF and ONNX NNEF V1.0 released in August 2018 After positive industry feedback on Provisional Specification. Maintenance update issued in September 2019 Extensions to V1.0 released for expanded functionality NNEF Working Group Participants ONNX 1.6 Released in September 2019 Introduced support for Quantization ONNX Runtime being integrated with GPU inferencing engines such as NVIDIA TensorRT ONNX Supporters Embedded Inferencing Import Training Interchange Defined Specification Open Source Project Multi-company Governance at Khronos Initiated by Facebook & Microsoft Stability for hardware deployment Software stack flexibility ONNX and NNEF are Complementary ONNX moves quickly to track authoring framework updates NNEF provides a stable bridge from training into edge inferencing engines 7
  • 8. © 2020 The Khronos Group NNEF Open Source Tools Ecosystem Files Caffe and Caffe2 Import/Export TensorFlow and TensorFlow Lite Import/Export NNEF open source projects hosted on Khronos NNEF GitHub repository under Apache 2.0 https://github.com/KhronosGroup/NNEF-Tools ONNX Import/Export Syntax Parser and Validator OpenVX Ingestion and Execution NNEF Model Zoo Now available on GitHub. Useful for checking that ingested NNEF produces acceptable results on target system Compound operations captured by exporting graph Python script NNEF adopts a rigorous approach to design lifecycle Especially important for safety-critical or mission-critical applications in automotive, industrial and infrastructure markets 8
  • 9. © 2020 The Khronos Group SYCL Single Source C++ Parallel Programming GPU FPGA DSP Custom Hardware GPU CPU CPU CPU Standard C++ Application Code C++ Libraries ML Frameworks C++ Template Libraries C++ Template Libraries C++ Template Libraries SYCL Compiler for OpenCL CPU Compiler CPU SYCL-BLAS, SYCL-DNN, SYCL-Eigen, SYCL Parallel STL C++ templates and lambda functions separate host & accelerated device code Accelerated code passed into device OpenCL compilers Complex ML frameworks can be directly compiled and accelerated SYCL is ideal for accelerating larger C++-based engines and applications with performance portability C++ Kernel Fusion can give better performance on complex apps and libs than hand-coding AI/Tensor HW 9
  • 10. © 2020 The Khronos Group SYCL Implementations Multiple Backends in Development SYCL beginning to be supported on multiple low-level APIs in addition to OpenCL e.g. ROCm and CUDA For more information: http://sycl.tech SYCL enables Khronos to influence ISO C++ to (eventually) support heterogeneous compute SYCL Source Code DPC++ Uses LLVM/Clang Part of oneAPI ComputeCpp SYCL 1.2.1 on multiple hardware triSYCL Open source test bed hipSYCL SYCL 1.2.1 on CUDA & HIP/ROCm Any CPU OpenCL + SPIR-V Any CPU OpenCL + SPIR(-V) OpenCL+PTX Intel CPUs Intel GPUs Intel FPGAs Intel CPUs Intel GPUs Intel FPGAs AMD GPUs (depends on driver stack) Arm Mali IMG PowerVR Renesas R-Car NVIDIA GPUs OpenMP OpenCL + SPIR/LLVM XILINX FPGAs POCL (open source OpenCL supporting CPUs and NVIDIA GPUs and more) Any CPU Experimental OpenMP ROCm CUDA AMD GPUs NVIDIA GPUs Any CPU CUDA+PTX NVIDIA GPUs SYCL, OpenCL and SPIR-V, as open industry standards, enable flexible integration and deployment of multiple acceleration technologies 10
  • 11. © 2020 The Khronos Group OpenVX Cross-Vendor Vision and Inferencing Vision Node Vision Node Vision Node Downstream Application Processing Native Camera Control CNN Nodes NNEF Translator converts NNEF representation into OpenVX Node Graphs OpenVX High-level graph-based abstraction for portable, efficient vision processing Graph can contain vision processing and NN nodes – enables global optimizations Optimized OpenVX drivers created, optimized and shipped by processor vendors Implementable on almost any hardware or processor with performance portability Run-time graph execution need very little host CPU interaction Performance comparable to hand-optimized, non-portable code Real, complex applications on real, complex hardware Much lower development effort than hand-optimized code Hardware Implementations OpenVX Graph 11
  • 12. © 2020 The Khronos Group OpenVX 1.3 Released October 2019 Deployment Flexibility through Feature Sets Conformant Implementations ship one or more complete feature sets Enables market-focused Implementations - Baseline Graph Infrastructure (enables other Feature Sets) - Default Vision Functions - Enhanced Vision Functions (introduced in OpenVX 1.2) - Neural Network Inferencing (including tensor objects) - NNEF Kernel import (including tensor objects) - Binary Images - Safety Critical (reduced features for easier safety certification) https://www.khronos.org/registry/OpenVX/specs/1.3/html/OpenVX_Specification_1_3.html Functionality Consolidation into Core Neural Net Extension, NNEF Kernel Import, Safety Critical etc. Open Source Conformance Test Suite https://github.com/KhronosGroup/OpenVX-cts/tree/openvx_1.3 OpenCL Interop Custom accelerated Nodes OpenCL Command Queue Application cl_mem buffers Fully asynchronous host-device operations during data exchange OpenVX data objects Runtime Runtime Map or copy OpenVX data objects into cl_mem buffers Copy or export cl_mem buffers into OpenVX data objects OpenVX user-kernels can access command queue and cl_mem objects to asynchronously schedule OpenCL kernel execution OpenVX/OpenCL Interop 12
  • 13. © 2020 The Khronos Group Open Source OpenVX & Samples Open Source OpenVX Tutorial and Code Samples https://github.com/rgiduthuri/openvx_tutorial https://github.com/KhronosGroup/openvx-samples Fully Conformant Open Source OpenVX 1.3 for Raspberry Pi https://github.com/KhronosGroup/OpenVX-sample-impl/tree/openvx_1.3 Raspberry Pi 3 and 4 Model B with Raspbian OS Memory access optimization via tiling/chaining Highly optimized kernels on multimedia instruction set Automatic parallelization for multicore CPUs and GPUs Automatic merging of common kernel sequences 13
  • 14. © 2020 The Khronos Group OpenCL is Widely Deployed and Used Accelerated Implementations Modo Desktop Creative Apps CLBlast SYCL-BLAS Linear Algebra Libraries Parallel Languages Math and Physics Libraries Vision, Imaging and Video Libraries The industry’s most pervasive, cross-vendor, open standard for low-level heterogeneous parallel programming Arm Compute Library SYCL-DNN Machine Learning Libraries and Frameworks TI DL Library (TIDL) VeriSilicon Xiaomi clDNN Intel Intel Synopsis MetaWare EV NNAPI https://en.wikipedia.org/wiki/List_of_OpenCL_applications Vegas Pro ForceBalance Molecular Modelling Libraries Machine Learning Compilers 14
  • 15. © 2020 The Khronos Group OpenCL – Low-level Parallel Programing Complements GPU-only APIs Simpler programming model Relatively lightweight run-time More language flexibility, e.g. pointers Rigorously defined numeric precision OpenCL Kernel Code OpenCL Kernel Code OpenCL Kernel Code OpenCL C Kernel Code GPU DSP CPU CPU FPGA OpenCL Devices Host CPU NN HW Runtime OpenCL API to compile, load and execute kernels across devices Programming and Runtime Framework for Application Acceleration Offload compute-intensive kernels onto parallel heterogeneous processors CPUs, GPUs, DSPs, FPGAs, Tensor Processors OpenCL C or C++ kernel languages Platform Layer API Query, select and initialize compute devices Runtime API Build and execute kernels programs on multiple devices Explicit Application Control Which programs execute on what device Where data is stored in memories in the system When programs are run, and what operations are dependent on earlier operations 15
  • 16. © 2020 The Khronos Group OpenCL 3.0 OpenCL C: - kernels, - address spaces, - special types, ... Most of C++17: - inheritance, - templates, - type deduction, ... C++ for OpenCL Increased Ecosystem Flexibility All functionality beyond OpenCL 1.2 queryable plus macros for optional OpenCL C language features New extensions that become widely adopted will be integrated into new OpenCL core specifications OpenCL C++ for OpenCL Open source C++ for OpenCL front end compiler combines OpenCL C and C++17 replacing OpenCL C++ language specification Unified Specification All versions of OpenCL in one specification for easier maintenance, evolution and accessibility Source on Khronos GitHub for community feedback, functionality requests and bug fixes Moving Applications to OpenCL 3.0 OpenCL 1.2 applications – no change OpenCL 2.X applications - no code changes if all used functionality is present Queries recommended for future portability C++ for OpenCL Supported by Clang and uses the LLVM compiler infrastructure OpenCL C code is valid and fully compatible Supports most C++17 features Generates SPIR-V kernels 16
  • 17. © 2020 The Khronos Group Google Ports TensorFlow Lite to OpenCL OpenCL providing ~2x inferencing speedup over OpenGL ES acceleration TensorFlow Lite uses OpenGL ES as a backup if OpenCL not available … …but most mobile GPU vendors provide an OpenCL drivers - even if not exposed directly to Android developers OpenCL is increasingly used as acceleration target for higher-level framework and compilers 17
  • 18. © 2020 The Khronos Group Primary Machine Learning Compilers Import Formats Caffe, Keras, MXNet, ONNX TensorFlow Graph, MXNet, PaddlePaddle, Keras, ONNX PyTorch, ONNX TensorFlow Graph, PyTorch, ONNX Front-end / IR NNVM / Relay IR nGraph / Stripe IR Glow Core / Glow IR XLA HLO Output OpenCL, LLVM, CUDA, Metal OpenCL, LLVM, CUDA OpenCL LLVM LLVM, TPU IR, XLA IR TensorFlow Lite / NNAPI (inc. HW accel) 18
  • 19. © 2020 The Khronos Group ML Compiler Steps 1.Import Trained Network Description 2. Apply graph-level optimizations e.g. node fusion, node lowering and memory tiling 3. Decompose to primitive instructions and emit programs for accelerated run-times Consistent Steps Fast progress but still area of intense research If compiler optimizations are effective - hardware accelerator APIs can stay ‘simple’ and won’t need complex metacommands (e.g. combined primitive commands like DirectML) 19
  • 20. © 2020 The Khronos Group Google MLIR and IREE Compilers MLIR Multi-level Intermediate Representation Format and library of compiler utilities that sits between the trained model representation and low-level compilers/executors that generate hardware-specific code IREE Intermediate Representation Execution Environment Lowers and optimizes ML models for real-time accelerated inferencing on mobile/edge heterogeneous hardware Contains scheduling logic to communicate data dependencies to low-level parallel pipelined hardware/APIs like Vulkan, and execution logic to encode dense computation in the form of hardware/API-specific binaries like SPIR-V IREE is a research project today. Google is working with Khronos working groups to explore how SPIR-V code can provide effective inferencing acceleration on APIs such as Vulkan through SPIR-V Trained Models Generate Hardware Specific Binaries Optimizes and Lowers for Acceleration 20
  • 21. © 2020 The Khronos Group SPIR-V Language Ecosystem OpenCL C C++ for OpenCL clspv triSYCL Intel DPC++ Codeplay ComputeCpp LLVM Clang SYCL SPIR-V LLVM IR Translator Khronos Open Source 3rd Party Open Source Language Definitions Closed Source Environment Specs OpenCL Vulkan OpenCLon12 Inc. Mesa SPIR-V to DXIL SPIRV-Cross GLSL HLSL Metal Shading Language glslang GLSL HLSL DXC DXIL SPIR-V Tools (Dis)Assembler Validator Optimize/Remap Fuzzer Reducer OpenCL C Online Compilation SPIR-V enables a rich ecosystem of languages and compilers to target low-level APIs such as Vulkan and OpenCL, including deployment flexibility: e.g. running OpenCL C kernels on Vulkan IREE 21
  • 22. © 2020 The Khronos Group Khronos for Global Industry Collaboration Khronos membership is open to any company Influence the design and direction of key open standards that will drive your business Accelerate time-to-market with early access to specification drafts Provide industry thought leadership and gain insights into industry trends and directions Benefit from Adopter discounts www.khronos.org/members/ ntrevett@nvidia.com | @neilt3d 22
  • 23. © 2020 The Khronos Group Resources • Khronos Website and home page for all Khronos Standards • https://www.khronos.org/ • OpenCL Resources and C++ for OpenCL documentation • https://www.khronos.org/opencl/resources • https://github.com/KhronosGroup/Khronosdotorg/blob/master/api/opencl/assets/CXX_for_OpenCL.pdf • OpenVX Tutorial, Samples and Sample Implementation • https://github.com/rgiduthuri/openvx_tutorial • https://github.com/KhronosGroup/openvx-samples • https://github.com/KhronosGroup/OpenVX-sample-impl/tree/openvx_1.3 • NNEF Tools • https://github.com/KhronosGroup/NNEF-Tools • SYCL Resources • http://sycl.tech • SPIR-V User Guide • https://github.com/KhronosGroup/SPIRV-Guide • MLIR Blog • https://blog.tensorflow.org/2019/04/mlir-new-intermediate-representation.html • IREE GitHub Repository • https://google.github.io/iree/ 23