SlideShare ist ein Scribd-Unternehmen logo
1 von 49
Downloaden Sie, um offline zu lesen
1
Optimization Techniques
with OpenVINO™ to
Enhance Performance on
Your Existing Hardware
Nico Galoppo, Principal Engineer
Ryan Loney, Technical Product Manager
The Challenge
The Challenge
© 2022 Intel 2
How do I Deploy My Great Neural Network?
© 2022 Intel 3
GPU?
© 2022 Intel 4
GPU?
Cloud?
© 2022 Intel 5
GPU? Cloud?
© 2022 Intel 6
Maybe just CPU?
© 2022 Intel 7
“But running on
CPU is slow…”
Almost Every Deep Learning Engineer
Is it?
© 2022 Intel 8
© 2022 Intel 9
The Solution
Visual
Inference
Optimization
and
Neural
Network
+
NLP, Audio...
© 2022 Intel
Built on the
foundation of
BUILD OPTIMIZE DEPLOY
3
2
1
Developer Journey
© 2022 Intel 11
Optimized Performance
CPU iGPU VPU
© 2022 Intel 12
www.openvino.ai
Installation Methods
© 2022 Intel 13
Model Optimizer
Get Your Model Run Model Optimizer
IR
For workloads and configurations visit www.intel.com/PerformanceIndex. Results may vary​.​
© 2022 Intel 14
For workloads and configurations please scan QR code. Results may vary​.​
0 25 50 75 100 125 150 175 200
Intel® Core™ i3-8100
Intel® Core™ i5-8500
Intel® Core™ i7-8700T
Intel® Core™ i7-1185G7 CPU-only
Intel® Core™ i7-1185G7 iGPU-only
Intel® Core™ i7-1185G7 CPU+iGPU
Throughput [FPS] – FP32
resnet_50_TF [224x224] yolo-v3-tiny-tf [416x416] deeplabv3-TF [513x513]
$ benchmark_app -m model_path -d device
15
Performance
For workloads and configurations visit www.intel.com/PerformanceIndex. Results may vary​.​
$ mo –-input_model model.onnx
--data_type FP32
Get Your Model Run Model Optimizer
IR
FP32
FP16
© 2022 Intel 16
Model Optimizer
Neural Network (any format)
Conv2D Add Relu6 Identity DepthwiseConv2DNative Mul Add Relu6 Identity
© 2022 Intel 17
Intermediate Representation (IR)
Convolution Add Clamp GroupConvolution Add Clamp
© 2022 Intel 18
Post-Training Optimization Tool (POT)
IR IR
Dataset Post-Training Optimization Tool
FP32 INT8
© 2022 Intel 19
© 2022 Intel 20
Neural Network Compression
Framework (NNCF)
Quantization-
Aware Training
Mixed-Precision
Quantization
Filter Pruning
© 2022 Intel 21
Neural Network Compression
Framework (NNCF)
Neural Network Compression
Framework (NNCF)
Quantization-
Aware Training
Mixed-Precision
Quantization
Filter Pruning
© 2022 Intel 22
OpenVINO Runtime
from openvino.runtime import Core
img = load_img()
core = Core()
model = core.read_model(model="model.xml", weights="model.bin")
compiled_model = core.compile_model(model=model, device_name="CPU")
output_layer = compiled_model.outputs[0]
result = compiled_model([img])[output_layer]
„dog”
© 2022 Intel 23
compiled_model = core.compile_model(model=model, device_name="CPU")
© 2022 Intel 24
Supported Devices
compiled_model = core.compile_model(model=model, device_name="CPU")
CPU
© 2022 Intel 25
Supported Devices
CPU
GPU
compiled_model = core.compile_model(model=model, device_name="CPU")
© 2022 Intel 26
Supported Devices
CPU
GPU
MYRIAD
compiled_model = core.compile_model(model=model, device_name="CPU")
© 2022 Intel 27
Supported Devices
compiled_model = core.compile_model(model=model, device_name="CPU")
CPU
GPU
MYRIAD
HETERO
CPU
GPU Convolution
Add
Clamp
GroupConvolution
Add
Clamp
Convolution
Add
Convolution
Add
© 2022 Intel
28
Supported Devices
compiled_model = core.compile_model(model=model, device_name="CPU")
CPU
GPU
MYRIAD
HETERO
MULTI
CPU
GPU Convolution
Add
Clamp
GroupConvolution
Add
Clamp
Convolution
Add
Convolution
Add
GPU Convolution
Add
Clamp
GroupConvolution
Add
Clamp
Convolution
Add
Convolution
Add
CPU Convolution
Add
Clamp
GroupConvolution
Add
Clamp
Convolution
Add
Convolution
Add
29
Supported Devices
Supported Devices
RUNTIME
GPU
CPU
APPLICATION
AUTO PLUGIN
CPU PLUGIN GPU PLUGIN
AUTO
AUTO chooses the device
AUTO sets device config
based on hints
AUTO handles the exec
logic on multiple devices
CPU
GPU
MYRIAD
AUTO
HETERO
MULTI
compiled_model = core.compile_model(model=model, device_name="CPU")
30
AUTO Device
INFERENCE ON GPU
INFERENCE ON CPU
INFERENCE
STOPS ON CPU
COMPILE NETWORK
FOR CPU
CPU
Workloads
GPU
Workloads
116.7 5,800 MS
TIME
APPLICATION LOGIC LOADING NN INFERENCE
DEVICE SELECTION APPLICATION INFERENCE ON AUTO
COMPILE / LOAD NETWORK FOR GPU
© 2022 Intel 31
Input Data with Variable Shape?
32
“What is the weather
going to be like today?”
How to handle variable input shapes?
33
Padding Multiple Precompiled
Models
Inefficient and Cumbersome!
Native Dynamic Shapes Support in OpenVINO™
34
Dynamic in input
image size
• Set-and-forget optimization knob – choose latency or throughput.
• Completely portable between the devices
• Throughput hint drives device-specific optimizations
• Even works for AUTO!
Portable Performance Hints
35
Automatic Batching
36
• How do you choose a good batch size?
• Let the OpenVINO™ runtime decide for you!
• No need to batch requests, runtime will do that too!
• Batching can improve throughput with select devices and models.
300+
Pre-Trained + Optimized Models
© 2022 Intel 37
Open Model Zoo
300+
Pre-Trained + Optimized Models
38
Open Model Zoo
300+
Pre-Trained + Optimized Models
© 2022 Intel 39
Open Model Zoo
300+
Pre-Trained + Optimized Models
© 2022 Intel 40
Open Model Zoo
300+
Pre-Trained + Optimized Models
© 2022 Intel 41
Open Model Zoo
300+
Pre-Trained + Optimized Models
© 2022 Intel 42
Open Model Zoo
300+
Pre-Trained + Optimized Models
© 2022 Intel 43
Open Model Zoo
Jupyter Notebooks
Try it yourself!
Learn more and download
at openvino.ai
Complete the Intel® Edge
AI Certification!
• Supercharge your career
• Approx 20 hours of video + quizzes +
coding
Deep Dive Session
Thursday, May 19
Intel AI Developer Expo—Let's Build Something Wonderful
Together
Session: 3:00 – 5:30 pm
Reception: 5:30 - 7:30 pm
Location: Room 209/210
Afternoon snack and post-session Reception will be
provided
Thank you!
Configuration Intel® Core™ i7-1185G7 Intel® Core™ i3-8100 Intel® Core™ i5-8500 Intel® Core™ i7-8700T
Motherboard Intel Corporation internal/
Reference Validation Platform
GIGABYTE* Z390 UD ASUS* PRIME Z370-A GIGABYTE* Z370M DS3H-CF
CPU Intel® Core™ i7-1185G7 @ 3.00GHz Intel® Core™ i3-8100 CPU @ 3.60GHz Intel® Core™ i5-8500 CPU @ 3.00GHz Intel® Core™ i7-8700T CPU @ 2.40GHz
Hyper Threading ON OFF OFF ON
Turbo Setting ON OFF ON ON
Memory 2 x 8 GB DDR4 3200MHz 4 x 8 GB DDR4 2400MHz 2 x 16 GB DDR4 2666MHz 4 x 16 GB DDR4 2400MHz
Operating System Ubuntu* 18.04 LTS Ubuntu* 18.04 LTS Ubuntu* 18.04 LTS Ubuntu* 18.04 LTS
Kernel Version 5.8.0-05-generic 5.3.0-24-generic 5.3.0-24-generic 5.3.0-24-generic
BIOS Vendor Intel Corporation American Megatrends Inc.* American Megatrends Inc.* American Megatrends Inc.*
BIOS Version TGLSFWI1.R00.3425. A00.2010162309 F8 2401 F14c
BIOS Release 16-Oct-20 24-May-19 12-Jul-19 23-Mar-21
BIOS Settings Default Settings Select optimized default settings,
set OS type to “other”, save and exit
Select optimized default
settings, save and exit
Select optimized default settings, set
OS type to “other”, save and exit
Batch size 1 1 1 1
Precision FP32 FP32 FP32 FP32
Number of concurrent
inference requests
4 4 3 4
Test Date 18-Jun-21 18-Jun-21 18-Jun-21 18-Jun-21
Rated maximum TDP/socket in Watt 28 65 65 35
48
© 2022 Intel
Platform Configurations for
Performance Benchmarks
Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.
Performance results are based on testing as of dates shown in configurations and may not reflect all publicly
available updates. See backup for configuration details.
No product or component can be absolutely secure.
Intel technologies may require enabled hardware, software or service activation.
Your costs and results may vary.
Intel is committed to respecting human rights and avoiding complicity in human rights abuses. See
Intel's Global Human Rights Principles. Intel's products and software are intended only to be used in
applications that do not cause or contribute to a violation of an internationally recognized human right.
© Intel Corporation. Intel, the Intel logo, and other Intel marks are trademarks of Intel Corporation or its
subsidiaries. Other names and brands may be claimed as the property of others.
Notices and Disclaimers
49
© 2022 Intel

Weitere ähnliche Inhalte

Was ist angesagt?

1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
NAVER Engineering
 
Introduction to Artificial Neural Networks (ANNs) - Step-by-Step Training & T...
Introduction to Artificial Neural Networks (ANNs) - Step-by-Step Training & T...Introduction to Artificial Neural Networks (ANNs) - Step-by-Step Training & T...
Introduction to Artificial Neural Networks (ANNs) - Step-by-Step Training & T...
Ahmed Gad
 
Terrain Rendering in Frostbite using Procedural Shader Splatting (Siggraph 2007)
Terrain Rendering in Frostbite using Procedural Shader Splatting (Siggraph 2007)Terrain Rendering in Frostbite using Procedural Shader Splatting (Siggraph 2007)
Terrain Rendering in Frostbite using Procedural Shader Splatting (Siggraph 2007)
Johan Andersson
 

Was ist angesagt? (20)

1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
 
Graphics Gems from CryENGINE 3 (Siggraph 2013)
Graphics Gems from CryENGINE 3 (Siggraph 2013)Graphics Gems from CryENGINE 3 (Siggraph 2013)
Graphics Gems from CryENGINE 3 (Siggraph 2013)
 
Rendering Tech of Space Marine
Rendering Tech of Space MarineRendering Tech of Space Marine
Rendering Tech of Space Marine
 
Supermicro’s Universal GPU: Modular, Standards Based and Built for the Future
Supermicro’s Universal GPU: Modular, Standards Based and Built for the FutureSupermicro’s Universal GPU: Modular, Standards Based and Built for the Future
Supermicro’s Universal GPU: Modular, Standards Based and Built for the Future
 
Developing and optimizing a procedural game: The Elder Scrolls Blades- Unite ...
Developing and optimizing a procedural game: The Elder Scrolls Blades- Unite ...Developing and optimizing a procedural game: The Elder Scrolls Blades- Unite ...
Developing and optimizing a procedural game: The Elder Scrolls Blades- Unite ...
 
Introduction to Transformer Model
Introduction to Transformer ModelIntroduction to Transformer Model
Introduction to Transformer Model
 
Introduction to Artificial Neural Networks (ANNs) - Step-by-Step Training & T...
Introduction to Artificial Neural Networks (ANNs) - Step-by-Step Training & T...Introduction to Artificial Neural Networks (ANNs) - Step-by-Step Training & T...
Introduction to Artificial Neural Networks (ANNs) - Step-by-Step Training & T...
 
Transformers in 2021
Transformers in 2021Transformers in 2021
Transformers in 2021
 
Khronos Munich 2018 - Halcyon and Vulkan
Khronos Munich 2018 - Halcyon and VulkanKhronos Munich 2018 - Halcyon and Vulkan
Khronos Munich 2018 - Halcyon and Vulkan
 
FrameGraph: Extensible Rendering Architecture in Frostbite
FrameGraph: Extensible Rendering Architecture in FrostbiteFrameGraph: Extensible Rendering Architecture in Frostbite
FrameGraph: Extensible Rendering Architecture in Frostbite
 
딥러닝 자연어처리 - RNN에서 BERT까지
딥러닝 자연어처리 - RNN에서 BERT까지딥러닝 자연어처리 - RNN에서 BERT까지
딥러닝 자연어처리 - RNN에서 BERT까지
 
Rendering Technologies from Crysis 3 (GDC 2013)
Rendering Technologies from Crysis 3 (GDC 2013)Rendering Technologies from Crysis 3 (GDC 2013)
Rendering Technologies from Crysis 3 (GDC 2013)
 
Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1
Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1
Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1
 
YOLO
YOLOYOLO
YOLO
 
Siggraph2016 - The Devil is in the Details: idTech 666
Siggraph2016 - The Devil is in the Details: idTech 666Siggraph2016 - The Devil is in the Details: idTech 666
Siggraph2016 - The Devil is in the Details: idTech 666
 
Federated learning
Federated learningFederated learning
Federated learning
 
GPU Programming
GPU ProgrammingGPU Programming
GPU Programming
 
Terrain Rendering in Frostbite using Procedural Shader Splatting (Siggraph 2007)
Terrain Rendering in Frostbite using Procedural Shader Splatting (Siggraph 2007)Terrain Rendering in Frostbite using Procedural Shader Splatting (Siggraph 2007)
Terrain Rendering in Frostbite using Procedural Shader Splatting (Siggraph 2007)
 
Stable Diffusion path
Stable Diffusion pathStable Diffusion path
Stable Diffusion path
 
Secrets of CryENGINE 3 Graphics Technology
Secrets of CryENGINE 3 Graphics TechnologySecrets of CryENGINE 3 Graphics Technology
Secrets of CryENGINE 3 Graphics Technology
 

Ähnlich wie “Optimization Techniques with Intel’s OpenVINO to Enhance Performance on Your Existing Hardware,” a Presentation from Intel

“Smarter Manufacturing with Intel’s Deep Learning-Based Machine Vision,” a Pr...
“Smarter Manufacturing with Intel’s Deep Learning-Based Machine Vision,” a Pr...“Smarter Manufacturing with Intel’s Deep Learning-Based Machine Vision,” a Pr...
“Smarter Manufacturing with Intel’s Deep Learning-Based Machine Vision,” a Pr...
Edge AI and Vision Alliance
 
“Acceleration of Deep Learning Using OpenVINO: 3D Seismic Case Study,” a Pres...
“Acceleration of Deep Learning Using OpenVINO: 3D Seismic Case Study,” a Pres...“Acceleration of Deep Learning Using OpenVINO: 3D Seismic Case Study,” a Pres...
“Acceleration of Deep Learning Using OpenVINO: 3D Seismic Case Study,” a Pres...
Edge AI and Vision Alliance
 
“Intel Video AI Box—Converging AI, Media and Computing in a Compact and Open ...
“Intel Video AI Box—Converging AI, Media and Computing in a Compact and Open ...“Intel Video AI Box—Converging AI, Media and Computing in a Compact and Open ...
“Intel Video AI Box—Converging AI, Media and Computing in a Compact and Open ...
Edge AI and Vision Alliance
 
Как выбрать оптимальную серверную архитектуру для создания высокоэффективных ЦОД
Как выбрать оптимальную серверную архитектуру для создания высокоэффективных ЦОДКак выбрать оптимальную серверную архитектуру для создания высокоэффективных ЦОД
Как выбрать оптимальную серверную архитектуру для создания высокоэффективных ЦОД
Nick Turunov
 

Ähnlich wie “Optimization Techniques with Intel’s OpenVINO to Enhance Performance on Your Existing Hardware,” a Presentation from Intel (20)

“Smarter Manufacturing with Intel’s Deep Learning-Based Machine Vision,” a Pr...
“Smarter Manufacturing with Intel’s Deep Learning-Based Machine Vision,” a Pr...“Smarter Manufacturing with Intel’s Deep Learning-Based Machine Vision,” a Pr...
“Smarter Manufacturing with Intel’s Deep Learning-Based Machine Vision,” a Pr...
 
Forts and Fights Scaling Performance on Unreal Engine*
Forts and Fights Scaling Performance on Unreal Engine*Forts and Fights Scaling Performance on Unreal Engine*
Forts and Fights Scaling Performance on Unreal Engine*
 
“Acceleration of Deep Learning Using OpenVINO: 3D Seismic Case Study,” a Pres...
“Acceleration of Deep Learning Using OpenVINO: 3D Seismic Case Study,” a Pres...“Acceleration of Deep Learning Using OpenVINO: 3D Seismic Case Study,” a Pres...
“Acceleration of Deep Learning Using OpenVINO: 3D Seismic Case Study,” a Pres...
 
Software Development Tools for Intel® IoT Platforms
Software Development Tools for Intel® IoT PlatformsSoftware Development Tools for Intel® IoT Platforms
Software Development Tools for Intel® IoT Platforms
 
“Intel Video AI Box—Converging AI, Media and Computing in a Compact and Open ...
“Intel Video AI Box—Converging AI, Media and Computing in a Compact and Open ...“Intel Video AI Box—Converging AI, Media and Computing in a Compact and Open ...
“Intel Video AI Box—Converging AI, Media and Computing in a Compact and Open ...
 
Tuning For Deep Learning Inference with Intel® Processor Graphics | SIGGRAPH ...
Tuning For Deep Learning Inference with Intel® Processor Graphics | SIGGRAPH ...Tuning For Deep Learning Inference with Intel® Processor Graphics | SIGGRAPH ...
Tuning For Deep Learning Inference with Intel® Processor Graphics | SIGGRAPH ...
 
Как выбрать оптимальную серверную архитектуру для создания высокоэффективных ЦОД
Как выбрать оптимальную серверную архитектуру для создания высокоэффективных ЦОДКак выбрать оптимальную серверную архитектуру для создания высокоэффективных ЦОД
Как выбрать оптимальную серверную архитектуру для создания высокоэффективных ЦОД
 
【視覺進化論】AI智慧視覺運算技術論壇_2_ChungYeh
【視覺進化論】AI智慧視覺運算技術論壇_2_ChungYeh【視覺進化論】AI智慧視覺運算技術論壇_2_ChungYeh
【視覺進化論】AI智慧視覺運算技術論壇_2_ChungYeh
 
Технологии Intel для виртуализации сетей операторов связи
Технологии Intel для виртуализации сетей операторов связиТехнологии Intel для виртуализации сетей операторов связи
Технологии Intel для виртуализации сетей операторов связи
 
Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase - Fin...
	 Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase - Fin...	 Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase - Fin...
Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase - Fin...
 
Optimizing Direct X On Multi Core Architectures
Optimizing Direct X On Multi Core ArchitecturesOptimizing Direct X On Multi Core Architectures
Optimizing Direct X On Multi Core Architectures
 
Intel® Select Solutions for the Network
Intel® Select Solutions for the NetworkIntel® Select Solutions for the Network
Intel® Select Solutions for the Network
 
Streamline End-to-End AI Pipelines with Intel, Databricks, and OmniSci
Streamline End-to-End AI Pipelines with Intel, Databricks, and OmniSciStreamline End-to-End AI Pipelines with Intel, Databricks, and OmniSci
Streamline End-to-End AI Pipelines with Intel, Databricks, and OmniSci
 
Media and entertainment workload comparison: HP Z8 vs. Apple Mac Pro
Media and entertainment workload comparison: HP Z8 vs. Apple Mac ProMedia and entertainment workload comparison: HP Z8 vs. Apple Mac Pro
Media and entertainment workload comparison: HP Z8 vs. Apple Mac Pro
 
Achieve Business-Class Performance With Intel vPro, an Intel Evo Design
Achieve Business-Class Performance With Intel vPro, an Intel Evo DesignAchieve Business-Class Performance With Intel vPro, an Intel Evo Design
Achieve Business-Class Performance With Intel vPro, an Intel Evo Design
 
Accelerate Your Game Development on Android*
Accelerate Your Game Development on Android*Accelerate Your Game Development on Android*
Accelerate Your Game Development on Android*
 
Intel Performance Workshop @ CES 2020
Intel Performance Workshop @ CES 2020Intel Performance Workshop @ CES 2020
Intel Performance Workshop @ CES 2020
 
Open Source Firmware - FrOSCon 2019
Open Source Firmware - FrOSCon 2019Open Source Firmware - FrOSCon 2019
Open Source Firmware - FrOSCon 2019
 
Intel 6th Gen vPro
Intel 6th Gen vProIntel 6th Gen vPro
Intel 6th Gen vPro
 
"How to Get the Best Deep Learning Performance with the OpenVINO Toolkit," a ...
"How to Get the Best Deep Learning Performance with the OpenVINO Toolkit," a ..."How to Get the Best Deep Learning Performance with the OpenVINO Toolkit," a ...
"How to Get the Best Deep Learning Performance with the OpenVINO Toolkit," a ...
 

Mehr von Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
Edge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
Edge AI and Vision Alliance
 

Mehr von Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 

“Optimization Techniques with Intel’s OpenVINO to Enhance Performance on Your Existing Hardware,” a Presentation from Intel

  • 1. 1 Optimization Techniques with OpenVINO™ to Enhance Performance on Your Existing Hardware Nico Galoppo, Principal Engineer Ryan Loney, Technical Product Manager
  • 3. How do I Deploy My Great Neural Network? © 2022 Intel 3
  • 7. Maybe just CPU? © 2022 Intel 7
  • 8. “But running on CPU is slow…” Almost Every Deep Learning Engineer Is it? © 2022 Intel 8
  • 9. © 2022 Intel 9 The Solution
  • 11. BUILD OPTIMIZE DEPLOY 3 2 1 Developer Journey © 2022 Intel 11
  • 12. Optimized Performance CPU iGPU VPU © 2022 Intel 12
  • 14. Model Optimizer Get Your Model Run Model Optimizer IR For workloads and configurations visit www.intel.com/PerformanceIndex. Results may vary​.​ © 2022 Intel 14
  • 15. For workloads and configurations please scan QR code. Results may vary​.​ 0 25 50 75 100 125 150 175 200 Intel® Core™ i3-8100 Intel® Core™ i5-8500 Intel® Core™ i7-8700T Intel® Core™ i7-1185G7 CPU-only Intel® Core™ i7-1185G7 iGPU-only Intel® Core™ i7-1185G7 CPU+iGPU Throughput [FPS] – FP32 resnet_50_TF [224x224] yolo-v3-tiny-tf [416x416] deeplabv3-TF [513x513] $ benchmark_app -m model_path -d device 15 Performance
  • 16. For workloads and configurations visit www.intel.com/PerformanceIndex. Results may vary​.​ $ mo –-input_model model.onnx --data_type FP32 Get Your Model Run Model Optimizer IR FP32 FP16 © 2022 Intel 16 Model Optimizer
  • 17. Neural Network (any format) Conv2D Add Relu6 Identity DepthwiseConv2DNative Mul Add Relu6 Identity © 2022 Intel 17
  • 18. Intermediate Representation (IR) Convolution Add Clamp GroupConvolution Add Clamp © 2022 Intel 18
  • 19. Post-Training Optimization Tool (POT) IR IR Dataset Post-Training Optimization Tool FP32 INT8 © 2022 Intel 19
  • 20. © 2022 Intel 20 Neural Network Compression Framework (NNCF)
  • 21. Quantization- Aware Training Mixed-Precision Quantization Filter Pruning © 2022 Intel 21 Neural Network Compression Framework (NNCF)
  • 22. Neural Network Compression Framework (NNCF) Quantization- Aware Training Mixed-Precision Quantization Filter Pruning © 2022 Intel 22
  • 23. OpenVINO Runtime from openvino.runtime import Core img = load_img() core = Core() model = core.read_model(model="model.xml", weights="model.bin") compiled_model = core.compile_model(model=model, device_name="CPU") output_layer = compiled_model.outputs[0] result = compiled_model([img])[output_layer] „dog” © 2022 Intel 23
  • 24. compiled_model = core.compile_model(model=model, device_name="CPU") © 2022 Intel 24 Supported Devices
  • 25. compiled_model = core.compile_model(model=model, device_name="CPU") CPU © 2022 Intel 25 Supported Devices
  • 26. CPU GPU compiled_model = core.compile_model(model=model, device_name="CPU") © 2022 Intel 26 Supported Devices
  • 27. CPU GPU MYRIAD compiled_model = core.compile_model(model=model, device_name="CPU") © 2022 Intel 27 Supported Devices
  • 28. compiled_model = core.compile_model(model=model, device_name="CPU") CPU GPU MYRIAD HETERO CPU GPU Convolution Add Clamp GroupConvolution Add Clamp Convolution Add Convolution Add © 2022 Intel 28 Supported Devices
  • 29. compiled_model = core.compile_model(model=model, device_name="CPU") CPU GPU MYRIAD HETERO MULTI CPU GPU Convolution Add Clamp GroupConvolution Add Clamp Convolution Add Convolution Add GPU Convolution Add Clamp GroupConvolution Add Clamp Convolution Add Convolution Add CPU Convolution Add Clamp GroupConvolution Add Clamp Convolution Add Convolution Add 29 Supported Devices
  • 30. Supported Devices RUNTIME GPU CPU APPLICATION AUTO PLUGIN CPU PLUGIN GPU PLUGIN AUTO AUTO chooses the device AUTO sets device config based on hints AUTO handles the exec logic on multiple devices CPU GPU MYRIAD AUTO HETERO MULTI compiled_model = core.compile_model(model=model, device_name="CPU") 30
  • 31. AUTO Device INFERENCE ON GPU INFERENCE ON CPU INFERENCE STOPS ON CPU COMPILE NETWORK FOR CPU CPU Workloads GPU Workloads 116.7 5,800 MS TIME APPLICATION LOGIC LOADING NN INFERENCE DEVICE SELECTION APPLICATION INFERENCE ON AUTO COMPILE / LOAD NETWORK FOR GPU © 2022 Intel 31
  • 32. Input Data with Variable Shape? 32 “What is the weather going to be like today?”
  • 33. How to handle variable input shapes? 33 Padding Multiple Precompiled Models Inefficient and Cumbersome!
  • 34. Native Dynamic Shapes Support in OpenVINO™ 34 Dynamic in input image size
  • 35. • Set-and-forget optimization knob – choose latency or throughput. • Completely portable between the devices • Throughput hint drives device-specific optimizations • Even works for AUTO! Portable Performance Hints 35
  • 36. Automatic Batching 36 • How do you choose a good batch size? • Let the OpenVINO™ runtime decide for you! • No need to batch requests, runtime will do that too! • Batching can improve throughput with select devices and models.
  • 37. 300+ Pre-Trained + Optimized Models © 2022 Intel 37 Open Model Zoo
  • 38. 300+ Pre-Trained + Optimized Models 38 Open Model Zoo
  • 39. 300+ Pre-Trained + Optimized Models © 2022 Intel 39 Open Model Zoo
  • 40. 300+ Pre-Trained + Optimized Models © 2022 Intel 40 Open Model Zoo
  • 41. 300+ Pre-Trained + Optimized Models © 2022 Intel 41 Open Model Zoo
  • 42. 300+ Pre-Trained + Optimized Models © 2022 Intel 42 Open Model Zoo
  • 43. 300+ Pre-Trained + Optimized Models © 2022 Intel 43 Open Model Zoo
  • 45. Try it yourself! Learn more and download at openvino.ai Complete the Intel® Edge AI Certification! • Supercharge your career • Approx 20 hours of video + quizzes + coding
  • 46. Deep Dive Session Thursday, May 19 Intel AI Developer Expo—Let's Build Something Wonderful Together Session: 3:00 – 5:30 pm Reception: 5:30 - 7:30 pm Location: Room 209/210 Afternoon snack and post-session Reception will be provided
  • 48. Configuration Intel® Core™ i7-1185G7 Intel® Core™ i3-8100 Intel® Core™ i5-8500 Intel® Core™ i7-8700T Motherboard Intel Corporation internal/ Reference Validation Platform GIGABYTE* Z390 UD ASUS* PRIME Z370-A GIGABYTE* Z370M DS3H-CF CPU Intel® Core™ i7-1185G7 @ 3.00GHz Intel® Core™ i3-8100 CPU @ 3.60GHz Intel® Core™ i5-8500 CPU @ 3.00GHz Intel® Core™ i7-8700T CPU @ 2.40GHz Hyper Threading ON OFF OFF ON Turbo Setting ON OFF ON ON Memory 2 x 8 GB DDR4 3200MHz 4 x 8 GB DDR4 2400MHz 2 x 16 GB DDR4 2666MHz 4 x 16 GB DDR4 2400MHz Operating System Ubuntu* 18.04 LTS Ubuntu* 18.04 LTS Ubuntu* 18.04 LTS Ubuntu* 18.04 LTS Kernel Version 5.8.0-05-generic 5.3.0-24-generic 5.3.0-24-generic 5.3.0-24-generic BIOS Vendor Intel Corporation American Megatrends Inc.* American Megatrends Inc.* American Megatrends Inc.* BIOS Version TGLSFWI1.R00.3425. A00.2010162309 F8 2401 F14c BIOS Release 16-Oct-20 24-May-19 12-Jul-19 23-Mar-21 BIOS Settings Default Settings Select optimized default settings, set OS type to “other”, save and exit Select optimized default settings, save and exit Select optimized default settings, set OS type to “other”, save and exit Batch size 1 1 1 1 Precision FP32 FP32 FP32 FP32 Number of concurrent inference requests 4 4 3 4 Test Date 18-Jun-21 18-Jun-21 18-Jun-21 18-Jun-21 Rated maximum TDP/socket in Watt 28 65 65 35 48 © 2022 Intel Platform Configurations for Performance Benchmarks
  • 49. Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex. Performance results are based on testing as of dates shown in configurations and may not reflect all publicly available updates. See backup for configuration details. No product or component can be absolutely secure. Intel technologies may require enabled hardware, software or service activation. Your costs and results may vary. Intel is committed to respecting human rights and avoiding complicity in human rights abuses. See Intel's Global Human Rights Principles. Intel's products and software are intended only to be used in applications that do not cause or contribute to a violation of an internationally recognized human right. © Intel Corporation. Intel, the Intel logo, and other Intel marks are trademarks of Intel Corporation or its subsidiaries. Other names and brands may be claimed as the property of others. Notices and Disclaimers 49 © 2022 Intel