SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Downloaden Sie, um offline zu lesen
© 2020 MathWorks
Deploying Deep Learning Application
on FPGAs with MATLAB
Jack Erickson
Technical Marketing
September 2020
© 2020 MathWorks
Airborne Image
Analysis
Deep Learning Deployment on Embedded Devices
2
Autonomous Driving Industrial Inspection
Medical Image
Analysis
Wireless Modulation
Classification
Radar Signature
Classification
© 2020 MathWorks
System Requirements Drive Network Design
3
Industrial Inspection
Hardware/Software
Engineers
Systems
Engineer
Deep Learning
Practitioner
Camera specs
Accuracy
Latency
Cost
Power
© 2020 MathWorks
Challenges of Deploying Deep Learning to FPGA Hardware:
Convolution
Each stride is an 11x11x3 matrix multiply-accumulate
→105M floating-point multiply operations!
55
55
96
filters
stride=4
224
224
11x11
96 filters of 11x11x3 of 32-bit parameters →140k bytes
11x11
→1.16M bytes of activations
© 2020 MathWorks
Challenges of Deploying Deep Learning to FPGA Hardware
conv
1
conv
2
conv
3
conv
4
conv
5
fc6 fc7 fc8input Total
140K 1.2M 3.5M 5.2M 1.8M 148M 64M 16M
Parameters
(Bytes)
n/a 230 M
1.1M 728K 252K 252K 168K 16K 16K 4K
Activations
(Bytes)
588K 3.1 M
105M 223M 149M 112M 74M 37M 16M 4MFLOPs n/a 720 M
Off-chip RAM
Block RAM
DSP Slices
© 2020 MathWorks
Deploying Deep Learning to FPGA Hardware Requires
Collaboration
140K 1.2M 3.5M 5.2M 1.8M 148M 64M 16M
Parameters
(Bytes)
conv
1
conv
2
conv
3
conv
4
conv
5
fc6 fc7 fc8
n/a
input Total
230 M
1.1M 728K 252K 252K 168K 16K 16K 4K
Activations
(Bytes)
588K 3.1 M
105M 223M 149M 112M 74M 37M 16M 4MFLOPs n/a 720 M
Resize
Acquire
data
Output /
display
Mem i/f
Optimize
• Network /layers
• Fixed-point quantization
• Processor micro-architecture
© 2020 MathWorks
A Collaborative AI Workflow
Model design and
tuning
Hardware
accelerated training
Interoperability
AI Modeling
Integration with
complex systems
System verification
and validation
System simulation
System Design
Data cleansing and
preparation
Simulation-
generated data
Human insight
Data Preparation
7
Enterprise systems
Embedded devices
Edge, cloud,
desktop
Deployment
I Iteration and Refinement
© 2020 MathWorks
Design and Analyze Your Networks in MATLAB
Classification Learner app to try different classifiers
and find the best fit for your data set
Deep Network Designer app to build, visualize, and
edit deep learning networks
Model design and
tuning
Hardware
accelerated training
Interoperability
AI Modeling
8
© 2020 MathWorks
9
MATLAB Interoperates with Other AI Frameworks
Caffe importer
Keras importer
Model design and
tuning
Hardware
accelerated training
Interoperability
AI Modeling
© 2020 MathWorks
Application
logic
Deploy from MATLAB to a Variety of Hardware Platforms
10
FPGA
CPU
GPU Enterprise systems
Embedded devices
Edge, cloud,
desktop
Deployment
© 2020 MathWorks
FPGA Deployment from MATLAB
© 2020 MathWorks
Application
logic
Get Started Prototyping on FPGA with Deep Learning HDL
ToolboxTM
Hardware support package
Deep learning processor with I/O and
external memory interfaces
• Int8 or single precision
• Supported boards:
• Xilinx: ZCU102 or ZC706
• Intel: Arria10 SoC
• http://mathworks.com/hardware-support.html
FPGA Bitstream
Layer
control
instructions
Weights &
Activations
Fully
Connected
Module
Convolution
Module
Processor Control
Memory Access
Activations
Activations
Activations
© 2020 MathWorks
Defect Detection Example
Application
logic
Pre-processing: Extract
regions and resize
Post-processing:
Annotate and label
Inference: Predict
using trained network
FPGA
© 2020 MathWorks
Run Deep Learning on FPGA from MATLAB
14
© 2020 MathWorks
Application
logic
Prototyping: Design Exploration and Customization
15
Layer
control
instructions
Weights &
Activations
Re-train
© 2020 MathWorks
Design Exploration and Customization
16
© 2020 MathWorks
Optimizing Deep Learning Applications Requires
Collaboration
Systems
Engineer
Deep Learning
Practitioner
Hardware/Software
Engineers
Fully
Connected
Module
Convolution
Module
Processor Control
Memory Access
Activations
Activations
Activations
x
+/-
Σ
32
/
x
+/-
Σ
8
/
x
+/-
Σ
8
/
x
+/-
Σ
8
/
x
+/-
Σ
8
/
© 2020 MathWorks
INT8 Quantization
18
© 2020 MathWorks
Application
logic
Deep Learning HDL Toolbox
19
Layer
control
instructions
Weights &
Activations
Re-train
Modify network
% Create target object
hTarget = dlhdl.Target(…)
% Create workflow object, using the target
hW = dlhdl.Workflow(…);
% Compile the network
hW.compile;
% Program the bitstream and deploy the compiled network and weights
hW.deploy;
% Run prediction
[score, speed] = hW.predict(img, ‘Profile’, ‘on’);>> deepNetworkDesigner
Parameters Speed
140 MB 18 fps
84 MB 45 fps
68 MB 139 fps
Quantize
>> deepNetworkQuantizer
Generate
HDL
Iterate and Converge on Deep Learning FPGA Deployment from MATLAB
© 2020 MathWorks
Application
logic
Deploy from MATLAB to a Variety of Hardware Platforms
20
FPGA
CPU
GPU
Tune for system requirements
Prototype from MATLAB
Configure and generate RTL
Deep Learning HDL Toolbox
© 2020 MathWorks
Resource Slide
Deep Learning Solutions in MATLAB
https://www.mathworks.com/solutions/deep-learning.html
Deep Learning HDL Toolbox
https://www.mathworks.com/products/deep-learning-hdl.html
Onramp: Deep Learning in MATLAB
https://www.mathworks.com/learn/tutorials/deep-learning-onramp.html
MathWorks FPGA Solutions Page
https://www.mathworks.com/solutions/fpga-asic-soc-development.html
21

Weitere ähnliche Inhalte

Was ist angesagt?

Kernel Recipes 2018 - CPU Idle Loop Rework - Rafael J. Wysocki
Kernel Recipes 2018 - CPU Idle Loop Rework - Rafael J. WysockiKernel Recipes 2018 - CPU Idle Loop Rework - Rafael J. Wysocki
Kernel Recipes 2018 - CPU Idle Loop Rework - Rafael J. Wysocki
Anne Nicolas
 
Creating an 86,000 Hour Speech Dataset with Apache Spark and TPUs
Creating an 86,000 Hour Speech Dataset with Apache Spark and TPUsCreating an 86,000 Hour Speech Dataset with Apache Spark and TPUs
Creating an 86,000 Hour Speech Dataset with Apache Spark and TPUs
Databricks
 
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
Edge AI and Vision Alliance
 
"Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"...
"Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"..."Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"...
"Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"...
Edge AI and Vision Alliance
 

Was ist angesagt? (20)

Kernel Recipes 2018 - CPU Idle Loop Rework - Rafael J. Wysocki
Kernel Recipes 2018 - CPU Idle Loop Rework - Rafael J. WysockiKernel Recipes 2018 - CPU Idle Loop Rework - Rafael J. Wysocki
Kernel Recipes 2018 - CPU Idle Loop Rework - Rafael J. Wysocki
 
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla MahGS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
 
cisco csr1000v
cisco csr1000vcisco csr1000v
cisco csr1000v
 
"Snapdragon Hybrid Computer Vision/Deep Learning Architecture for Imaging App...
"Snapdragon Hybrid Computer Vision/Deep Learning Architecture for Imaging App..."Snapdragon Hybrid Computer Vision/Deep Learning Architecture for Imaging App...
"Snapdragon Hybrid Computer Vision/Deep Learning Architecture for Imaging App...
 
Creating an 86,000 Hour Speech Dataset with Apache Spark and TPUs
Creating an 86,000 Hour Speech Dataset with Apache Spark and TPUsCreating an 86,000 Hour Speech Dataset with Apache Spark and TPUs
Creating an 86,000 Hour Speech Dataset with Apache Spark and TPUs
 
Luxoft A DXC Technology Company
Luxoft A DXC Technology CompanyLuxoft A DXC Technology Company
Luxoft A DXC Technology Company
 
GPU Acceleration for Containers on Intel Processor Graphics
GPU Acceleration for Containers on Intel Processor GraphicsGPU Acceleration for Containers on Intel Processor Graphics
GPU Acceleration for Containers on Intel Processor Graphics
 
Kubernetes Failure Stories - KubeCon Europe Barcelona
Kubernetes Failure Stories - KubeCon Europe BarcelonaKubernetes Failure Stories - KubeCon Europe Barcelona
Kubernetes Failure Stories - KubeCon Europe Barcelona
 
Strata - Scaling Jupyter with Jupyter Enterprise Gateway
Strata - Scaling Jupyter with Jupyter Enterprise GatewayStrata - Scaling Jupyter with Jupyter Enterprise Gateway
Strata - Scaling Jupyter with Jupyter Enterprise Gateway
 
PCCC22:日本AMD株式会社 テーマ1「第4世代AMD EPYC™ プロセッサー (Genoa) の概要」
PCCC22:日本AMD株式会社 テーマ1「第4世代AMD EPYC™ プロセッサー (Genoa) の概要」PCCC22:日本AMD株式会社 テーマ1「第4世代AMD EPYC™ プロセッサー (Genoa) の概要」
PCCC22:日本AMD株式会社 テーマ1「第4世代AMD EPYC™ プロセッサー (Genoa) の概要」
 
ABCI: AI Bridging Cloud Infrastructure for Scalable AI/Big Data
ABCI: AI Bridging Cloud Infrastructure for Scalable AI/Big DataABCI: AI Bridging Cloud Infrastructure for Scalable AI/Big Data
ABCI: AI Bridging Cloud Infrastructure for Scalable AI/Big Data
 
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
 
Cheat Sheet
Cheat SheetCheat Sheet
Cheat Sheet
 
PAN-OS - Network Security/Prevention Everywhere
PAN-OS - Network Security/Prevention EverywherePAN-OS - Network Security/Prevention Everywhere
PAN-OS - Network Security/Prevention Everywhere
 
Intel® QuickAssist Technology Introduction, Applications, and Lab, Including ...
Intel® QuickAssist Technology Introduction, Applications, and Lab, Including ...Intel® QuickAssist Technology Introduction, Applications, and Lab, Including ...
Intel® QuickAssist Technology Introduction, Applications, and Lab, Including ...
 
Zero Trust for Private 5G and Edge
Zero Trust for Private 5G and EdgeZero Trust for Private 5G and Edge
Zero Trust for Private 5G and Edge
 
Introduction to CUDA
Introduction to CUDAIntroduction to CUDA
Introduction to CUDA
 
Introduction to Deep Learning (NVIDIA)
Introduction to Deep Learning (NVIDIA)Introduction to Deep Learning (NVIDIA)
Introduction to Deep Learning (NVIDIA)
 
"Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"...
"Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"..."Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"...
"Emerging Processor Architectures for Deep Learning: Options and Trade-offs,"...
 
End-to-End Deep Learning Deployment with ONNX
End-to-End Deep Learning Deployment with ONNXEnd-to-End Deep Learning Deployment with ONNX
End-to-End Deep Learning Deployment with ONNX
 

Ähnlich wie “Deploying Deep Learning Applications on FPGAs with MATLAB,” a Presentation from MathWorks

"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems..."Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
Edge AI and Vision Alliance
 
Altera’s Role In Accelerating the Internet of Things
Altera’s Role In Accelerating the Internet of ThingsAltera’s Role In Accelerating the Internet of Things
Altera’s Role In Accelerating the Internet of Things
Altera Corporation
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 

Ähnlich wie “Deploying Deep Learning Applications on FPGAs with MATLAB,” a Presentation from MathWorks (20)

Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Backend.AI Technical Introduction (19.09 / 2019 Autumn)Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Backend.AI Technical Introduction (19.09 / 2019 Autumn)
 
DATE 2020: Design, Automation and Test in Europe Conference
DATE 2020: Design, Automation and Test in Europe ConferenceDATE 2020: Design, Automation and Test in Europe Conference
DATE 2020: Design, Automation and Test in Europe Conference
 
Connectivity for the data centric era
Connectivity for the data centric eraConnectivity for the data centric era
Connectivity for the data centric era
 
Brain in the Cloud: Machine Learning on OpenStack & Kubernetes Done Right - E...
Brain in the Cloud: Machine Learning on OpenStack & Kubernetes Done Right - E...Brain in the Cloud: Machine Learning on OpenStack & Kubernetes Done Right - E...
Brain in the Cloud: Machine Learning on OpenStack & Kubernetes Done Right - E...
 
Harnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligenceHarnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligence
 
SS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentation
SS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentationSS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentation
SS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentation
 
NGIoT standardisation workshops_Jens Hagemeyer presentation
NGIoT standardisation workshops_Jens Hagemeyer presentationNGIoT standardisation workshops_Jens Hagemeyer presentation
NGIoT standardisation workshops_Jens Hagemeyer presentation
 
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems..."Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
 
Netsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfvNetsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfv
 
Altera’s Role In Accelerating the Internet of Things
Altera’s Role In Accelerating the Internet of ThingsAltera’s Role In Accelerating the Internet of Things
Altera’s Role In Accelerating the Internet of Things
 
Resume
ResumeResume
Resume
 
Hardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and MLHardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and ML
 
Intels presentation at blue line industrial computer seminar
Intels presentation at blue line industrial computer seminarIntels presentation at blue line industrial computer seminar
Intels presentation at blue line industrial computer seminar
 
Santhosh Resume
Santhosh ResumeSanthosh Resume
Santhosh Resume
 
POLYTEDA: Power DRC/LVS, June 2017
POLYTEDA: Power DRC/LVS, June 2017POLYTEDA: Power DRC/LVS, June 2017
POLYTEDA: Power DRC/LVS, June 2017
 
Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)
 
Introduction to Programmable Networks by Clarence Anslem, Intel
Introduction to Programmable Networks by Clarence Anslem, IntelIntroduction to Programmable Networks by Clarence Anslem, Intel
Introduction to Programmable Networks by Clarence Anslem, Intel
 
Altera FPGA Marketing Report
Altera FPGA Marketing ReportAltera FPGA Marketing Report
Altera FPGA Marketing Report
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
White Box Hardware Challenges in the 5G & IoT Hyperconnected Era
White Box Hardware Challenges in the 5G & IoT Hyperconnected EraWhite Box Hardware Challenges in the 5G & IoT Hyperconnected Era
White Box Hardware Challenges in the 5G & IoT Hyperconnected Era
 

Mehr von Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
Edge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
Edge AI and Vision Alliance
 

Mehr von Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Kürzlich hochgeladen

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Kürzlich hochgeladen (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

“Deploying Deep Learning Applications on FPGAs with MATLAB,” a Presentation from MathWorks

  • 1. © 2020 MathWorks Deploying Deep Learning Application on FPGAs with MATLAB Jack Erickson Technical Marketing September 2020
  • 2. © 2020 MathWorks Airborne Image Analysis Deep Learning Deployment on Embedded Devices 2 Autonomous Driving Industrial Inspection Medical Image Analysis Wireless Modulation Classification Radar Signature Classification
  • 3. © 2020 MathWorks System Requirements Drive Network Design 3 Industrial Inspection Hardware/Software Engineers Systems Engineer Deep Learning Practitioner Camera specs Accuracy Latency Cost Power
  • 4. © 2020 MathWorks Challenges of Deploying Deep Learning to FPGA Hardware: Convolution Each stride is an 11x11x3 matrix multiply-accumulate →105M floating-point multiply operations! 55 55 96 filters stride=4 224 224 11x11 96 filters of 11x11x3 of 32-bit parameters →140k bytes 11x11 →1.16M bytes of activations
  • 5. © 2020 MathWorks Challenges of Deploying Deep Learning to FPGA Hardware conv 1 conv 2 conv 3 conv 4 conv 5 fc6 fc7 fc8input Total 140K 1.2M 3.5M 5.2M 1.8M 148M 64M 16M Parameters (Bytes) n/a 230 M 1.1M 728K 252K 252K 168K 16K 16K 4K Activations (Bytes) 588K 3.1 M 105M 223M 149M 112M 74M 37M 16M 4MFLOPs n/a 720 M Off-chip RAM Block RAM DSP Slices
  • 6. © 2020 MathWorks Deploying Deep Learning to FPGA Hardware Requires Collaboration 140K 1.2M 3.5M 5.2M 1.8M 148M 64M 16M Parameters (Bytes) conv 1 conv 2 conv 3 conv 4 conv 5 fc6 fc7 fc8 n/a input Total 230 M 1.1M 728K 252K 252K 168K 16K 16K 4K Activations (Bytes) 588K 3.1 M 105M 223M 149M 112M 74M 37M 16M 4MFLOPs n/a 720 M Resize Acquire data Output / display Mem i/f Optimize • Network /layers • Fixed-point quantization • Processor micro-architecture
  • 7. © 2020 MathWorks A Collaborative AI Workflow Model design and tuning Hardware accelerated training Interoperability AI Modeling Integration with complex systems System verification and validation System simulation System Design Data cleansing and preparation Simulation- generated data Human insight Data Preparation 7 Enterprise systems Embedded devices Edge, cloud, desktop Deployment I Iteration and Refinement
  • 8. © 2020 MathWorks Design and Analyze Your Networks in MATLAB Classification Learner app to try different classifiers and find the best fit for your data set Deep Network Designer app to build, visualize, and edit deep learning networks Model design and tuning Hardware accelerated training Interoperability AI Modeling 8
  • 9. © 2020 MathWorks 9 MATLAB Interoperates with Other AI Frameworks Caffe importer Keras importer Model design and tuning Hardware accelerated training Interoperability AI Modeling
  • 10. © 2020 MathWorks Application logic Deploy from MATLAB to a Variety of Hardware Platforms 10 FPGA CPU GPU Enterprise systems Embedded devices Edge, cloud, desktop Deployment
  • 11. © 2020 MathWorks FPGA Deployment from MATLAB
  • 12. © 2020 MathWorks Application logic Get Started Prototyping on FPGA with Deep Learning HDL ToolboxTM Hardware support package Deep learning processor with I/O and external memory interfaces • Int8 or single precision • Supported boards: • Xilinx: ZCU102 or ZC706 • Intel: Arria10 SoC • http://mathworks.com/hardware-support.html FPGA Bitstream Layer control instructions Weights & Activations Fully Connected Module Convolution Module Processor Control Memory Access Activations Activations Activations
  • 13. © 2020 MathWorks Defect Detection Example Application logic Pre-processing: Extract regions and resize Post-processing: Annotate and label Inference: Predict using trained network FPGA
  • 14. © 2020 MathWorks Run Deep Learning on FPGA from MATLAB 14
  • 15. © 2020 MathWorks Application logic Prototyping: Design Exploration and Customization 15 Layer control instructions Weights & Activations Re-train
  • 16. © 2020 MathWorks Design Exploration and Customization 16
  • 17. © 2020 MathWorks Optimizing Deep Learning Applications Requires Collaboration Systems Engineer Deep Learning Practitioner Hardware/Software Engineers Fully Connected Module Convolution Module Processor Control Memory Access Activations Activations Activations x +/- Σ 32 / x +/- Σ 8 / x +/- Σ 8 / x +/- Σ 8 / x +/- Σ 8 /
  • 18. © 2020 MathWorks INT8 Quantization 18
  • 19. © 2020 MathWorks Application logic Deep Learning HDL Toolbox 19 Layer control instructions Weights & Activations Re-train Modify network % Create target object hTarget = dlhdl.Target(…) % Create workflow object, using the target hW = dlhdl.Workflow(…); % Compile the network hW.compile; % Program the bitstream and deploy the compiled network and weights hW.deploy; % Run prediction [score, speed] = hW.predict(img, ‘Profile’, ‘on’);>> deepNetworkDesigner Parameters Speed 140 MB 18 fps 84 MB 45 fps 68 MB 139 fps Quantize >> deepNetworkQuantizer Generate HDL Iterate and Converge on Deep Learning FPGA Deployment from MATLAB
  • 20. © 2020 MathWorks Application logic Deploy from MATLAB to a Variety of Hardware Platforms 20 FPGA CPU GPU Tune for system requirements Prototype from MATLAB Configure and generate RTL Deep Learning HDL Toolbox
  • 21. © 2020 MathWorks Resource Slide Deep Learning Solutions in MATLAB https://www.mathworks.com/solutions/deep-learning.html Deep Learning HDL Toolbox https://www.mathworks.com/products/deep-learning-hdl.html Onramp: Deep Learning in MATLAB https://www.mathworks.com/learn/tutorials/deep-learning-onramp.html MathWorks FPGA Solutions Page https://www.mathworks.com/solutions/fpga-asic-soc-development.html 21