SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Downloaden Sie, um offline zu lesen
1
Jeff Gehlhaar, Vice President, Qualcomm Research
May 12, 2015
Deep-learning-based visual perception
in mobile and embedded devices:
Opportunities and challenges
2
Qualcomm Research:
Transforming the future of mobile technology
Research Prototype Standardize
3
Vision for cognitive computing
More intuitive devices and things
4
Key elements of “Cognition”
Hear AnticipateSee PlanConcepts
AutonomousClassify Infer context
Relationships
Perception ActionReasoning
5
Rich
Connectivity
Heterogeneous
Computing
On-device
Intelligence
On-device capabilities
• Integrated modem & AP
• Adaptive RF front end
• LTE broadcast & service
focused modem features
• Tightly integrated Wi-Fi/BT
• Leading location / GPS
• Fully customized
architecture
• Superior performance at low
power consumption
• Highly optimized for cutting-
edge cognitive capabilities
• On-device machine learning
• Computer vision
• Behavioral analysis
• Sensor processing and
classification algorithms
• Natural language processing
Visual
Perception
Speech & Audio
Understanding
Natural
Interactions
Intelligent
Connectivity
Immersive
Multimedia
Intuitive
Security
Always On
Awareness
On the road to a “Cognitive Platform”
6
On-device visual perception is key
Democratizing
robotics to assist
us in daily lives
Revolutionizing
transportation with
autonomous cars
Contextualizing your
environment through scene
understanding
7
Process data closest to the source, complement cloud
Why fully on-device matters
Reliability
Efficient use of
network bandwidth
Low
Latency
Security and
user privacy
8
• Qualcomm Technologies, Inc. has been applying
machine learning to mobile for many years
• Deep learning for visual perception
• Provides best-in-class solutions
• Traditionally a cloud-only solution, but not on
mobile (until now)
• Presents many implementation challenges
• Our mobile focused platform goes beyond deep
learning to include RNNs and other strategies
• Applications: Security, handwriting, natural
language processing, etc.
Deep learning solves visual perception
C C C C C C
C C C C C C
Pooling
Fully Connected
Result
Deep Network
9
Challenges to enabling deep learning
based visual perception on mobile
10
Typical computing environment for deep
learning
Performance
Teraflops
Memory
bandwidth
100s of GB/s
Storage
10s of GBs of RAM
Power
100s of watts
Best-in-class server-based visual perception models
require about ~2B MAC operations per image
11
Supporting deep learning on-device is
a major challenge
Power and thermal efficiency
Storage and memory bandwidth
limitations
Battery powered
Constrained
mobile environment
Visual perception
workloads
Compute intensive
Large and complicated
neural network models
12
Within the power and thermal
constraints of mobile devices
Solving the challenge of
on-device visual perception
13
Scene understanding video
14
Robot face tracking video
https://www.youtube.com/watch?v=0D9I0SBGAPY
15
Key to deep learning on mobile is an efficient execution environment that considers all
aspects of the SoC combined with efficient library implementations
• Careful analysis of deep learning tradeoffs
• Consider the impact of different network architectures
• Focus on cache performance, data locality, DRAM utilization efficiency
• Focus on parallelism and heterogeneity
• Take advantage of heterogeneous computing frameworks (e.g. Qualcomm MARE)
• Span execution across Qualcomm® Snapdragon™ CPU, DSP, and GPU
• Focus on underlying optimizations
• Convolutions implemented as highly efficient matrix multiply operations
• Smart buffer management for GPU and fixed bit-width optimizations for DSP
• Optimized matrix multiply for Snapdragon processors1
• 6X faster than Eigen
Efficient execution on mobile SoCs
1. Results are based on Snapdragon 805 processor and Eigen 3.2.2
Qualcomm Snapdragon and Qualcomm Multicore Asynchronous Runtime Environment are products of Qualcomm Technologies, Inc.
16
Goal
Reduce both physical size and number of MACs required at equivalent precision
• Utilize available memory bandwidth, computations effectively -> power efficiency
• Smaller size permits in-field model upgrades and improvements
Reducing model size through compression
C C C C C C
C C C C C C
Pooling
Fully Convolution
Result
Deep Network
Qualcomm Technologies, Inc.
approach
• Initial SVD approach based on a paper by
Denton, et. al. of NYU1
• Qualcomm Technologies Inc. approach
involves replacing single layers with
multiple layers
• Approach permits fine-tuning all layers,
not just layers above compressed layers
Results
• Up to a 10X reduction in physical
model size
• Up to a 35% reduction in the
number of MAC operations with
minimal lost of precision
1. “Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation”, arXiv:1404.0736 [cs.CV]
17
Size compression and error rate impact
FC Layer Compressed
Original Network
FC and Conv Layer
Compressed
Fully connected layer compression significantly impacts physical network size
10X size reduction
~1% pt loss in top5 error
18
MAC compression and error rate impact
FC Layer Compressed
Original Network
FC and Conv Layer
Compressed
Compression
~ 35% MAC reduction
~ 1.3% pt loss in top5 error
Fine Tuning
2.5% pt improvement in
top5 error under max
MAC constraints
AlexNet
Convolutional layer compression significantly impacts MAC requirements
19
Focus on reduction of precision for both weights (static value) and
activations (dynamic values) versus traditional 32-bit floating approaches
• Physically smaller networks
• 2X improvement in memory access efficiency for network weights
Fixed point and reduced bit widths
16-bit values are used with no net increase in top-5 error
ActivationBitWidths
Neural Network Weight Bit Widths
4 8 16 24 32 Float
8 20.0% 1.4% 0.1% 0.1% 0.1% 0.1%
16 20.1% 1.4% 0.0% 0.0% 0.0% 0.0%
24 20.1% 1.4% 0.0% 0.0% 0.0% 0.0%
32 20.1% 1.4% 0.0% 0.0% 0.0% 0.0%
Float 20.1% 1.4% 0.0% 0.0% 0.0% 0.0%
0.0%
20
Conclusions
What the future holds
21
Expanding the frontier of visual perception
• More complex models
• Video classification
• Scene parsing and object localization and tracking
Platform enhancements
• Evolution of the SoC
Working towards “Cognition”
• Qualcomm Research is experimenting with algorithms for
“reasoning” to link perception to action
What comes next?
22
• Qualcomm Technologies, Inc. web sites:
• Computer Vision: https://www.qualcomm.com/invention/research/projects/computer-vision
• Cognitive Technologies: https://www.qualcomm.com/invention/cognitive-technologies
• FastCV™ SDK: : https://developer.qualcomm.com/mobile-development/add-advanced-
features/computer-vision-fastcv/tools-and-resources
• Embedded Vision Alliance web sites:
• Heterogeneous computing for CV: http://www.embedded-vision.com/platinum-
members/qualcomm/embedded-vision-training/videos/pages/oct-2013-embedded-vision-
summit-heterogeneous
• CV acceleration: http://www.embedded-vision.com/platinum-members/bdti/embedded-vision-
training/videos/pages/september-2013-qualcomm-uplinq-conferenc
• Demo in Technology Showcase
• Scene detect through on-device deep learning
Additional resources
FastCV is a product of Qualcomm Technologies, Inc.
Qualcomm and Snapdragon are trademarks of Qualcomm Incorporated, registered in the United States and other countries.
FastCV is a trademark of Qualcomm Incorporated. All Qualcomm Incorporated trademarks are used with permission.
Other products and brand names may be trademarks or registered trademarks of their respective owners.

Weitere ähnliche Inhalte

Andere mochten auch

"Using SGEMM and FFTs to Accelerate Deep Learning," a Presentation from ARM
"Using SGEMM and FFTs to Accelerate Deep Learning," a Presentation from ARM"Using SGEMM and FFTs to Accelerate Deep Learning," a Presentation from ARM
"Using SGEMM and FFTs to Accelerate Deep Learning," a Presentation from ARMEdge AI and Vision Alliance
 
An Overview of LoRA, Sigfox, and IEEE 802.11ah
An Overview of LoRA, Sigfox, and IEEE 802.11ahAn Overview of LoRA, Sigfox, and IEEE 802.11ah
An Overview of LoRA, Sigfox, and IEEE 802.11ahFaheem Zafari
 
Low Power Wireless Sensor Network Technologies and Standards for the Internet...
Low Power Wireless Sensor Network Technologies and Standards for the Internet...Low Power Wireless Sensor Network Technologies and Standards for the Internet...
Low Power Wireless Sensor Network Technologies and Standards for the Internet...Duncan Purves
 
Basics of Image Processing using MATLAB
Basics of Image Processing using MATLABBasics of Image Processing using MATLAB
Basics of Image Processing using MATLABvkn13
 
LPWAN Technologies for Internet of Things (IoT) and M2M Scenarios
LPWAN Technologies for Internet of Things (IoT) and M2M ScenariosLPWAN Technologies for Internet of Things (IoT) and M2M Scenarios
LPWAN Technologies for Internet of Things (IoT) and M2M ScenariosPeter R. Egli
 
Visual Information Retrieval: Advances, Challenges and Opportunities
Visual Information Retrieval: Advances, Challenges and OpportunitiesVisual Information Retrieval: Advances, Challenges and Opportunities
Visual Information Retrieval: Advances, Challenges and OpportunitiesOge Marques
 
Introduction to Digital Image Processing Using MATLAB
Introduction to Digital Image Processing Using MATLABIntroduction to Digital Image Processing Using MATLAB
Introduction to Digital Image Processing Using MATLABRay Phan
 

Andere mochten auch (9)

"Using SGEMM and FFTs to Accelerate Deep Learning," a Presentation from ARM
"Using SGEMM and FFTs to Accelerate Deep Learning," a Presentation from ARM"Using SGEMM and FFTs to Accelerate Deep Learning," a Presentation from ARM
"Using SGEMM and FFTs to Accelerate Deep Learning," a Presentation from ARM
 
An Overview of LoRA, Sigfox, and IEEE 802.11ah
An Overview of LoRA, Sigfox, and IEEE 802.11ahAn Overview of LoRA, Sigfox, and IEEE 802.11ah
An Overview of LoRA, Sigfox, and IEEE 802.11ah
 
Low Power Wireless Sensor Network Technologies and Standards for the Internet...
Low Power Wireless Sensor Network Technologies and Standards for the Internet...Low Power Wireless Sensor Network Technologies and Standards for the Internet...
Low Power Wireless Sensor Network Technologies and Standards for the Internet...
 
Basics of Image Processing using MATLAB
Basics of Image Processing using MATLABBasics of Image Processing using MATLAB
Basics of Image Processing using MATLAB
 
LPWAN Technologies for Internet of Things (IoT) and M2M Scenarios
LPWAN Technologies for Internet of Things (IoT) and M2M ScenariosLPWAN Technologies for Internet of Things (IoT) and M2M Scenarios
LPWAN Technologies for Internet of Things (IoT) and M2M Scenarios
 
LPWAN for IoT
LPWAN for IoTLPWAN for IoT
LPWAN for IoT
 
Getting started with image processing using Matlab
Getting started with image processing using MatlabGetting started with image processing using Matlab
Getting started with image processing using Matlab
 
Visual Information Retrieval: Advances, Challenges and Opportunities
Visual Information Retrieval: Advances, Challenges and OpportunitiesVisual Information Retrieval: Advances, Challenges and Opportunities
Visual Information Retrieval: Advances, Challenges and Opportunities
 
Introduction to Digital Image Processing Using MATLAB
Introduction to Digital Image Processing Using MATLABIntroduction to Digital Image Processing Using MATLAB
Introduction to Digital Image Processing Using MATLAB
 

Mehr von Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...Edge AI and Vision Alliance
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...Edge AI and Vision Alliance
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsightsEdge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from SamsaraEdge AI and Vision Alliance
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...Edge AI and Vision Alliance
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...Edge AI and Vision Alliance
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...Edge AI and Vision Alliance
 

Mehr von Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Kürzlich hochgeladen

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 

Kürzlich hochgeladen (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

"Deep-learning-based Visual Perception in Mobile and Embedded Devices: Opportunities and Challenges," a Presentation from Qualcomm

  • 1. 1 Jeff Gehlhaar, Vice President, Qualcomm Research May 12, 2015 Deep-learning-based visual perception in mobile and embedded devices: Opportunities and challenges
  • 2. 2 Qualcomm Research: Transforming the future of mobile technology Research Prototype Standardize
  • 3. 3 Vision for cognitive computing More intuitive devices and things
  • 4. 4 Key elements of “Cognition” Hear AnticipateSee PlanConcepts AutonomousClassify Infer context Relationships Perception ActionReasoning
  • 5. 5 Rich Connectivity Heterogeneous Computing On-device Intelligence On-device capabilities • Integrated modem & AP • Adaptive RF front end • LTE broadcast & service focused modem features • Tightly integrated Wi-Fi/BT • Leading location / GPS • Fully customized architecture • Superior performance at low power consumption • Highly optimized for cutting- edge cognitive capabilities • On-device machine learning • Computer vision • Behavioral analysis • Sensor processing and classification algorithms • Natural language processing Visual Perception Speech & Audio Understanding Natural Interactions Intelligent Connectivity Immersive Multimedia Intuitive Security Always On Awareness On the road to a “Cognitive Platform”
  • 6. 6 On-device visual perception is key Democratizing robotics to assist us in daily lives Revolutionizing transportation with autonomous cars Contextualizing your environment through scene understanding
  • 7. 7 Process data closest to the source, complement cloud Why fully on-device matters Reliability Efficient use of network bandwidth Low Latency Security and user privacy
  • 8. 8 • Qualcomm Technologies, Inc. has been applying machine learning to mobile for many years • Deep learning for visual perception • Provides best-in-class solutions • Traditionally a cloud-only solution, but not on mobile (until now) • Presents many implementation challenges • Our mobile focused platform goes beyond deep learning to include RNNs and other strategies • Applications: Security, handwriting, natural language processing, etc. Deep learning solves visual perception C C C C C C C C C C C C Pooling Fully Connected Result Deep Network
  • 9. 9 Challenges to enabling deep learning based visual perception on mobile
  • 10. 10 Typical computing environment for deep learning Performance Teraflops Memory bandwidth 100s of GB/s Storage 10s of GBs of RAM Power 100s of watts Best-in-class server-based visual perception models require about ~2B MAC operations per image
  • 11. 11 Supporting deep learning on-device is a major challenge Power and thermal efficiency Storage and memory bandwidth limitations Battery powered Constrained mobile environment Visual perception workloads Compute intensive Large and complicated neural network models
  • 12. 12 Within the power and thermal constraints of mobile devices Solving the challenge of on-device visual perception
  • 14. 14 Robot face tracking video https://www.youtube.com/watch?v=0D9I0SBGAPY
  • 15. 15 Key to deep learning on mobile is an efficient execution environment that considers all aspects of the SoC combined with efficient library implementations • Careful analysis of deep learning tradeoffs • Consider the impact of different network architectures • Focus on cache performance, data locality, DRAM utilization efficiency • Focus on parallelism and heterogeneity • Take advantage of heterogeneous computing frameworks (e.g. Qualcomm MARE) • Span execution across Qualcomm® Snapdragon™ CPU, DSP, and GPU • Focus on underlying optimizations • Convolutions implemented as highly efficient matrix multiply operations • Smart buffer management for GPU and fixed bit-width optimizations for DSP • Optimized matrix multiply for Snapdragon processors1 • 6X faster than Eigen Efficient execution on mobile SoCs 1. Results are based on Snapdragon 805 processor and Eigen 3.2.2 Qualcomm Snapdragon and Qualcomm Multicore Asynchronous Runtime Environment are products of Qualcomm Technologies, Inc.
  • 16. 16 Goal Reduce both physical size and number of MACs required at equivalent precision • Utilize available memory bandwidth, computations effectively -> power efficiency • Smaller size permits in-field model upgrades and improvements Reducing model size through compression C C C C C C C C C C C C Pooling Fully Convolution Result Deep Network Qualcomm Technologies, Inc. approach • Initial SVD approach based on a paper by Denton, et. al. of NYU1 • Qualcomm Technologies Inc. approach involves replacing single layers with multiple layers • Approach permits fine-tuning all layers, not just layers above compressed layers Results • Up to a 10X reduction in physical model size • Up to a 35% reduction in the number of MAC operations with minimal lost of precision 1. “Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation”, arXiv:1404.0736 [cs.CV]
  • 17. 17 Size compression and error rate impact FC Layer Compressed Original Network FC and Conv Layer Compressed Fully connected layer compression significantly impacts physical network size 10X size reduction ~1% pt loss in top5 error
  • 18. 18 MAC compression and error rate impact FC Layer Compressed Original Network FC and Conv Layer Compressed Compression ~ 35% MAC reduction ~ 1.3% pt loss in top5 error Fine Tuning 2.5% pt improvement in top5 error under max MAC constraints AlexNet Convolutional layer compression significantly impacts MAC requirements
  • 19. 19 Focus on reduction of precision for both weights (static value) and activations (dynamic values) versus traditional 32-bit floating approaches • Physically smaller networks • 2X improvement in memory access efficiency for network weights Fixed point and reduced bit widths 16-bit values are used with no net increase in top-5 error ActivationBitWidths Neural Network Weight Bit Widths 4 8 16 24 32 Float 8 20.0% 1.4% 0.1% 0.1% 0.1% 0.1% 16 20.1% 1.4% 0.0% 0.0% 0.0% 0.0% 24 20.1% 1.4% 0.0% 0.0% 0.0% 0.0% 32 20.1% 1.4% 0.0% 0.0% 0.0% 0.0% Float 20.1% 1.4% 0.0% 0.0% 0.0% 0.0% 0.0%
  • 21. 21 Expanding the frontier of visual perception • More complex models • Video classification • Scene parsing and object localization and tracking Platform enhancements • Evolution of the SoC Working towards “Cognition” • Qualcomm Research is experimenting with algorithms for “reasoning” to link perception to action What comes next?
  • 22. 22 • Qualcomm Technologies, Inc. web sites: • Computer Vision: https://www.qualcomm.com/invention/research/projects/computer-vision • Cognitive Technologies: https://www.qualcomm.com/invention/cognitive-technologies • FastCV™ SDK: : https://developer.qualcomm.com/mobile-development/add-advanced- features/computer-vision-fastcv/tools-and-resources • Embedded Vision Alliance web sites: • Heterogeneous computing for CV: http://www.embedded-vision.com/platinum- members/qualcomm/embedded-vision-training/videos/pages/oct-2013-embedded-vision- summit-heterogeneous • CV acceleration: http://www.embedded-vision.com/platinum-members/bdti/embedded-vision- training/videos/pages/september-2013-qualcomm-uplinq-conferenc • Demo in Technology Showcase • Scene detect through on-device deep learning Additional resources FastCV is a product of Qualcomm Technologies, Inc. Qualcomm and Snapdragon are trademarks of Qualcomm Incorporated, registered in the United States and other countries. FastCV is a trademark of Qualcomm Incorporated. All Qualcomm Incorporated trademarks are used with permission. Other products and brand names may be trademarks or registered trademarks of their respective owners.