SlideShare ist ein Scribd-Unternehmen logo
1 von 32
Downloaden Sie, um offline zu lesen
Achieving AI @Scale
on Mobile Devices
Qualcomm Technologies, Inc.
@qualcomm_techMarch 2018
2
Mobile is the
largest computing
platform in the
world
Source: Gartner Mar. ‘17
>8.5
Billion
Cumulative smartphone
unit shipments forecast
between 2017–2021
3
Source: Qualcomm Incorporated data. Currently, Qualcomm
semiconductors are products of Qualcomm Technologies, Inc.
or its subsidiaries. MSM is a product of Qualcomm Technologies,
Inc. IHS, Mar. ’17; Strategy Analytics, Mar. ’17
30 Years of driving the
evolution of wireless
#
1Fabless semiconductor
company
#
1 in 3G/4G LTE modem
842MMSM™ chipsets shipped FY’16
4
Modem
3G (CDMA, GSM)
4G/LTE
5G
Connectivity
Bluetooth Smart
Bluetooth Mesh
802.11ac/ax
802.11ad
802.11n
802.15.4
DSRC
Powerline
GNSS/Location
Peripherals (PCIe, USB.)
Processors
CPU
GPU
DSP
Multimedia
Video
Speech
Display processing
Media processing
Camera processing
Audio processing
Computer Vision
AR / VR
Qualcomm Technologies’ success is based on
RF
Power amps
Acoustic filters
RF switches
Other
Machine learning
NoC / MemCtrl
Power management
Security
Touch
Fingerprint
Technology leadership
Connectivity Computing
5
1 chipset per year
Chipset complexity
is growing dramatically
Qualcomm Snapdragon is a product of Qualcomm Technologies, Inc. Source: Qualcomm internal data
~10 chipsets per year
Transistors on a chip
More frequent and more
complex design cycles
Today
Transistors on a chip
<10 Million
Comparative scale
Circa 2000
>3Billion
6
A mobile processor today—Snapdragon 835
Highly integrated and complex SoC using 10nm process technology
Qualcomm
Spectra 180 Camera
Qualcomm
Mobile Security
Kryo 280 CPU
Wi-Fi
Snapdragon
X16 LTE modem
Adreno 540
Graphics Processing Unit
(GPU)
Hexagon 682 DSP
HVX
All-Ways
Aware
Qualcomm®
Location Suite
Display
Processing
Unit (DPU)
Video
Processing
Unit (VPU)
Qualcomm®
Aqstic Audio
* Compared to Snapdragon 820
Qualcomm Adreno, Qualcomm Kryo, Qualcomm Hexagon, Qualcomm Spectra, Qualcomm Aqstic, Qualcomm Location Suite, Qualcomm All-Ways Aware, and Qualcomm Mobile Security are products of Qualcomm Technologies, Inc.
Snapdragon X16 LTE
World’s first announced
gigabit-class LTE modem
Qualcomm®
Hexagon™
DSP
Snapdragon Neural Processing
Engine support
Qualcomm®
Mobile Security
First to support full biometric suite
Qualcomm®
Adreno™
Visual Processing
25% faster graphics rendering
60x more display colors*
Qualcomm®
Kryo™
280 CPU
Our most power efficient
architecture to date
Qualcomm Spectra™
Camera ISP
Smooth zoom | Fast-autofocus
True to life colors
7
Mobile scale
changes everything
7
Superior scale
Rapid
replacement cycles
Integrated and
optimized technologies
Industrial IoT
Smart cities
Automotive
Smart homes
Extended reality
Wearables
Healthcare
Mobile computing
Networking
8
Intelligence is moving to the device
Server/Cloud
Training
Execution/Inference
Devices
Execution/Inference
Training (emerging)
9
Privacy
Reliability
Low latency
Efficient use of
network bandwidth
Process data closest to the
source, complement the cloud
On-device
intelligence is
paramount
10
Mobile is becoming
the pervasive AI platform
11
Qualcomm
Technologies
is accelerating
on-device AI
Making efficient on-device machine learning
possible for highly responsive, private, and
intuitive user experiences
1212
Qualcomm
®
Artificial
Intelligence Platform
A high-performance
platform designed to
support myriad intelligent-
on-device-capabilities
that utilize:
• Qualcomm® Snapdragon™
mobile platform’s heterogeneous
compute capabilities within a
highly integrated SoC
• Innovations in machine learning
algorithms and enabling software
• Development frameworks to
minimize the time and effort for
integrating customer networks
with our platform
The platform for efficient on-device
machine learning
Qualcomm Artificial Intelligence Platform and Qualcomm Snapdragon are products of Qualcomm Technologies, Inc.
Audio
intelligence
Intuitive
security
Visual
intelligence
13
Making on-device intelligence pervasive
Focusing on high performance HW/SW and optimized network design
Algorithmic
advancements
Algorithmic research that benefits from
state-of-the-art deep neural networks
Optimization for space and
runtime efficiency
Efficient
hardware
Developing heterogeneous compute to
run demanding neural networks at low
power and within thermal limits
Selecting the right compute
block for the right task
Software
tools
Software accelerated run-time
for deep learning
SDK/development frameworks
14
Algorithmic enhancements for space and runtime efficiency
Improve performance by addressing model complexity
Source: Qualcomm Research. Results from running a semantic segmentation network for automotive.
Neural network optimizations
for embedded
• Improved network architecture
• Focus on memory and storage
◦ Reduce bit widths
◦ Model compression
◦ Leverage sparsity
• Architecture learning
Required operations per image
0
10000
20000
30000
40000
Original QCOM V1 QCOM V2 QCOM V3
GOPS
Network versions
QTI V1 QTI V2 QTI V3
27x
Reduction in
complexity
15
Snapdragon Neural Processing Engine SDK
Software accelerated runtime for the execution of deep neural networks on device
Qualcomm Kryo, Qualcomm Adreno and Qualcomm Hexagon are products of Qualcomm Technologies, Inc. Available at: developer.qualcomm.com
Efficient execution on Snapdragon
• Takes advantage of Snapdragon
heterogeneous computing capabilities
• Runtime and libraries accelerate deep
neural net processing on all engines:
CPU, GPU, and DSP with HVX
Model framework/Network support
• Convolutional neural networks and LSTMs
• Support for Caffe/Caffe2, TensorFlow,
and user/developer defined layers
Optimization/Debugging tools
• Offline network conversion tools
• Debug and analyze network performance
• API and SDK documentation with sample code
• Ease of integration into customer applications
Qualcomm®
Kryo™
CPU
Qualcomm®
Adreno™
GPU
Qualcomm®
Hexagon™
DSP
Sample
code
Offline
conversion tools
Analyze
performance
Ease of
integration
16
Robust software and tools simplify development
Mobile
Auto
Home
Camera
XR
NPE
GoogleNet/
Inception
SSD
Alexnet
ResNet
SqueezeNet
Object classification
Face detection
Scene segmentation
Natural language
understanding
Speaker recognition
Security/Authentication
Resource management
17
Optimized
NPE Model
(.dlc file)
Model to runtime workflow: Training and inference
Training: Machine learning experts build and train their network to solve their particular problem
Inference: NPE enables the network to run on Snapdragon devices
Model testing
Good
enough
match?
No
Backward propagation
Training
data
Test
data
Yes
Deep
learning
frameworks
(e.g., Caffe2,
TensorFlow)
Model
conversion
tools
Application Model
optimization
tools
Optional (quantization, compression, etc.)
Data 1
Data 1
Model building and training
NPE Enabled App
NPE runtime
.dlc file
Model file
Static weights and biases
NPE Model
(.dlc file)
18
Application
NPElibrary
Computeruntimes
High-level software architecture for the Snapdragon NPE
HW
Kernel space
User space
OS support:
• ARM Android
• ARM Linux
• x86 Linux
Kryo CPU Adreno GPU Hexagon DSP
OS
DSP kernel
mode driver
GPU kernel
mode driver
DL containerModel debugProfiling logging
Model loader
NPE API
CPU
QSMLSymphony
User defined layers
Float
Application code
Future
User developed
GPU
Float
OpenCL driver
DSP
Fixed 8
Fixed
16
FastRPC driver
Half
float
19
Model
Conversion Tool
UDL
.dlc file
Model
conversion tool
User Defined Layer (UDL) workflow
Supports prototyping of layers not yet supported by the Snapdragon NPE
Note: there is runtime performance cost to inserting a UDL into a network, related to “context” switching. Execution context is transferred to user in CPU control plane.
NPE model
(.dlc file)
UDL
Model
conversion tool
UDL handler
Model file
(static weights
and biases)
UDL
NPE workflow
with UDL
NPE model
(.dlc file)
Model
conversion tool
Model file
(static weights
and biases)
NPE workflow
User-provided implementation User-defined layer weights and parameters
NPE runtime
UDL implementation
NPE runtime
.dlc file
20
Converting and quantizing a model
Native model
(e.g., Caffe)
Model
conversion
tool
NPE model
(.dlc file)
Model
optimization
tool
NPE model
(.dlc file)
NPE enabled appOnly needed for certain features
(e.g., activation quantization)
Images
Images
Images
Optional (quantization,
compression, etc.)
snpe-<framework>-to-dlc snpe-dlc-optimize
snpe-<framework>-to-dlc
(<framework> = Caffe | Caffe2 | TensorFlow)
• Input is the model in native
framework format
• Output is a converted but not optimized
NPE DLC file
snpe-dlc-optimize
• Converts non-quantized DLC models
into 8-bit quantized DLC models
◦ Additionally implements further optimizations
such as SVD compression
• Quantized model is necessary for
fixed-point NPE runtimes (e.g. DSP)
21
Repeat
API example usage
NPE provides a simple C++ API
with the following functionality
• Load a DLC model and select
the runtime
• Execute the model
• Debug support
◦ Dump the output of all layers
in a model
• Collect performance metrics
◦ Per-layer timing
Green API calls are only
required for UDLs
Application
InitializationRuntime
Get input data and prepare InputTensor
Process OutputTensorMap
NPE
new SNPEBuilder ( DLContainer )
SNPE->execute( InputTensor, OutputTensorMap )
Snapdragon NPE Object (Snapdragon NPE)
SNPEBuilder Object(SBO)
SBO->setRunTimeProcessor( RuntimeMode )
SBO->build( )
InputTensor = SNPEFactory:
:getTensorFactory().createTensor()
See SNPEBuilder.hpp
for all builder APIs
Process
network
See NativeCpp/BatchRun/ example in NPE
For each UDL
read from
DLContainer
IUDL object (IUDL)
UdlFactory( UdlContext )
IUDL::Setup( )
SBO->setUdlBundle( UdlFactoryContextBundle )
IUDL->execute( InputTensor, OutputTensor )
2222
Learn more at: developer.qualcomm.com
AI + Qualcomm Technologies
Massive scale
of Millions
of units
100s
Based on Snapdragon Platforms capable of supporting Snapdragon NPE today.
23
Qualcomm Technologies
and Facebook collaboration
for massive AI scale
2424
Facebook +
Qualcomm Technologies
On-device AI with Snapdragon
Facebook and Qualcomm’s
Caffe2 collaboration
• Demonstrated Caffe2 acceleration
with NPE at F8 2017
• 5x performance upside on GPU
(compared to CPU)
• Announced commercial support of
Caffe2 in July through Qualcomm
Developer Network
• Facebook AML has integrated the
NPE with Caffe2
Future Caffe2/NPE
research and development
• Continue to work closely with
Facebook to optimize key
networks for maximum on-
device performance
• Enhancements to Caffe2
allowing Snapdragon specific
SoC optimizations
• More advanced AI-powered
XR applications
F8 2017
“On-device machine learning
is made possible by the
Qualcomm Snapdragon NPE
which does the heavy lifting
needed to run neural
networks more efficiently on
Snapdragon devices.”
Source: XDA
25
Enhancing the Facebook experience through on-device AI
Augmented reality features
potentially powered by AI
• Style transfer and filters
• Frames and masks
• Photo and live videos, including 360°
• Contextual awareness
(e.g. location/sensor metadata)
On-device acceleration benefits
• Smooth UI with increased frame rate
• Increased battery life
More engaging social media with AI and AR
*Requires network connection and will support up to 20 hours of battery life
26
What’s next
27
AI hardware
What does the
future look like?
Neural processing
architecture research
Distributed computing
architectures
28
AI offers enhanced experiences
and new capabilities for smartphones
Superior photographyTrue personal assistance
Extended battery life
Enhanced security
Natural user interfaces
Enhanced connectivity
A new development paradigm
where things repeatedly improve
29
AI will bring XR closer to the ultimate level of immersion
Creating physical presence in real or imagined worlds
Interactions
Natural UI
Depth estimation
Sounds
Audio filtering
and cleanup
Visuals
Rendering techniques
Battery efficiency
Managing workload
concurrency
3030
AI is revolutionizing
the car of the future
Redefining the in-car experience
• Natural user interfaces
• Personalization
• Driver awareness monitoring
Paving the road to autonomy
• Surround view perception
• Sensor fusion
• Path planning
• Decision making
31
Improved
optimization strategies
Algorithmic
advancements
Specialized
hardware
What’s next?
Reference
Follow us on:
For more information, visit us at:
www.qualcomm.com & www.qualcomm.com/blog
Thank you!
Nothing in these materials is an offer to sell any of the
components or devices referenced herein.
©2018 Qualcomm Technologies, Inc. and/or its affiliated
companies. All Rights Reserved.
Qualcomm is a trademark of Qualcomm Incorporated,
registered in the United States and other countries. Other
products and brand names may be trademarks or registered
trademarks of their respective owners.
References in this presentation to “Qualcomm” may mean Qualcomm
Incorporated, Qualcomm Technologies, Inc., and/or other subsidiaries
or business units within the Qualcomm corporate structure, as
applicable. Qualcomm Incorporated includes Qualcomm’s licensing
business, QTL, and the vast majority of its patent portfolio. Qualcomm
Technologies, Inc., a wholly-owned subsidiary of Qualcomm
Incorporated, operates, along with its subsidiaries, substantially all of
Qualcomm’s engineering, research and development functions, and
substantially all of its product and services businesses, including its
semiconductor business, QCT.

Weitere ähnliche Inhalte

Was ist angesagt?

Pushing the boundaries of AI research
Pushing the boundaries of AI researchPushing the boundaries of AI research
Pushing the boundaries of AI researchQualcomm Research
 
5G positioning for the connected intelligent edge
5G positioning for the connected intelligent edge5G positioning for the connected intelligent edge
5G positioning for the connected intelligent edgeQualcomm Research
 
AI firsts: Leading from research to proof-of-concept
AI firsts: Leading from research to proof-of-conceptAI firsts: Leading from research to proof-of-concept
AI firsts: Leading from research to proof-of-conceptQualcomm Research
 
latencyin fiber optic networks
latencyin fiber optic networkslatencyin fiber optic networks
latencyin fiber optic networksMapYourTech
 
5G Multi-Access Edge Compute
5G Multi-Access Edge Compute5G Multi-Access Edge Compute
5G Multi-Access Edge ComputeMichelle Holley
 
What's in the future of 5G millimeter wave?
What's in the future of 5G millimeter wave? What's in the future of 5G millimeter wave?
What's in the future of 5G millimeter wave? Qualcomm Research
 
Iccbn Presentation - optical
Iccbn Presentation - opticalIccbn Presentation - optical
Iccbn Presentation - opticalroysoumya
 
Transforming enterprise and industry with 5G private networks
Transforming enterprise and industry with 5G private networksTransforming enterprise and industry with 5G private networks
Transforming enterprise and industry with 5G private networksQualcomm Research
 
IoT-LPWAN LoRa Geoloc - sagemcom - m2m-innovationworld_geotrack
IoT-LPWAN LoRa Geoloc - sagemcom - m2m-innovationworld_geotrackIoT-LPWAN LoRa Geoloc - sagemcom - m2m-innovationworld_geotrack
IoT-LPWAN LoRa Geoloc - sagemcom - m2m-innovationworld_geotrackThierry Lestable
 
Span Design considerations in TAT-14 US Backhaul across G.652 Fiber
Span Design considerations in TAT-14 US Backhaul across G.652 FiberSpan Design considerations in TAT-14 US Backhaul across G.652 Fiber
Span Design considerations in TAT-14 US Backhaul across G.652 FiberRoderick Dottin
 
Accelerating 5G enterprise networks with edge computing and latency assurance
Accelerating 5G enterprise networks with edge computing and latency assuranceAccelerating 5G enterprise networks with edge computing and latency assurance
Accelerating 5G enterprise networks with edge computing and latency assuranceADVA
 
Designing LoRaWAN for dense IoT deployments webinar
Designing LoRaWAN for dense IoT deployments webinarDesigning LoRaWAN for dense IoT deployments webinar
Designing LoRaWAN for dense IoT deployments webinarActility
 
The essential role of technology standards
The essential role of technology standardsThe essential role of technology standards
The essential role of technology standardsQualcomm Research
 
5G for Reliable Industrial Wireless Networks
5G for Reliable Industrial Wireless Networks5G for Reliable Industrial Wireless Networks
5G for Reliable Industrial Wireless NetworksAUTOWARE
 
Actility and Factory Systemes explain how IoT is transforming industry.
Actility and Factory Systemes explain how IoT is transforming industry.Actility and Factory Systemes explain how IoT is transforming industry.
Actility and Factory Systemes explain how IoT is transforming industry.Actility
 
Transforming to Enable 5G
Transforming to Enable 5GTransforming to Enable 5G
Transforming to Enable 5GLumina Networks
 
Link labs LTE-M NB-IOT Hype Webinar slides
Link labs LTE-M NB-IOT Hype Webinar slidesLink labs LTE-M NB-IOT Hype Webinar slides
Link labs LTE-M NB-IOT Hype Webinar slidesBrian Ray
 

Was ist angesagt? (20)

Pushing the boundaries of AI research
Pushing the boundaries of AI researchPushing the boundaries of AI research
Pushing the boundaries of AI research
 
5G positioning for the connected intelligent edge
5G positioning for the connected intelligent edge5G positioning for the connected intelligent edge
5G positioning for the connected intelligent edge
 
5G NR-based C-V2X
5G NR-based C-V2X5G NR-based C-V2X
5G NR-based C-V2X
 
AI firsts: Leading from research to proof-of-concept
AI firsts: Leading from research to proof-of-conceptAI firsts: Leading from research to proof-of-concept
AI firsts: Leading from research to proof-of-concept
 
latencyin fiber optic networks
latencyin fiber optic networkslatencyin fiber optic networks
latencyin fiber optic networks
 
5G Multi-Access Edge Compute
5G Multi-Access Edge Compute5G Multi-Access Edge Compute
5G Multi-Access Edge Compute
 
What's in the future of 5G millimeter wave?
What's in the future of 5G millimeter wave? What's in the future of 5G millimeter wave?
What's in the future of 5G millimeter wave?
 
Dwdm12
Dwdm12Dwdm12
Dwdm12
 
Iccbn Presentation - optical
Iccbn Presentation - opticalIccbn Presentation - optical
Iccbn Presentation - optical
 
Transforming enterprise and industry with 5G private networks
Transforming enterprise and industry with 5G private networksTransforming enterprise and industry with 5G private networks
Transforming enterprise and industry with 5G private networks
 
IoT-LPWAN LoRa Geoloc - sagemcom - m2m-innovationworld_geotrack
IoT-LPWAN LoRa Geoloc - sagemcom - m2m-innovationworld_geotrackIoT-LPWAN LoRa Geoloc - sagemcom - m2m-innovationworld_geotrack
IoT-LPWAN LoRa Geoloc - sagemcom - m2m-innovationworld_geotrack
 
Span Design considerations in TAT-14 US Backhaul across G.652 Fiber
Span Design considerations in TAT-14 US Backhaul across G.652 FiberSpan Design considerations in TAT-14 US Backhaul across G.652 Fiber
Span Design considerations in TAT-14 US Backhaul across G.652 Fiber
 
Accelerating 5G enterprise networks with edge computing and latency assurance
Accelerating 5G enterprise networks with edge computing and latency assuranceAccelerating 5G enterprise networks with edge computing and latency assurance
Accelerating 5G enterprise networks with edge computing and latency assurance
 
Designing LoRaWAN for dense IoT deployments webinar
Designing LoRaWAN for dense IoT deployments webinarDesigning LoRaWAN for dense IoT deployments webinar
Designing LoRaWAN for dense IoT deployments webinar
 
The essential role of technology standards
The essential role of technology standardsThe essential role of technology standards
The essential role of technology standards
 
5G for Reliable Industrial Wireless Networks
5G for Reliable Industrial Wireless Networks5G for Reliable Industrial Wireless Networks
5G for Reliable Industrial Wireless Networks
 
Wind River Overview
Wind River OverviewWind River Overview
Wind River Overview
 
Actility and Factory Systemes explain how IoT is transforming industry.
Actility and Factory Systemes explain how IoT is transforming industry.Actility and Factory Systemes explain how IoT is transforming industry.
Actility and Factory Systemes explain how IoT is transforming industry.
 
Transforming to Enable 5G
Transforming to Enable 5GTransforming to Enable 5G
Transforming to Enable 5G
 
Link labs LTE-M NB-IOT Hype Webinar slides
Link labs LTE-M NB-IOT Hype Webinar slidesLink labs LTE-M NB-IOT Hype Webinar slides
Link labs LTE-M NB-IOT Hype Webinar slides
 

Ähnlich wie Achieving AI @scale on Mobile Devices

“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...Edge AI and Vision Alliance
 
“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...
“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...
“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...Edge AI and Vision Alliance
 
Intel IoT Edge Computing 在 AI 領域的應用與商機
Intel IoT Edge Computing 在 AI 領域的應用與商機Intel IoT Edge Computing 在 AI 領域的應用與商機
Intel IoT Edge Computing 在 AI 領域的應用與商機Amazon Web Services
 
Android Tools for Qualcomm Snapdragon Processors
Android Tools for Qualcomm Snapdragon Processors Android Tools for Qualcomm Snapdragon Processors
Android Tools for Qualcomm Snapdragon Processors Qualcomm Developer Network
 
Accelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to CloudAccelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to CloudRebekah Rodriguez
 
“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...
“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...
“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...Edge AI and Vision Alliance
 
PLNOG 17 - Andrzej Jeruzal - Dell Networking OS10: sieciowy system operacyjny...
PLNOG 17 - Andrzej Jeruzal - Dell Networking OS10: sieciowy system operacyjny...PLNOG 17 - Andrzej Jeruzal - Dell Networking OS10: sieciowy system operacyjny...
PLNOG 17 - Andrzej Jeruzal - Dell Networking OS10: sieciowy system operacyjny...PROIDEA
 
Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)Intel® Software
 
Machine Learning Inference at the Edge
Machine Learning Inference at the EdgeMachine Learning Inference at the Edge
Machine Learning Inference at the EdgeAmazon Web Services
 
Datapalooza: A Music Festival Themed ML & IoT Workshop
Datapalooza: A Music Festival Themed ML & IoT WorkshopDatapalooza: A Music Festival Themed ML & IoT Workshop
Datapalooza: A Music Festival Themed ML & IoT WorkshopAmazon Web Services
 
Innovation with ai at scale on the edge vt sept 2019 v0
Innovation with ai at scale  on the edge vt sept 2019 v0Innovation with ai at scale  on the edge vt sept 2019 v0
Innovation with ai at scale on the edge vt sept 2019 v0Ganesan Narayanasamy
 
Machine Learning Inference at the Edge
Machine Learning Inference at the EdgeMachine Learning Inference at the Edge
Machine Learning Inference at the EdgeJulien SIMON
 
G rpc talk with intel (3)
G rpc talk with intel (3)G rpc talk with intel (3)
G rpc talk with intel (3)Intel
 
Resume_Appaji
Resume_AppajiResume_Appaji
Resume_AppajiAppaji K
 
Hai Tao at AI Frontiers: Deep Learning For Embedded Vision System
Hai Tao at AI Frontiers: Deep Learning For Embedded Vision SystemHai Tao at AI Frontiers: Deep Learning For Embedded Vision System
Hai Tao at AI Frontiers: Deep Learning For Embedded Vision SystemAI Frontiers
 
Eric Theis resume61.1
Eric Theis resume61.1Eric Theis resume61.1
Eric Theis resume61.1Eric Theis
 
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...James Serra
 
AI Scalability for the Next Decade
AI Scalability for the Next DecadeAI Scalability for the Next Decade
AI Scalability for the Next DecadePaula Koziol
 

Ähnlich wie Achieving AI @scale on Mobile Devices (20)

“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
 
“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...
“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...
“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...
 
Intel IoT Edge Computing 在 AI 領域的應用與商機
Intel IoT Edge Computing 在 AI 領域的應用與商機Intel IoT Edge Computing 在 AI 領域的應用與商機
Intel IoT Edge Computing 在 AI 領域的應用與商機
 
Android Tools for Qualcomm Snapdragon Processors
Android Tools for Qualcomm Snapdragon Processors Android Tools for Qualcomm Snapdragon Processors
Android Tools for Qualcomm Snapdragon Processors
 
Accelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to CloudAccelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to Cloud
 
“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...
“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...
“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...
 
PLNOG 17 - Andrzej Jeruzal - Dell Networking OS10: sieciowy system operacyjny...
PLNOG 17 - Andrzej Jeruzal - Dell Networking OS10: sieciowy system operacyjny...PLNOG 17 - Andrzej Jeruzal - Dell Networking OS10: sieciowy system operacyjny...
PLNOG 17 - Andrzej Jeruzal - Dell Networking OS10: sieciowy system operacyjny...
 
Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)
 
Machine Learning Inference at the Edge
Machine Learning Inference at the EdgeMachine Learning Inference at the Edge
Machine Learning Inference at the Edge
 
Datapalooza: A Music Festival Themed ML & IoT Workshop
Datapalooza: A Music Festival Themed ML & IoT WorkshopDatapalooza: A Music Festival Themed ML & IoT Workshop
Datapalooza: A Music Festival Themed ML & IoT Workshop
 
Innovation with ai at scale on the edge vt sept 2019 v0
Innovation with ai at scale  on the edge vt sept 2019 v0Innovation with ai at scale  on the edge vt sept 2019 v0
Innovation with ai at scale on the edge vt sept 2019 v0
 
The future of AI is hybrid
The future of AI is hybridThe future of AI is hybrid
The future of AI is hybrid
 
Machine Learning Inference at the Edge
Machine Learning Inference at the EdgeMachine Learning Inference at the Edge
Machine Learning Inference at the Edge
 
China AI Summit talk 2017
China AI Summit talk 2017China AI Summit talk 2017
China AI Summit talk 2017
 
G rpc talk with intel (3)
G rpc talk with intel (3)G rpc talk with intel (3)
G rpc talk with intel (3)
 
Resume_Appaji
Resume_AppajiResume_Appaji
Resume_Appaji
 
Hai Tao at AI Frontiers: Deep Learning For Embedded Vision System
Hai Tao at AI Frontiers: Deep Learning For Embedded Vision SystemHai Tao at AI Frontiers: Deep Learning For Embedded Vision System
Hai Tao at AI Frontiers: Deep Learning For Embedded Vision System
 
Eric Theis resume61.1
Eric Theis resume61.1Eric Theis resume61.1
Eric Theis resume61.1
 
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
 
AI Scalability for the Next Decade
AI Scalability for the Next DecadeAI Scalability for the Next Decade
AI Scalability for the Next Decade
 

Mehr von Qualcomm Research

Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdfQualcomm Research
 
Qualcomm Webinar: Solving Unsolvable Combinatorial Problems with AI
Qualcomm Webinar: Solving Unsolvable Combinatorial Problems with AIQualcomm Webinar: Solving Unsolvable Combinatorial Problems with AI
Qualcomm Webinar: Solving Unsolvable Combinatorial Problems with AIQualcomm Research
 
Why and what you need to know about 6G in 2022
Why and what you need to know about 6G in 2022Why and what you need to know about 6G in 2022
Why and what you need to know about 6G in 2022Qualcomm Research
 
Understanding the world in 3D with AI.pdf
Understanding the world in 3D with AI.pdfUnderstanding the world in 3D with AI.pdf
Understanding the world in 3D with AI.pdfQualcomm Research
 
Enabling the metaverse with 5G- web.pdf
Enabling the metaverse with 5G- web.pdfEnabling the metaverse with 5G- web.pdf
Enabling the metaverse with 5G- web.pdfQualcomm Research
 
Presentation - Model Efficiency for Edge AI
Presentation - Model Efficiency for Edge AIPresentation - Model Efficiency for Edge AI
Presentation - Model Efficiency for Edge AIQualcomm Research
 
Bringing AI research to wireless communication and sensing
Bringing AI research to wireless communication and sensingBringing AI research to wireless communication and sensing
Bringing AI research to wireless communication and sensingQualcomm Research
 
How will sidelink bring a new level of 5G versatility.pdf
How will sidelink bring a new level of 5G versatility.pdfHow will sidelink bring a new level of 5G versatility.pdf
How will sidelink bring a new level of 5G versatility.pdfQualcomm Research
 
Scaling 5G to new frontiers with NR-Light (RedCap)
Scaling 5G to new frontiers with NR-Light (RedCap)Scaling 5G to new frontiers with NR-Light (RedCap)
Scaling 5G to new frontiers with NR-Light (RedCap)Qualcomm Research
 
Realizing mission-critical industrial automation with 5G
Realizing mission-critical industrial automation with 5GRealizing mission-critical industrial automation with 5G
Realizing mission-critical industrial automation with 5GQualcomm Research
 
3GPP Release 17: Completing the first phase of 5G evolution
3GPP Release 17: Completing the first phase of 5G evolution3GPP Release 17: Completing the first phase of 5G evolution
3GPP Release 17: Completing the first phase of 5G evolutionQualcomm Research
 
Setting off the 5G Advanced evolution with 3GPP Release 18
Setting off the 5G Advanced evolution with 3GPP Release 18Setting off the 5G Advanced evolution with 3GPP Release 18
Setting off the 5G Advanced evolution with 3GPP Release 18Qualcomm Research
 
Enabling on-device learning at scale
Enabling on-device learning at scaleEnabling on-device learning at scale
Enabling on-device learning at scaleQualcomm Research
 
The essential role of AI in the 5G future
The essential role of AI in the 5G futureThe essential role of AI in the 5G future
The essential role of AI in the 5G futureQualcomm Research
 
Role of localization and environment perception in autonomous driving
Role of localization and environment perception in autonomous drivingRole of localization and environment perception in autonomous driving
Role of localization and environment perception in autonomous drivingQualcomm Research
 
Intelligence at scale through AI model efficiency
Intelligence at scale through AI model efficiencyIntelligence at scale through AI model efficiency
Intelligence at scale through AI model efficiencyQualcomm Research
 
How to build high performance 5G networks with vRAN and O-RAN
How to build high performance 5G networks with vRAN and O-RANHow to build high performance 5G networks with vRAN and O-RAN
How to build high performance 5G networks with vRAN and O-RANQualcomm Research
 
Efficient video perception through AI
Efficient video perception through AIEfficient video perception through AI
Efficient video perception through AIQualcomm Research
 
Enabling the rise of the smartphone: Chronicling the developmental history at...
Enabling the rise of the smartphone: Chronicling the developmental history at...Enabling the rise of the smartphone: Chronicling the developmental history at...
Enabling the rise of the smartphone: Chronicling the developmental history at...Qualcomm Research
 

Mehr von Qualcomm Research (20)

Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdf
 
Qualcomm Webinar: Solving Unsolvable Combinatorial Problems with AI
Qualcomm Webinar: Solving Unsolvable Combinatorial Problems with AIQualcomm Webinar: Solving Unsolvable Combinatorial Problems with AI
Qualcomm Webinar: Solving Unsolvable Combinatorial Problems with AI
 
Why and what you need to know about 6G in 2022
Why and what you need to know about 6G in 2022Why and what you need to know about 6G in 2022
Why and what you need to know about 6G in 2022
 
Understanding the world in 3D with AI.pdf
Understanding the world in 3D with AI.pdfUnderstanding the world in 3D with AI.pdf
Understanding the world in 3D with AI.pdf
 
Enabling the metaverse with 5G- web.pdf
Enabling the metaverse with 5G- web.pdfEnabling the metaverse with 5G- web.pdf
Enabling the metaverse with 5G- web.pdf
 
Presentation - Model Efficiency for Edge AI
Presentation - Model Efficiency for Edge AIPresentation - Model Efficiency for Edge AI
Presentation - Model Efficiency for Edge AI
 
Bringing AI research to wireless communication and sensing
Bringing AI research to wireless communication and sensingBringing AI research to wireless communication and sensing
Bringing AI research to wireless communication and sensing
 
How will sidelink bring a new level of 5G versatility.pdf
How will sidelink bring a new level of 5G versatility.pdfHow will sidelink bring a new level of 5G versatility.pdf
How will sidelink bring a new level of 5G versatility.pdf
 
Scaling 5G to new frontiers with NR-Light (RedCap)
Scaling 5G to new frontiers with NR-Light (RedCap)Scaling 5G to new frontiers with NR-Light (RedCap)
Scaling 5G to new frontiers with NR-Light (RedCap)
 
Realizing mission-critical industrial automation with 5G
Realizing mission-critical industrial automation with 5GRealizing mission-critical industrial automation with 5G
Realizing mission-critical industrial automation with 5G
 
3GPP Release 17: Completing the first phase of 5G evolution
3GPP Release 17: Completing the first phase of 5G evolution3GPP Release 17: Completing the first phase of 5G evolution
3GPP Release 17: Completing the first phase of 5G evolution
 
Setting off the 5G Advanced evolution with 3GPP Release 18
Setting off the 5G Advanced evolution with 3GPP Release 18Setting off the 5G Advanced evolution with 3GPP Release 18
Setting off the 5G Advanced evolution with 3GPP Release 18
 
Enabling on-device learning at scale
Enabling on-device learning at scaleEnabling on-device learning at scale
Enabling on-device learning at scale
 
The essential role of AI in the 5G future
The essential role of AI in the 5G futureThe essential role of AI in the 5G future
The essential role of AI in the 5G future
 
Role of localization and environment perception in autonomous driving
Role of localization and environment perception in autonomous drivingRole of localization and environment perception in autonomous driving
Role of localization and environment perception in autonomous driving
 
Pioneering 5G broadcast
Pioneering 5G broadcastPioneering 5G broadcast
Pioneering 5G broadcast
 
Intelligence at scale through AI model efficiency
Intelligence at scale through AI model efficiencyIntelligence at scale through AI model efficiency
Intelligence at scale through AI model efficiency
 
How to build high performance 5G networks with vRAN and O-RAN
How to build high performance 5G networks with vRAN and O-RANHow to build high performance 5G networks with vRAN and O-RAN
How to build high performance 5G networks with vRAN and O-RAN
 
Efficient video perception through AI
Efficient video perception through AIEfficient video perception through AI
Efficient video perception through AI
 
Enabling the rise of the smartphone: Chronicling the developmental history at...
Enabling the rise of the smartphone: Chronicling the developmental history at...Enabling the rise of the smartphone: Chronicling the developmental history at...
Enabling the rise of the smartphone: Chronicling the developmental history at...
 

Kürzlich hochgeladen

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 

Kürzlich hochgeladen (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 

Achieving AI @scale on Mobile Devices

  • 1. Achieving AI @Scale on Mobile Devices Qualcomm Technologies, Inc. @qualcomm_techMarch 2018
  • 2. 2 Mobile is the largest computing platform in the world Source: Gartner Mar. ‘17 >8.5 Billion Cumulative smartphone unit shipments forecast between 2017–2021
  • 3. 3 Source: Qualcomm Incorporated data. Currently, Qualcomm semiconductors are products of Qualcomm Technologies, Inc. or its subsidiaries. MSM is a product of Qualcomm Technologies, Inc. IHS, Mar. ’17; Strategy Analytics, Mar. ’17 30 Years of driving the evolution of wireless # 1Fabless semiconductor company # 1 in 3G/4G LTE modem 842MMSM™ chipsets shipped FY’16
  • 4. 4 Modem 3G (CDMA, GSM) 4G/LTE 5G Connectivity Bluetooth Smart Bluetooth Mesh 802.11ac/ax 802.11ad 802.11n 802.15.4 DSRC Powerline GNSS/Location Peripherals (PCIe, USB.) Processors CPU GPU DSP Multimedia Video Speech Display processing Media processing Camera processing Audio processing Computer Vision AR / VR Qualcomm Technologies’ success is based on RF Power amps Acoustic filters RF switches Other Machine learning NoC / MemCtrl Power management Security Touch Fingerprint Technology leadership Connectivity Computing
  • 5. 5 1 chipset per year Chipset complexity is growing dramatically Qualcomm Snapdragon is a product of Qualcomm Technologies, Inc. Source: Qualcomm internal data ~10 chipsets per year Transistors on a chip More frequent and more complex design cycles Today Transistors on a chip <10 Million Comparative scale Circa 2000 >3Billion
  • 6. 6 A mobile processor today—Snapdragon 835 Highly integrated and complex SoC using 10nm process technology Qualcomm Spectra 180 Camera Qualcomm Mobile Security Kryo 280 CPU Wi-Fi Snapdragon X16 LTE modem Adreno 540 Graphics Processing Unit (GPU) Hexagon 682 DSP HVX All-Ways Aware Qualcomm® Location Suite Display Processing Unit (DPU) Video Processing Unit (VPU) Qualcomm® Aqstic Audio * Compared to Snapdragon 820 Qualcomm Adreno, Qualcomm Kryo, Qualcomm Hexagon, Qualcomm Spectra, Qualcomm Aqstic, Qualcomm Location Suite, Qualcomm All-Ways Aware, and Qualcomm Mobile Security are products of Qualcomm Technologies, Inc. Snapdragon X16 LTE World’s first announced gigabit-class LTE modem Qualcomm® Hexagon™ DSP Snapdragon Neural Processing Engine support Qualcomm® Mobile Security First to support full biometric suite Qualcomm® Adreno™ Visual Processing 25% faster graphics rendering 60x more display colors* Qualcomm® Kryo™ 280 CPU Our most power efficient architecture to date Qualcomm Spectra™ Camera ISP Smooth zoom | Fast-autofocus True to life colors
  • 7. 7 Mobile scale changes everything 7 Superior scale Rapid replacement cycles Integrated and optimized technologies Industrial IoT Smart cities Automotive Smart homes Extended reality Wearables Healthcare Mobile computing Networking
  • 8. 8 Intelligence is moving to the device Server/Cloud Training Execution/Inference Devices Execution/Inference Training (emerging)
  • 9. 9 Privacy Reliability Low latency Efficient use of network bandwidth Process data closest to the source, complement the cloud On-device intelligence is paramount
  • 10. 10 Mobile is becoming the pervasive AI platform
  • 11. 11 Qualcomm Technologies is accelerating on-device AI Making efficient on-device machine learning possible for highly responsive, private, and intuitive user experiences
  • 12. 1212 Qualcomm ® Artificial Intelligence Platform A high-performance platform designed to support myriad intelligent- on-device-capabilities that utilize: • Qualcomm® Snapdragon™ mobile platform’s heterogeneous compute capabilities within a highly integrated SoC • Innovations in machine learning algorithms and enabling software • Development frameworks to minimize the time and effort for integrating customer networks with our platform The platform for efficient on-device machine learning Qualcomm Artificial Intelligence Platform and Qualcomm Snapdragon are products of Qualcomm Technologies, Inc. Audio intelligence Intuitive security Visual intelligence
  • 13. 13 Making on-device intelligence pervasive Focusing on high performance HW/SW and optimized network design Algorithmic advancements Algorithmic research that benefits from state-of-the-art deep neural networks Optimization for space and runtime efficiency Efficient hardware Developing heterogeneous compute to run demanding neural networks at low power and within thermal limits Selecting the right compute block for the right task Software tools Software accelerated run-time for deep learning SDK/development frameworks
  • 14. 14 Algorithmic enhancements for space and runtime efficiency Improve performance by addressing model complexity Source: Qualcomm Research. Results from running a semantic segmentation network for automotive. Neural network optimizations for embedded • Improved network architecture • Focus on memory and storage ◦ Reduce bit widths ◦ Model compression ◦ Leverage sparsity • Architecture learning Required operations per image 0 10000 20000 30000 40000 Original QCOM V1 QCOM V2 QCOM V3 GOPS Network versions QTI V1 QTI V2 QTI V3 27x Reduction in complexity
  • 15. 15 Snapdragon Neural Processing Engine SDK Software accelerated runtime for the execution of deep neural networks on device Qualcomm Kryo, Qualcomm Adreno and Qualcomm Hexagon are products of Qualcomm Technologies, Inc. Available at: developer.qualcomm.com Efficient execution on Snapdragon • Takes advantage of Snapdragon heterogeneous computing capabilities • Runtime and libraries accelerate deep neural net processing on all engines: CPU, GPU, and DSP with HVX Model framework/Network support • Convolutional neural networks and LSTMs • Support for Caffe/Caffe2, TensorFlow, and user/developer defined layers Optimization/Debugging tools • Offline network conversion tools • Debug and analyze network performance • API and SDK documentation with sample code • Ease of integration into customer applications Qualcomm® Kryo™ CPU Qualcomm® Adreno™ GPU Qualcomm® Hexagon™ DSP Sample code Offline conversion tools Analyze performance Ease of integration
  • 16. 16 Robust software and tools simplify development Mobile Auto Home Camera XR NPE GoogleNet/ Inception SSD Alexnet ResNet SqueezeNet Object classification Face detection Scene segmentation Natural language understanding Speaker recognition Security/Authentication Resource management
  • 17. 17 Optimized NPE Model (.dlc file) Model to runtime workflow: Training and inference Training: Machine learning experts build and train their network to solve their particular problem Inference: NPE enables the network to run on Snapdragon devices Model testing Good enough match? No Backward propagation Training data Test data Yes Deep learning frameworks (e.g., Caffe2, TensorFlow) Model conversion tools Application Model optimization tools Optional (quantization, compression, etc.) Data 1 Data 1 Model building and training NPE Enabled App NPE runtime .dlc file Model file Static weights and biases NPE Model (.dlc file)
  • 18. 18 Application NPElibrary Computeruntimes High-level software architecture for the Snapdragon NPE HW Kernel space User space OS support: • ARM Android • ARM Linux • x86 Linux Kryo CPU Adreno GPU Hexagon DSP OS DSP kernel mode driver GPU kernel mode driver DL containerModel debugProfiling logging Model loader NPE API CPU QSMLSymphony User defined layers Float Application code Future User developed GPU Float OpenCL driver DSP Fixed 8 Fixed 16 FastRPC driver Half float
  • 19. 19 Model Conversion Tool UDL .dlc file Model conversion tool User Defined Layer (UDL) workflow Supports prototyping of layers not yet supported by the Snapdragon NPE Note: there is runtime performance cost to inserting a UDL into a network, related to “context” switching. Execution context is transferred to user in CPU control plane. NPE model (.dlc file) UDL Model conversion tool UDL handler Model file (static weights and biases) UDL NPE workflow with UDL NPE model (.dlc file) Model conversion tool Model file (static weights and biases) NPE workflow User-provided implementation User-defined layer weights and parameters NPE runtime UDL implementation NPE runtime .dlc file
  • 20. 20 Converting and quantizing a model Native model (e.g., Caffe) Model conversion tool NPE model (.dlc file) Model optimization tool NPE model (.dlc file) NPE enabled appOnly needed for certain features (e.g., activation quantization) Images Images Images Optional (quantization, compression, etc.) snpe-<framework>-to-dlc snpe-dlc-optimize snpe-<framework>-to-dlc (<framework> = Caffe | Caffe2 | TensorFlow) • Input is the model in native framework format • Output is a converted but not optimized NPE DLC file snpe-dlc-optimize • Converts non-quantized DLC models into 8-bit quantized DLC models ◦ Additionally implements further optimizations such as SVD compression • Quantized model is necessary for fixed-point NPE runtimes (e.g. DSP)
  • 21. 21 Repeat API example usage NPE provides a simple C++ API with the following functionality • Load a DLC model and select the runtime • Execute the model • Debug support ◦ Dump the output of all layers in a model • Collect performance metrics ◦ Per-layer timing Green API calls are only required for UDLs Application InitializationRuntime Get input data and prepare InputTensor Process OutputTensorMap NPE new SNPEBuilder ( DLContainer ) SNPE->execute( InputTensor, OutputTensorMap ) Snapdragon NPE Object (Snapdragon NPE) SNPEBuilder Object(SBO) SBO->setRunTimeProcessor( RuntimeMode ) SBO->build( ) InputTensor = SNPEFactory: :getTensorFactory().createTensor() See SNPEBuilder.hpp for all builder APIs Process network See NativeCpp/BatchRun/ example in NPE For each UDL read from DLContainer IUDL object (IUDL) UdlFactory( UdlContext ) IUDL::Setup( ) SBO->setUdlBundle( UdlFactoryContextBundle ) IUDL->execute( InputTensor, OutputTensor )
  • 22. 2222 Learn more at: developer.qualcomm.com AI + Qualcomm Technologies Massive scale of Millions of units 100s Based on Snapdragon Platforms capable of supporting Snapdragon NPE today.
  • 23. 23 Qualcomm Technologies and Facebook collaboration for massive AI scale
  • 24. 2424 Facebook + Qualcomm Technologies On-device AI with Snapdragon Facebook and Qualcomm’s Caffe2 collaboration • Demonstrated Caffe2 acceleration with NPE at F8 2017 • 5x performance upside on GPU (compared to CPU) • Announced commercial support of Caffe2 in July through Qualcomm Developer Network • Facebook AML has integrated the NPE with Caffe2 Future Caffe2/NPE research and development • Continue to work closely with Facebook to optimize key networks for maximum on- device performance • Enhancements to Caffe2 allowing Snapdragon specific SoC optimizations • More advanced AI-powered XR applications F8 2017 “On-device machine learning is made possible by the Qualcomm Snapdragon NPE which does the heavy lifting needed to run neural networks more efficiently on Snapdragon devices.” Source: XDA
  • 25. 25 Enhancing the Facebook experience through on-device AI Augmented reality features potentially powered by AI • Style transfer and filters • Frames and masks • Photo and live videos, including 360° • Contextual awareness (e.g. location/sensor metadata) On-device acceleration benefits • Smooth UI with increased frame rate • Increased battery life More engaging social media with AI and AR *Requires network connection and will support up to 20 hours of battery life
  • 27. 27 AI hardware What does the future look like? Neural processing architecture research Distributed computing architectures
  • 28. 28 AI offers enhanced experiences and new capabilities for smartphones Superior photographyTrue personal assistance Extended battery life Enhanced security Natural user interfaces Enhanced connectivity A new development paradigm where things repeatedly improve
  • 29. 29 AI will bring XR closer to the ultimate level of immersion Creating physical presence in real or imagined worlds Interactions Natural UI Depth estimation Sounds Audio filtering and cleanup Visuals Rendering techniques Battery efficiency Managing workload concurrency
  • 30. 3030 AI is revolutionizing the car of the future Redefining the in-car experience • Natural user interfaces • Personalization • Driver awareness monitoring Paving the road to autonomy • Surround view perception • Sensor fusion • Path planning • Decision making
  • 32. Follow us on: For more information, visit us at: www.qualcomm.com & www.qualcomm.com/blog Thank you! Nothing in these materials is an offer to sell any of the components or devices referenced herein. ©2018 Qualcomm Technologies, Inc. and/or its affiliated companies. All Rights Reserved. Qualcomm is a trademark of Qualcomm Incorporated, registered in the United States and other countries. Other products and brand names may be trademarks or registered trademarks of their respective owners. References in this presentation to “Qualcomm” may mean Qualcomm Incorporated, Qualcomm Technologies, Inc., and/or other subsidiaries or business units within the Qualcomm corporate structure, as applicable. Qualcomm Incorporated includes Qualcomm’s licensing business, QTL, and the vast majority of its patent portfolio. Qualcomm Technologies, Inc., a wholly-owned subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of Qualcomm’s engineering, research and development functions, and substantially all of its product and services businesses, including its semiconductor business, QCT.