SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
© 2021 Qualcomm Technologies
Efficient Video Perception
Through AI
Fatih Porikli
Sr. Director, Technology
Qualcomm Technologies, Inc.
© 2021 Qualcomm Technologies
A picture is worth a thousand words
Out of all the five senses,
vision is arguably the most
important
© 2021 Qualcomm Technologies
A minute of video has more than 1000 pictures
© 2021 Qualcomm Technologies
The scale of video being created
and consumed is massive
4
1M
Minutes of video
crossing the internet per
second
300
Hours of video are
uploaded every
minute to YouTube
82%
Of all consumer
internet traffic is
online video
76
Minutes per day watching
video on digital devices
by US adults
8B
Average daily
video views
on Facebook
Cisco Visual Networking Index: Forecast and Trends, 2017–2022
© 2021 Qualcomm Technologies
Increasingly, video is all around us —
providing entertainment, enhancing collaboration, and transforming industries
5
Video monitoring
Extended reality Smart cities
Smart factories
Autonomous vehicles
Video conferencing
Sports
Smartphone
© 2021 Qualcomm Technologies 6
Understand
Recognizing patterns,
identities, objects,
scenes, context, relations,
compositions, changes,
motions, actions, activities,
events, 3D structures, surfaces,
lightings, text, emotions,
sentiments, sounds, and more
Systems
Any compute
platform, including
SoCs, CPUs,
GPUs, TPUs,
NPUs, and DSPs
Making
Developing
mathematical
representations,
models,
algorithms,
rules, and
frameworks
Video
perception
Making systems understand
video content
What is video perception?
© 2021 Qualcomm Technologies 6
© 2021 Qualcomm Technologies
What makes video perception challenging?
7
Platform
limitations
Task
diversity
Volume of video data
(training/testing)
Diversity in
visual data
Quality of data
acquisition
Availability of
annotated datasets
Implementation
challenges
Data
challenges
Video
perception
challenges
© 2021 Qualcomm Technologies
Power and thermal efficiency are essential for on-device
video perception
8
The challenge of
AI workloads
Constrained mobile & embedded
environments
Very compute
intensive
Large,
complicated neural
network models
Complex
concurrencies
Always-on
Real-time
Must be thermally
efficient for sleek,
ultra-light designs
Storage/memory
bandwidth limitations
Requires long battery
life for all-day use
© 2021 Qualcomm Technologies
Making video perception ubiquitous
9
Robustness
Robust to data variations
Adaptability
Adaptable to different domains
Scalability
Scaling up and down,
from IoT to the data center
Solving additional
key challenges to
take video perception
from the research lab
to broad commercial
deployment
© 2021 Qualcomm Technologies
Efficiently running on-device video perception without
sacrificing accuracy
10
Leverage
Temporal
redundancy
By reusing what
is computed before
• Learning to skip regions
• Recycling features
Make
Early
decisions
By dynamically changing
the network architecture
per input frame
• Early exiting
• Frame exiting
Key concepts
for efficient
video perception
© 2021 Qualcomm Technologies
Learning to skip redundant computations
Video frames are heavily correlated
11
frame t
“Skip-convolutions for efficient video processing” (CVPR 2021)
frame t+10 residual
The residual frame, the difference between two
consecutive frames, contains little information in
most regions
Limit the computation only to the regions
where there are significant changes
© 2021 Qualcomm Technologies
Skip-convolution
A convolutional layer with a skip gate that masks out negligible residuals
12
“Skip-convolutions for efficient video processing”
(CVPR 2021)
A convolution at a frame
can be written as the previous
frame’s convolution plus the
convolution of the residual
Computation is limited only
to the regions where there
are strong residuals
Reinforce residual’s sparsity
by removing negligible residuals
Can replace convolutional layers in any
CNN with skip convolutions
Conv
G
Conv
– +
Skip
gate
Previous frame
output features
Output
features
Input
features
Convolutional layer
Previous frame
input features
Current frame
input features
Output
features
Previous frame
output features
Small
residual
Large
residual
No computation
12
© 2021 Qualcomm Technologies
Learning to skip reduces compute for human pose
estimation
13
“Skip-convolutions for efficient video processing” (CVPR 2021)
Results for human
pose estimation
GMACs without
skip-convolutions
GMACs with
skip-convolutions
10.2
10.2
1.1
10.2
10.2
10.2
0.4
10.2 0.4
10.2 0.4
10.2 0.5
10.2
0.7
10.2 0.8
10.2 0.7
10.2
© 2021 Qualcomm Technologies
Learning to skip is complementary to model compression
14
0.9
0.91
0.92
0.93
0.94
0.95
0.96
0 2 4 6 8 10
Percentage
of
correct
keypoints
GMAC
Pose estimation
HRNet-w32
L0
W-SVD
S-SVD
Ours-Skip
Ours-Skip+S-SVD
“Skip-convolutions for efficient video processing” (CVPR 2021)
Results for human pose
estimation on video human
action dataset
speed-up over
HRNet
2.5x- 8x
© 2021 Qualcomm Technologies
Recycling features saves compute
Instead of computing deep features repetitively,
compute once and recycle
15
Compute deep features once and recycle
— reuse from past frame
Compute shallow features
for all frames
Deep features remain relatively stationary over
time — they have lower spatial resolution
Shallow features are more responsive to smooth
changes, encoding the temporally varying
information
Applicable to any video neural network architectures including segmentation, optical flow,
classification, and more
“Time-sharing networks for efficient semantic video segmentation” (submitted 2021)
© 2021 Qualcomm Technologies
Recycling features saves compute
Instead of computing deep features repetitively,
compute once and recycle
16
Raw video obtained from Cityscapes Benchmark:
https://www.cityscapes-dataset.com/
“Time-sharing networks for efficient semantic video
segmentation” (submitted 2021)
Visual example of
recycling features for a
semantic segmentation
task
© 2021 Qualcomm Technologies
Feature recycling reduces compute and latency
17
74.0
26.0
17.0
77.9
Model efficiency
78%
GMACs On-device latency (ms/frame)
GMAC
reduction 65% latency
reduction
Memory traffic
321.3
373.6
89%
MB read MB write
read
reduction 93% write
reduction
41.3 22.6
“Time-sharing networks for efficient semantic video segmentation” (submitted 2021)
Enhanced Net
HRNet w18 v2
Qualcomm Snapdragon is a product of Qualcomm
Technologies, Inc. and/or its subsidiaries.
Semantic segmentation
example
Input:
2048x1024 RGB video
Output:
2048x1024,
19 object classes
Runs on:
Qualcomm® Snapdragon™ 888 Mobile
Platform
© 2021 Qualcomm Technologies
Early exiting a neural network saves compute
Exploit the fact that not all input examples require
models of the same complexity
18
Ideally, our system should be composed of a
cascade of classifiers throughout the network
“FrameExit: Conditional early exiting for efficient video recognition” (CVPR 2021)
Very large, computationally intensive
models are needed to correctly classify
Very small and compact models can achieve very
high accuracies, but they fail for complex examples
Complex
examples
Simple
examples
© 2021 Qualcomm Technologies
Early exiting at the earliest possible NN layer for video
19
Conv
G1
Accept previous frame
layer output
Feature
from t-1
Feature
from t
G1:
Gating based
on temporal
agreement
Continue
Similar
Not
similar
FC Classifier
G2
G2:
Gating based
on frame
complexity
Exit
Continue
Confident
Not
confident
Frame t-1
Exit NN layer
“FrameExit: Conditional early exiting for efficient video recognition” (CVPR 2021)
Frame t
Exit NN layer
© 2021 Qualcomm Technologies
Early exiting reduces compute while maintaining accuracy
20
0.91
0.915
0.92
0.925
0.93
0.935
0.94
0.945
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
Top-1
accuracy
MAC (x108)
Classification on image dataset
RANets1
RANets2
RANets3
MSDNet
Ours-MSDNet
DenseNet
Ours-DenseNet
ResNet
Ours-ResNet
Early exiting applies
to most neural
network backbones
“FrameExit: Conditional early exiting for efficient video recognition” (CVPR 2021)
© 2021 Qualcomm Technologies
What’s next?
21
Efficient sparse convolutions
Personalization
Multi-task networks
Quantization aware training
Platform optimizations
Unsupervised / semi-
supervised learning
Advance existing conditional compute
techniques
Develop efficient video
neural network solutions
Learning to skip regions
Recycling features
Early exiting
Frame exiting
Future work
in video
perception
© 2021 Qualcomm Technologies
Machine
learning
core
Computer vision
3D Sensing
RF Sensing
Personalization
Biometrics
Data driven models
XR
Camera
Autonomous
vehicles
Fingerprint ASP
IOT
Cloud
Wi-Fi
Mobile
Research Areas Applications
Leading high-impact
research efforts in
perception
Inventing technology
enablers for important
applications
22
Our perception research is much broader than video
© 2021 Qualcomm Technologies
© 2021 Qualcomm Technologies
Takeaways
23
Video perception is crucial for understanding the
world and making devices smarter
We are conducting leading research and
development in video perception
We are making power efficient video perception
possible without sacrificing accuracy
© 2021 Qualcomm Technologies
Resource slide
• Qualcomm AI page:
https://www.qualcomm.com/invention/artificial-
intelligence
• Qualcomm AI Research page:
https://www.qualcomm.com/invention/artificial-
intelligence/ai-research
• Qualcomm® Platform Solution Ecosystem:
https://www.qualcomm.com/support/qan/platform-
solutions-ecosystem
• GitHub open-source projects:
https://github.com/quic/aimet
https://github.com/quic/aimet-model-zoo/
• Qualcomm Mobile AI page:
https://www.qualcomm.com/products/smartphones/mobile-ai
• Qualcomm Mobile AI blog:
https://www.qualcomm.com/news/onq/2020/12/02/exploring-ai-
capabilities-qualcomm-snapdragon-888-mobile-platform
• Qualcomm® Cloud AI 100 blog:
https://www.qualcomm.com/news/onq/2021/03/15/qualcomm-cloud-
ai-100-amd-epyc-7003-series-processor-and-gigabyte-server
• Qualcomm AI Research blog:
https://www.qualcomm.com/news/onq/2020/09/01/pushing-
boundaries-ai-research
Qualcomm AI Research is an initiative of Qualcomm Technologies, Inc. AIMET and AIMET Model Zoo are products of Qualcomm Innovation Center, Inc.
Qualcomm Cloud AI is a product of Qualcomm Technologies, Inc. and/or its subsidiaries. Qualcomm Platform Solutions Ecosystem Program is a program of Qualcomm Technologies, Inc. and/or its subsidiaries.
© 2021 Qualcomm Technologies
Thank you
25
Follow us on:
For more information, visit us at:
www.qualcomm.com & www.qualcomm.com/blog
Nothing in these materials is an offer to sell any of the
components or devices referenced herein.
©2018-2021 Qualcomm Technologies, Inc. and/or its
affiliated companies. All Rights Reserved.
Qualcomm and Snapdragon are trademarks or registered
trademarks of Qualcomm Incorporated. Other products
and brand names may be trademarks or registered
trademarks of their respective owners.
References in this presentation to “Qualcomm” may mean Qualcomm
Incorporated, Qualcomm Technologies, Inc., and/or other subsidiaries
or business units within the Qualcomm corporate structure, as
applicable. Qualcomm Incorporated includes our licensing business,
QTL, and the vast majority of our patent portfolio. Qualcomm
Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates,
along with its subsidiaries, substantially all of our engineering,
research and development functions, and substantially all of our
products and services businesses, including our QCT semiconductor
business.

Weitere ähnliche Inhalte

Was ist angesagt?

AI, A New Computing Model
AI, A New Computing ModelAI, A New Computing Model
AI, A New Computing ModelNVIDIA Taiwan
 
Dell NVIDIA AI Powered Transformation Webinar
Dell NVIDIA AI Powered Transformation WebinarDell NVIDIA AI Powered Transformation Webinar
Dell NVIDIA AI Powered Transformation WebinarBill Wong
 
AMD 2014 Performance Mobile APUs
AMD 2014 Performance Mobile APUsAMD 2014 Performance Mobile APUs
AMD 2014 Performance Mobile APUsAMD
 
AMD 2014 Mobility APU Lineup Announcement
AMD 2014 Mobility APU Lineup AnnouncementAMD 2014 Mobility APU Lineup Announcement
AMD 2014 Mobility APU Lineup AnnouncementAMD
 
"Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres...
"Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres..."Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres...
"Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres...Edge AI and Vision Alliance
 
Enabling ARM® Server Technology for the Datacenter
Enabling ARM® Server Technology for the DatacenterEnabling ARM® Server Technology for the Datacenter
Enabling ARM® Server Technology for the DatacenterAMD
 
1050: 車載用ADAS/自動運転プラットフォームDRIVE PX及びコックピット・プラットフォームDRIVE CXのご紹介
1050: 車載用ADAS/自動運転プラットフォームDRIVE PX及びコックピット・プラットフォームDRIVE CXのご紹介1050: 車載用ADAS/自動運転プラットフォームDRIVE PX及びコックピット・プラットフォームDRIVE CXのご紹介
1050: 車載用ADAS/自動運転プラットフォームDRIVE PX及びコックピット・プラットフォームDRIVE CXのご紹介NVIDIA Japan
 
"Combining Flexibility and Low-Power in Embedded Vision Subsystems: An Applic...
"Combining Flexibility and Low-Power in Embedded Vision Subsystems: An Applic..."Combining Flexibility and Low-Power in Embedded Vision Subsystems: An Applic...
"Combining Flexibility and Low-Power in Embedded Vision Subsystems: An Applic...Edge AI and Vision Alliance
 
NVIDIA DGX-1 超級電腦與人工智慧及深度學習
NVIDIA DGX-1 超級電腦與人工智慧及深度學習NVIDIA DGX-1 超級電腦與人工智慧及深度學習
NVIDIA DGX-1 超級電腦與人工智慧及深度學習NVIDIA Taiwan
 
2014 AMD Low-Power Mobile APUs
2014 AMD Low-Power Mobile APUs2014 AMD Low-Power Mobile APUs
2014 AMD Low-Power Mobile APUsAMD
 
Manufacturing pov jeff green 2016 v2
Manufacturing pov jeff green 2016 v2Manufacturing pov jeff green 2016 v2
Manufacturing pov jeff green 2016 v2Jeff Green
 
AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...
AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...
AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...AMD
 
Accelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to CloudAccelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to CloudRebekah Rodriguez
 
TechWiseTV Workshop: Application Hosting on Catalyst 9000 Series Switches
TechWiseTV Workshop: Application Hosting on Catalyst 9000 Series SwitchesTechWiseTV Workshop: Application Hosting on Catalyst 9000 Series Switches
TechWiseTV Workshop: Application Hosting on Catalyst 9000 Series SwitchesRobb Boyd
 
CPU Subsystem Total Power Consumption: Understanding the Factors and Selectin...
CPU Subsystem Total Power Consumption: Understanding the Factors and Selectin...CPU Subsystem Total Power Consumption: Understanding the Factors and Selectin...
CPU Subsystem Total Power Consumption: Understanding the Factors and Selectin...CAST, Inc.
 
Intel® Open Image Denoise: Optimized CPU Denoising | SIGGRAPH 2019 Technical ...
Intel® Open Image Denoise: Optimized CPU Denoising | SIGGRAPH 2019 Technical ...Intel® Open Image Denoise: Optimized CPU Denoising | SIGGRAPH 2019 Technical ...
Intel® Open Image Denoise: Optimized CPU Denoising | SIGGRAPH 2019 Technical ...Intel® Software
 
Qualcomm Hexagon SDK: Optimize Your Multimedia Solutions
Qualcomm Hexagon SDK: Optimize Your Multimedia SolutionsQualcomm Hexagon SDK: Optimize Your Multimedia Solutions
Qualcomm Hexagon SDK: Optimize Your Multimedia SolutionsQualcomm Developer Network
 
MPU and MCU IP Cores from CAST
MPU and MCU IP Cores from CASTMPU and MCU IP Cores from CAST
MPU and MCU IP Cores from CASTCAST, Inc.
 
CAST BA22 32-bit Processor Design Seminar, 2/1/12
CAST BA22 32-bit Processor Design Seminar, 2/1/12CAST BA22 32-bit Processor Design Seminar, 2/1/12
CAST BA22 32-bit Processor Design Seminar, 2/1/12CAST, Inc.
 

Was ist angesagt? (20)

AI, A New Computing Model
AI, A New Computing ModelAI, A New Computing Model
AI, A New Computing Model
 
Dell NVIDIA AI Powered Transformation Webinar
Dell NVIDIA AI Powered Transformation WebinarDell NVIDIA AI Powered Transformation Webinar
Dell NVIDIA AI Powered Transformation Webinar
 
AMD 2014 Performance Mobile APUs
AMD 2014 Performance Mobile APUsAMD 2014 Performance Mobile APUs
AMD 2014 Performance Mobile APUs
 
AMD 2014 Mobility APU Lineup Announcement
AMD 2014 Mobility APU Lineup AnnouncementAMD 2014 Mobility APU Lineup Announcement
AMD 2014 Mobility APU Lineup Announcement
 
"Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres...
"Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres..."Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres...
"Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres...
 
Enabling ARM® Server Technology for the Datacenter
Enabling ARM® Server Technology for the DatacenterEnabling ARM® Server Technology for the Datacenter
Enabling ARM® Server Technology for the Datacenter
 
1050: 車載用ADAS/自動運転プラットフォームDRIVE PX及びコックピット・プラットフォームDRIVE CXのご紹介
1050: 車載用ADAS/自動運転プラットフォームDRIVE PX及びコックピット・プラットフォームDRIVE CXのご紹介1050: 車載用ADAS/自動運転プラットフォームDRIVE PX及びコックピット・プラットフォームDRIVE CXのご紹介
1050: 車載用ADAS/自動運転プラットフォームDRIVE PX及びコックピット・プラットフォームDRIVE CXのご紹介
 
Bring Out the Best in Embedded Computing
Bring Out the Best in Embedded ComputingBring Out the Best in Embedded Computing
Bring Out the Best in Embedded Computing
 
"Combining Flexibility and Low-Power in Embedded Vision Subsystems: An Applic...
"Combining Flexibility and Low-Power in Embedded Vision Subsystems: An Applic..."Combining Flexibility and Low-Power in Embedded Vision Subsystems: An Applic...
"Combining Flexibility and Low-Power in Embedded Vision Subsystems: An Applic...
 
NVIDIA DGX-1 超級電腦與人工智慧及深度學習
NVIDIA DGX-1 超級電腦與人工智慧及深度學習NVIDIA DGX-1 超級電腦與人工智慧及深度學習
NVIDIA DGX-1 超級電腦與人工智慧及深度學習
 
2014 AMD Low-Power Mobile APUs
2014 AMD Low-Power Mobile APUs2014 AMD Low-Power Mobile APUs
2014 AMD Low-Power Mobile APUs
 
Manufacturing pov jeff green 2016 v2
Manufacturing pov jeff green 2016 v2Manufacturing pov jeff green 2016 v2
Manufacturing pov jeff green 2016 v2
 
AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...
AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...
AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...
 
Accelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to CloudAccelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to Cloud
 
TechWiseTV Workshop: Application Hosting on Catalyst 9000 Series Switches
TechWiseTV Workshop: Application Hosting on Catalyst 9000 Series SwitchesTechWiseTV Workshop: Application Hosting on Catalyst 9000 Series Switches
TechWiseTV Workshop: Application Hosting on Catalyst 9000 Series Switches
 
CPU Subsystem Total Power Consumption: Understanding the Factors and Selectin...
CPU Subsystem Total Power Consumption: Understanding the Factors and Selectin...CPU Subsystem Total Power Consumption: Understanding the Factors and Selectin...
CPU Subsystem Total Power Consumption: Understanding the Factors and Selectin...
 
Intel® Open Image Denoise: Optimized CPU Denoising | SIGGRAPH 2019 Technical ...
Intel® Open Image Denoise: Optimized CPU Denoising | SIGGRAPH 2019 Technical ...Intel® Open Image Denoise: Optimized CPU Denoising | SIGGRAPH 2019 Technical ...
Intel® Open Image Denoise: Optimized CPU Denoising | SIGGRAPH 2019 Technical ...
 
Qualcomm Hexagon SDK: Optimize Your Multimedia Solutions
Qualcomm Hexagon SDK: Optimize Your Multimedia SolutionsQualcomm Hexagon SDK: Optimize Your Multimedia Solutions
Qualcomm Hexagon SDK: Optimize Your Multimedia Solutions
 
MPU and MCU IP Cores from CAST
MPU and MCU IP Cores from CASTMPU and MCU IP Cores from CAST
MPU and MCU IP Cores from CAST
 
CAST BA22 32-bit Processor Design Seminar, 2/1/12
CAST BA22 32-bit Processor Design Seminar, 2/1/12CAST BA22 32-bit Processor Design Seminar, 2/1/12
CAST BA22 32-bit Processor Design Seminar, 2/1/12
 

Ähnlich wie “Efficient Video Perception Through AI,” a Presentation from Qualcomm

Efficient video perception through AI
Efficient video perception through AIEfficient video perception through AI
Efficient video perception through AIQualcomm Research
 
AWS_Re_invent_22_VNova.pdf
AWS_Re_invent_22_VNova.pdfAWS_Re_invent_22_VNova.pdf
AWS_Re_invent_22_VNova.pdfV-Nova
 
Telepresnce Solutions and SVC
Telepresnce Solutions and SVCTelepresnce Solutions and SVC
Telepresnce Solutions and SVCIMTC
 
“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...
“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...
“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...Edge AI and Vision Alliance
 
Whitepaper multipoint video_conferencing_june2012_wr
Whitepaper multipoint video_conferencing_june2012_wrWhitepaper multipoint video_conferencing_june2012_wr
Whitepaper multipoint video_conferencing_june2012_wrJohn Shim
 
BRKEVT-2311_joeking_pbr.pptx
BRKEVT-2311_joeking_pbr.pptxBRKEVT-2311_joeking_pbr.pptx
BRKEVT-2311_joeking_pbr.pptxJoel W. King
 
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...Edge AI and Vision Alliance
 
Imaging automotive 2015 addfor v002
Imaging automotive 2015   addfor v002Imaging automotive 2015   addfor v002
Imaging automotive 2015 addfor v002Enrico Busto
 
Imaging automotive 2015 addfor v002
Imaging automotive 2015   addfor v002Imaging automotive 2015   addfor v002
Imaging automotive 2015 addfor v002Enrico Busto
 
“Trends in Neural Network Topologies for Vision at the Edge,” a Presentation ...
“Trends in Neural Network Topologies for Vision at the Edge,” a Presentation ...“Trends in Neural Network Topologies for Vision at the Edge,” a Presentation ...
“Trends in Neural Network Topologies for Vision at the Edge,” a Presentation ...Edge AI and Vision Alliance
 
“5G and AI Transforming the Next Generation of Robotics,” a Presentation from...
“5G and AI Transforming the Next Generation of Robotics,” a Presentation from...“5G and AI Transforming the Next Generation of Robotics,” a Presentation from...
“5G and AI Transforming the Next Generation of Robotics,” a Presentation from...Edge AI and Vision Alliance
 
“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...
“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...
“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...Edge AI and Vision Alliance
 
Designing CloudStack Clouds
Designing CloudStack CloudsDesigning CloudStack Clouds
Designing CloudStack CloudsShapeBlue
 
Edge optimized architecture for fabric defect detection in real-time
Edge optimized architecture for fabric defect detection in real-timeEdge optimized architecture for fabric defect detection in real-time
Edge optimized architecture for fabric defect detection in real-timeShuquan Huang
 
Connectivity is here (5 g, swarm,...). now, let's build interplanetary apps! (1)
Connectivity is here (5 g, swarm,...). now, let's build interplanetary apps! (1)Connectivity is here (5 g, swarm,...). now, let's build interplanetary apps! (1)
Connectivity is here (5 g, swarm,...). now, let's build interplanetary apps! (1)Samy Fodil
 
Deliver Modern Applications with an Elastic Load Balancing Fabric Powered by ...
Deliver Modern Applications with an Elastic Load Balancing Fabric Powered by ...Deliver Modern Applications with an Elastic Load Balancing Fabric Powered by ...
Deliver Modern Applications with an Elastic Load Balancing Fabric Powered by ...Avi Networks
 
"Portable Performance via the OpenVX Computer Vision Library: Case Studies," ...
"Portable Performance via the OpenVX Computer Vision Library: Case Studies," ..."Portable Performance via the OpenVX Computer Vision Library: Case Studies," ...
"Portable Performance via the OpenVX Computer Vision Library: Case Studies," ...Edge AI and Vision Alliance
 
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...Edge AI and Vision Alliance
 
Hardware implementation of 3 d dct compressed and digitally watermarked video
Hardware implementation of 3 d dct compressed and digitally watermarked videoHardware implementation of 3 d dct compressed and digitally watermarked video
Hardware implementation of 3 d dct compressed and digitally watermarked videoIAEME Publication
 
White Paper - CEVA-XM4 Intelligent Vision Processor
White Paper - CEVA-XM4 Intelligent Vision ProcessorWhite Paper - CEVA-XM4 Intelligent Vision Processor
White Paper - CEVA-XM4 Intelligent Vision ProcessorCEVA, Inc.
 

Ähnlich wie “Efficient Video Perception Through AI,” a Presentation from Qualcomm (20)

Efficient video perception through AI
Efficient video perception through AIEfficient video perception through AI
Efficient video perception through AI
 
AWS_Re_invent_22_VNova.pdf
AWS_Re_invent_22_VNova.pdfAWS_Re_invent_22_VNova.pdf
AWS_Re_invent_22_VNova.pdf
 
Telepresnce Solutions and SVC
Telepresnce Solutions and SVCTelepresnce Solutions and SVC
Telepresnce Solutions and SVC
 
“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...
“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...
“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...
 
Whitepaper multipoint video_conferencing_june2012_wr
Whitepaper multipoint video_conferencing_june2012_wrWhitepaper multipoint video_conferencing_june2012_wr
Whitepaper multipoint video_conferencing_june2012_wr
 
BRKEVT-2311_joeking_pbr.pptx
BRKEVT-2311_joeking_pbr.pptxBRKEVT-2311_joeking_pbr.pptx
BRKEVT-2311_joeking_pbr.pptx
 
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
 
Imaging automotive 2015 addfor v002
Imaging automotive 2015   addfor v002Imaging automotive 2015   addfor v002
Imaging automotive 2015 addfor v002
 
Imaging automotive 2015 addfor v002
Imaging automotive 2015   addfor v002Imaging automotive 2015   addfor v002
Imaging automotive 2015 addfor v002
 
“Trends in Neural Network Topologies for Vision at the Edge,” a Presentation ...
“Trends in Neural Network Topologies for Vision at the Edge,” a Presentation ...“Trends in Neural Network Topologies for Vision at the Edge,” a Presentation ...
“Trends in Neural Network Topologies for Vision at the Edge,” a Presentation ...
 
“5G and AI Transforming the Next Generation of Robotics,” a Presentation from...
“5G and AI Transforming the Next Generation of Robotics,” a Presentation from...“5G and AI Transforming the Next Generation of Robotics,” a Presentation from...
“5G and AI Transforming the Next Generation of Robotics,” a Presentation from...
 
“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...
“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...
“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...
 
Designing CloudStack Clouds
Designing CloudStack CloudsDesigning CloudStack Clouds
Designing CloudStack Clouds
 
Edge optimized architecture for fabric defect detection in real-time
Edge optimized architecture for fabric defect detection in real-timeEdge optimized architecture for fabric defect detection in real-time
Edge optimized architecture for fabric defect detection in real-time
 
Connectivity is here (5 g, swarm,...). now, let's build interplanetary apps! (1)
Connectivity is here (5 g, swarm,...). now, let's build interplanetary apps! (1)Connectivity is here (5 g, swarm,...). now, let's build interplanetary apps! (1)
Connectivity is here (5 g, swarm,...). now, let's build interplanetary apps! (1)
 
Deliver Modern Applications with an Elastic Load Balancing Fabric Powered by ...
Deliver Modern Applications with an Elastic Load Balancing Fabric Powered by ...Deliver Modern Applications with an Elastic Load Balancing Fabric Powered by ...
Deliver Modern Applications with an Elastic Load Balancing Fabric Powered by ...
 
"Portable Performance via the OpenVX Computer Vision Library: Case Studies," ...
"Portable Performance via the OpenVX Computer Vision Library: Case Studies," ..."Portable Performance via the OpenVX Computer Vision Library: Case Studies," ...
"Portable Performance via the OpenVX Computer Vision Library: Case Studies," ...
 
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
“Open Standards: Powering the Future of Embedded Vision,” a Presentation from...
 
Hardware implementation of 3 d dct compressed and digitally watermarked video
Hardware implementation of 3 d dct compressed and digitally watermarked videoHardware implementation of 3 d dct compressed and digitally watermarked video
Hardware implementation of 3 d dct compressed and digitally watermarked video
 
White Paper - CEVA-XM4 Intelligent Vision Processor
White Paper - CEVA-XM4 Intelligent Vision ProcessorWhite Paper - CEVA-XM4 Intelligent Vision Processor
White Paper - CEVA-XM4 Intelligent Vision Processor
 

Mehr von Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...Edge AI and Vision Alliance
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...Edge AI and Vision Alliance
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsightsEdge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from SamsaraEdge AI and Vision Alliance
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...Edge AI and Vision Alliance
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...Edge AI and Vision Alliance
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...Edge AI and Vision Alliance
 

Mehr von Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Kürzlich hochgeladen

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 

Kürzlich hochgeladen (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 

“Efficient Video Perception Through AI,” a Presentation from Qualcomm

  • 1. © 2021 Qualcomm Technologies Efficient Video Perception Through AI Fatih Porikli Sr. Director, Technology Qualcomm Technologies, Inc.
  • 2. © 2021 Qualcomm Technologies A picture is worth a thousand words Out of all the five senses, vision is arguably the most important
  • 3. © 2021 Qualcomm Technologies A minute of video has more than 1000 pictures
  • 4. © 2021 Qualcomm Technologies The scale of video being created and consumed is massive 4 1M Minutes of video crossing the internet per second 300 Hours of video are uploaded every minute to YouTube 82% Of all consumer internet traffic is online video 76 Minutes per day watching video on digital devices by US adults 8B Average daily video views on Facebook Cisco Visual Networking Index: Forecast and Trends, 2017–2022
  • 5. © 2021 Qualcomm Technologies Increasingly, video is all around us — providing entertainment, enhancing collaboration, and transforming industries 5 Video monitoring Extended reality Smart cities Smart factories Autonomous vehicles Video conferencing Sports Smartphone
  • 6. © 2021 Qualcomm Technologies 6 Understand Recognizing patterns, identities, objects, scenes, context, relations, compositions, changes, motions, actions, activities, events, 3D structures, surfaces, lightings, text, emotions, sentiments, sounds, and more Systems Any compute platform, including SoCs, CPUs, GPUs, TPUs, NPUs, and DSPs Making Developing mathematical representations, models, algorithms, rules, and frameworks Video perception Making systems understand video content What is video perception? © 2021 Qualcomm Technologies 6
  • 7. © 2021 Qualcomm Technologies What makes video perception challenging? 7 Platform limitations Task diversity Volume of video data (training/testing) Diversity in visual data Quality of data acquisition Availability of annotated datasets Implementation challenges Data challenges Video perception challenges
  • 8. © 2021 Qualcomm Technologies Power and thermal efficiency are essential for on-device video perception 8 The challenge of AI workloads Constrained mobile & embedded environments Very compute intensive Large, complicated neural network models Complex concurrencies Always-on Real-time Must be thermally efficient for sleek, ultra-light designs Storage/memory bandwidth limitations Requires long battery life for all-day use
  • 9. © 2021 Qualcomm Technologies Making video perception ubiquitous 9 Robustness Robust to data variations Adaptability Adaptable to different domains Scalability Scaling up and down, from IoT to the data center Solving additional key challenges to take video perception from the research lab to broad commercial deployment
  • 10. © 2021 Qualcomm Technologies Efficiently running on-device video perception without sacrificing accuracy 10 Leverage Temporal redundancy By reusing what is computed before • Learning to skip regions • Recycling features Make Early decisions By dynamically changing the network architecture per input frame • Early exiting • Frame exiting Key concepts for efficient video perception
  • 11. © 2021 Qualcomm Technologies Learning to skip redundant computations Video frames are heavily correlated 11 frame t “Skip-convolutions for efficient video processing” (CVPR 2021) frame t+10 residual The residual frame, the difference between two consecutive frames, contains little information in most regions Limit the computation only to the regions where there are significant changes
  • 12. © 2021 Qualcomm Technologies Skip-convolution A convolutional layer with a skip gate that masks out negligible residuals 12 “Skip-convolutions for efficient video processing” (CVPR 2021) A convolution at a frame can be written as the previous frame’s convolution plus the convolution of the residual Computation is limited only to the regions where there are strong residuals Reinforce residual’s sparsity by removing negligible residuals Can replace convolutional layers in any CNN with skip convolutions Conv G Conv – + Skip gate Previous frame output features Output features Input features Convolutional layer Previous frame input features Current frame input features Output features Previous frame output features Small residual Large residual No computation 12
  • 13. © 2021 Qualcomm Technologies Learning to skip reduces compute for human pose estimation 13 “Skip-convolutions for efficient video processing” (CVPR 2021) Results for human pose estimation GMACs without skip-convolutions GMACs with skip-convolutions 10.2 10.2 1.1 10.2 10.2 10.2 0.4 10.2 0.4 10.2 0.4 10.2 0.5 10.2 0.7 10.2 0.8 10.2 0.7 10.2
  • 14. © 2021 Qualcomm Technologies Learning to skip is complementary to model compression 14 0.9 0.91 0.92 0.93 0.94 0.95 0.96 0 2 4 6 8 10 Percentage of correct keypoints GMAC Pose estimation HRNet-w32 L0 W-SVD S-SVD Ours-Skip Ours-Skip+S-SVD “Skip-convolutions for efficient video processing” (CVPR 2021) Results for human pose estimation on video human action dataset speed-up over HRNet 2.5x- 8x
  • 15. © 2021 Qualcomm Technologies Recycling features saves compute Instead of computing deep features repetitively, compute once and recycle 15 Compute deep features once and recycle — reuse from past frame Compute shallow features for all frames Deep features remain relatively stationary over time — they have lower spatial resolution Shallow features are more responsive to smooth changes, encoding the temporally varying information Applicable to any video neural network architectures including segmentation, optical flow, classification, and more “Time-sharing networks for efficient semantic video segmentation” (submitted 2021)
  • 16. © 2021 Qualcomm Technologies Recycling features saves compute Instead of computing deep features repetitively, compute once and recycle 16 Raw video obtained from Cityscapes Benchmark: https://www.cityscapes-dataset.com/ “Time-sharing networks for efficient semantic video segmentation” (submitted 2021) Visual example of recycling features for a semantic segmentation task
  • 17. © 2021 Qualcomm Technologies Feature recycling reduces compute and latency 17 74.0 26.0 17.0 77.9 Model efficiency 78% GMACs On-device latency (ms/frame) GMAC reduction 65% latency reduction Memory traffic 321.3 373.6 89% MB read MB write read reduction 93% write reduction 41.3 22.6 “Time-sharing networks for efficient semantic video segmentation” (submitted 2021) Enhanced Net HRNet w18 v2 Qualcomm Snapdragon is a product of Qualcomm Technologies, Inc. and/or its subsidiaries. Semantic segmentation example Input: 2048x1024 RGB video Output: 2048x1024, 19 object classes Runs on: Qualcomm® Snapdragon™ 888 Mobile Platform
  • 18. © 2021 Qualcomm Technologies Early exiting a neural network saves compute Exploit the fact that not all input examples require models of the same complexity 18 Ideally, our system should be composed of a cascade of classifiers throughout the network “FrameExit: Conditional early exiting for efficient video recognition” (CVPR 2021) Very large, computationally intensive models are needed to correctly classify Very small and compact models can achieve very high accuracies, but they fail for complex examples Complex examples Simple examples
  • 19. © 2021 Qualcomm Technologies Early exiting at the earliest possible NN layer for video 19 Conv G1 Accept previous frame layer output Feature from t-1 Feature from t G1: Gating based on temporal agreement Continue Similar Not similar FC Classifier G2 G2: Gating based on frame complexity Exit Continue Confident Not confident Frame t-1 Exit NN layer “FrameExit: Conditional early exiting for efficient video recognition” (CVPR 2021) Frame t Exit NN layer
  • 20. © 2021 Qualcomm Technologies Early exiting reduces compute while maintaining accuracy 20 0.91 0.915 0.92 0.925 0.93 0.935 0.94 0.945 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 Top-1 accuracy MAC (x108) Classification on image dataset RANets1 RANets2 RANets3 MSDNet Ours-MSDNet DenseNet Ours-DenseNet ResNet Ours-ResNet Early exiting applies to most neural network backbones “FrameExit: Conditional early exiting for efficient video recognition” (CVPR 2021)
  • 21. © 2021 Qualcomm Technologies What’s next? 21 Efficient sparse convolutions Personalization Multi-task networks Quantization aware training Platform optimizations Unsupervised / semi- supervised learning Advance existing conditional compute techniques Develop efficient video neural network solutions Learning to skip regions Recycling features Early exiting Frame exiting Future work in video perception
  • 22. © 2021 Qualcomm Technologies Machine learning core Computer vision 3D Sensing RF Sensing Personalization Biometrics Data driven models XR Camera Autonomous vehicles Fingerprint ASP IOT Cloud Wi-Fi Mobile Research Areas Applications Leading high-impact research efforts in perception Inventing technology enablers for important applications 22 Our perception research is much broader than video © 2021 Qualcomm Technologies
  • 23. © 2021 Qualcomm Technologies Takeaways 23 Video perception is crucial for understanding the world and making devices smarter We are conducting leading research and development in video perception We are making power efficient video perception possible without sacrificing accuracy
  • 24. © 2021 Qualcomm Technologies Resource slide • Qualcomm AI page: https://www.qualcomm.com/invention/artificial- intelligence • Qualcomm AI Research page: https://www.qualcomm.com/invention/artificial- intelligence/ai-research • Qualcomm® Platform Solution Ecosystem: https://www.qualcomm.com/support/qan/platform- solutions-ecosystem • GitHub open-source projects: https://github.com/quic/aimet https://github.com/quic/aimet-model-zoo/ • Qualcomm Mobile AI page: https://www.qualcomm.com/products/smartphones/mobile-ai • Qualcomm Mobile AI blog: https://www.qualcomm.com/news/onq/2020/12/02/exploring-ai- capabilities-qualcomm-snapdragon-888-mobile-platform • Qualcomm® Cloud AI 100 blog: https://www.qualcomm.com/news/onq/2021/03/15/qualcomm-cloud- ai-100-amd-epyc-7003-series-processor-and-gigabyte-server • Qualcomm AI Research blog: https://www.qualcomm.com/news/onq/2020/09/01/pushing- boundaries-ai-research Qualcomm AI Research is an initiative of Qualcomm Technologies, Inc. AIMET and AIMET Model Zoo are products of Qualcomm Innovation Center, Inc. Qualcomm Cloud AI is a product of Qualcomm Technologies, Inc. and/or its subsidiaries. Qualcomm Platform Solutions Ecosystem Program is a program of Qualcomm Technologies, Inc. and/or its subsidiaries.
  • 25. © 2021 Qualcomm Technologies Thank you 25 Follow us on: For more information, visit us at: www.qualcomm.com & www.qualcomm.com/blog Nothing in these materials is an offer to sell any of the components or devices referenced herein. ©2018-2021 Qualcomm Technologies, Inc. and/or its affiliated companies. All Rights Reserved. Qualcomm and Snapdragon are trademarks or registered trademarks of Qualcomm Incorporated. Other products and brand names may be trademarks or registered trademarks of their respective owners. References in this presentation to “Qualcomm” may mean Qualcomm Incorporated, Qualcomm Technologies, Inc., and/or other subsidiaries or business units within the Qualcomm corporate structure, as applicable. Qualcomm Incorporated includes our licensing business, QTL, and the vast majority of our patent portfolio. Qualcomm Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of our engineering, research and development functions, and substantially all of our products and services businesses, including our QCT semiconductor business.