SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
A New AI Platform Architecture
for the Smart Toys of the
Future
Gabriel Costache
Senior R&D Director
XPERI
40+
offices worldwide
headquarters in San
Jose, CA
$1.5B
+
market cap
public company,
trading under XPER
1,600
+
employees
worldwide
1,500
+
engineers
11,000
+
patent assets
100B+
devices worldwide
empowered by
technologies
delivered via Xperi
brands
• Safe
• Secure
• Private
• Enhances child development
• Uses natural interaction
• Monitors child cognitive load
• Develops with the child
• Long battery life
• Re-usable
Ideal Smart Toy
3
© 2022 XPERI
Smart Toy Examples
4
© 2022 XPERI
Privacy Issues
5
2022 XPERI
• Data privacy
• Safety
• Battery life
• Fast response
• AI technologies for children
• Data bias in AI
• Natural interaction with children
• Multimodal: audio, imaging, sensing
Smart Toy Challenges
6
© 2022 XPERI
DTIF (Disruptive Technology Innovation Fund)
D.A.V.I.D
DAVID will develop a “privacy by design” AI platform, capable of multi-modal, ultra-low power
consumption, “data center” level processing of audio and vision data on-device, without the need to
transmit any personal data to the cloud.
What DAVID will deliver to the smart toy market:
• A platform for a wide range of learning and interactive applications in the toy market
• A smart, trusted proof-of-concept toy using this platform that helps children learn and develop, using XPERI imaging
technology, Perceive® Ergo® chip and SoapBox Labs speech technology capabilities in collaboration with the National
University of Ireland, Galway.
• Cloud-free capabilities to ensure privacy and wonderfully immersive user experiences for children of all abilities.
DAVID – Data-center Audio/Video Intelligence
on Device
7
© 2022 XPERI
All-in-one Chip/Platform
Designed for Privacy
Multi-modal Platform Communication
Speech, Expressions, Emotions, Gesture, Context and
others..
• Perception
• Imaging/Vision
• Face Analytics
• Body Analytics
• Hand Analytics
• Video Compression
• Thermal Imaging
• Audio
• Wake Words / VAD
• Speech2Text / ASR
• Voice Analytics / Biometrics
• Sensing
AI Technologies to be Considered
8
© 2022 XPERI
• Interaction
• Visual
• Audio
• Text2Speech
• Sound Generation
• Others
• Language Models / Conversational Models
• Multi Modal Intent
• Cognitive and Behaviour Analysis
• Personalization
• Interactive Games
Perceive® Ergo® AI Processor
9
Source: A Reuther et al. MIT Lincoln Laboratory Supercomputing Center-arXiv:2009.00993
Ergo*
*Note: Ergo uses a proprietary representation. Ergo is not INT8.
© 2022 XPERI
DAVID Platform Design
10
© 2022 XPERI
• Interfaces:
- I2S (Tx, Rx), I2C (Tx, Rx) – (HUB and Ergo)
- MIPI and Parallel (Ergo)
- SPI & QSPI (HUB & Ergo)
- GPIO (HUB and Ergo)
- FTDI (JTAG, UART) (HUB)
- WiFi/BT (HUB)
- USB OTG (HUB)
• Computation Units:
- 3 x Ergo (55 TOPS/Watt + Arc CPUDSP)
- HUB STM32 MCU (Arm M7)
- ESP32 (2x Xtensa LX6)
• Memory:
- 16MB QSPI Flash (Ergo)
- 128MB QSPI Flash + 32MB SRAM (HUB)
- 448 KB ROM + 520 KB SRAM (ESP32)
- SDCard (HUB)
DAVID Platform Specifications
11
© 2022 XPERI
DAVID Toy PoC
12
© 2022 XPERI
microphones
camera
Thermal
LCDs
PIR
Speaker
Contacts
Wireless
charging
Boards, battery
& sensors
Current Ergo Vision Application
13
© 2022 XPERI
Face, Body & Hand
Detection
Facial Analytics FR CNN
Face Alignment
ERGO
x, y, w, h, confidence,
trackID
Facial Landmarks
Face Orientation
Face Expression
Face Embedding FR
x1,y1,
x2,y2
….
Tx, Ty, Rot, Scale
x, y, w, h
Body Analytics
Body Landmarks/Skeleton
Hand Analytics
Hand Gestures
Video Encoder
Encoded stream
1 2 3
4
5
6
Example Ergo Application
• Frame rate 30 fps
• Resolution 320x320
• Power ~100 mW
Fully neural video encoder (Ergo) and decoder (generic)
• Trained end-to-end
• Custom stream – data privacy
• Extra security can be added
• Y only currently but can be easily extended to color
• Enabler of other image enhancement technologies: colorization, super resolution
• Can enable smart monitoring
Video Encoding
14
© 2022 XPERI
ERGO
Video Encoder
Camera
MIPI/Parallel Stream Packing
Hub
Streaming App
Video Decoder
ONNX, TFLite, NNAPI
Mobile App
Decoded Frame
Hub
• Current Ergo board 3 application Text2Speech -> spectrogram generation +
vocoder
• Focus on comprehension, less on naturalness
• Next focus on: voice adaptation, voice cloning
• Extend to sound/music generation
Speech/Audio Neural Synthesis
15
© 2022 XPERI
powers magical and joyful
experiences for kids using speech technology
that is engaging, fun, and frictionless.
PLAY
DAVID Partners
NUIG C3I - Center for Computational,
Cognitive & Connected Imaging
© 2022 XPERI 16
• Smart Toy requirements:
• Privacy
• Battery life
• Multimodal interaction
• Platform requirements:
• Dedicated NN unit with very high OPs/W
• Communication unit
• Multiple sensor support
• Generic processing unit
• DAVID platform and toy PoC
• Available Q3/Q4 2022 for selected partners
Conclusions
17
© 2022 XPERI
Resources
• Xperi – www.Xperi.com
• Perceive, Ergo – www.perceive.io
• SoapBox Labs – www.soapboxlabs.com
• C3I, National University of Ireland, Galway - www.nuigalway.ie/c3i
• Disruptive Technologies Innovation Fund – DTIF
• STMicroelectronics STM32 MCU
• Espressif Systems ESP32
Resources
© 2022 XPERI 19

Weitere ähnliche Inhalte

Ähnlich wie “A New AI Platform Architecture for the Smart Toys of the Future,” a Presentation from Xperi

AiLIbrary White paper05
AiLIbrary White paper05AiLIbrary White paper05
AiLIbrary White paper05Gordon Kraft
 
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntelAPAC
 
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntelAPAC
 
The AI Takeover in Hollywood by Yves Bergquist
The AI Takeover in Hollywood by Yves BergquistThe AI Takeover in Hollywood by Yves Bergquist
The AI Takeover in Hollywood by Yves BergquistData Con LA
 
NUX Presentation from TechMixer Birmingham 2011
NUX Presentation from TechMixer Birmingham 2011NUX Presentation from TechMixer Birmingham 2011
NUX Presentation from TechMixer Birmingham 2011Michael Heydt
 
AiLibrary Whitepaper 2
AiLibrary Whitepaper 2AiLibrary Whitepaper 2
AiLibrary Whitepaper 2Gordon Kraft
 
HPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program GuideHPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program GuideIsaac Rodriguez
 
Taller IoT en la Actualidad
Taller IoT en la ActualidadTaller IoT en la Actualidad
Taller IoT en la ActualidadLaurence HR
 
Unity: What does it take to port a browser title to mobiles
Unity: What does it take to port a browser title to mobilesUnity: What does it take to port a browser title to mobiles
Unity: What does it take to port a browser title to mobilesDevGAMM Conference
 
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel:  Creating Smart Spaces with All-in-OnesIT@Intel:  Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-OnesIT@Intel
 
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-OnesIT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-OnesIntel IT Center
 
Google glass and the wearable revolution - NYCCamp 2013
Google glass and the wearable revolution - NYCCamp 2013Google glass and the wearable revolution - NYCCamp 2013
Google glass and the wearable revolution - NYCCamp 2013Frank Carey
 
Dell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western OntarioDell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western OntarioBill Wong
 
Realsense only STAGE 01 - Firstman Marpaung
Realsense only STAGE 01 - Firstman Marpaung Realsense only STAGE 01 - Firstman Marpaung
Realsense only STAGE 01 - Firstman Marpaung binusgamelab
 
The Internet of Things and You - A Developers Guide to IoT
The Internet of Things and You - A Developers Guide to IoTThe Internet of Things and You - A Developers Guide to IoT
The Internet of Things and You - A Developers Guide to IoTJim McKeeth
 
Robotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensingRobotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensingDesign World
 
Ai Development Company
Ai Development CompanyAi Development Company
Ai Development CompanyRuchir Kakkad
 

Ähnlich wie “A New AI Platform Architecture for the Smart Toys of the Future,” a Presentation from Xperi (20)

AiLIbrary White paper05
AiLIbrary White paper05AiLIbrary White paper05
AiLIbrary White paper05
 
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
 
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
 
The AI Takeover in Hollywood by Yves Bergquist
The AI Takeover in Hollywood by Yves BergquistThe AI Takeover in Hollywood by Yves Bergquist
The AI Takeover in Hollywood by Yves Bergquist
 
google glass
google glassgoogle glass
google glass
 
NUX Presentation from TechMixer Birmingham 2011
NUX Presentation from TechMixer Birmingham 2011NUX Presentation from TechMixer Birmingham 2011
NUX Presentation from TechMixer Birmingham 2011
 
AiLibrary Whitepaper 2
AiLibrary Whitepaper 2AiLibrary Whitepaper 2
AiLibrary Whitepaper 2
 
HPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program GuideHPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program Guide
 
Hololens
HololensHololens
Hololens
 
Taller IoT en la Actualidad
Taller IoT en la ActualidadTaller IoT en la Actualidad
Taller IoT en la Actualidad
 
Unity: What does it take to port a browser title to mobiles
Unity: What does it take to port a browser title to mobilesUnity: What does it take to port a browser title to mobiles
Unity: What does it take to port a browser title to mobiles
 
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel:  Creating Smart Spaces with All-in-OnesIT@Intel:  Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-Ones
 
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-OnesIT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-Ones
 
Google glass and the wearable revolution - NYCCamp 2013
Google glass and the wearable revolution - NYCCamp 2013Google glass and the wearable revolution - NYCCamp 2013
Google glass and the wearable revolution - NYCCamp 2013
 
Dell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western OntarioDell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western Ontario
 
Realsense only STAGE 01 - Firstman Marpaung
Realsense only STAGE 01 - Firstman Marpaung Realsense only STAGE 01 - Firstman Marpaung
Realsense only STAGE 01 - Firstman Marpaung
 
The Internet of Things and You - A Developers Guide to IoT
The Internet of Things and You - A Developers Guide to IoTThe Internet of Things and You - A Developers Guide to IoT
The Internet of Things and You - A Developers Guide to IoT
 
Telepresence Cisco
Telepresence CiscoTelepresence Cisco
Telepresence Cisco
 
Robotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensingRobotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensing
 
Ai Development Company
Ai Development CompanyAi Development Company
Ai Development Company
 

Mehr von Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...Edge AI and Vision Alliance
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...Edge AI and Vision Alliance
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsightsEdge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from SamsaraEdge AI and Vision Alliance
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...Edge AI and Vision Alliance
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...Edge AI and Vision Alliance
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...Edge AI and Vision Alliance
 

Mehr von Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Kürzlich hochgeladen

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 

Kürzlich hochgeladen (20)

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

“A New AI Platform Architecture for the Smart Toys of the Future,” a Presentation from Xperi

  • 1. A New AI Platform Architecture for the Smart Toys of the Future Gabriel Costache Senior R&D Director XPERI
  • 2. 40+ offices worldwide headquarters in San Jose, CA $1.5B + market cap public company, trading under XPER 1,600 + employees worldwide 1,500 + engineers 11,000 + patent assets 100B+ devices worldwide empowered by technologies delivered via Xperi brands
  • 3. • Safe • Secure • Private • Enhances child development • Uses natural interaction • Monitors child cognitive load • Develops with the child • Long battery life • Re-usable Ideal Smart Toy 3 © 2022 XPERI
  • 6. • Data privacy • Safety • Battery life • Fast response • AI technologies for children • Data bias in AI • Natural interaction with children • Multimodal: audio, imaging, sensing Smart Toy Challenges 6 © 2022 XPERI DTIF (Disruptive Technology Innovation Fund) D.A.V.I.D
  • 7. DAVID will develop a “privacy by design” AI platform, capable of multi-modal, ultra-low power consumption, “data center” level processing of audio and vision data on-device, without the need to transmit any personal data to the cloud. What DAVID will deliver to the smart toy market: • A platform for a wide range of learning and interactive applications in the toy market • A smart, trusted proof-of-concept toy using this platform that helps children learn and develop, using XPERI imaging technology, Perceive® Ergo® chip and SoapBox Labs speech technology capabilities in collaboration with the National University of Ireland, Galway. • Cloud-free capabilities to ensure privacy and wonderfully immersive user experiences for children of all abilities. DAVID – Data-center Audio/Video Intelligence on Device 7 © 2022 XPERI All-in-one Chip/Platform Designed for Privacy Multi-modal Platform Communication Speech, Expressions, Emotions, Gesture, Context and others..
  • 8. • Perception • Imaging/Vision • Face Analytics • Body Analytics • Hand Analytics • Video Compression • Thermal Imaging • Audio • Wake Words / VAD • Speech2Text / ASR • Voice Analytics / Biometrics • Sensing AI Technologies to be Considered 8 © 2022 XPERI • Interaction • Visual • Audio • Text2Speech • Sound Generation • Others • Language Models / Conversational Models • Multi Modal Intent • Cognitive and Behaviour Analysis • Personalization • Interactive Games
  • 9. Perceive® Ergo® AI Processor 9 Source: A Reuther et al. MIT Lincoln Laboratory Supercomputing Center-arXiv:2009.00993 Ergo* *Note: Ergo uses a proprietary representation. Ergo is not INT8. © 2022 XPERI
  • 11. • Interfaces: - I2S (Tx, Rx), I2C (Tx, Rx) – (HUB and Ergo) - MIPI and Parallel (Ergo) - SPI & QSPI (HUB & Ergo) - GPIO (HUB and Ergo) - FTDI (JTAG, UART) (HUB) - WiFi/BT (HUB) - USB OTG (HUB) • Computation Units: - 3 x Ergo (55 TOPS/Watt + Arc CPUDSP) - HUB STM32 MCU (Arm M7) - ESP32 (2x Xtensa LX6) • Memory: - 16MB QSPI Flash (Ergo) - 128MB QSPI Flash + 32MB SRAM (HUB) - 448 KB ROM + 520 KB SRAM (ESP32) - SDCard (HUB) DAVID Platform Specifications 11 © 2022 XPERI
  • 12. DAVID Toy PoC 12 © 2022 XPERI microphones camera Thermal LCDs PIR Speaker Contacts Wireless charging Boards, battery & sensors
  • 13. Current Ergo Vision Application 13 © 2022 XPERI Face, Body & Hand Detection Facial Analytics FR CNN Face Alignment ERGO x, y, w, h, confidence, trackID Facial Landmarks Face Orientation Face Expression Face Embedding FR x1,y1, x2,y2 …. Tx, Ty, Rot, Scale x, y, w, h Body Analytics Body Landmarks/Skeleton Hand Analytics Hand Gestures Video Encoder Encoded stream 1 2 3 4 5 6 Example Ergo Application • Frame rate 30 fps • Resolution 320x320 • Power ~100 mW
  • 14. Fully neural video encoder (Ergo) and decoder (generic) • Trained end-to-end • Custom stream – data privacy • Extra security can be added • Y only currently but can be easily extended to color • Enabler of other image enhancement technologies: colorization, super resolution • Can enable smart monitoring Video Encoding 14 © 2022 XPERI ERGO Video Encoder Camera MIPI/Parallel Stream Packing Hub Streaming App Video Decoder ONNX, TFLite, NNAPI Mobile App Decoded Frame Hub
  • 15. • Current Ergo board 3 application Text2Speech -> spectrogram generation + vocoder • Focus on comprehension, less on naturalness • Next focus on: voice adaptation, voice cloning • Extend to sound/music generation Speech/Audio Neural Synthesis 15 © 2022 XPERI
  • 16. powers magical and joyful experiences for kids using speech technology that is engaging, fun, and frictionless. PLAY DAVID Partners NUIG C3I - Center for Computational, Cognitive & Connected Imaging © 2022 XPERI 16
  • 17. • Smart Toy requirements: • Privacy • Battery life • Multimodal interaction • Platform requirements: • Dedicated NN unit with very high OPs/W • Communication unit • Multiple sensor support • Generic processing unit • DAVID platform and toy PoC • Available Q3/Q4 2022 for selected partners Conclusions 17 © 2022 XPERI
  • 19. • Xperi – www.Xperi.com • Perceive, Ergo – www.perceive.io • SoapBox Labs – www.soapboxlabs.com • C3I, National University of Ireland, Galway - www.nuigalway.ie/c3i • Disruptive Technologies Innovation Fund – DTIF • STMicroelectronics STM32 MCU • Espressif Systems ESP32 Resources © 2022 XPERI 19