SlideShare ist ein Scribd-Unternehmen logo
1 von 33
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Adam Larter
Principal Solutions Architect, Developer
Specialist, Amazon Web Services
Alastair Cousins
Senior Solutions Architect, Amazon Web Services
Harnessing Artificial Intelligence
in Your Applications
Amazon Rekognition, Amazon Polly, and Amazon Lex
Level 300
Intelligent Multimodal Interfaces
What is Amazon Polly?
• A service that converts text into lifelike speech
• Offers 47 lifelike voices across 24 languages
• Low latency responses enable developers to build
real-time systems
• Developers can store, replay, and distribute
generated speech
Amazon Polly: Quality
Natural-sounding speech
A subjective measure of how close TTS output is to human speech.
Accurate text processing
Ability of the system to interpret common text formats such
as abbreviations, numerical sequences, homographs etc.
Today in Sydney Australia, it's 26°C
It’s nice to know, we’re going to Nice
Highly intelligible
A measure of how comprehensible speech is.
Peter Piper picked a peck of pickled peppers
Amazon Polly: SSML
Speech Synthesis Markup Language
Is a W3C recommendation, an XML-based markup language for speech
synthesis applications
<speak>
My name is Adam Larter. It is spelled
<prosody rate='x-slow'>
<say-as interpret-as="characters">Larter</say-as>
</prosody>
</speak>
Example Use Case
Adding speech synthesis to any app
Polly Voice Synthesis Demo
Amazon Polly
Amazon API
Gateway
Lambda
function
Amazon
S3
Mobile App
IoT Device
Calling through API Gateway
allows us to implement caching
and use throttling and API
Keys via Usage Plans
Images – Another Untapped Interface
Amazon Rekognition
Deep learning-based image recognition service
Search, verify, and organise millions of images
Object and Scene
Detection
Facial
Analysis
Face
Comparison
Facial
Recognition
Amazon Rekognition
Deep learning-based image recognition service
Search, verify, and organise millions of images
Object and Scene
Detection
Facial
Analysis
Face
Comparison
Facial
Recognition
Detecting Faces in a Crowd
IoT
Camera
Amazon
Rekognition
Lambda
function
Amazon API
Gateway
DetectFaces()
Image
with
Faces
"Emotions": [
{"Confidence": 99.1335220336914,
"Type": "HAPPY" },
{"Confidence": 3.3275485038757324,
"Type": "CALM"},
{"Confidence": 0.31517744064331055,
"Type": "SAD"}
],
"Eyeglasses": {"Confidence": 99.8050537109375,
"Value": false},
"EyesOpen": {Confidence": 99.99979400634766,
"Value": true},
Understanding Bounding Boxes
Turn Ratios into X/Y
co-ordinates:
multiply by the image
width/height
"BoundingBox": {
"Height": 0.3449999988079071,
"Left": 0.09666666388511658,
"Top": 0.27166667580604553,
"Width": 0.23000000417232513
},
Tip: Capture Additional Context
Introduce a coefficient to
capture additional image
context by inflating
the bounding box
Cropping Faces
Scaling to Many Faces
Amazon
Rekognition
Lambda
function Amazon
ElasticSearch
Amazon
SNS
Lambda
function
Amazon
S3
User’s Face
Image
Fan Out of Lambda Functions via SNS.
1 Notification per Face detected
Metadata from DetectFaces() +
S3 Object Ref to Face Image
Metadata +
Location +
Timestamp
User’s Face
Image
Example Use Case
Authentication using face image
Sign In Using Face
• Cognito User Pools (CUP) as System of Record for users
• Create a Developer-Authenticated Identity Provider (IdP)
to perform AuthN using Amazon Rekognition
• Federate CUP and Developer IdP through
Cognito Identity Federation
• CUP user names are unique – make use of the
ExternalId parameter in indexFaces()
CUP and Developer
Authenticated Identities
will be linked after this call
Linking Identities in Cognito Federation
Amazon
Cognito
User Pool
Username and password sent to
Cognito User Pools Identity Provider
Link Face to Cognito User
Mobile
App
Amazon
Cognito
User Pool
Cognito Identity Token returned
Link Face to Cognito User
Mobile
App
Amazon Cognito
Identity Pool
Cognito Identity Token
Link Face to Cognito User
Mobile
App
User’s Face
Image
Amazon API
Gateway
Lambda
functionUser’s Face
Image
+
Cognito User Pool username
stored in the Rekognition
collection as the ExternalId for
the user’s face vector
Amazon
Rekognition
username
as ExternalId
Store in
Collection
Identities linked by call to
getOpenIdTokenForDeveloperIdentity()
Amazon Cognito
Identity Pool
Mobile
App
User’s Face
Image
Amazon API
Gateway
Lambda
functionUser’s Face
Image
ExternalId used as the unique user identifier in call to
CognitoIdentity::getOpenIdTokenForDeveloperIdentity
Amazon
Rekognition
Sign In Using Face FaceId +
ExternalId
AccessKeyId / SecretAccessKey / SessionToken
Sign In Using Face – Implementation
Linking face to Cognito User:
• Sign in first using Cognito User Pools via Cognito SDK
• Take user’s picture & send image with JWT
• Rekognition::indexFaces()
to store user’s face vector in collection and use
Cognito User Pools username as the External Id
• CognitoIdentity::getOpenIdTokenForDeveloperIdentity
to create a Cognito Token and link the identities together
Sign In Using Face – Implementation
Sign in using face:
• Rekognition::searchFacesByImage()
to get External Id
• Cognito::getOpenIdTokenForDeveloperIdentity()
with retrieved External Id to generate the Token and Identity
Id the client app needs
• Client app then follows standard Cognito process using
CognitoCachingCredentialsProvider()
Amazon Lex
AWS
Lambda
Polly Amazon
CloudWatch
Monitoring
Text
Speech
Text
Amazon
DynamoDB
AWS IoT
Amazon API
Gateway
Conversational Interfaces
Applications
Walkthrough
Lex Bot Creation Process
Example Use Case
The Smart Assistant
Smart Assistant - Key Features
• Triggers using any type of input, not just speech
− This demo uses a camera, and on-device face detection with
OpenCV – http://opencv.org
• Hot word detection to get device’s attention
− Snowboy - https://snowboy.kitt.ai/
• Silence detection during live speech capture
− SoX - http://sox.sourceforge.net/
• NLU provided by Amazon Lex
− Speech input SDK not yet available
− Don’t let that stop you calling the API directly!
Smart Assistant
Wait for Hot Word
(Snowboy)
Wait for Face to
appear in camera view
Listen for audio
command
START
Smart Assistant
Wait for Face to
appear in camera view
Capture image from
webcam
(fswebcam)
Recognise Face
(Amazon Rekognition)
Resize to improve
process effiiency
(Imagemagick)
Detect face on device
(OpenCV)
Known User State
Replay Audio
Is the face
in the
collection?
YES
NO
Run User Speech
Dialogue Interaction
and NLU
Smart Assistant
Process intent
(API Gateway/Lambda)
Listen for speech input
with silence detection
(SoX)
Play audio response &
loop back to listen for
speech input
Construct Lex payload
and submit to API
(HTTPS Request)
Parse response
headers
YES
Run User Speech
Dialogue Interaction
and NLU
Is the
interaction
Ready for
Fulfillment
?
NO
Listen for speech input
with silence detection
(SoX)
Thank you!

Weitere ähnliche Inhalte

Ähnlich wie Harnessing Artificial Intelligence in your Applications - Level 300

Xuedong Huang - Deep Learning and Intelligent Applications
Xuedong Huang - Deep Learning and Intelligent ApplicationsXuedong Huang - Deep Learning and Intelligent Applications
Xuedong Huang - Deep Learning and Intelligent ApplicationsMachine Learning Prague
 
AWS Machine Learning Week SF: Build an Image-Based Automatic Alert System wit...
AWS Machine Learning Week SF: Build an Image-Based Automatic Alert System wit...AWS Machine Learning Week SF: Build an Image-Based Automatic Alert System wit...
AWS Machine Learning Week SF: Build an Image-Based Automatic Alert System wit...Amazon Web Services
 
MCL314_Unlocking Media Workflows Using Amazon Rekognition
MCL314_Unlocking Media Workflows Using Amazon RekognitionMCL314_Unlocking Media Workflows Using Amazon Rekognition
MCL314_Unlocking Media Workflows Using Amazon RekognitionAmazon Web Services
 
Proposal -co_win_india_valardigital-converted
Proposal  -co_win_india_valardigital-convertedProposal  -co_win_india_valardigital-converted
Proposal -co_win_india_valardigital-convertedUpendraSharma53
 
AWS Media Day- AWS 인공 지능 서비스를 활용한 미디어 서비스 개발화 (김기완 솔루션즈 아키텍트)
AWS Media Day- AWS 인공 지능 서비스를 활용한 미디어 서비스 개발화 (김기완 솔루션즈 아키텍트)AWS Media Day- AWS 인공 지능 서비스를 활용한 미디어 서비스 개발화 (김기완 솔루션즈 아키텍트)
AWS Media Day- AWS 인공 지능 서비스를 활용한 미디어 서비스 개발화 (김기완 솔루션즈 아키텍트)Amazon Web Services Korea
 
Intro to Amazon Lightsail and Launching Your First Application on Amazon Ligh...
Intro to Amazon Lightsail and Launching Your First Application on Amazon Ligh...Intro to Amazon Lightsail and Launching Your First Application on Amazon Ligh...
Intro to Amazon Lightsail and Launching Your First Application on Amazon Ligh...Amazon Web Services
 
Deep learning-based image recognition: Intro to Amazon Rekognition:
Deep learning-based image recognition: Intro to Amazon Rekognition: Deep learning-based image recognition: Intro to Amazon Rekognition:
Deep learning-based image recognition: Intro to Amazon Rekognition: Amazon Web Services
 
AWS DeepLens Workshop: Building Computer Vision Applications - BDA201 - Atlan...
AWS DeepLens Workshop: Building Computer Vision Applications - BDA201 - Atlan...AWS DeepLens Workshop: Building Computer Vision Applications - BDA201 - Atlan...
AWS DeepLens Workshop: Building Computer Vision Applications - BDA201 - Atlan...Amazon Web Services
 
Aws cloud computing conference
Aws cloud computing conferenceAws cloud computing conference
Aws cloud computing conferenceAnjani Phuyal
 
Build, train, and deploy machine learning models at scale - AWS Summit Cape T...
Build, train, and deploy machine learning models at scale - AWS Summit Cape T...Build, train, and deploy machine learning models at scale - AWS Summit Cape T...
Build, train, and deploy machine learning models at scale - AWS Summit Cape T...Amazon Web Services
 
New Artificial Intelligence and IoT Services (Lex, Polly, Rekognition, Greeng...
New Artificial Intelligence and IoT Services (Lex, Polly, Rekognition, Greeng...New Artificial Intelligence and IoT Services (Lex, Polly, Rekognition, Greeng...
New Artificial Intelligence and IoT Services (Lex, Polly, Rekognition, Greeng...Amazon Web Services
 
Intelligence of Things: IoT, AWS DeepLens and Amazon SageMaker
Intelligence of Things: IoT, AWS DeepLens and Amazon SageMakerIntelligence of Things: IoT, AWS DeepLens and Amazon SageMaker
Intelligence of Things: IoT, AWS DeepLens and Amazon SageMakerAmazon Web Services
 
AWS AI Media & Entertainment Seminar - NYC, August 15, 2017
AWS AI Media & Entertainment Seminar - NYC, August 15, 2017AWS AI Media & Entertainment Seminar - NYC, August 15, 2017
AWS AI Media & Entertainment Seminar - NYC, August 15, 2017Amazon Web Services
 
AI and Innovations on AWS
AI and Innovations on AWSAI and Innovations on AWS
AI and Innovations on AWSAdrian Hornsby
 
Biometric Systems - Automate Video Streaming Analysis with Azure and AWS
Biometric Systems - Automate Video Streaming Analysis with Azure and AWSBiometric Systems - Automate Video Streaming Analysis with Azure and AWS
Biometric Systems - Automate Video Streaming Analysis with Azure and AWSRoberto Falconi
 
Scaling Machine Learning from zero to millions of users (May 2019)
Scaling Machine Learning from zero to millions of users (May 2019)Scaling Machine Learning from zero to millions of users (May 2019)
Scaling Machine Learning from zero to millions of users (May 2019)Julien SIMON
 

Ähnlich wie Harnessing Artificial Intelligence in your Applications - Level 300 (20)

Xuedong Huang - Deep Learning and Intelligent Applications
Xuedong Huang - Deep Learning and Intelligent ApplicationsXuedong Huang - Deep Learning and Intelligent Applications
Xuedong Huang - Deep Learning and Intelligent Applications
 
AWS Machine Learning Week SF: Build an Image-Based Automatic Alert System wit...
AWS Machine Learning Week SF: Build an Image-Based Automatic Alert System wit...AWS Machine Learning Week SF: Build an Image-Based Automatic Alert System wit...
AWS Machine Learning Week SF: Build an Image-Based Automatic Alert System wit...
 
MCL314_Unlocking Media Workflows Using Amazon Rekognition
MCL314_Unlocking Media Workflows Using Amazon RekognitionMCL314_Unlocking Media Workflows Using Amazon Rekognition
MCL314_Unlocking Media Workflows Using Amazon Rekognition
 
Proposal -co_win_india_valardigital-converted
Proposal  -co_win_india_valardigital-convertedProposal  -co_win_india_valardigital-converted
Proposal -co_win_india_valardigital-converted
 
AWS Media Day- AWS 인공 지능 서비스를 활용한 미디어 서비스 개발화 (김기완 솔루션즈 아키텍트)
AWS Media Day- AWS 인공 지능 서비스를 활용한 미디어 서비스 개발화 (김기완 솔루션즈 아키텍트)AWS Media Day- AWS 인공 지능 서비스를 활용한 미디어 서비스 개발화 (김기완 솔루션즈 아키텍트)
AWS Media Day- AWS 인공 지능 서비스를 활용한 미디어 서비스 개발화 (김기완 솔루션즈 아키텍트)
 
Intro to Amazon Lightsail and Launching Your First Application on Amazon Ligh...
Intro to Amazon Lightsail and Launching Your First Application on Amazon Ligh...Intro to Amazon Lightsail and Launching Your First Application on Amazon Ligh...
Intro to Amazon Lightsail and Launching Your First Application on Amazon Ligh...
 
Deep learning-based image recognition: Intro to Amazon Rekognition:
Deep learning-based image recognition: Intro to Amazon Rekognition: Deep learning-based image recognition: Intro to Amazon Rekognition:
Deep learning-based image recognition: Intro to Amazon Rekognition:
 
Are API Services Taking Over All the Interesting Data Science Problems?
Are API Services Taking Over All the Interesting Data Science Problems?Are API Services Taking Over All the Interesting Data Science Problems?
Are API Services Taking Over All the Interesting Data Science Problems?
 
AWS DeepLens Workshop: Building Computer Vision Applications - BDA201 - Atlan...
AWS DeepLens Workshop: Building Computer Vision Applications - BDA201 - Atlan...AWS DeepLens Workshop: Building Computer Vision Applications - BDA201 - Atlan...
AWS DeepLens Workshop: Building Computer Vision Applications - BDA201 - Atlan...
 
Aws cloud computing conference
Aws cloud computing conferenceAws cloud computing conference
Aws cloud computing conference
 
Build, train, and deploy machine learning models at scale - AWS Summit Cape T...
Build, train, and deploy machine learning models at scale - AWS Summit Cape T...Build, train, and deploy machine learning models at scale - AWS Summit Cape T...
Build, train, and deploy machine learning models at scale - AWS Summit Cape T...
 
New Artificial Intelligence and IoT Services (Lex, Polly, Rekognition, Greeng...
New Artificial Intelligence and IoT Services (Lex, Polly, Rekognition, Greeng...New Artificial Intelligence and IoT Services (Lex, Polly, Rekognition, Greeng...
New Artificial Intelligence and IoT Services (Lex, Polly, Rekognition, Greeng...
 
Intelligence of Things: IoT, AWS DeepLens and Amazon SageMaker
Intelligence of Things: IoT, AWS DeepLens and Amazon SageMakerIntelligence of Things: IoT, AWS DeepLens and Amazon SageMaker
Intelligence of Things: IoT, AWS DeepLens and Amazon SageMaker
 
AWS AI Media & Entertainment Seminar - NYC, August 15, 2017
AWS AI Media & Entertainment Seminar - NYC, August 15, 2017AWS AI Media & Entertainment Seminar - NYC, August 15, 2017
AWS AI Media & Entertainment Seminar - NYC, August 15, 2017
 
AI and Innovations on AWS
AI and Innovations on AWSAI and Innovations on AWS
AI and Innovations on AWS
 
Biometric Systems - Automate Video Streaming Analysis with Azure and AWS
Biometric Systems - Automate Video Streaming Analysis with Azure and AWSBiometric Systems - Automate Video Streaming Analysis with Azure and AWS
Biometric Systems - Automate Video Streaming Analysis with Azure and AWS
 
Scaling Machine Learning from zero to millions of users (May 2019)
Scaling Machine Learning from zero to millions of users (May 2019)Scaling Machine Learning from zero to millions of users (May 2019)
Scaling Machine Learning from zero to millions of users (May 2019)
 
AI on a PI
AI on a PIAI on a PI
AI on a PI
 
Moving forward with AI
Moving forward with AIMoving forward with AI
Moving forward with AI
 
Building mobile apps on aws
Building mobile apps on awsBuilding mobile apps on aws
Building mobile apps on aws
 

Mehr von Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Mehr von Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Kürzlich hochgeladen

Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 

Kürzlich hochgeladen (20)

Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 

Harnessing Artificial Intelligence in your Applications - Level 300

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Adam Larter Principal Solutions Architect, Developer Specialist, Amazon Web Services Alastair Cousins Senior Solutions Architect, Amazon Web Services Harnessing Artificial Intelligence in Your Applications Amazon Rekognition, Amazon Polly, and Amazon Lex Level 300
  • 3. What is Amazon Polly? • A service that converts text into lifelike speech • Offers 47 lifelike voices across 24 languages • Low latency responses enable developers to build real-time systems • Developers can store, replay, and distribute generated speech
  • 4. Amazon Polly: Quality Natural-sounding speech A subjective measure of how close TTS output is to human speech. Accurate text processing Ability of the system to interpret common text formats such as abbreviations, numerical sequences, homographs etc. Today in Sydney Australia, it's 26°C It’s nice to know, we’re going to Nice Highly intelligible A measure of how comprehensible speech is. Peter Piper picked a peck of pickled peppers
  • 5. Amazon Polly: SSML Speech Synthesis Markup Language Is a W3C recommendation, an XML-based markup language for speech synthesis applications <speak> My name is Adam Larter. It is spelled <prosody rate='x-slow'> <say-as interpret-as="characters">Larter</say-as> </prosody> </speak>
  • 6. Example Use Case Adding speech synthesis to any app
  • 7. Polly Voice Synthesis Demo Amazon Polly Amazon API Gateway Lambda function Amazon S3 Mobile App IoT Device Calling through API Gateway allows us to implement caching and use throttling and API Keys via Usage Plans
  • 8. Images – Another Untapped Interface
  • 9. Amazon Rekognition Deep learning-based image recognition service Search, verify, and organise millions of images Object and Scene Detection Facial Analysis Face Comparison Facial Recognition
  • 10. Amazon Rekognition Deep learning-based image recognition service Search, verify, and organise millions of images Object and Scene Detection Facial Analysis Face Comparison Facial Recognition
  • 11.
  • 12. Detecting Faces in a Crowd IoT Camera Amazon Rekognition Lambda function Amazon API Gateway DetectFaces() Image with Faces "Emotions": [ {"Confidence": 99.1335220336914, "Type": "HAPPY" }, {"Confidence": 3.3275485038757324, "Type": "CALM"}, {"Confidence": 0.31517744064331055, "Type": "SAD"} ], "Eyeglasses": {"Confidence": 99.8050537109375, "Value": false}, "EyesOpen": {Confidence": 99.99979400634766, "Value": true},
  • 13. Understanding Bounding Boxes Turn Ratios into X/Y co-ordinates: multiply by the image width/height "BoundingBox": { "Height": 0.3449999988079071, "Left": 0.09666666388511658, "Top": 0.27166667580604553, "Width": 0.23000000417232513 },
  • 14. Tip: Capture Additional Context Introduce a coefficient to capture additional image context by inflating the bounding box
  • 16. Scaling to Many Faces Amazon Rekognition Lambda function Amazon ElasticSearch Amazon SNS Lambda function Amazon S3 User’s Face Image Fan Out of Lambda Functions via SNS. 1 Notification per Face detected Metadata from DetectFaces() + S3 Object Ref to Face Image Metadata + Location + Timestamp User’s Face Image
  • 18. Sign In Using Face • Cognito User Pools (CUP) as System of Record for users • Create a Developer-Authenticated Identity Provider (IdP) to perform AuthN using Amazon Rekognition • Federate CUP and Developer IdP through Cognito Identity Federation • CUP user names are unique – make use of the ExternalId parameter in indexFaces()
  • 19. CUP and Developer Authenticated Identities will be linked after this call Linking Identities in Cognito Federation
  • 20. Amazon Cognito User Pool Username and password sent to Cognito User Pools Identity Provider Link Face to Cognito User Mobile App
  • 21. Amazon Cognito User Pool Cognito Identity Token returned Link Face to Cognito User Mobile App
  • 22. Amazon Cognito Identity Pool Cognito Identity Token Link Face to Cognito User Mobile App User’s Face Image Amazon API Gateway Lambda functionUser’s Face Image + Cognito User Pool username stored in the Rekognition collection as the ExternalId for the user’s face vector Amazon Rekognition username as ExternalId Store in Collection Identities linked by call to getOpenIdTokenForDeveloperIdentity()
  • 23. Amazon Cognito Identity Pool Mobile App User’s Face Image Amazon API Gateway Lambda functionUser’s Face Image ExternalId used as the unique user identifier in call to CognitoIdentity::getOpenIdTokenForDeveloperIdentity Amazon Rekognition Sign In Using Face FaceId + ExternalId AccessKeyId / SecretAccessKey / SessionToken
  • 24. Sign In Using Face – Implementation Linking face to Cognito User: • Sign in first using Cognito User Pools via Cognito SDK • Take user’s picture & send image with JWT • Rekognition::indexFaces() to store user’s face vector in collection and use Cognito User Pools username as the External Id • CognitoIdentity::getOpenIdTokenForDeveloperIdentity to create a Cognito Token and link the identities together
  • 25. Sign In Using Face – Implementation Sign in using face: • Rekognition::searchFacesByImage() to get External Id • Cognito::getOpenIdTokenForDeveloperIdentity() with retrieved External Id to generate the Token and Identity Id the client app needs • Client app then follows standard Cognito process using CognitoCachingCredentialsProvider()
  • 26. Amazon Lex AWS Lambda Polly Amazon CloudWatch Monitoring Text Speech Text Amazon DynamoDB AWS IoT Amazon API Gateway Conversational Interfaces Applications
  • 28. Example Use Case The Smart Assistant
  • 29. Smart Assistant - Key Features • Triggers using any type of input, not just speech − This demo uses a camera, and on-device face detection with OpenCV – http://opencv.org • Hot word detection to get device’s attention − Snowboy - https://snowboy.kitt.ai/ • Silence detection during live speech capture − SoX - http://sox.sourceforge.net/ • NLU provided by Amazon Lex − Speech input SDK not yet available − Don’t let that stop you calling the API directly!
  • 30. Smart Assistant Wait for Hot Word (Snowboy) Wait for Face to appear in camera view Listen for audio command START
  • 31. Smart Assistant Wait for Face to appear in camera view Capture image from webcam (fswebcam) Recognise Face (Amazon Rekognition) Resize to improve process effiiency (Imagemagick) Detect face on device (OpenCV) Known User State Replay Audio Is the face in the collection? YES NO Run User Speech Dialogue Interaction and NLU
  • 32. Smart Assistant Process intent (API Gateway/Lambda) Listen for speech input with silence detection (SoX) Play audio response & loop back to listen for speech input Construct Lex payload and submit to API (HTTPS Request) Parse response headers YES Run User Speech Dialogue Interaction and NLU Is the interaction Ready for Fulfillment ? NO Listen for speech input with silence detection (SoX)