• Overview of AI services
• Deploy a data science environment with the AWS Deep Learning AMI
• Leverage the power of Intel GPU’s with NVIDIA Tesla M60 GPUs
• Train and deploy Deep Learning models at scale
6. How does it work ?
Historical Data Model Building Prediction
What is my color?
And what is mine?
7. … but more data …
Less Data More Data Even More Data
Linear Models
Categorical Models
Bayesian Methods
Decision Trees
Neural Networks
Cluster Analysis
… More …
Kernel Based Methods
12. Amazon Rekognition
Deep learning-based image recognition service
Search, verify, and organize millions of images
Object and Scene
Detection
Facial
Analysis
Face
Comparison
Facial
Recognition
13. Face Comparison
Measure the likelihood that faces in two images are of
the same person
• Add face verification to
applications and devices
• Extend physical security
controls
• Provide guest access to
VIP-only facilities
• Verify users for online
exams and polls
14. Amazon Rekognition API
var compareParams = {
SimilarityThreshold: 90,
SourceImage: { ... },
TargetImage: { ... }
}};
rekognition.compareFaces(compareParams, function(err, data) {
if (err) {
console.log(err, err.stack); // an error occurred
} else {
if (data.FaceMatches.length > 0) {
//get item in dynamo
console.log("Similarity: " + data.FaceMatches[0].Similarity);
dynamodb.getItem(paramsItem, function(err, data) { ... }
}
});
17. Deep Learning – Neural Network
Output
Neural Network
Input
Hidden layers
Computing systems inspired by the biological neural networks
which learn to do tasks by considering examples, generally
without task-specific programming
22. Apache MXNet
Programmable Portable High Performance
Near linear scaling
across hundreds of GPUs
Highly efficient
models for mobile
and IoT
Simple syntax,
multiple languages
26. Amazon Polly: Life-like Speech Service
Converts text
to life-like
speech
47 voices 24 languages Low latency,
real time
Fully managed
27. TEXT
Market grew by > 20%.
WORDSPHONEMES
{
{
{
{
{
ˈtwɛn.ti
pɚ.ˈsɛnt
ˈmɑɹ.kət ˈgɹu baɪ ˈmoʊɹ
ˈðæn
PROSODY CONTOURUNIT SELECTION AND ADAPTATION
TEXT PROCESSING
PROSODY MODIFICATIONSTREAMING
Market grew by more
than
twenty
percent
Speech units
inventory
28. Markup language: SSML
Speech Synthesis Markup Language
is a W3C recommendation, an XML-based markup language for speech
synthesis applications
<speak>
My name is Coqueiro. It is spelled
<prosody rate='x-slow'>
<say-as interpret-as="characters">Coqueiro</say-as>
</prosody>
</speak>