SlideShare a Scribd company logo
1 of 67
Download to read offline
© 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved.
James Chiang (蔣宗恩)
Solutions Architect, Amazon Web Services
AWS AI & ML:
Overview of the 10 Most Common ML
Algorithms
© 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved.
Agenda
• AI Quick Introduction
• Supervised Learning Algorithm
• Unsupervised Learning Algorithm
• How Sagemaker Run Algorithm
© 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved.
I Will Be Back !!!!
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AI, Machine Learning, and Deep Learning
Fast Compute
Ubiquitous Data
Advanced Learning Algorithms
Key Drivers for Deep Learning
© 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved.
Types of Machine Learning
Supervised
Human Intervention and validation required
Photo classification or tagging
Vs.
Unsupervised
No human intervention required
Auto-classification of documents based on context
Supervised Learning
• Training data provides “examples” and “outcomes”
• The machine learns to predict the outcome of new data based on the past
examples
Supervised Learning
Training data has one feature that is the “outcome”
Split the data into a training and test set
Model the training set / Predict the test test
Compares the predictions to the known values
Algorithms
Model / Ensemble
Logistic Regression
Time Series
Sometimes referred to as the “label” or “objective”
Goal is to build a model which can predict the outcome
If categorical: model is a “classification” or model is a “regression”
Because the data has a known value, model can be evaluated
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Unsupervised Learning
Training data provides “examples” - no specific “outcome”
The machine tries to find “interesting” patterns in the data
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Unsupervised Learning
Training data has only examples and no specific “outcome”
This is common - labels are typically expensive
Goal is to perform discovery, find patterns, etc
Tends to be more difficult
Algorithms
Clusters
Anomaly Detection
Association Discovery
Topic Models
Because the data has no “outcome”, can not be evaluated
Each discovery method has it’s own quality measures
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Data Visualization &
Analysis
Business Problem –
ML problem framing Data Collection
Data Integration
Data Preparation &
Cleaning
Feature Engineering
Model Training &
Parameter Tuning
Model Evaluation
Are Business
Goals met?
Model Deployment
Monitoring &
Debugging
– Predictions
YesNo
DataAugmentation
Feature
Augmentation
The Machine Learning Process
Re-training
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Introducing: Amazon SageMaker
A managed service
that provides the quickest and easiest way for
your data scientists and developers to get
ML models from idea to production.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Data Visualization &
Analysis
Business Problem –
ML problem framing Data Collection
Data Integration
Data Preparation &
Cleaning
Feature Engineering
Model Training &
Parameter Tuning
Model Evaluation
Are Business
Goals met?
Model Deployment
Monitoring &
Debugging
– Predictions
YesNo
DataAugmentation
Feature
Augmentation
Retraining
• Setup and manage
Notebook Environments
• Setup and manage
Training Clusters
• Write Data Connectors
• Scale ML algorithms to
large datasets
• Distribute ML training
algorithm to multiple
machines
• Secure Model artifacts
Why We built Amazon SageMaker: The Model Training Undifferentiated Heavy Lifting
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Built-in ML Algorithm
h t t p s : / / d o c s . a w s . a m a z o n . c o m / s a g e m a k e r / l a t e s t / d g / a l g o s . h t m l
Problem Algorithm Learning Typ
Discrete Classification,
Regression
Linear Learner Supervised
XGBoost Algorithm Supervised
Discrete Recommendations Factorization Machines Supervised
Image Classification Image Classification Algorithm Supervised, CNN
Neural Machine Translation Sequence to Sequence Supervised, seq2seq
Time-series Prediction DeepAR Supervised, RNN
Discrete Groupings K-Means Algorithm Unsupervised
Dimensionality Reduction PCA (Principal Component Analysis) Unsupervised
Topic Determination Latent Dirichlet Allocation (LDA) Unsupervised
Neural Topic Model (NTM) Unsupervised,
Neural Network Based
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Supervised Learning Algorithm
Linear Learner
Regression:
Estimate a real valuedfunction
Binary classification:
Predict a 0/1 class
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Train
Fit thresholds
and select
Select modelwith best validation performance
Linear Learner
More than 8x speed-up over naive parallel training!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Linear Learner
Regression (mean squarederror)
SageMaker Other
1.02 1.06
1.09 1.02
0.332 0.183
0.086 0.129
83.3 84.5
Classification (F1 Score)
SageMaker Other
0.980 0.981
0.870 0.930
0.997 0.997
0.978 0.964
0.914 0.859
0.470 0.472
0.903 0.908
0.508 0.508
30-GB datasets for web-spam and web-URL classification
1.2
1
0.8
0.6
0.4
0.2
0
0 5 10 20 25 30
CostinDollars
15
Billable time inMinutes
sagemaker-url sagemaker-spam other-url other-spam
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Boosted Decision Trees
Throughput vs. number of machinesXGBoost is one of the most
commonly used
implementations of boosted
decision trees in theworld.
It is now available inAmazon
SageMaker!
400
200
0
600
800
1000
1200
1400
0 10 60 70
ThroughputinMB/Sec
20 30 40 50
Number of machines (C4.8xLarge)
Regression Tree (CART)
When Trees forms a Forest(Tree Ensembles)
© 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved.
split
weight
Training Loss measures how
Well model fit on training data
Regularization, measures
complexity of trees
XGboots introduction
© 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved.
XGboots introduction
Factorization Machines
Log_loss F1 Score Seconds
SageMaker 0.494 0.277 820
Other (10 Iter) 0.516 0.190 650
Other (20 Iter) 0.507 0.254 1300
Other (50 Iter) 0.481 0.313 3250
Click Prediction 1-TBadvertising dataset,
m4.4xlarge machines, perfect scaling
$200.00
$180.00
$160.00
$140.00
$120.00
$100.00
$80.00
$60.00
$40.00
$20.00
$-
1 2 3 6 7 8
Costindollars
4 5
Billable time in hours
sa
50 40 30 20 10
m chines machine s machin
e
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Where factorization is useful
A user movie rating table
Matrix factorization
Predictions for recommendations
Star Wars Inception Godfather Notebook
U1 4.97 2.98 2.18 0.98
U2 3.97 2.4 1.97 0.99
U3 1.02 0.93 5.02 4.93
U4 1.00 0.85 4.59 3.93
U5 1.36 1.07 4.89 4.12
Sequence to Sequence problems
0
5
10
15
20
25
0 5 10 15 20 25
BLEUScore
Billable Time in HoursP2.16x P2.8x P2.x
Best known result!
Based on Sockeye and Apache
incubated MxNet, Multi-GPU,
and can be used for Neural
Machine Translation.
Supports both RNN/CNN
as encoder/decoder
RNN Introduction
Ex: Arrive Taipei at November 2nd
Leave Taipei at November 2nd
LSTM
Sequence to Sequence problems
(1) Vanilla mode of processing without RNN, from fixed-sized input to fixed-sized output (e.g. image classification).
(2) Sequence output (e.g. image captioning takes an image and outputs a sentence of words).
(3) Sequence input (e.g. sentiment analysis where a given sentence is classified as expressing positive
or negative sentiment).
(4) Sequence input and sequence output (e.g. Machine Translation: an RNN reads a sentence in English and
then outputs a sentence in French).
(5) Synced sequence input and output (e.g. video classification where we wish to label each frame of the video).
Image Classification
Implementation in MxNet of
ResNet.
Other networks such as
DenseNet and Inception will
be added in the future.
Transfer learning: begin with
a model already trained on
ImageNet!
0
0.5
1
1.5
2
2.5
3
3.5
0 1 2 3 4 5
Speedup
Number of Machine (P2)
Speedup with Horizontal Scaling
Image Classification problem
Resnet
IMAGENET Challenge
11 million labelled image
across 11000 categories!
Transfer Learning process
DeepAR
1. DeepAR model effectively learns a global model from related
time series
2. Is able to lean complex patterns such as seasonality and
uncertainty growth over time from the data
3. Interestingly, the method works with little or no hyper
parameters tuning on wide variety of dataset, and in is
applicable to medium-size dataset containing only few hundred
time series
4. Scales up to datasets comprising 100000+ time series
Time series global model for related time series
Related time series
e.g. demand for various
Product sold by you
Time Series Forecasting
Mean absolute
percentage error
P90 Loss
DeepAR R DeepAR R
traffic
Hourly occupancy rate of 963
bay area freeways
0.14 0.27 0.13 0.24
electricity
Electricity use of 370
homes over time
0.07 0.11 0.08 0.09
pageviews
Page view hits
of websites
10k 0.32 0.32 0.44 0.31
180k 0.32 0.34 0.29 NA
One hour on p2.xlarge, $1
Input
Network
Types of forecasts
1. Cold start forecasting
2. Probability forecasts
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Unsupervised Learning Algorithm
K-Means Clustering
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
K-Means Clustering
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Method Accurate? Passes Efficient
tuning
Comments
Lloyds [1] Yes* 5–10 No
K-Means ++ [2] Yes k+5 to k+10 No scikit-learn
K-Means|| [3] Yes 7–12 No spark.ml
Online [4] No 1 No
Streaming [5,6] No 1 No Impractical
Webscale [7] No 1 No Spark streaming
Coresets [8] No 1 Yes Impractical
Amazon SageMaker Yes 1 Yes
1 Lloyd, IEEE TIT, 1982
2 Arthur et. al. ACM-SIAM, 2007
3 Bahmani et.al.,VLDB, 2012
4 Liberty et. al., 2015
5 Shindler et. al, NIPS, 2011
6 Guha et. al, IEEE Trans. Knowl. Data Eng. 2003
7 Sculley, WWW, 2010
8 Feldman et. al.
2
1
0
3
6
5
4
7
8
10 500
BillableTimeinMinutes
100
Number of clusters
sagemaker other
K-Means Clustering
9.77E2
Running time vs. Number ofclustersk SageMaker Other
Text
1.2 GB
10 1.18E3 1.18E3
100 1.00E3
500 9.18.E2 9.03E2
Images
9 GB
10 3.29E2 3.28E2
100 2.72E2 2.71E2
500 2.17E2 Failed
Videos
27 GB
10 2.19E2 2.18E2
100 2.03E2 2.02E2
500 1.86E2 1.85E2
Advertising
127 GB
10 1.72E7 Failed
100 1.30E7 Failed
500 1.03E7 Failed
Synthetic
1,100 GB
10 3.81E7 Failed
100 3.51E7 Failed
500 2.81E7 Failed
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
~10x faster!
Principal Component Analysis (PCA)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Principal Component Analysis (PCA)
Data Distribution
Principal Component Analysis (PCA)
Principal Component Analysis (PCA)
Principal Component Analysis (PCA)
Principal Component Analysis (PCA)
Principal Component Analysis (PCA)
The variation for PC1 = 15,
The variation for PC2 = 3
Principal Component Analysis (PCA)
Visual explanation
Principal Component Analysis (PCA)
More than 10x
fa
ster
at afractionthe cost!
0.00
20.00
40.00
60.00
80.00
100.00
120.00
8 20
Mb/Sec/Machine
10
Number of machines
other sagemaker-deterministic sagemaker-randomized
Cost vs.time Throughput and scalability
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
5
0 5 10 35 40 45
CostinDollars
other
15 20 25 30
Billable time inminutes
sagemaker-deterministic sagemaker-randomized
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Spectral LDA
Training Time vs. Number of Topics
0
50
100
150
200
250
0 20 40 60 80 100TrainingTimeinMinutes
Number of Topics
lda-data-a lda-data-b other-data-a other-data-b
© 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved.
Topic eat sleep play meow bark
Topic 1 0.1 0.3 0.2 0.4 0.0
Topic 2 0.2 0.1 0.4 0.0 0.3
Sample Output
About cats, as cats like
To meow and sleep
About dogs, as dogs like
To play and bark
Neural Topic Modeling
• Perplexity vs. number of topics
Input term counts vector
Document
posterior
Sampled document
representation
Decoder:
Softmax
Output term counts vector
6000
4000
2000
0
8000
10000
12000
0 50 150 200
Perplexity
100
Number oftopics
NTM Other
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Using Amazon SageMaker
Sagemaker Jupyter Notebook
Input data
From the Command Line
profile=<your_profile>
arn_role=<your_arn_role>
training_image=382416733822.dkr.ecr.us-east-1.amazonaws.com/kmeans:1
training_job_name=clustering_text_documents_`date '+%Y_%m_%d_%H_%M_%S'` aws --
profile $profile 
--region us-east-1 
sagemaker create-training-job 
--training-job-name $training_job_name 
--algorithm-specification TrainingImage=$training_image,TrainingInputMode=File
--hyper-parameters k=10,feature_dim=1024,mini_batch_size=1000 
--role-arn $arn_role 
--input-data-config '{"ChannelName": "train", "DataSource":
{"S3DataSource":{"S3DataType": "S3Prefix", "S3Uri": "s3://kmeans_demo/train",
"S3DataDistributionType": "ShardedByS3Key"}}, "CompressionType": "None", "RecordWrapperType": "None"}'

--output-data-config S3OutputPath=s3://training_output/$training_job_name
--resource-config InstanceCount=2,InstanceType=ml.c4.8xlarge,VolumeSizeInGB=50 
--stopping-condition MaxRuntimeInSeconds=3600
Hardware
Algorithm
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
From Amazon EMR
Hardware
Parameters
Start training
Apply model
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
From Amazon SageMaker Notebooks
Hardware
Parameters
Start training
Host model
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Automatic Model Tuning
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
ML compute by the
second starting
at $0.0464/hr
ML storage by the
second
at $0.14
per GB-month
Data processed in
notebooks and hosting
at $0.016 per GB
Free trial to get
started quickly
Pay-as -You-Go (Inexpensive, Too)
Amazon SageMaker – Try It Free
aws.amazon.com/sagemaker
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon ML Lab
Lots of companies
doing Machine
Learning
Unable to unlock
business potential
Brainstorming Modeling Teaching
Lack ML
expertise
Leverage Amazon experts with decades of ML
experience with technologies like Amazon Echo,
Amazon Alexa, Prime Air, and Amazon Go
Amazon ML Lab
provides the missing
ML expertise
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!

More Related Content

What's hot

如何以 serverless 架構打造快速回應客戶需求的零售情境 (Level: 200)
如何以 serverless 架構打造快速回應客戶需求的零售情境 (Level: 200)如何以 serverless 架構打造快速回應客戶需求的零售情境 (Level: 200)
如何以 serverless 架構打造快速回應客戶需求的零售情境 (Level: 200)Amazon Web Services
 
Accelerate Machine Learning with Ease using Amazon SageMaker
Accelerate Machine Learning with Ease using Amazon SageMakerAccelerate Machine Learning with Ease using Amazon SageMaker
Accelerate Machine Learning with Ease using Amazon SageMakerAmazon Web Services
 
Deep Learning Applications Using TensorFlow, ft. Advanced Microgrid Solutions...
Deep Learning Applications Using TensorFlow, ft. Advanced Microgrid Solutions...Deep Learning Applications Using TensorFlow, ft. Advanced Microgrid Solutions...
Deep Learning Applications Using TensorFlow, ft. Advanced Microgrid Solutions...Amazon Web Services
 
[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...
[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...
[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...Amazon Web Services
 
Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...
Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...
Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...Amazon Web Services
 
SRV317 Creating and Publishing AR and VR Apps with Amazon Sumerian
SRV317 Creating and Publishing AR and VR Apps with Amazon SumerianSRV317 Creating and Publishing AR and VR Apps with Amazon Sumerian
SRV317 Creating and Publishing AR and VR Apps with Amazon SumerianAmazon Web Services
 
Building an AWS Greengrass Machine Learning Solution at the Edge (IOT403-R1) ...
Building an AWS Greengrass Machine Learning Solution at the Edge (IOT403-R1) ...Building an AWS Greengrass Machine Learning Solution at the Edge (IOT403-R1) ...
Building an AWS Greengrass Machine Learning Solution at the Edge (IOT403-R1) ...Amazon Web Services
 
How Peak.AI Uses Amazon SageMaker for Product Personalization (GPSTEC316) - A...
How Peak.AI Uses Amazon SageMaker for Product Personalization (GPSTEC316) - A...How Peak.AI Uses Amazon SageMaker for Product Personalization (GPSTEC316) - A...
How Peak.AI Uses Amazon SageMaker for Product Personalization (GPSTEC316) - A...Amazon Web Services
 
Building a Modern Data Platform in the Cloud
Building a Modern Data Platform in the CloudBuilding a Modern Data Platform in the Cloud
Building a Modern Data Platform in the CloudAmazon Web Services
 
AWS re:Invent 2017 Recap - Solutions Updates
AWS re:Invent 2017 Recap - Solutions UpdatesAWS re:Invent 2017 Recap - Solutions Updates
AWS re:Invent 2017 Recap - Solutions UpdatesAmazon Web Services
 
Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...
Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...
Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...Amazon Web Services
 
建構全球跨區域 x Active-Active架構的無伺服器化後台服務
建構全球跨區域  x Active-Active架構的無伺服器化後台服務建構全球跨區域  x Active-Active架構的無伺服器化後台服務
建構全球跨區域 x Active-Active架構的無伺服器化後台服務Amazon Web Services
 
Better, Faster, Cheaper – Cost Optimizing Compute with Amazon EC2 Fleet #savi...
Better, Faster, Cheaper – Cost Optimizing Compute with Amazon EC2 Fleet #savi...Better, Faster, Cheaper – Cost Optimizing Compute with Amazon EC2 Fleet #savi...
Better, Faster, Cheaper – Cost Optimizing Compute with Amazon EC2 Fleet #savi...Amazon Web Services
 
Build, Train, and Deploy ML Models at Scale
Build, Train, and Deploy ML Models at ScaleBuild, Train, and Deploy ML Models at Scale
Build, Train, and Deploy ML Models at ScaleAmazon Web Services
 
Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...
Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...
Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...Amazon Web Services
 
Migrate from Netezza to Amazon Redshift: Best Practices with Financial Engine...
Migrate from Netezza to Amazon Redshift: Best Practices with Financial Engine...Migrate from Netezza to Amazon Redshift: Best Practices with Financial Engine...
Migrate from Netezza to Amazon Redshift: Best Practices with Financial Engine...Amazon Web Services
 
Building State-of-the-Art Computer Vision Models Using MXNet and Gluon (AIM36...
Building State-of-the-Art Computer Vision Models Using MXNet and Gluon (AIM36...Building State-of-the-Art Computer Vision Models Using MXNet and Gluon (AIM36...
Building State-of-the-Art Computer Vision Models Using MXNet and Gluon (AIM36...Amazon Web Services
 
Introduction to Amazon EC2 F1 Instances
Introduction to Amazon EC2 F1 Instances Introduction to Amazon EC2 F1 Instances
Introduction to Amazon EC2 F1 Instances Amazon Web Services
 
MySQL High Availability & Disaster Recovery (DAT361) - AWS re:Invent 2018
MySQL High Availability & Disaster Recovery (DAT361) - AWS re:Invent 2018MySQL High Availability & Disaster Recovery (DAT361) - AWS re:Invent 2018
MySQL High Availability & Disaster Recovery (DAT361) - AWS re:Invent 2018Amazon Web Services
 

What's hot (20)

如何以 serverless 架構打造快速回應客戶需求的零售情境 (Level: 200)
如何以 serverless 架構打造快速回應客戶需求的零售情境 (Level: 200)如何以 serverless 架構打造快速回應客戶需求的零售情境 (Level: 200)
如何以 serverless 架構打造快速回應客戶需求的零售情境 (Level: 200)
 
Accelerate Machine Learning with Ease using Amazon SageMaker
Accelerate Machine Learning with Ease using Amazon SageMakerAccelerate Machine Learning with Ease using Amazon SageMaker
Accelerate Machine Learning with Ease using Amazon SageMaker
 
Deep Learning Applications Using TensorFlow, ft. Advanced Microgrid Solutions...
Deep Learning Applications Using TensorFlow, ft. Advanced Microgrid Solutions...Deep Learning Applications Using TensorFlow, ft. Advanced Microgrid Solutions...
Deep Learning Applications Using TensorFlow, ft. Advanced Microgrid Solutions...
 
[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...
[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...
[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...
 
Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...
Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...
Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...
 
SRV317 Creating and Publishing AR and VR Apps with Amazon Sumerian
SRV317 Creating and Publishing AR and VR Apps with Amazon SumerianSRV317 Creating and Publishing AR and VR Apps with Amazon Sumerian
SRV317 Creating and Publishing AR and VR Apps with Amazon Sumerian
 
Managed NoSQL databases
Managed NoSQL databasesManaged NoSQL databases
Managed NoSQL databases
 
Building an AWS Greengrass Machine Learning Solution at the Edge (IOT403-R1) ...
Building an AWS Greengrass Machine Learning Solution at the Edge (IOT403-R1) ...Building an AWS Greengrass Machine Learning Solution at the Edge (IOT403-R1) ...
Building an AWS Greengrass Machine Learning Solution at the Edge (IOT403-R1) ...
 
How Peak.AI Uses Amazon SageMaker for Product Personalization (GPSTEC316) - A...
How Peak.AI Uses Amazon SageMaker for Product Personalization (GPSTEC316) - A...How Peak.AI Uses Amazon SageMaker for Product Personalization (GPSTEC316) - A...
How Peak.AI Uses Amazon SageMaker for Product Personalization (GPSTEC316) - A...
 
Building a Modern Data Platform in the Cloud
Building a Modern Data Platform in the CloudBuilding a Modern Data Platform in the Cloud
Building a Modern Data Platform in the Cloud
 
AWS re:Invent 2017 Recap - Solutions Updates
AWS re:Invent 2017 Recap - Solutions UpdatesAWS re:Invent 2017 Recap - Solutions Updates
AWS re:Invent 2017 Recap - Solutions Updates
 
Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...
Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...
Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...
 
建構全球跨區域 x Active-Active架構的無伺服器化後台服務
建構全球跨區域  x Active-Active架構的無伺服器化後台服務建構全球跨區域  x Active-Active架構的無伺服器化後台服務
建構全球跨區域 x Active-Active架構的無伺服器化後台服務
 
Better, Faster, Cheaper – Cost Optimizing Compute with Amazon EC2 Fleet #savi...
Better, Faster, Cheaper – Cost Optimizing Compute with Amazon EC2 Fleet #savi...Better, Faster, Cheaper – Cost Optimizing Compute with Amazon EC2 Fleet #savi...
Better, Faster, Cheaper – Cost Optimizing Compute with Amazon EC2 Fleet #savi...
 
Build, Train, and Deploy ML Models at Scale
Build, Train, and Deploy ML Models at ScaleBuild, Train, and Deploy ML Models at Scale
Build, Train, and Deploy ML Models at Scale
 
Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...
Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...
Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...
 
Migrate from Netezza to Amazon Redshift: Best Practices with Financial Engine...
Migrate from Netezza to Amazon Redshift: Best Practices with Financial Engine...Migrate from Netezza to Amazon Redshift: Best Practices with Financial Engine...
Migrate from Netezza to Amazon Redshift: Best Practices with Financial Engine...
 
Building State-of-the-Art Computer Vision Models Using MXNet and Gluon (AIM36...
Building State-of-the-Art Computer Vision Models Using MXNet and Gluon (AIM36...Building State-of-the-Art Computer Vision Models Using MXNet and Gluon (AIM36...
Building State-of-the-Art Computer Vision Models Using MXNet and Gluon (AIM36...
 
Introduction to Amazon EC2 F1 Instances
Introduction to Amazon EC2 F1 Instances Introduction to Amazon EC2 F1 Instances
Introduction to Amazon EC2 F1 Instances
 
MySQL High Availability & Disaster Recovery (DAT361) - AWS re:Invent 2018
MySQL High Availability & Disaster Recovery (DAT361) - AWS re:Invent 2018MySQL High Availability & Disaster Recovery (DAT361) - AWS re:Invent 2018
MySQL High Availability & Disaster Recovery (DAT361) - AWS re:Invent 2018
 

Similar to Amazon SageMaker 內建機器學習演算法 (Level 400)

Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...Amazon Web Services
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS SummitAmazon Web Services
 
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...Amazon Web Services
 
Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...
Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...
Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...Amazon Web Services
 
Speed up your Machine Learning workflows with built-in algorithms - Tel Aviv ...
Speed up your Machine Learning workflows with built-in algorithms - Tel Aviv ...Speed up your Machine Learning workflows with built-in algorithms - Tel Aviv ...
Speed up your Machine Learning workflows with built-in algorithms - Tel Aviv ...Amazon Web Services
 
Speed up your Machine Learning workflows with build-in algorithms
Speed up your Machine Learning workflows with build-in algorithmsSpeed up your Machine Learning workflows with build-in algorithms
Speed up your Machine Learning workflows with build-in algorithmsJulien SIMON
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksAmazon Web Services
 
Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...
Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...
Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...Amazon Web Services
 
Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...
Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...
Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...Amazon Web Services
 
From Notebook to production with Amazon SageMaker
From Notebook to production with Amazon SageMakerFrom Notebook to production with Amazon SageMaker
From Notebook to production with Amazon SageMakerAmazon Web Services
 
Quickly and easily build, train, and deploy machine learning models at any scale
Quickly and easily build, train, and deploy machine learning models at any scaleQuickly and easily build, train, and deploy machine learning models at any scale
Quickly and easily build, train, and deploy machine learning models at any scaleAWS Germany
 
AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...
AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...
AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...Amazon Web Services Korea
 
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...Amazon Web Services
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS SummitAmazon Web Services
 
An Introduction to Amazon SageMaker (October 2018)
An Introduction to Amazon SageMaker (October 2018)An Introduction to Amazon SageMaker (October 2018)
An Introduction to Amazon SageMaker (October 2018)Julien SIMON
 
Amazon SageMaker (December 2018)
Amazon SageMaker (December 2018)Amazon SageMaker (December 2018)
Amazon SageMaker (December 2018)Julien SIMON
 
Julien Simon, Principal Technical Evangelist at Amazon - Machine Learning: Fr...
Julien Simon, Principal Technical Evangelist at Amazon - Machine Learning: Fr...Julien Simon, Principal Technical Evangelist at Amazon - Machine Learning: Fr...
Julien Simon, Principal Technical Evangelist at Amazon - Machine Learning: Fr...Codiax
 
From notebook to production with Amazon Sagemaker
From notebook to production with Amazon SagemakerFrom notebook to production with Amazon Sagemaker
From notebook to production with Amazon SagemakerAmazon Web Services
 

Similar to Amazon SageMaker 內建機器學習演算法 (Level 400) (20)

Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
 
Introduction to Sagemaker
Introduction to SagemakerIntroduction to Sagemaker
Introduction to Sagemaker
 
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
 
Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...
Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...
Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...
 
Speed up your Machine Learning workflows with built-in algorithms - Tel Aviv ...
Speed up your Machine Learning workflows with built-in algorithms - Tel Aviv ...Speed up your Machine Learning workflows with built-in algorithms - Tel Aviv ...
Speed up your Machine Learning workflows with built-in algorithms - Tel Aviv ...
 
Speed up your Machine Learning workflows with build-in algorithms
Speed up your Machine Learning workflows with build-in algorithmsSpeed up your Machine Learning workflows with build-in algorithms
Speed up your Machine Learning workflows with build-in algorithms
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech Talks
 
Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...
Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...
Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...
 
Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...
Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...
Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...
 
From Notebook to production with Amazon SageMaker
From Notebook to production with Amazon SageMakerFrom Notebook to production with Amazon SageMaker
From Notebook to production with Amazon SageMaker
 
Quickly and easily build, train, and deploy machine learning models at any scale
Quickly and easily build, train, and deploy machine learning models at any scaleQuickly and easily build, train, and deploy machine learning models at any scale
Quickly and easily build, train, and deploy machine learning models at any scale
 
AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...
AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...
AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...
 
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
 
Amazon SageMaker
Amazon SageMakerAmazon SageMaker
Amazon SageMaker
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
 
An Introduction to Amazon SageMaker (October 2018)
An Introduction to Amazon SageMaker (October 2018)An Introduction to Amazon SageMaker (October 2018)
An Introduction to Amazon SageMaker (October 2018)
 
Amazon SageMaker (December 2018)
Amazon SageMaker (December 2018)Amazon SageMaker (December 2018)
Amazon SageMaker (December 2018)
 
Julien Simon, Principal Technical Evangelist at Amazon - Machine Learning: Fr...
Julien Simon, Principal Technical Evangelist at Amazon - Machine Learning: Fr...Julien Simon, Principal Technical Evangelist at Amazon - Machine Learning: Fr...
Julien Simon, Principal Technical Evangelist at Amazon - Machine Learning: Fr...
 
From notebook to production with Amazon Sagemaker
From notebook to production with Amazon SagemakerFrom notebook to production with Amazon Sagemaker
From notebook to production with Amazon Sagemaker
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Amazon SageMaker 內建機器學習演算法 (Level 400)

  • 1. © 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved. James Chiang (蔣宗恩) Solutions Architect, Amazon Web Services AWS AI & ML: Overview of the 10 Most Common ML Algorithms
  • 2. © 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved. Agenda • AI Quick Introduction • Supervised Learning Algorithm • Unsupervised Learning Algorithm • How Sagemaker Run Algorithm
  • 3. © 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved. I Will Be Back !!!!
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AI, Machine Learning, and Deep Learning
  • 5. Fast Compute Ubiquitous Data Advanced Learning Algorithms Key Drivers for Deep Learning
  • 6.
  • 7. © 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved. Types of Machine Learning Supervised Human Intervention and validation required Photo classification or tagging Vs. Unsupervised No human intervention required Auto-classification of documents based on context
  • 8. Supervised Learning • Training data provides “examples” and “outcomes” • The machine learns to predict the outcome of new data based on the past examples
  • 9. Supervised Learning Training data has one feature that is the “outcome” Split the data into a training and test set Model the training set / Predict the test test Compares the predictions to the known values Algorithms Model / Ensemble Logistic Regression Time Series Sometimes referred to as the “label” or “objective” Goal is to build a model which can predict the outcome If categorical: model is a “classification” or model is a “regression” Because the data has a known value, model can be evaluated
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Unsupervised Learning Training data provides “examples” - no specific “outcome” The machine tries to find “interesting” patterns in the data
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Unsupervised Learning Training data has only examples and no specific “outcome” This is common - labels are typically expensive Goal is to perform discovery, find patterns, etc Tends to be more difficult Algorithms Clusters Anomaly Detection Association Discovery Topic Models Because the data has no “outcome”, can not be evaluated Each discovery method has it’s own quality measures
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Data Visualization & Analysis Business Problem – ML problem framing Data Collection Data Integration Data Preparation & Cleaning Feature Engineering Model Training & Parameter Tuning Model Evaluation Are Business Goals met? Model Deployment Monitoring & Debugging – Predictions YesNo DataAugmentation Feature Augmentation The Machine Learning Process Re-training
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Introducing: Amazon SageMaker A managed service that provides the quickest and easiest way for your data scientists and developers to get ML models from idea to production.
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Data Visualization & Analysis Business Problem – ML problem framing Data Collection Data Integration Data Preparation & Cleaning Feature Engineering Model Training & Parameter Tuning Model Evaluation Are Business Goals met? Model Deployment Monitoring & Debugging – Predictions YesNo DataAugmentation Feature Augmentation Retraining • Setup and manage Notebook Environments • Setup and manage Training Clusters • Write Data Connectors • Scale ML algorithms to large datasets • Distribute ML training algorithm to multiple machines • Secure Model artifacts Why We built Amazon SageMaker: The Model Training Undifferentiated Heavy Lifting
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Built-in ML Algorithm h t t p s : / / d o c s . a w s . a m a z o n . c o m / s a g e m a k e r / l a t e s t / d g / a l g o s . h t m l Problem Algorithm Learning Typ Discrete Classification, Regression Linear Learner Supervised XGBoost Algorithm Supervised Discrete Recommendations Factorization Machines Supervised Image Classification Image Classification Algorithm Supervised, CNN Neural Machine Translation Sequence to Sequence Supervised, seq2seq Time-series Prediction DeepAR Supervised, RNN Discrete Groupings K-Means Algorithm Unsupervised Dimensionality Reduction PCA (Principal Component Analysis) Unsupervised Topic Determination Latent Dirichlet Allocation (LDA) Unsupervised Neural Topic Model (NTM) Unsupervised, Neural Network Based
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Supervised Learning Algorithm
  • 17. Linear Learner Regression: Estimate a real valuedfunction Binary classification: Predict a 0/1 class © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 18. Train Fit thresholds and select Select modelwith best validation performance Linear Learner More than 8x speed-up over naive parallel training! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 19. Linear Learner Regression (mean squarederror) SageMaker Other 1.02 1.06 1.09 1.02 0.332 0.183 0.086 0.129 83.3 84.5 Classification (F1 Score) SageMaker Other 0.980 0.981 0.870 0.930 0.997 0.997 0.978 0.964 0.914 0.859 0.470 0.472 0.903 0.908 0.508 0.508 30-GB datasets for web-spam and web-URL classification 1.2 1 0.8 0.6 0.4 0.2 0 0 5 10 20 25 30 CostinDollars 15 Billable time inMinutes sagemaker-url sagemaker-spam other-url other-spam © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Boosted Decision Trees Throughput vs. number of machinesXGBoost is one of the most commonly used implementations of boosted decision trees in theworld. It is now available inAmazon SageMaker! 400 200 0 600 800 1000 1200 1400 0 10 60 70 ThroughputinMB/Sec 20 30 40 50 Number of machines (C4.8xLarge)
  • 22. When Trees forms a Forest(Tree Ensembles)
  • 23. © 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved. split weight Training Loss measures how Well model fit on training data Regularization, measures complexity of trees XGboots introduction
  • 24. © 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved. XGboots introduction
  • 25. Factorization Machines Log_loss F1 Score Seconds SageMaker 0.494 0.277 820 Other (10 Iter) 0.516 0.190 650 Other (20 Iter) 0.507 0.254 1300 Other (50 Iter) 0.481 0.313 3250 Click Prediction 1-TBadvertising dataset, m4.4xlarge machines, perfect scaling $200.00 $180.00 $160.00 $140.00 $120.00 $100.00 $80.00 $60.00 $40.00 $20.00 $- 1 2 3 6 7 8 Costindollars 4 5 Billable time in hours sa 50 40 30 20 10 m chines machine s machin e © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 26. Where factorization is useful A user movie rating table
  • 28. Predictions for recommendations Star Wars Inception Godfather Notebook U1 4.97 2.98 2.18 0.98 U2 3.97 2.4 1.97 0.99 U3 1.02 0.93 5.02 4.93 U4 1.00 0.85 4.59 3.93 U5 1.36 1.07 4.89 4.12
  • 29. Sequence to Sequence problems 0 5 10 15 20 25 0 5 10 15 20 25 BLEUScore Billable Time in HoursP2.16x P2.8x P2.x Best known result! Based on Sockeye and Apache incubated MxNet, Multi-GPU, and can be used for Neural Machine Translation. Supports both RNN/CNN as encoder/decoder
  • 30. RNN Introduction Ex: Arrive Taipei at November 2nd Leave Taipei at November 2nd LSTM
  • 31. Sequence to Sequence problems (1) Vanilla mode of processing without RNN, from fixed-sized input to fixed-sized output (e.g. image classification). (2) Sequence output (e.g. image captioning takes an image and outputs a sentence of words). (3) Sequence input (e.g. sentiment analysis where a given sentence is classified as expressing positive or negative sentiment). (4) Sequence input and sequence output (e.g. Machine Translation: an RNN reads a sentence in English and then outputs a sentence in French). (5) Synced sequence input and output (e.g. video classification where we wish to label each frame of the video).
  • 32. Image Classification Implementation in MxNet of ResNet. Other networks such as DenseNet and Inception will be added in the future. Transfer learning: begin with a model already trained on ImageNet! 0 0.5 1 1.5 2 2.5 3 3.5 0 1 2 3 4 5 Speedup Number of Machine (P2) Speedup with Horizontal Scaling
  • 34. IMAGENET Challenge 11 million labelled image across 11000 categories!
  • 36. DeepAR 1. DeepAR model effectively learns a global model from related time series 2. Is able to lean complex patterns such as seasonality and uncertainty growth over time from the data 3. Interestingly, the method works with little or no hyper parameters tuning on wide variety of dataset, and in is applicable to medium-size dataset containing only few hundred time series 4. Scales up to datasets comprising 100000+ time series
  • 37. Time series global model for related time series Related time series e.g. demand for various Product sold by you
  • 38. Time Series Forecasting Mean absolute percentage error P90 Loss DeepAR R DeepAR R traffic Hourly occupancy rate of 963 bay area freeways 0.14 0.27 0.13 0.24 electricity Electricity use of 370 homes over time 0.07 0.11 0.08 0.09 pageviews Page view hits of websites 10k 0.32 0.32 0.44 0.31 180k 0.32 0.34 0.29 NA One hour on p2.xlarge, $1 Input Network
  • 39. Types of forecasts 1. Cold start forecasting 2. Probability forecasts
  • 40. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Unsupervised Learning Algorithm
  • 41. K-Means Clustering © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 42. K-Means Clustering © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Method Accurate? Passes Efficient tuning Comments Lloyds [1] Yes* 5–10 No K-Means ++ [2] Yes k+5 to k+10 No scikit-learn K-Means|| [3] Yes 7–12 No spark.ml Online [4] No 1 No Streaming [5,6] No 1 No Impractical Webscale [7] No 1 No Spark streaming Coresets [8] No 1 Yes Impractical Amazon SageMaker Yes 1 Yes 1 Lloyd, IEEE TIT, 1982 2 Arthur et. al. ACM-SIAM, 2007 3 Bahmani et.al.,VLDB, 2012 4 Liberty et. al., 2015 5 Shindler et. al, NIPS, 2011 6 Guha et. al, IEEE Trans. Knowl. Data Eng. 2003 7 Sculley, WWW, 2010 8 Feldman et. al.
  • 43. 2 1 0 3 6 5 4 7 8 10 500 BillableTimeinMinutes 100 Number of clusters sagemaker other K-Means Clustering 9.77E2 Running time vs. Number ofclustersk SageMaker Other Text 1.2 GB 10 1.18E3 1.18E3 100 1.00E3 500 9.18.E2 9.03E2 Images 9 GB 10 3.29E2 3.28E2 100 2.72E2 2.71E2 500 2.17E2 Failed Videos 27 GB 10 2.19E2 2.18E2 100 2.03E2 2.02E2 500 1.86E2 1.85E2 Advertising 127 GB 10 1.72E7 Failed 100 1.30E7 Failed 500 1.03E7 Failed Synthetic 1,100 GB 10 3.81E7 Failed 100 3.51E7 Failed 500 2.81E7 Failed © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. ~10x faster!
  • 44. Principal Component Analysis (PCA) © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 45. Principal Component Analysis (PCA) Data Distribution
  • 50. Principal Component Analysis (PCA) The variation for PC1 = 15, The variation for PC2 = 3
  • 53. Principal Component Analysis (PCA) More than 10x fa ster at afractionthe cost! 0.00 20.00 40.00 60.00 80.00 100.00 120.00 8 20 Mb/Sec/Machine 10 Number of machines other sagemaker-deterministic sagemaker-randomized Cost vs.time Throughput and scalability 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 0 5 10 35 40 45 CostinDollars other 15 20 25 30 Billable time inminutes sagemaker-deterministic sagemaker-randomized © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 54. Spectral LDA Training Time vs. Number of Topics 0 50 100 150 200 250 0 20 40 60 80 100TrainingTimeinMinutes Number of Topics lda-data-a lda-data-b other-data-a other-data-b
  • 55. © 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved. Topic eat sleep play meow bark Topic 1 0.1 0.3 0.2 0.4 0.0 Topic 2 0.2 0.1 0.4 0.0 0.3 Sample Output About cats, as cats like To meow and sleep About dogs, as dogs like To play and bark
  • 56. Neural Topic Modeling • Perplexity vs. number of topics Input term counts vector Document posterior Sampled document representation Decoder: Softmax Output term counts vector 6000 4000 2000 0 8000 10000 12000 0 50 150 200 Perplexity 100 Number oftopics NTM Other © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 57. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Using Amazon SageMaker
  • 59.
  • 60. Input data From the Command Line profile=<your_profile> arn_role=<your_arn_role> training_image=382416733822.dkr.ecr.us-east-1.amazonaws.com/kmeans:1 training_job_name=clustering_text_documents_`date '+%Y_%m_%d_%H_%M_%S'` aws -- profile $profile --region us-east-1 sagemaker create-training-job --training-job-name $training_job_name --algorithm-specification TrainingImage=$training_image,TrainingInputMode=File --hyper-parameters k=10,feature_dim=1024,mini_batch_size=1000 --role-arn $arn_role --input-data-config '{"ChannelName": "train", "DataSource": {"S3DataSource":{"S3DataType": "S3Prefix", "S3Uri": "s3://kmeans_demo/train", "S3DataDistributionType": "ShardedByS3Key"}}, "CompressionType": "None", "RecordWrapperType": "None"}' --output-data-config S3OutputPath=s3://training_output/$training_job_name --resource-config InstanceCount=2,InstanceType=ml.c4.8xlarge,VolumeSizeInGB=50 --stopping-condition MaxRuntimeInSeconds=3600 Hardware Algorithm © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 61. From Amazon EMR Hardware Parameters Start training Apply model © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 62. From Amazon SageMaker Notebooks Hardware Parameters Start training Host model © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 64. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ML compute by the second starting at $0.0464/hr ML storage by the second at $0.14 per GB-month Data processed in notebooks and hosting at $0.016 per GB Free trial to get started quickly Pay-as -You-Go (Inexpensive, Too)
  • 65. Amazon SageMaker – Try It Free aws.amazon.com/sagemaker © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 66. Amazon ML Lab Lots of companies doing Machine Learning Unable to unlock business potential Brainstorming Modeling Teaching Lack ML expertise Leverage Amazon experts with decades of ML experience with technologies like Amazon Echo, Amazon Alexa, Prime Air, and Amazon Go Amazon ML Lab provides the missing ML expertise
  • 67. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Thank you!