SlideShare ist ein Scribd-Unternehmen logo
1 von 29
Downloaden Sie, um offline zu lesen
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Andre Dufour, AWS Auto Scaling
Hook Hua, Jet Propulsion Laboratory
December 2016
Auto Scaling
CMP201
The Fleet Management Solution for Planet Earth
Availability with fleet management
Keeping up with demand with dynamic scaling
Future of Auto Scaling
Myth Fact
My application doesn’t need scaling,
so I don’t benefit from Auto Scaling
It’s hard to use
My instances are stateful or unique;
I can’t use Auto Scaling
Auto Scaling groupAuto Scaling group
Auto Scaling
Fleet management Dynamic scaling
ELB
EC2 instances
ELB
CPU
Utilization
EC2 instances
Is Fleet Management For You?
“I’ve got instances serving a
business-impacting application”
“If my instances become unhealthy,
I’d like them replaced automatically”
“I would like my instances distributed
to maximize resilience”
Demo
Dynamic Scaling
Scaling on a Schedule
Recurring scaling
events
Schedule
individual events
Auto Scaling group
ELB
EC2 instances
Reactive Scaling Policies
Collect metrics
Alarm fires when
threshold is crossed
Auto Scaling
Alarm is
connected to
Auto Scaling
policy
Scaling event is triggered
ELB
Auto Scaling group
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
JPL: Auto Scaling Advantage
Advanced Rapid Imaging and Analysis
© 2016 California Institute of Technology. Government sponsorship acknowledged.
Reference herein to any specific commercial product, or service by trade name, trademark, manufacturer, or otherwise, does not constitute or
imply its endorsement by the United States Government or the Jet Propulsion Laboratory, California Institute of Technology.
Challenge: Remote Sensing for Disaster Response
Automated data system are required to analyze large quantities
of data from NASA NISAR, other satellite missions, and rapidly
expandingGPS networks.
Mean Access Time (Day)
∞ 4 2 1.3 1
Going from Artisan to Automation: Use system engineering
approach to translate specialized data analysis into
operational capability.
Orbit Data Coherence
Library
AZO / MAI
EQ
Kinema c
Modeling
Level-0 Data
Kinema c
Model
Catalog &
Waveform
Sta c
Model
InSAR Processing
Time Series
Con guous Pairs Sta c
Modeling
Int. Phase
Stress Change
III. Rou ne Data Processing
Coherence
Change Maps
Seismometer
Radar Sensor
I. Geode c
Sensors
II. Data Providers / Archives
Global CMT
USGS NEIC
IRIS (Waveform)
Fault
Geometry
Mw>5.5 Mw>6.5
Yes
Yes
No Small EQ
Modeling
Amp / Arr. T
correc on
IV. Triggered
Legend
Ready to be used ( > 70 % readiness )
Need modifica on ( 30 – 70 % readiness)
Doesn’t exist ( < 30 % readiness) Latency / Bo leneck
Out of scope of R&TD
Customer Products
Level-0 Data
Op cal Im.
Processing
Offset Image
Event Driven
Op cal Sensor
SPOT Image (SPOT 1-5,
Formosat 2)
NSPO (SPOT)
DigitalGlobe (Quickbird,
Worldview 1)
RINEX Data
GPS Processing
Daily
Point Pos.
1 Hz (GDGPS)
Hourly
Daily
GPS Sta on
Orbit Data
Sub-daily
Point Pos.
SOPAC
+ many others
GPS Constella on
NASA - JPL
V. Customer
Products
Damage
Assessment
3D Surface
Disp. Maps
Ground
Mo on
3D Surface
Disp. Maps
EQ catalog
CTBT
correc on
surface
Coseismic
Interferogram
Coseismic GPS
vector map
Coseismic
Interferogram
Coseismic GPS
vector map
Temporal Records
of Ground Deformation
Spatial Maps of Ground
Deformation
Earthquake
Models
Coseismic Ground
Deformation
Demonstrate response to hazards with standardized set of data products for decision & policy makers.
Coseismic
Damage
Automated
Data Collection
& Processing
Integrating
Space Geodesy
Seismology
Modeling
Radar
Sensors
Building Damage
& Inundation
Radar
GPS Seismology
Permanent
Ground Deformation
High-Resolution
Hazard Assessment
from Fault Models
Monitoring &
Near Real-Time
Assessment
“We have high hopes
for ARIA.”
Keiko Saito
World Bank
“This is exactly the kind of
product we are looking for.”
Anne Rosinski
Calif. Geol. Survey
Examples from the 2011 M9.0
Tohoku-Oki (Japan) earthquake
GPS
Networks
Seismic
Networks
Optical
Sensors
Advanced Rapid Imaging and Analysis (ARIA)
Example Analysis Scenarios
Afar/Asal Rifts
Volcanism
2013 Sudan Floods
Disaster
CaliMap
Tectonics
Hydrology
Disaster
Copahue
VolcanismRutford Ice Stream
Glacier Dynamics
2013 Oklahoma Tornado
Disaster
2012 New York City
Hurricane Sandy
Disaster2013 Colorado Floods
Disaster
2013 Pakistan Earthquake
Tectonics
Sierra Negra
Volcanism
CSK 1
CSK 2
CSK 3
CSK 4
North Anatolian Fault, Turkey
Tectonics, Disaster
Kilauea
Volcanism
Potenza, Italy
Landslide Dynamics
Slumgullion
Landslide
Dynamics
CSK 1
CSK 4
CSK 3
CSK 2
• Sample pipelined PGE
• Each major PGE processing step is deployed in its own AWS auto-scaling group (ASG)
Process Control & Management in AWS
Availability Zone
Auto Scaling group
instances
Compute Workers
Availability Zone
Auto Scaling group
instances
Compute Workers
Availability Zone
Auto Scaling group
instances
Compute Workers
bucket with
objects
Metadata Data
Availability Zone
instances
Process Control &
Management services
Search Catalog of Datasets
instances
Resource Management
instances
Analytics / Metrics
Release to Distribution Data Centers
Peg12
Dop12
Int-
gen
I12,A1
2
tropoMap I12,A1
2
tc
tc
unwrap I12,A1
2
tc,u
tc,u
int
Telemetry
Job control
Auto Scaling Science Data System
The size of the science data system compute nodes can automatically grow/shrink based on processing demand
Auto Scaling group policies
• Auto scaling size = 21
• Metric alarm of queue size > 20
• Auto scaling rest period of 60-seconds
ASG max set to 1000 instances x 32 vCPUs
Auto scaling enabling runs
of over 100,000 vCPUs
Hybrid Cloud Science Data System
OCO-2 L2 Full Physics processing operational in AWS
• Processing of L2 full physics data products in Amazon cloud across multiple regions
• Scaled up thousands of compute nodes
• Demonstrated capability of higher internal data throughput rates than NISAR needs
Number of
compute
nodes over
time
Per node
transfer rate
over time
Scalable
internal data
throughput
@ 32,000 PGEs on
1,000 nodes
Optimizing for Scaling In/Out Events
• Scaling group resets and timeout periods
• Scale up in group sizes of multiples of
availability zones (AZ) to minimize AZ
load rebalancing terminations
Scaling up (scale out) • What policy to set to scale down?
• E.g. CPU / network utilization
• Domain knowledge only known within the
compute instances
• Self-terminating instances
• “harakiri” / “seppuku”
Scaling down (scale in)
Spot Market
• Major cost savings (75%-90% savings over on-demand)…if can use
spot instances
• On spot market, AWS will terminate compute instances if market
prices exceed bid threshold
• HySDS able to self-recover from spot instance terminations by using
Auto Scaling
High-Resiliency
X
X
X X
X
Availability Zone a Availability Zone b Availability Zone c
Running in spot market forces the data
system to be more resilient to failures
“Market Maker”
This OCO-2 data production
run of 1000 x 32vCPUs
affected the market prices
Strategy:
• Mitigate impact on spot market
• Diversification of resources
• “Spot fleet”
“Thundering Herd”
Fleet of ASG compute
instances calling same
services at same time
• “API rate limit exceeded”
“Jittering” the API calls
• Introduce randomizations
to API calls
• Distributes load on
infrastructure
Service
Compute Fleet Instances
Next Generation Missions
0
10
20
30
40
50
60
70
80
90
100
OCO-2 SMAP NISAR
Estimated Daily Volume (TB)
• The volume of data
produced is larger
than previous
missions
• Data storage,
processing,
movement, and
costs are the
biggest challenges
(2021)(2009) (2015)
Future of Auto Scaling:
Application Auto Scaling
Application Auto Scaling: Amazon ECS
Scale out
Scale in
Service
ECS cluster
Task
Task
Task
Task
Task
Application Auto Scaling: Spot Fleet
Spot Fleet
Spot Fleet
Auto Scaling
Application Auto Scaling: Amazon EMR
EMR Cluster
Summary
• Availability with Fleet Management
• Resources on aws.amazon.com/autoscaling
• Keeping up with demand with Dynamic Scaling
• Case studies on Auto Scaling website
• Future of Auto Scaling
• Blog posts on Application Auto Scaling
• Questions:
• autoscaling-reinvent2016-questions@amazon.com
Thank you!
Additional authors: Hook Hua1, Gerald Manipon1, Michael Starch1, Lan Dang1, Justin Linick1, Sang-Ho Yun1, Gian
Franco Sacco1, Piyush Agram1, Susan Owen1, Brian Bue1,, Eric Fielding1, Paul Lundgren1, Angelyn Moore1, Paul
Rosen1, Zhen Liu1, Eric Gurrola1, Tom Farr1, Vincent Realmuto1, Frank Webb1, Mark Simons2, Pietro Milillo1
1 Jet Propulsion Laboratory
2 California Institute of Technology
Remember to complete
your evaluations!

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Cost Optimization at Scale
Cost Optimization at ScaleCost Optimization at Scale
Cost Optimization at Scale
 
Optimize Content Processing in the Cloud with GPU and Spot Instances
Optimize Content Processing in the Cloud with GPU and Spot InstancesOptimize Content Processing in the Cloud with GPU and Spot Instances
Optimize Content Processing in the Cloud with GPU and Spot Instances
 
AWS re:Invent 2016: How DataXu scaled its Attribution System to handle billio...
AWS re:Invent 2016: How DataXu scaled its Attribution System to handle billio...AWS re:Invent 2016: How DataXu scaled its Attribution System to handle billio...
AWS re:Invent 2016: How DataXu scaled its Attribution System to handle billio...
 
AWS re:Invent 2016: [REPEAT] How EA Leveraged Amazon Redshift and AWS Partner...
AWS re:Invent 2016: [REPEAT] How EA Leveraged Amazon Redshift and AWS Partner...AWS re:Invent 2016: [REPEAT] How EA Leveraged Amazon Redshift and AWS Partner...
AWS re:Invent 2016: [REPEAT] How EA Leveraged Amazon Redshift and AWS Partner...
 
Agile BI - Pop-up Loft Tel Aviv
Agile BI - Pop-up Loft Tel AvivAgile BI - Pop-up Loft Tel Aviv
Agile BI - Pop-up Loft Tel Aviv
 
AWS re:Invent 2016: Event Handling at Scale: Designing an Auditable Ingestion...
AWS re:Invent 2016: Event Handling at Scale: Designing an Auditable Ingestion...AWS re:Invent 2016: Event Handling at Scale: Designing an Auditable Ingestion...
AWS re:Invent 2016: Event Handling at Scale: Designing an Auditable Ingestion...
 
Real-time Data Processing Using AWS Lambda
Real-time Data Processing Using AWS LambdaReal-time Data Processing Using AWS Lambda
Real-time Data Processing Using AWS Lambda
 
AWS re:Invent 2016: Cross-Region Replication with Amazon DynamoDB Streams (DA...
AWS re:Invent 2016: Cross-Region Replication with Amazon DynamoDB Streams (DA...AWS re:Invent 2016: Cross-Region Replication with Amazon DynamoDB Streams (DA...
AWS re:Invent 2016: Cross-Region Replication with Amazon DynamoDB Streams (DA...
 
Deep Dive on Amazon RDS (May 2016)
Deep Dive on Amazon RDS (May 2016)Deep Dive on Amazon RDS (May 2016)
Deep Dive on Amazon RDS (May 2016)
 
SEC301 Security @ (Cloud) Scale
SEC301 Security @ (Cloud) ScaleSEC301 Security @ (Cloud) Scale
SEC301 Security @ (Cloud) Scale
 
SMC303 Real-time Data Processing Using AWS Lambda
SMC303 Real-time Data Processing Using AWS LambdaSMC303 Real-time Data Processing Using AWS Lambda
SMC303 Real-time Data Processing Using AWS Lambda
 
AWS re:Invent 2016: How Thermo Fisher Is Reducing Mass Spectrometry Experimen...
AWS re:Invent 2016: How Thermo Fisher Is Reducing Mass Spectrometry Experimen...AWS re:Invent 2016: How Thermo Fisher Is Reducing Mass Spectrometry Experimen...
AWS re:Invent 2016: How Thermo Fisher Is Reducing Mass Spectrometry Experimen...
 
Introduction to Amazon EC2 Spot
Introduction to Amazon EC2 Spot Introduction to Amazon EC2 Spot
Introduction to Amazon EC2 Spot
 
AWS re:Invent 2016: Extending Hadoop and Spark to the AWS Cloud (GPST304)
AWS re:Invent 2016: Extending Hadoop and Spark to the AWS Cloud (GPST304)AWS re:Invent 2016: Extending Hadoop and Spark to the AWS Cloud (GPST304)
AWS re:Invent 2016: Extending Hadoop and Spark to the AWS Cloud (GPST304)
 
Running Relational Databases on AWS
Running Relational Databases on AWS  Running Relational Databases on AWS
Running Relational Databases on AWS
 
ENT203 Monitoring and Autoscaling, a Match Made in Heaven
ENT203 Monitoring and Autoscaling, a Match Made in HeavenENT203 Monitoring and Autoscaling, a Match Made in Heaven
ENT203 Monitoring and Autoscaling, a Match Made in Heaven
 
Getting Started with AWS Database Migration Service
Getting Started with AWS Database Migration ServiceGetting Started with AWS Database Migration Service
Getting Started with AWS Database Migration Service
 
AWS re:Invent 2016: Large-Scale, Cloud-Based Analysis of Cancer Genomes: Less...
AWS re:Invent 2016: Large-Scale, Cloud-Based Analysis of Cancer Genomes: Less...AWS re:Invent 2016: Large-Scale, Cloud-Based Analysis of Cancer Genomes: Less...
AWS re:Invent 2016: Large-Scale, Cloud-Based Analysis of Cancer Genomes: Less...
 
AWS re:Invent 2016: Using AWS Lambda to Build Control Systems for Your AWS In...
AWS re:Invent 2016: Using AWS Lambda to Build Control Systems for Your AWS In...AWS re:Invent 2016: Using AWS Lambda to Build Control Systems for Your AWS In...
AWS re:Invent 2016: Using AWS Lambda to Build Control Systems for Your AWS In...
 
AWS re:Invent 2016: Disrupting Big Data with Cost-effective Compute (CMP302)
AWS re:Invent 2016: Disrupting Big Data with Cost-effective Compute (CMP302)AWS re:Invent 2016: Disrupting Big Data with Cost-effective Compute (CMP302)
AWS re:Invent 2016: Disrupting Big Data with Cost-effective Compute (CMP302)
 

Andere mochten auch

Programming Amazon Web Services for Beginners (1)
Programming Amazon Web Services for Beginners (1)Programming Amazon Web Services for Beginners (1)
Programming Amazon Web Services for Beginners (1)
Markus Klems
 
Fleet Management in Russia, CIS and Eastern Europe
Fleet Management in Russia, CIS and Eastern EuropeFleet Management in Russia, CIS and Eastern Europe
Fleet Management in Russia, CIS and Eastern Europe
johanfagerberg
 

Andere mochten auch (20)

(CMP201) All You Need To Know About Auto Scaling
(CMP201) All You Need To Know About Auto Scaling(CMP201) All You Need To Know About Auto Scaling
(CMP201) All You Need To Know About Auto Scaling
 
Day 5 - AWS Autoscaling Master Class - The New Capacity Plan
Day 5 - AWS Autoscaling Master Class - The New Capacity PlanDay 5 - AWS Autoscaling Master Class - The New Capacity Plan
Day 5 - AWS Autoscaling Master Class - The New Capacity Plan
 
Auto Scaling on AWS
Auto Scaling on AWSAuto Scaling on AWS
Auto Scaling on AWS
 
Auto scaling using Amazon Web Services ( AWS )
Auto scaling using Amazon Web Services ( AWS )Auto scaling using Amazon Web Services ( AWS )
Auto scaling using Amazon Web Services ( AWS )
 
How Does Amazon EC2 Auto Scaling Work
How Does Amazon EC2 Auto Scaling WorkHow Does Amazon EC2 Auto Scaling Work
How Does Amazon EC2 Auto Scaling Work
 
Amazon S3 Masterclass
Amazon S3 MasterclassAmazon S3 Masterclass
Amazon S3 Masterclass
 
AWS re:Invent 2016: Deep Dive on Amazon Elastic Block Store (STG301)
AWS re:Invent 2016: Deep Dive on Amazon Elastic Block Store (STG301)AWS re:Invent 2016: Deep Dive on Amazon Elastic Block Store (STG301)
AWS re:Invent 2016: Deep Dive on Amazon Elastic Block Store (STG301)
 
AWS re:Invent 2016: Elastic Load Balancing Deep Dive and Best Practices (NET403)
AWS re:Invent 2016: Elastic Load Balancing Deep Dive and Best Practices (NET403)AWS re:Invent 2016: Elastic Load Balancing Deep Dive and Best Practices (NET403)
AWS re:Invent 2016: Elastic Load Balancing Deep Dive and Best Practices (NET403)
 
Aws Autoscaling
Aws AutoscalingAws Autoscaling
Aws Autoscaling
 
CTO Night & Day Morning Session "Auto Scaling & Spot Instances Deep Dive"
CTO Night & Day Morning Session "Auto Scaling & Spot Instances Deep Dive"CTO Night & Day Morning Session "Auto Scaling & Spot Instances Deep Dive"
CTO Night & Day Morning Session "Auto Scaling & Spot Instances Deep Dive"
 
Deep Dive on Amazon S3 (May 2016)
Deep Dive on Amazon S3 (May 2016)Deep Dive on Amazon S3 (May 2016)
Deep Dive on Amazon S3 (May 2016)
 
Programming Amazon Web Services for Beginners (1)
Programming Amazon Web Services for Beginners (1)Programming Amazon Web Services for Beginners (1)
Programming Amazon Web Services for Beginners (1)
 
Cloud Analytics
Cloud AnalyticsCloud Analytics
Cloud Analytics
 
Cloud analytics
Cloud analyticsCloud analytics
Cloud analytics
 
Cloud Analytics - Using cloud based services to analyse big data
Cloud Analytics - Using cloud based services to analyse big dataCloud Analytics - Using cloud based services to analyse big data
Cloud Analytics - Using cloud based services to analyse big data
 
AWS re:Invent 2016: Case Study: Librato's Experience Running Cassandra Using ...
AWS re:Invent 2016: Case Study: Librato's Experience Running Cassandra Using ...AWS re:Invent 2016: Case Study: Librato's Experience Running Cassandra Using ...
AWS re:Invent 2016: Case Study: Librato's Experience Running Cassandra Using ...
 
Toyota Reach Trucks – The Simply Effective BT Reflex
Toyota Reach Trucks – The Simply Effective BT ReflexToyota Reach Trucks – The Simply Effective BT Reflex
Toyota Reach Trucks – The Simply Effective BT Reflex
 
Fleet Management in Russia, CIS and Eastern Europe
Fleet Management in Russia, CIS and Eastern EuropeFleet Management in Russia, CIS and Eastern Europe
Fleet Management in Russia, CIS and Eastern Europe
 
Презентация Такси Всегда для корпоративных клиентов
Презентация Такси Всегда для корпоративных клиентовПрезентация Такси Всегда для корпоративных клиентов
Презентация Такси Всегда для корпоративных клиентов
 
ESAM 6900 rus
ESAM 6900 rusESAM 6900 rus
ESAM 6900 rus
 

Ähnlich wie AWS re:Invent 2016: Auto Scaling – the Fleet Management Solution for Planet Earth (CMP201)

Apache Hadoop India Summit 2011 talk "Making Hadoop Enterprise Ready with Am...
Apache Hadoop India Summit 2011 talk  "Making Hadoop Enterprise Ready with Am...Apache Hadoop India Summit 2011 talk  "Making Hadoop Enterprise Ready with Am...
Apache Hadoop India Summit 2011 talk "Making Hadoop Enterprise Ready with Am...
Yahoo Developer Network
 

Ähnlich wie AWS re:Invent 2016: Auto Scaling – the Fleet Management Solution for Planet Earth (CMP201) (20)

Auto Scaling: The Fleet Management Solution for Planet Earth - CMP201 - re:In...
Auto Scaling: The Fleet Management Solution for Planet Earth - CMP201 - re:In...Auto Scaling: The Fleet Management Solution for Planet Earth - CMP201 - re:In...
Auto Scaling: The Fleet Management Solution for Planet Earth - CMP201 - re:In...
 
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
 
Apache Hadoop India Summit 2011 talk "Making Hadoop Enterprise Ready with Am...
Apache Hadoop India Summit 2011 talk  "Making Hadoop Enterprise Ready with Am...Apache Hadoop India Summit 2011 talk  "Making Hadoop Enterprise Ready with Am...
Apache Hadoop India Summit 2011 talk "Making Hadoop Enterprise Ready with Am...
 
#lspe Q1 2013 dynamically scaling netflix in the cloud
#lspe Q1 2013   dynamically scaling netflix in the cloud#lspe Q1 2013   dynamically scaling netflix in the cloud
#lspe Q1 2013 dynamically scaling netflix in the cloud
 
More Nines for Your Dimes: Improving Availability and Lowering Costs using Au...
More Nines for Your Dimes: Improving Availability and Lowering Costs using Au...More Nines for Your Dimes: Improving Availability and Lowering Costs using Au...
More Nines for Your Dimes: Improving Availability and Lowering Costs using Au...
 
The Evolution of Apache Kylin
The Evolution of Apache KylinThe Evolution of Apache Kylin
The Evolution of Apache Kylin
 
Modernizing upstream workflows with aws storage - john mallory
Modernizing upstream workflows with aws storage -  john malloryModernizing upstream workflows with aws storage -  john mallory
Modernizing upstream workflows with aws storage - john mallory
 
AWS Public Sector Symposium 2014 Canberra | Managing Seasonal Workloads on AWS
AWS Public Sector Symposium 2014 Canberra | Managing Seasonal Workloads on AWS AWS Public Sector Symposium 2014 Canberra | Managing Seasonal Workloads on AWS
AWS Public Sector Symposium 2014 Canberra | Managing Seasonal Workloads on AWS
 
Launching Your First Big Data Project on AWS
Launching Your First Big Data Project on AWSLaunching Your First Big Data Project on AWS
Launching Your First Big Data Project on AWS
 
Big Data Day LA 2015 - The AWS Big Data Platform by Michael Limcaco of Amazon
Big Data Day LA 2015 - The AWS Big Data Platform by Michael Limcaco of AmazonBig Data Day LA 2015 - The AWS Big Data Platform by Michael Limcaco of Amazon
Big Data Day LA 2015 - The AWS Big Data Platform by Michael Limcaco of Amazon
 
Apache Kylin 1.5 Updates
Apache Kylin 1.5 UpdatesApache Kylin 1.5 Updates
Apache Kylin 1.5 Updates
 
Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...
Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...
Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...
 
How Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon RedshiftHow Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon Redshift
 
Big Data & Analytics - Use Cases in Mobile, E-commerce, Media and more
Big Data & Analytics - Use Cases in Mobile, E-commerce, Media and moreBig Data & Analytics - Use Cases in Mobile, E-commerce, Media and more
Big Data & Analytics - Use Cases in Mobile, E-commerce, Media and more
 
Building a Big Data & Analytics Platform using AWS
Building a Big Data & Analytics Platform using AWS Building a Big Data & Analytics Platform using AWS
Building a Big Data & Analytics Platform using AWS
 
AWS AutoScalling- Tech Talks Maio 2019
AWS AutoScalling- Tech Talks Maio 2019AWS AutoScalling- Tech Talks Maio 2019
AWS AutoScalling- Tech Talks Maio 2019
 
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹
 
Analytics in the Cloud
Analytics in the CloudAnalytics in the Cloud
Analytics in the Cloud
 
AWS Summit London 2014 | Improving Availability and Lowering Costs (300)
AWS Summit London 2014 | Improving Availability and Lowering Costs (300)AWS Summit London 2014 | Improving Availability and Lowering Costs (300)
AWS Summit London 2014 | Improving Availability and Lowering Costs (300)
 
More Nines for Your Dimes: Improving Availability and Lowering Costs using Au...
More Nines for Your Dimes: Improving Availability and Lowering Costs using Au...More Nines for Your Dimes: Improving Availability and Lowering Costs using Au...
More Nines for Your Dimes: Improving Availability and Lowering Costs using Au...
 

Mehr von Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

Mehr von Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 

AWS re:Invent 2016: Auto Scaling – the Fleet Management Solution for Planet Earth (CMP201)

  • 1. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Andre Dufour, AWS Auto Scaling Hook Hua, Jet Propulsion Laboratory December 2016 Auto Scaling CMP201 The Fleet Management Solution for Planet Earth
  • 2. Availability with fleet management Keeping up with demand with dynamic scaling Future of Auto Scaling
  • 3. Myth Fact My application doesn’t need scaling, so I don’t benefit from Auto Scaling It’s hard to use My instances are stateful or unique; I can’t use Auto Scaling
  • 4. Auto Scaling groupAuto Scaling group Auto Scaling Fleet management Dynamic scaling ELB EC2 instances ELB CPU Utilization EC2 instances
  • 5. Is Fleet Management For You? “I’ve got instances serving a business-impacting application” “If my instances become unhealthy, I’d like them replaced automatically” “I would like my instances distributed to maximize resilience”
  • 8. Scaling on a Schedule Recurring scaling events Schedule individual events Auto Scaling group ELB EC2 instances
  • 9. Reactive Scaling Policies Collect metrics Alarm fires when threshold is crossed Auto Scaling Alarm is connected to Auto Scaling policy Scaling event is triggered ELB Auto Scaling group
  • 10. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. JPL: Auto Scaling Advantage Advanced Rapid Imaging and Analysis © 2016 California Institute of Technology. Government sponsorship acknowledged. Reference herein to any specific commercial product, or service by trade name, trademark, manufacturer, or otherwise, does not constitute or imply its endorsement by the United States Government or the Jet Propulsion Laboratory, California Institute of Technology.
  • 11. Challenge: Remote Sensing for Disaster Response Automated data system are required to analyze large quantities of data from NASA NISAR, other satellite missions, and rapidly expandingGPS networks. Mean Access Time (Day) ∞ 4 2 1.3 1 Going from Artisan to Automation: Use system engineering approach to translate specialized data analysis into operational capability. Orbit Data Coherence Library AZO / MAI EQ Kinema c Modeling Level-0 Data Kinema c Model Catalog & Waveform Sta c Model InSAR Processing Time Series Con guous Pairs Sta c Modeling Int. Phase Stress Change III. Rou ne Data Processing Coherence Change Maps Seismometer Radar Sensor I. Geode c Sensors II. Data Providers / Archives Global CMT USGS NEIC IRIS (Waveform) Fault Geometry Mw>5.5 Mw>6.5 Yes Yes No Small EQ Modeling Amp / Arr. T correc on IV. Triggered Legend Ready to be used ( > 70 % readiness ) Need modifica on ( 30 – 70 % readiness) Doesn’t exist ( < 30 % readiness) Latency / Bo leneck Out of scope of R&TD Customer Products Level-0 Data Op cal Im. Processing Offset Image Event Driven Op cal Sensor SPOT Image (SPOT 1-5, Formosat 2) NSPO (SPOT) DigitalGlobe (Quickbird, Worldview 1) RINEX Data GPS Processing Daily Point Pos. 1 Hz (GDGPS) Hourly Daily GPS Sta on Orbit Data Sub-daily Point Pos. SOPAC + many others GPS Constella on NASA - JPL V. Customer Products Damage Assessment 3D Surface Disp. Maps Ground Mo on 3D Surface Disp. Maps EQ catalog CTBT correc on surface Coseismic Interferogram Coseismic GPS vector map Coseismic Interferogram Coseismic GPS vector map Temporal Records of Ground Deformation Spatial Maps of Ground Deformation Earthquake Models Coseismic Ground Deformation Demonstrate response to hazards with standardized set of data products for decision & policy makers. Coseismic Damage
  • 12. Automated Data Collection & Processing Integrating Space Geodesy Seismology Modeling Radar Sensors Building Damage & Inundation Radar GPS Seismology Permanent Ground Deformation High-Resolution Hazard Assessment from Fault Models Monitoring & Near Real-Time Assessment “We have high hopes for ARIA.” Keiko Saito World Bank “This is exactly the kind of product we are looking for.” Anne Rosinski Calif. Geol. Survey Examples from the 2011 M9.0 Tohoku-Oki (Japan) earthquake GPS Networks Seismic Networks Optical Sensors Advanced Rapid Imaging and Analysis (ARIA)
  • 13. Example Analysis Scenarios Afar/Asal Rifts Volcanism 2013 Sudan Floods Disaster CaliMap Tectonics Hydrology Disaster Copahue VolcanismRutford Ice Stream Glacier Dynamics 2013 Oklahoma Tornado Disaster 2012 New York City Hurricane Sandy Disaster2013 Colorado Floods Disaster 2013 Pakistan Earthquake Tectonics Sierra Negra Volcanism CSK 1 CSK 2 CSK 3 CSK 4 North Anatolian Fault, Turkey Tectonics, Disaster Kilauea Volcanism Potenza, Italy Landslide Dynamics Slumgullion Landslide Dynamics CSK 1 CSK 4 CSK 3 CSK 2
  • 14. • Sample pipelined PGE • Each major PGE processing step is deployed in its own AWS auto-scaling group (ASG) Process Control & Management in AWS Availability Zone Auto Scaling group instances Compute Workers Availability Zone Auto Scaling group instances Compute Workers Availability Zone Auto Scaling group instances Compute Workers bucket with objects Metadata Data Availability Zone instances Process Control & Management services Search Catalog of Datasets instances Resource Management instances Analytics / Metrics Release to Distribution Data Centers Peg12 Dop12 Int- gen I12,A1 2 tropoMap I12,A1 2 tc tc unwrap I12,A1 2 tc,u tc,u int Telemetry Job control
  • 15. Auto Scaling Science Data System The size of the science data system compute nodes can automatically grow/shrink based on processing demand Auto Scaling group policies • Auto scaling size = 21 • Metric alarm of queue size > 20 • Auto scaling rest period of 60-seconds ASG max set to 1000 instances x 32 vCPUs Auto scaling enabling runs of over 100,000 vCPUs
  • 16. Hybrid Cloud Science Data System OCO-2 L2 Full Physics processing operational in AWS • Processing of L2 full physics data products in Amazon cloud across multiple regions • Scaled up thousands of compute nodes • Demonstrated capability of higher internal data throughput rates than NISAR needs Number of compute nodes over time Per node transfer rate over time Scalable internal data throughput @ 32,000 PGEs on 1,000 nodes
  • 17. Optimizing for Scaling In/Out Events • Scaling group resets and timeout periods • Scale up in group sizes of multiples of availability zones (AZ) to minimize AZ load rebalancing terminations Scaling up (scale out) • What policy to set to scale down? • E.g. CPU / network utilization • Domain knowledge only known within the compute instances • Self-terminating instances • “harakiri” / “seppuku” Scaling down (scale in)
  • 18. Spot Market • Major cost savings (75%-90% savings over on-demand)…if can use spot instances • On spot market, AWS will terminate compute instances if market prices exceed bid threshold • HySDS able to self-recover from spot instance terminations by using Auto Scaling
  • 19. High-Resiliency X X X X X Availability Zone a Availability Zone b Availability Zone c Running in spot market forces the data system to be more resilient to failures
  • 20. “Market Maker” This OCO-2 data production run of 1000 x 32vCPUs affected the market prices Strategy: • Mitigate impact on spot market • Diversification of resources • “Spot fleet”
  • 21. “Thundering Herd” Fleet of ASG compute instances calling same services at same time • “API rate limit exceeded” “Jittering” the API calls • Introduce randomizations to API calls • Distributes load on infrastructure Service Compute Fleet Instances
  • 22. Next Generation Missions 0 10 20 30 40 50 60 70 80 90 100 OCO-2 SMAP NISAR Estimated Daily Volume (TB) • The volume of data produced is larger than previous missions • Data storage, processing, movement, and costs are the biggest challenges (2021)(2009) (2015)
  • 23. Future of Auto Scaling: Application Auto Scaling
  • 24. Application Auto Scaling: Amazon ECS Scale out Scale in Service ECS cluster Task Task Task Task Task
  • 25. Application Auto Scaling: Spot Fleet Spot Fleet Spot Fleet Auto Scaling
  • 26. Application Auto Scaling: Amazon EMR EMR Cluster
  • 27. Summary • Availability with Fleet Management • Resources on aws.amazon.com/autoscaling • Keeping up with demand with Dynamic Scaling • Case studies on Auto Scaling website • Future of Auto Scaling • Blog posts on Application Auto Scaling • Questions: • autoscaling-reinvent2016-questions@amazon.com
  • 28. Thank you! Additional authors: Hook Hua1, Gerald Manipon1, Michael Starch1, Lan Dang1, Justin Linick1, Sang-Ho Yun1, Gian Franco Sacco1, Piyush Agram1, Susan Owen1, Brian Bue1,, Eric Fielding1, Paul Lundgren1, Angelyn Moore1, Paul Rosen1, Zhen Liu1, Eric Gurrola1, Tom Farr1, Vincent Realmuto1, Frank Webb1, Mark Simons2, Pietro Milillo1 1 Jet Propulsion Laboratory 2 California Institute of Technology