Deep Dive with Amazon EC2 Container Service Hands-on Workshop

© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Pahud Hsieh(謝洪恩), Solutions Architect
Aug 01, 2017
Hong Kong
Amazon ECS Advanced Workshop

What to Expect from the Workshop
• Introduction to ECS
• Lab1 Getting Started with ECS and
CloudFormation
• Lab2 ECS Windows Container
• Lab3 ECS CI/CD
• Lab4 ECS service autoscaling and host
autoscaling
• Lab5 Log consolidation

What to Expect from the Workshop
• Lab6 ECS Events
• Lab7 Spot Fleet support
• Lab8 host scale in with container draining
• Lab9 credentials management
• Lab10 ECS Service with external and internal
ALB
• Lab11 service discovery with etcd3
• Q & A

Prerequisite
• AWS Account (personal preferred)
• IAM User with AdministratorAccess policy

Amazon ECS: Under the Hood
ALB ALB
AZ 1 AZ 2
user / scheduler
Scheduler
Cluster State Service
Placement Engine
Event Stream

https://github.com/pahud/ecs-cfn-refarch

Lab1 – ECS with CloudFormation

Lab2 – ECS with Windows Container

Amazon EC2 Container Service (ECS)
EC2 INSTANCES
ECS
AGENT
ECS
AGENT
Amazon
ECS
ECS
AGENT
DEPLOYMENT
AUTOMATION

Deployment – In Place – Doubling
Availability Zone Availability Zone
Scenario
Service’s task definition is
updated to a new revision with
parameters:
Desired Count = 2
Minimum Healthy Percent = 100%
Maximum Percent = 200%
These settings permit the service
to grow to double its desired size
during deployment
EXISTING EXISTING

Two new tasks are started
growing the number of tasks to
200% of its desired count which is
the maximum permitted
EXISTING EXISTINGNEW NEW
Desired Count = 2

After the new tasks are verified to
be healthy by the Elastic Load
Balancer health check, the two
previous tasks with the older task
definition are drained and stopped
NEW NEW
Desired Count = 2

Deployment – In Place – Rolling
Scenario
Service’s task definition is
updated to a new revision with
parameters:
Desired Count = 2
These settings constrain the
service to not exceed its desired
size but allows it to halve the
number of tasks during
deployment
EXISTING EXISTING

First, an existing task is stopped
which brings the healthy
percentage of the service to 50%
and makes room on the cluster for
new tasks
EXISTING
Desired Count = 2

A task using the new task
definition is started bringing the
service back to 100%
EXISTING
Desired Count = 2
NEW

After the new task is verified to be
healthy by the Elastic Load
Balancer health check, the next
existing task with the older task
definition is drained and stopped
Desired Count = 2
NEW

The second new task is started on
the cluster bringing the service
back to 100%
NEW NEW
Desired Count = 2

Deployment – Canary
Scenario
The new revision runs as a small
subset of production by deploying
a canary service in the same
target group
Deployment is completed by
updating the primary service’s
task definition and scaling down
the canary service. EXISTING EXISTINGEXISTING

A standalone service with the new
task definition is deployed using
the same Application Load
Balancer target group of the
existing service
EXISTING EXISTINGEXISTING CANARY

After some period of monitoring
the metrics from the canary
instance, the existing service’s
task definition is updated to the
new revision
NEW NEWNEW CANARY

After the deployment, all tasks are
running the same task definition
with the new revision of the
application and the canary can be
destroyed
NEW NEWNEW

Deployment – Blue/Green – DNS Swap
Availability Zone
EXISTING EXISTING
www.myproduct.com
Scenario
Two services are defined each
with their own Application Load
Balancer
swapping the Route 53 alias
record between the two
Application Load Balancers
Availability Zone

Availability Zone
EXISTING EXISTING
www.myproduct.com
An identical Application Load
Balancer and a service with a task
definition using the new revision is
deployed
Availability Zone
NEW NEW
next.myproduct.com

Availability Zone
EXISTING EXISTING
next.myproduct.com
After automated or manual
testing, the deployment is
completed by swapping the Route
53 alias record between the two
Application Load Balancers
Availability Zone
NEW NEW
www.myproduct.com

Availability Zone
The previous service and its
Application Load Balancer can
then be destroyed
Availability Zone
NEW NEW
www.myproduct.com

Deployment – Blue/Green – Target Group Swap
Availability Zone
EXISTING EXISTING
Scenario
Two services are defined each
with their own target group
registered in the same Application
Load Balancer using Host-based
routing
swapping the listener rules
between the two target groups
Availability Zone

Availability Zone
EXISTING EXISTING
The second service is deployed
with a new target group and
registered to the same Application
Load Balancer
Using Host-based routing, requests
to www.myproduct.com are
directed to our blue service while
requests to next.myproduct.com
are directed to our green service NEW NEW
Availability Zone

Availability Zone
After automated or manual testing,
the deployment can be completed
by swapping the listener rules on
the Application Load Balancer and
sending traffic to the green service
NEW NEW
Availability Zone
EXISTING EXISTING

Availability Zone
The previous service and its target
group can then be destroyed
NEW NEW
Availability Zone

Best Practices
• In-place Doubling – beware the CPU and
Memory reservation with host autoscaling
ahead
• Monitor the health of ecs-agent
• Monitor your CloudWatch metrics and Logs
• Canary deployment – test your pilot
workload with the same backend capacity
• Prepare your rollback plan

Lab4 – ECS Service and Host Autoscaling

Best Practices
• CloudWatch Logs Agent
1. In the same Docker image
2. In the same Task Definition
3. Standalone task with distinct
placement
• Alternative – Fluentd

Takeaways
• Monitor state changes for containers
and tasks
• On state changes, trigger SNS and
Lambda for advanced coordination or
scheduling
• Monitor cloudtrail logs(API usage) as
well

Takeaways
• Use lowestPrice allocation strategy for
dev/testing
• Use diversified allocation strategy for
pilot, staging or pre-prod
• Beware of the host termination – you
have no enough time for container
draining
• Be careful to deploy in Production

Lab8 – ASG Scale In and Container Draining

Takeaways
• Container Draining may take up to 5-10
min
• Beware of the draining of batch tasks
• Stay patient and test

Lab9 – Credentials Management

Takeaways
• Restrict the access of parameter and
KMS key in the ECS Task Role
• Use Parameter Hierarchy, Tagging
and Notification
$ aws ssm get-parameter-by-path –path /Dev/Web/Nginx
$ aws ssm get-parameter-by-path –path /Testing/Web/Nginx
$ aws ssm get-parameter-by-path –path /Prod/Web/Nginx –with-encryption

Lab10 – External and Internal ALB

Takeaways
• Use ALB host-based and path-based
routing
• Fanout your task events for multiple
operations – e.g. Route53 update
• We have a PFR for this

Deep Dive with Amazon EC2 Container Service Hands-on Workshop

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Deep Dive with Amazon EC2 Container Service Hands-on Workshop

Ähnlich wie Deep Dive with Amazon EC2 Container Service Hands-on Workshop (20)

Mehr von Amazon Web Services

Mehr von Amazon Web Services (20)

Deep Dive with Amazon EC2 Container Service Hands-on Workshop