Using AWS Batch and AWS Step Functions for High-Throughput Workflows

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Jamie Kinney, Principal Product Manager AWS Batch
Aaron Friedman, Partner Solutions Architect, Healthcare and Life Sciences
September 14, 2017
Using AWS Batch and AWS Step
Functions to Design and Run
High-Throughput Workflows

What we will cover
• What Are High-Throughput Workflows?
• Architecture Overview
• Service Overview – AWS Batch
• Service Overview – AWS Step Functions
• Architecture Deep Dive

What are high-throughput workflows?
Start
Pre-processing
Long-running
operation
Post-
Processing
Copy results
to S3
End
Now run this same workflow for thousands
of inputs while also:
• Starting each step at the right time
• Running each step on appropriate
compute resources
• Managing concurrency
• Scaling infrastructure up and down
• Handling errors
• Providing notifications
• Accelerating workflow development
Network I/O
and CPU
Disk I/O and
large memory
GPU-
Accelerated
Network I/O

High-throughput workflows are everywhere
Media & Entertainment
Transportation & Logistics
Manufacturing & Design
Financial Services
Life Sciences Earth Sciences & Geospatial Analytics

Introducing AWS Batch
Fully Managed
No software to install or
servers to manage. AWS
Batch provisions and
scales your infrastructure
Integrated with AWS
AWS Batch jobs can easily
and securely interact with
services such as Amazon S3,
DynamoDB, and Rekognition
Cost-Efficient
AWS Batch launches compute
resources tailored to your jobs
and can provision Amazon EC2
and EC2 Spot instances

AWS Batch Concepts
• Jobs
• Job Definitions
• Job Queue
• Compute Environments
• Scheduler

IAM Role for
Batch Job
Input Files
Queue of
Runnable Jobs
S3 Events Trigger
Lambda Function
Submits Batch Job
AWS Batch
Compute Environments
AWS Batch Job
Output
Example AWS Batch Job Architecture
Job Definition
Job Resource Requirements
and other parameters
AWS Batch Execution
Application
Image
AWS Batch
Scheduler

A Visual Representation of AWS Batch

AWS Step Functions…
…makes it easy to
coordinate the components
of distributed applications
using visual workflows.

Application Lifecycle in AWS Step Functions
Visualize in the
Console
Define in JSON Monitor
Executions

Seven State Types
Task A single unit of work
Choice Adds branching logic
Parallel Fork and join the data across tasks
Wait Delay for a specified time
Fail Stops an execution and marks it as a failure
Succeed Stops an execution successfully
Pass Passes its input to its output

Build Visual Workflows Using State Types
AWS STEP FUNCTIONS
Task
Choice
Fail
ParallelMountains
People
Snow

Executing Job(s)
Specify Docker run parameters as container overrides
Specify Job Queue
Submit Dependencies
response = batch_client.submit_job(
dependsOn=event['dependsOn'],
containerOverrides=event['containerOverrides'],
jobDefinition=event['jobDefinition'],
jobName=event['jobName'],
jobQueue=event['jobQueue'],
)
Confidential

Considerations for Batch Layer: Data Sharing
Consideration: Jobs are managed at the container, not
instance level. Cannot guarantee consecutive containers in
a workflow will run on same instance.
Solution: Stage all data in Amazon S3, and read and write
everything from there. Also important for traceability,
logging, etc.

Considerations for Batch Layer: Multitenancy
Consideration: May have multiple containers running
batch processes on same instance in same base working
directory.
Solution: Within scratch directory, each batch process
creates a subfolder with a unique ID. All scratch data
written to this subdirectory.

Considerations for Batch Layer: Volume Reuse
Consideration: Scratch data should live only as long as the
job using it in order to optimize for instance and Amazon
EBS storage costs.
Solution: Within scratch directory, each batch process
creates a subfolder with a unique ID. All scratch data written
to this subdirectory. Delete subdirectory at end of job.

Deployment with AWS Step Functions

A Flexible Workflow Deployment Model
• Decouple batch engine and workflow orchestration
• Workflow creation now done as JSON
• Easier to deploy
• Easier to automate
• Easier to test
• Can integrate non-Batch applications as well

{
...
"SubmitJob": {
"Type": "Task",
"Resource":
"arn:aws:lambda:REGION:ACCOUN
T:function:batchSubmitJob1",
"Next": "GetJobStatus"
},
...
}
Change one line to change workflow
{
...
"SubmitJob": {
"Type": "Task",
"Resource":
"arn:aws:lambda:REGION:ACCOUN
T:function:batchSubmitJob2",
"Next": "GetJobStatus"
},
...
}

A Practical Example: Genomics
Annotation
Variant
Calling
QC
Alignment

Thank you!
AWS Batch: https://aws.amazon.com/batch/
AWS Step Functions: https://aws.amazon.com/step-functions/
Reference Architecture: https://github.com/awslabs/aws-batch-genomics

Using AWS Batch and AWS Step Functions for High-Throughput Workflows

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Using AWS Batch and AWS Step Functions for High-Throughput Workflows

Similar to Using AWS Batch and AWS Step Functions for High-Throughput Workflows (20)

More from Amazon Web Services

More from Amazon Web Services (20)

Using AWS Batch and AWS Step Functions for High-Throughput Workflows