Plenary Talk at ACAT 2010

Scien&ﬁc
Compu&ng
with
Amazon
Web
Services
Deepak
Singh

ACAT
2010.

Jaipur,
India

Via Reavel under a CC-BY-NC-ND license

By ~Prescott under a CC-BY-NC license

couldn’t find a good picture for
arrays of sensors in the ocean

Image
via
image
editor
under
a
CC-‐BY
License

ex
ab
y tes
petabytes ?

Compute Storage
Amazon Elastic Compute Database
Amazon Simple Amazon RDS and
Cloud (EC2) Storage Service (S3)
- Elastic Load Balancing SimpleDB
- AWS Import/Export
- Auto Scaling

Payments On-Demand
Parallel Processing Messaging
Content Delivery Amazon Flexible Workforce
Amazon Elastic Amazon Simple
Amazon CloudFront Payments Service Amazon Mechanical
MapReduce Queue Service (SQS)
(FPS) Turk

Compute Storage
- AWS Import/Export
- Auto Scaling

Tools Isolated Networks
Monitoring Management
AWS Toolkit for Eclipse Amazon Virtual Private
Amazon CloudWatch AWS Management Console
AWS Toolkit for .NET Cloud

Payments On-Demand
(FPS) Turk

Compute Storage
- AWS Import/Export
- Auto Scaling

Your Custom Applications and Services

Tools Isolated Networks
Monitoring Management
AWS Toolkit for Eclipse Amazon Virtual Private
Amazon CloudWatch AWS Management Console
AWS Toolkit for .NET Cloud

Payments On-Demand
(FPS) Turk

Compute Storage
- AWS Import/Export
- Auto Scaling

go
o u
s y
scalable ay a
P
cost effective

scalable
cost effective
reliable

scalable
cost effective
reliable
secure

3000 CPU’s for one firm’s risk management application
3444JJ'
!"#$%&'()'*+,'-./01.2%/'

344'+567/'(.'
8%%9%.:/'

344'JJ'

I%:.%/:1=' ;<"&/:1=' A&B:1=' C10"&:1=' C".:1=' E(.:1=' ;"%/:1='
>?,,?,44@' >?,3?,44@' >?,>?,44@' >?,H?,44@' >?,D?,44@' >?,F?,44@' >?,G?,44@'

“Everything fails, all the time”
-- Werner Vogels

2.3% AFR in population of 13,250

Source: James Hamilton

US East Region

Availability Availability
Zone A Zone B

Availability Availability
Zone C Zone D

on-demand instances
reserved instances
spot instances

Amazon Relational Data Service

Amazon Elastic
MapReduce

Amazon EC2 Instances
End
Deploy Application
Hadoop Hadoop Hadoop
Elastic Elastic
MapReduce MapReduce
Hadoop Hadoop Hadoop Notify
Web Console, Command
line tools Input output
dataset results

Input
S3
Output
S3
Get Results
Input Data
bucket bucket

Amazon S3

apache hive

http://hadoop.apache.org/hive/

apache pig

http://hadoop.apache.org/pig/

cascading

http://www.cascading.org/

sudo gem install cloud-crowd

http://cyclecomputing.com
http://wiki.github.com/documentcloud/cloud-crowd

http://bitbucket.org/galaxy/galaxy-central/wiki/Home

http://aws.amazon.com/publicdatasets/

3.7 million classifications in just over three days
~15 million in less than a month
>2.6 million clicks in 100 hours

Crossbow: Rapid whole genome SNP analysis

Preprocessed reads

Map: Bowtie

Sort: Bin and partition

Reduce: SoapSNP
Langmead B, Trapnell C, Pop M, Salzberg SL. Genome Biol 10 (3): R25.

Crossbow
condenses
over
1,000
hours
of

resequencing
computa:on
into
a
few
hours

without
requiring
the
user
to
own
or
operate
a

computer
cluster

BLAT @ U. PENN
Map 100 million, 100 base paired end reads
Quad core with 5 GB of RAM would take 16 days

30 high-memory instances; 32 hours; $195

GALAXY MAPPING
Goal: Create an astrometric catalog of a billion
stars with micro arc second precision
Gaia satellite launched 2011; observations till
2017; catalog ready 2019
Problem: Single pass through the data for image
processing would take 30 years (on one CPU)

Solution: Use AWS

Capacity Capacity
Resources

Resources
Demand Demand

Time Time

Static data center Data center in the cloud

Unused resources

HEAVY-ION COLLISIONS

Problem: Quark matter physics conference
imminent but no compute resources handy

Solution: NIMBUS context broker allowed
researchers to provision 300 nodes and get the
simulations done

BELLE MONTE CARLO

Credit: Tom Fiﬁeld

http://aws.amazon.com/education

Thank
you!

deesingh@amazon.com
Twi<er:@mndoci

Presenta?on
ideas
from
James
Hamilton
and
@lessig

Plenary Talk at ACAT 2010

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (9)

Similar to Plenary Talk at ACAT 2010

Similar to Plenary Talk at ACAT 2010 (20)

More from Deepak Singh

More from Deepak Singh (17)

Recently uploaded

Recently uploaded (20)

Plenary Talk at ACAT 2010