Weitere ähnliche Inhalte
Ähnlich wie Customer presentation: Eagle Genomics, Introduction to AWS, Cambridge
Ähnlich wie Customer presentation: Eagle Genomics, Introduction to AWS, Cambridge (20)
Mehr von Amazon Web Services
Mehr von Amazon Web Services (20)
Kürzlich hochgeladen (20)
Customer presentation: Eagle Genomics, Introduction to AWS, Cambridge
- 2. http://aws.amazon.com/solutions/case-studies/unilever
“Unilever’s digital data program now processes genetic
sequence twenty times faster – without incurring higher
compute costs.
– Pete Keeley, eScience IT Lead for Cloud, Unilever Research
Transforming Informatics for Unilever Research
William Spooner, CTO and Founder, Eagle Genomics
Introduction to AWS | Cambridge 30th May 2012
©Eagle Genomics Ltd
©Eagle Genomics Ltd
- 3. AWS Case Study: Unilever
Anglo/Dutch multinational
Consumer goods
Over 400 brands in 180
countries.
Covers all facets of daily life,
from food to cleaning
to health and well-being.
Unilever Research
Over 6,000 specialists
In twenty countries
http://aws.amazon.com/solutions/case-studies/unilever
Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 3
- 4. Technology Partner: Eagle
Babraham-based consultancy
Informatics: life science R&D
Customers in US, Europe, Asia
Operating for 4 years
12 Employees
Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 4
- 5. The DNA Path
1 mile
10,000 letters
1 gene; BRCA2
BReast CAncer 2
Tumor suppressor
© Keith Edkins (CC BY-SA 2.0)
Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 5
- 6. The Human
Genome
3,000,000,000 letters
20,000 genes
x10 round the world
First sequence (HGP);
Released in 2000
Took 10 years
Cost $100M
© webdesignhot.com (CC SA 3.0)
Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 6
- 7. © S. Ballard (CC BY-SA 2.0)
Next Generation DNA
Sequencing (NGS)
Latest figures (2012)
Takes 12 days
Costs $10,000
Costs still falling
rapidly
Type footer in here ©Eagle Genomics Ltd June 6, 2012 7
- 8. 1000 Human
Genomes
200 TB sequence data
AWS public data set
Data analysis, e.g.
Fragment assembly
06/06/2012 8
©Eagle Genomics Ltd
- 9. © T. Harris
HPC at Sanger
Over 10,000 cores
Over 10 PB storage
Supported by a large
team
© Genome Research Ltd.
Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 9
- 10. HPC with AWS
Virtual supercomputer
50,000 cores
$5,000/hour
Vs. Hardware cost of:
~$15,000,000
Used for Protein
simulation experiment
©Eagle Genomics Ltd
- 11. Bacterial diversity: Gingivitis
Health Shared Gingivitis
© David Taylor, Suzi Adams, Unilever Research. Generated using Cytoscape
Sample (healthy site, n=40)
Sample (gingivitis site, n=36) Actinobacteria Proteobacteria
Bacteriodetes SR1
Cyanobacteria Spirochaetes
OTU coloured by phyla
Firmicutes TM7
OTU size proportional to log
Fusobacteria Tenericutes
count
Unclassified
Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 11
- 12. AWS
sFTP Web UI
EC2 instance
Main data input/output
Exchange for user access
Workflow
Blackboard
Server
Instances only launched
Job fetching and on workflow demand/load
S3 status updating
Data input/output
storage
elastic
EC2 instances
12
©Eagle Genomics Ltd
© David Taylor, Pete Keeley - Unilever Research
- 13. Results of Pilot
June 2011 Jan 2012
10 Studies per Year 50 Studies per Year
Runtime – Weeks (realtime) Runtime – Hours (realtime)
1 pipeline 2 pipelines
Run jobs sequentially Run all jobs in parallel
1 UK Lab 6 Global Labs
12 direct users 50 direct users
Few Specialists Locally All Biologists Globally
Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 13
© David Taylor, Pete Keeley - Unilever Research
- 14. “Unilever’s digital data program now processes genetic
sequence twenty times faster – without incurring higher
compute costs.
“In addition, its robust architecture supports ten times as many
scientists, all working simultaneously”
– Pete Keeley, eScience IT Lead for Cloud, Unilever Research
info@eaglegenomics.com www.eaglegenomics.com +44 (0)1223 654481
@wspoonr
facebook.com/eaglegenomics blog.eaglegenomics.com
@eaglegen
©Eagle Genomics Ltd
Eagle® is a registered trademark no. 010418135 of Eagle Genomics Ltd.
Postal address: Eagle Genomics Ltd., Babraham Research Campus, Cambridge CB22 3AT, United Kingdom.
©Eagle Genomics Ltd