SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Downloaden Sie, um offline zu lesen
1/21Copyright 2013 InVitae, Inc
Reece Hart, Ph.D.
reece@invitae.com
InVitae, Inc.
invitae.com
Developing a Clinical GenomeDeveloping a Clinical Genome
Interpretation Pipeline at AWSInterpretation Pipeline at AWS
2/21Copyright 2013 InVitae, Inc
The MissionThe Mission
To provide comprehensive,
clinically-relevant information from
genomic variation data in a single test.
3/21Copyright 2013 InVitae, Inc
one sample
one requisition
one report
up to 264 conditions
two weeks
one lab, one price
InVitae's process features online requisitioning and reporting, CLIA-certified
sequencing, and a HIPAA-compliant information management.
intake
Requisitioning and Laboratory Information Management System
interpretationsequencing review
4/21Copyright 2013 InVitae, Inc
Where does InVitae fit?Where does InVitae fit?
photos:
Baylor College of Medicine, Univ. Utah, learningradiology.com, sciencephotos.com
Patient presents with symptoms
If genomic interpretation might influence
diagnosis or treatment, doctor refers
patient to genetic counselor
GC takes history; sample is sent
to internal or one of hundreds of
labs that provide specific
genomic tests
Sequencing and other lab
data are processed into
preliminary iterpretation
Report is returned to GC
and/or physician who
verify interpretation and
consult with patient
5/21Copyright 2013 InVitae, Inc
http://www.ncbi.nlm.nih.gov/sites/GeneTests/
6/21Copyright 2013 InVitae, Inc
Examples of published clinical variantsExamples of published clinical variants
➢ Inherited conditions
● NM_000136.2:c.355_360delATGAGAinsT
‒ at risk for Fanconi Anemia
➢ Carrier conditions
● NM_000520.4:c.1277_1278insGATA
‒ carrier for Tay-Sachs
➢ Pharmacogenetic conditions
● HLAB*1502 & HLAB*5701 haplotypes (23 snps)
‒ Abacavir drug-induced hypersensitivity
‒ Flucloxacillin drug-induced liver injury
‒ Carbamazepine drug-induced cutaneous adverse events
7/21Copyright 2013 InVitae, Inc
Reported VariantsReported Variants
8/21Copyright 2013 InVitae, Inc
Our ReportOur Report
9/21Copyright 2013 InVitae, Inc
10/21Copyright 2013 InVitae, Inc
NOW
Early Access
Commercial Program
(CLIA Certified)
264
GENETIC TESTS
$1,500
2014
Goal to update offering
every six months
>1000
GENETIC TESTS
<$1,000
50x minimum, ~425x average
100% coverage of curated variants
>90% of targeted regions
SNVs, indels (<100nt), VUS
11/21Copyright 2013 InVitae, Inc
How?How?
By deeply integrating custom genetic
curation, sequencing assay design, and
analytical pipelines.
12/21Copyright 2013 InVitae, Inc
Curation + Assay Design + PipelineCuration + Assay Design + Pipeline
curation
analysis pipeline
A>T
sequence assay
curation
13/21Copyright 2013 InVitae, Inc
Protected Health Information
on-site, encrypted, restricted access
Architectural OverviewArchitectural Overview
IPSec
Tunnel
4.5GB up
4 MB down
14/21Copyright 2013 InVitae, Inc
Pipeline OverviewPipeline Overview
aligned readsreads
variantsaligned reads
reportvariants
bwa
samtools
gatk
picard
+ assay regions
+ GRCh37 ref
+ 1kg known indels
gatk
freebayes
custom variant calling
+ qualities
+ metrics
+ quality metrics
call haplotypes
match variants
VUS pipeline
render
15/21Copyright 2013 InVitae, Inc
AWS TopologyAWS Topology
1 VPC
3 subnets
NFS
interactive hosts
web services
scaled dynamically
build/test
16/21Copyright 2013 InVitae, Inc
What's worked well at AWS?What's worked well at AWS?
➢ Performance and Capacity Scaling
● on-demand
➢ Security and VPG
● Simple, comprehensive rules for VPCs
➢ Service ecosystem: EC2, EBS, S3, R53, IAM, VPC
● boto!
➢ Future:
● Archive: S3 and Glacier?
● Investigate workflows
● Reanalysis
17/21Copyright 2013 InVitae, Inc
What challenges have we experienced?What challenges have we experienced?
➢ Large shared fileystems
● poor and variable performance (better now)
● familiarity, flexibility, transparency
● existing software expectations
➢ Node failures
● Once upon a time... 0-3 times per day, no warning
● moved zones no longer an issue
➢ Devops
● Built our own, but moving to puppet
18/21Copyright 2013 InVitae, Inc
19/21Copyright 2013 InVitae, Inc
One source of variation...One source of variation...
attribution unknown
21/21Copyright 2013 InVitae, Inc

Weitere ähnliche Inhalte

Was ist angesagt?

Clinical Reporting Made Easy
Clinical Reporting Made EasyClinical Reporting Made Easy
Clinical Reporting Made EasyGolden Helix Inc
 
Drs. Jeff Zimmerman & Rodger Main - Evolution of Biosurveillance
Drs. Jeff Zimmerman & Rodger Main - Evolution of BiosurveillanceDrs. Jeff Zimmerman & Rodger Main - Evolution of Biosurveillance
Drs. Jeff Zimmerman & Rodger Main - Evolution of BiosurveillanceJohn Blue
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance
 
The Future of Healthcare with Big Data and AI with Ion Stoica and Frank Nothaft
The Future of Healthcare with Big Data and AI with Ion Stoica and Frank NothaftThe Future of Healthcare with Big Data and AI with Ion Stoica and Frank Nothaft
The Future of Healthcare with Big Data and AI with Ion Stoica and Frank NothaftDatabricks
 
Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...
Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...
Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...Pistoia Alliance
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance
 
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance ExperienceDomselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance ExperienceIRIDA_community
 
ApolloDx's mDx Platform - Real-Time Results Transmitted Instantly
ApolloDx's mDx Platform - Real-Time Results Transmitted InstantlyApolloDx's mDx Platform - Real-Time Results Transmitted Instantly
ApolloDx's mDx Platform - Real-Time Results Transmitted InstantlyApolloDx
 
PDA Annual Meeting Orlando March 2010 Mj
PDA Annual Meeting Orlando March 2010 MjPDA Annual Meeting Orlando March 2010 Mj
PDA Annual Meeting Orlando March 2010 MjMWJornitz
 

Was ist angesagt? (11)

Clinical Reporting Made Easy
Clinical Reporting Made EasyClinical Reporting Made Easy
Clinical Reporting Made Easy
 
Drs. Jeff Zimmerman & Rodger Main - Evolution of Biosurveillance
Drs. Jeff Zimmerman & Rodger Main - Evolution of BiosurveillanceDrs. Jeff Zimmerman & Rodger Main - Evolution of Biosurveillance
Drs. Jeff Zimmerman & Rodger Main - Evolution of Biosurveillance
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
The Future of Healthcare with Big Data and AI with Ion Stoica and Frank Nothaft
The Future of Healthcare with Big Data and AI with Ion Stoica and Frank NothaftThe Future of Healthcare with Big Data and AI with Ion Stoica and Frank Nothaft
The Future of Healthcare with Big Data and AI with Ion Stoica and Frank Nothaft
 
Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...
Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...
Sequence analysis in the regulated domain - A Pistoia Alliance Debates webina...
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance ExperienceDomselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
 
c5an90032h
c5an90032hc5an90032h
c5an90032h
 
Analytics in Pharmaceutical Industry
Analytics in Pharmaceutical IndustryAnalytics in Pharmaceutical Industry
Analytics in Pharmaceutical Industry
 
ApolloDx's mDx Platform - Real-Time Results Transmitted Instantly
ApolloDx's mDx Platform - Real-Time Results Transmitted InstantlyApolloDx's mDx Platform - Real-Time Results Transmitted Instantly
ApolloDx's mDx Platform - Real-Time Results Transmitted Instantly
 
PDA Annual Meeting Orlando March 2010 Mj
PDA Annual Meeting Orlando March 2010 MjPDA Annual Meeting Orlando March 2010 Mj
PDA Annual Meeting Orlando March 2010 Mj
 

Ähnlich wie AWS Life Sciences

WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...Amazon Web Services
 
Optimizing the Output of Your Molecular Pathology Laboratory
Optimizing the Output of Your Molecular Pathology LaboratoryOptimizing the Output of Your Molecular Pathology Laboratory
Optimizing the Output of Your Molecular Pathology LaboratoryJosh Forsythe
 
VSWarehouse; a scalable, rapid genomic repository solution
VSWarehouse; a scalable, rapid genomic repository solutionVSWarehouse; a scalable, rapid genomic repository solution
VSWarehouse; a scalable, rapid genomic repository solutionGolden Helix
 
Bringing NGS Testing In-House
Bringing NGS Testing In-HouseBringing NGS Testing In-House
Bringing NGS Testing In-HouseJosh Forsythe
 
In Vitro Cardiac Safety Assessment
In Vitro Cardiac Safety Assessment In Vitro Cardiac Safety Assessment
In Vitro Cardiac Safety Assessment Covance
 
Clinical Reporting Made Easy
Clinical Reporting Made EasyClinical Reporting Made Easy
Clinical Reporting Made EasyGolden Helix
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Ian Foster
 
Production Bioinformatics, emphasis on Production
Production Bioinformatics, emphasis on ProductionProduction Bioinformatics, emphasis on Production
Production Bioinformatics, emphasis on ProductionChris Dwan
 
Jax bio dataworldcongress.ngs.20181128finalwithoutbu
Jax bio dataworldcongress.ngs.20181128finalwithoutbuJax bio dataworldcongress.ngs.20181128finalwithoutbu
Jax bio dataworldcongress.ngs.20181128finalwithoutbuAnne Deslattes Mays
 
Article IVD March 2006
Article IVD March 2006Article IVD March 2006
Article IVD March 2006Fabrice Sultan
 
Aug2015 steve lincoln analytical validation
Aug2015 steve lincoln analytical validationAug2015 steve lincoln analytical validation
Aug2015 steve lincoln analytical validationGenomeInABottle
 
Leveraging Data to Develop, Execute and Exceed the Expectations of Your Regu...
Leveraging Data to Develop, Execute and Exceed the Expectations of  Your Regu...Leveraging Data to Develop, Execute and Exceed the Expectations of  Your Regu...
Leveraging Data to Develop, Execute and Exceed the Expectations of Your Regu...April Bright
 
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeqIntroducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeqGolden Helix Inc
 
Melbourne Genomics Health Alliance
Melbourne Genomics Health AllianceMelbourne Genomics Health Alliance
Melbourne Genomics Health AllianceAmazon Web Services
 
Building a Clinical NGS Program
Building a Clinical NGS ProgramBuilding a Clinical NGS Program
Building a Clinical NGS ProgramLisa Owen
 
Golden Helix's End-to-End Solution for Clinical Labs
Golden Helix's End-to-End Solution for Clinical LabsGolden Helix's End-to-End Solution for Clinical Labs
Golden Helix's End-to-End Solution for Clinical LabsGolden Helix
 
SITIST 2015 Dev - Turning big data into presicion medicine real life examples
SITIST 2015 Dev - Turning big data into presicion medicine real life examplesSITIST 2015 Dev - Turning big data into presicion medicine real life examples
SITIST 2015 Dev - Turning big data into presicion medicine real life examplessitist
 

Ähnlich wie AWS Life Sciences (20)

WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
 
Optimizing the Output of Your Molecular Pathology Laboratory
Optimizing the Output of Your Molecular Pathology LaboratoryOptimizing the Output of Your Molecular Pathology Laboratory
Optimizing the Output of Your Molecular Pathology Laboratory
 
VSWarehouse; a scalable, rapid genomic repository solution
VSWarehouse; a scalable, rapid genomic repository solutionVSWarehouse; a scalable, rapid genomic repository solution
VSWarehouse; a scalable, rapid genomic repository solution
 
Bringing NGS Testing In-House
Bringing NGS Testing In-HouseBringing NGS Testing In-House
Bringing NGS Testing In-House
 
In Vitro Cardiac Safety Assessment
In Vitro Cardiac Safety Assessment In Vitro Cardiac Safety Assessment
In Vitro Cardiac Safety Assessment
 
Clinical Reporting Made Easy
Clinical Reporting Made EasyClinical Reporting Made Easy
Clinical Reporting Made Easy
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
Production Bioinformatics, emphasis on Production
Production Bioinformatics, emphasis on ProductionProduction Bioinformatics, emphasis on Production
Production Bioinformatics, emphasis on Production
 
BioData World Basel 2018
BioData World Basel 2018BioData World Basel 2018
BioData World Basel 2018
 
Jax bio dataworldcongress.ngs.20181128finalwithoutbu
Jax bio dataworldcongress.ngs.20181128finalwithoutbuJax bio dataworldcongress.ngs.20181128finalwithoutbu
Jax bio dataworldcongress.ngs.20181128finalwithoutbu
 
Article IVD March 2006
Article IVD March 2006Article IVD March 2006
Article IVD March 2006
 
Aug2015 steve lincoln analytical validation
Aug2015 steve lincoln analytical validationAug2015 steve lincoln analytical validation
Aug2015 steve lincoln analytical validation
 
Leveraging Data to Develop, Execute and Exceed the Expectations of Your Regu...
Leveraging Data to Develop, Execute and Exceed the Expectations of  Your Regu...Leveraging Data to Develop, Execute and Exceed the Expectations of  Your Regu...
Leveraging Data to Develop, Execute and Exceed the Expectations of Your Regu...
 
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeqIntroducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
 
Melbourne Genomics Health Alliance
Melbourne Genomics Health AllianceMelbourne Genomics Health Alliance
Melbourne Genomics Health Alliance
 
Building a Clinical NGS Program
Building a Clinical NGS ProgramBuilding a Clinical NGS Program
Building a Clinical NGS Program
 
Ginipacs
GinipacsGinipacs
Ginipacs
 
2015-fall
2015-fall2015-fall
2015-fall
 
Golden Helix's End-to-End Solution for Clinical Labs
Golden Helix's End-to-End Solution for Clinical LabsGolden Helix's End-to-End Solution for Clinical Labs
Golden Helix's End-to-End Solution for Clinical Labs
 
SITIST 2015 Dev - Turning big data into presicion medicine real life examples
SITIST 2015 Dev - Turning big data into presicion medicine real life examplesSITIST 2015 Dev - Turning big data into presicion medicine real life examples
SITIST 2015 Dev - Turning big data into presicion medicine real life examples
 

Mehr von Reece Hart

HGVS 2015 poster: hgvs, uta, variantanalyzer
HGVS 2015 poster: hgvs, uta, variantanalyzerHGVS 2015 poster: hgvs, uta, variantanalyzer
HGVS 2015 poster: hgvs, uta, variantanalyzerReece Hart
 
Clinical significance of transcript alignment discrepancies gne - 20141016
Clinical significance of transcript alignment discrepancies   gne - 20141016Clinical significance of transcript alignment discrepancies   gne - 20141016
Clinical significance of transcript alignment discrepancies gne - 20141016Reece Hart
 
The Clinical Significance of Transcript Alignment Discrepancies
The Clinical Significance of Transcript Alignment DiscrepanciesThe Clinical Significance of Transcript Alignment Discrepancies
The Clinical Significance of Transcript Alignment DiscrepanciesReece Hart
 
Invitae PSB 2014 poster
Invitae PSB 2014 posterInvitae PSB 2014 poster
Invitae PSB 2014 posterReece Hart
 
ASHG 2012 Poster
ASHG 2012 PosterASHG 2012 Poster
ASHG 2012 PosterReece Hart
 
Building a clinical genome interpretation services company
Building a clinical genome interpretation services companyBuilding a clinical genome interpretation services company
Building a clinical genome interpretation services companyReece Hart
 
Bio-IT 2010 Genome Commons
Bio-IT 2010 Genome CommonsBio-IT 2010 Genome Commons
Bio-IT 2010 Genome CommonsReece Hart
 
HVP Critical Assessment of Genome Interpretation
HVP Critical Assessment of Genome InterpretationHVP Critical Assessment of Genome Interpretation
HVP Critical Assessment of Genome InterpretationReece Hart
 
Introduction to and Applications of Unison, an Open Source Database for Targe...
Introduction to and Applications of Unison, an Open Source Database for Targe...Introduction to and Applications of Unison, an Open Source Database for Targe...
Introduction to and Applications of Unison, an Open Source Database for Targe...Reece Hart
 
Unison: Enabling easy, rapid, and comprehensive proteomic mining
Unison: Enabling easy, rapid, and comprehensive proteomic miningUnison: Enabling easy, rapid, and comprehensive proteomic mining
Unison: Enabling easy, rapid, and comprehensive proteomic miningReece Hart
 
A Tour of Research Computing at Genentech
A Tour of Research Computing at GenentechA Tour of Research Computing at Genentech
A Tour of Research Computing at GenentechReece Hart
 
Integrating Public and Private Data: Lessons Learned from Unison
Integrating Public and Private Data: Lessons Learned from UnisonIntegrating Public and Private Data: Lessons Learned from Unison
Integrating Public and Private Data: Lessons Learned from UnisonReece Hart
 
Unison: An Integrated Platform for Computational Biology Discovery
Unison: An Integrated Platform for Computational Biology DiscoveryUnison: An Integrated Platform for Computational Biology Discovery
Unison: An Integrated Platform for Computational Biology DiscoveryReece Hart
 
Mining for Novel TNF Ligands
Mining for Novel TNF LigandsMining for Novel TNF Ligands
Mining for Novel TNF LigandsReece Hart
 

Mehr von Reece Hart (14)

HGVS 2015 poster: hgvs, uta, variantanalyzer
HGVS 2015 poster: hgvs, uta, variantanalyzerHGVS 2015 poster: hgvs, uta, variantanalyzer
HGVS 2015 poster: hgvs, uta, variantanalyzer
 
Clinical significance of transcript alignment discrepancies gne - 20141016
Clinical significance of transcript alignment discrepancies   gne - 20141016Clinical significance of transcript alignment discrepancies   gne - 20141016
Clinical significance of transcript alignment discrepancies gne - 20141016
 
The Clinical Significance of Transcript Alignment Discrepancies
The Clinical Significance of Transcript Alignment DiscrepanciesThe Clinical Significance of Transcript Alignment Discrepancies
The Clinical Significance of Transcript Alignment Discrepancies
 
Invitae PSB 2014 poster
Invitae PSB 2014 posterInvitae PSB 2014 poster
Invitae PSB 2014 poster
 
ASHG 2012 Poster
ASHG 2012 PosterASHG 2012 Poster
ASHG 2012 Poster
 
Building a clinical genome interpretation services company
Building a clinical genome interpretation services companyBuilding a clinical genome interpretation services company
Building a clinical genome interpretation services company
 
Bio-IT 2010 Genome Commons
Bio-IT 2010 Genome CommonsBio-IT 2010 Genome Commons
Bio-IT 2010 Genome Commons
 
HVP Critical Assessment of Genome Interpretation
HVP Critical Assessment of Genome InterpretationHVP Critical Assessment of Genome Interpretation
HVP Critical Assessment of Genome Interpretation
 
Introduction to and Applications of Unison, an Open Source Database for Targe...
Introduction to and Applications of Unison, an Open Source Database for Targe...Introduction to and Applications of Unison, an Open Source Database for Targe...
Introduction to and Applications of Unison, an Open Source Database for Targe...
 
Unison: Enabling easy, rapid, and comprehensive proteomic mining
Unison: Enabling easy, rapid, and comprehensive proteomic miningUnison: Enabling easy, rapid, and comprehensive proteomic mining
Unison: Enabling easy, rapid, and comprehensive proteomic mining
 
A Tour of Research Computing at Genentech
A Tour of Research Computing at GenentechA Tour of Research Computing at Genentech
A Tour of Research Computing at Genentech
 
Integrating Public and Private Data: Lessons Learned from Unison
Integrating Public and Private Data: Lessons Learned from UnisonIntegrating Public and Private Data: Lessons Learned from Unison
Integrating Public and Private Data: Lessons Learned from Unison
 
Unison: An Integrated Platform for Computational Biology Discovery
Unison: An Integrated Platform for Computational Biology DiscoveryUnison: An Integrated Platform for Computational Biology Discovery
Unison: An Integrated Platform for Computational Biology Discovery
 
Mining for Novel TNF Ligands
Mining for Novel TNF LigandsMining for Novel TNF Ligands
Mining for Novel TNF Ligands
 

Kürzlich hochgeladen

Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 

Kürzlich hochgeladen (20)

Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 

AWS Life Sciences

  • 1. 1/21Copyright 2013 InVitae, Inc Reece Hart, Ph.D. reece@invitae.com InVitae, Inc. invitae.com Developing a Clinical GenomeDeveloping a Clinical Genome Interpretation Pipeline at AWSInterpretation Pipeline at AWS
  • 2. 2/21Copyright 2013 InVitae, Inc The MissionThe Mission To provide comprehensive, clinically-relevant information from genomic variation data in a single test.
  • 3. 3/21Copyright 2013 InVitae, Inc one sample one requisition one report up to 264 conditions two weeks one lab, one price InVitae's process features online requisitioning and reporting, CLIA-certified sequencing, and a HIPAA-compliant information management. intake Requisitioning and Laboratory Information Management System interpretationsequencing review
  • 4. 4/21Copyright 2013 InVitae, Inc Where does InVitae fit?Where does InVitae fit? photos: Baylor College of Medicine, Univ. Utah, learningradiology.com, sciencephotos.com Patient presents with symptoms If genomic interpretation might influence diagnosis or treatment, doctor refers patient to genetic counselor GC takes history; sample is sent to internal or one of hundreds of labs that provide specific genomic tests Sequencing and other lab data are processed into preliminary iterpretation Report is returned to GC and/or physician who verify interpretation and consult with patient
  • 5. 5/21Copyright 2013 InVitae, Inc http://www.ncbi.nlm.nih.gov/sites/GeneTests/
  • 6. 6/21Copyright 2013 InVitae, Inc Examples of published clinical variantsExamples of published clinical variants ➢ Inherited conditions ● NM_000136.2:c.355_360delATGAGAinsT ‒ at risk for Fanconi Anemia ➢ Carrier conditions ● NM_000520.4:c.1277_1278insGATA ‒ carrier for Tay-Sachs ➢ Pharmacogenetic conditions ● HLAB*1502 & HLAB*5701 haplotypes (23 snps) ‒ Abacavir drug-induced hypersensitivity ‒ Flucloxacillin drug-induced liver injury ‒ Carbamazepine drug-induced cutaneous adverse events
  • 7. 7/21Copyright 2013 InVitae, Inc Reported VariantsReported Variants
  • 8. 8/21Copyright 2013 InVitae, Inc Our ReportOur Report
  • 10. 10/21Copyright 2013 InVitae, Inc NOW Early Access Commercial Program (CLIA Certified) 264 GENETIC TESTS $1,500 2014 Goal to update offering every six months >1000 GENETIC TESTS <$1,000 50x minimum, ~425x average 100% coverage of curated variants >90% of targeted regions SNVs, indels (<100nt), VUS
  • 11. 11/21Copyright 2013 InVitae, Inc How?How? By deeply integrating custom genetic curation, sequencing assay design, and analytical pipelines.
  • 12. 12/21Copyright 2013 InVitae, Inc Curation + Assay Design + PipelineCuration + Assay Design + Pipeline curation analysis pipeline A>T sequence assay curation
  • 13. 13/21Copyright 2013 InVitae, Inc Protected Health Information on-site, encrypted, restricted access Architectural OverviewArchitectural Overview IPSec Tunnel 4.5GB up 4 MB down
  • 14. 14/21Copyright 2013 InVitae, Inc Pipeline OverviewPipeline Overview aligned readsreads variantsaligned reads reportvariants bwa samtools gatk picard + assay regions + GRCh37 ref + 1kg known indels gatk freebayes custom variant calling + qualities + metrics + quality metrics call haplotypes match variants VUS pipeline render
  • 15. 15/21Copyright 2013 InVitae, Inc AWS TopologyAWS Topology 1 VPC 3 subnets NFS interactive hosts web services scaled dynamically build/test
  • 16. 16/21Copyright 2013 InVitae, Inc What's worked well at AWS?What's worked well at AWS? ➢ Performance and Capacity Scaling ● on-demand ➢ Security and VPG ● Simple, comprehensive rules for VPCs ➢ Service ecosystem: EC2, EBS, S3, R53, IAM, VPC ● boto! ➢ Future: ● Archive: S3 and Glacier? ● Investigate workflows ● Reanalysis
  • 17. 17/21Copyright 2013 InVitae, Inc What challenges have we experienced?What challenges have we experienced? ➢ Large shared fileystems ● poor and variable performance (better now) ● familiarity, flexibility, transparency ● existing software expectations ➢ Node failures ● Once upon a time... 0-3 times per day, no warning ● moved zones no longer an issue ➢ Devops ● Built our own, but moving to puppet
  • 19. 19/21Copyright 2013 InVitae, Inc One source of variation...One source of variation... attribution unknown