SlideShare a Scribd company logo
1 of 35
Download to read offline
The Future of Healthcare
with Big Data and AI
DATA
ENGINEERS
DATA
SCIENTISTS
UNIFIED
ANALYTICS
PLATFORM
EXPERTISE GAP
DOMAIN
EXPERT
UNIFIED
ANALYTICS
PLATFORM
DOMAIN
EXPERT
INDUSTRY-SPECIFIC TOOLS
UNIFIED
ANALYTICS
PLATFORM
Unified Analytics Platform
for Genomics
Massive Investments in Genomic Data
Potential to Transform the Industry
Faster Drug
Discovery
Reduced
Health Claims
Better Patient
Outcomes
40,000 Petabytes / year by 2025
Genomic Data Volumes are Exploding
From $2.7B to <$1,000
Sequencing Data / $
2018
Sequencing Data Processed / $
2018
Sequencing Data / $
Sequencing Data Processed / $
2018
Cost of processing all DNA sequenced in a year
is growing exponentially year-over-year!
Sequencing Data / $
Challenge #1: Complex Pipelines
Complex Genomic Pipelines
Costly and time consuming
Annotation
Alignment
Variant Calling
Quality Control
BWA
Analysis
Raw Data
Challenge #2: Rigid Analytics
Complex Genomic Pipelines
Costly and time consuming
Annotation
Alignment
Variant Calling
Quality Control
BWA
Analysis
Raw Data
Rigid Analytics
Reduced Scope of Research
Challenge #3: Siloed Teams
Complex Genomic Pipelines
Costly and time consuming
Annotation
Alignment
Variant Calling
Quality Control
BWA
Analysis
Raw Data
Rigid Analytics
Reduced Scope of Research
Siloed Teams
Lack of Productivity
Researchers
and Clinicians
Bioinformatics
Teams
Computational
Biologists
Solution #1: Prebuilt Pipelines
Complex Genomic Pipelines
Costly and time consuming
Annotation
Alignment
Variant Calling
Quality Control
BWA
Analysis
Raw Data
Rigid Analytics
Reduced Scope of Research
Siloed Teams
Lack of Productivity
Researchers
and Clinicians
Bioinformatics
Teams
Computational
Biologists
Raw Data
Analyses
Raw Data Raw Data
Packaged Workflows and Tools
powered by
Databricks Runtime
“One click” execution
Best Practice Pipelines
Solution #1: Prebuilt Pipelines
Complex Genomic Pipeline
Costly and time consuming
Annotation
Alignment
Variant Calling
Quality Control
BWA
Analysis
Raw Data
Rigid Analytics
Reduced Scope of Research
Siloed Teams
Lack of Productivity
Researchers
and Clinicians
Bioinformatics
Teams
Computational
Biologists
Raw Data
Analyses
Raw Data Raw Data
Packaged Workflows and Tools
powered by
Databricks Runtime
“One click” execution
Best Practice Pipelines
30x Coverage Whole Genome (GVCF)
0:30:00 1:00:00 1:30:00 2:00:00
Processing Time
3.8x faster than
industry leader
Edico
2:29:23
0:39:23
Solution #2: Powerful Analytics
Rigid Analytics
Reduced Scope of Research
Siloed Teams
Lack of Productivity
Researchers
and Clinicians
Bioinformatics
Teams
Computational
Biologists
From interactive queries to AI
Powerful Analytics
Raw Data
Analyses
Raw Data Raw Data
Packaged Workflows and Tools
powered by
Databricks Runtime
“One click” execution
Best Practice Pipelines
Solution #2: Powerful Analytics
Rigid Analytics
Reduced Scope of Research
Siloed Teams
Lack of Productivity
Researchers
and Clinicians
Bioinformatics
Teams
Computational
Biologists
From interactive queries to AI
Powerful Analytics
Raw Data
Analyses
Raw Data Raw Data
Packaged Workflows and Tools
powered by
Databricks Runtime
“One click” execution
Best Practice Pipelines
“Having the data is the first step,
enabling drug development teams
to answer questions with the data
is how we are building the future of
drug discovery.”
Dr. Jeff Reid, Exec Dir at Regeneron
“Queries on
60B+ genome
associations in
3 seconds vs.
30 minutes”
Siloed Teams
Lack of Productivity
Researchers
and Clinicians
Bioinformatics
Teams
Computational
Biologists
Solution #3: Collaborative Workspaces
Raw Data
Analyses
Raw Data Raw Data
Packaged Workflows and Tools
powered by
Databricks Runtime
“One click” execution
Best Practice Pipelines
From interactive queries to AI
Powerful Analytics
Lack of ProductivityDramatically Improve Productivity
Collaborative Workspaces
Researchers
and Clinicians
Bioinformatics
Teams
Computational
Biologists
Siloed Teams
Lack of Productivity
Researchers
and Clinicians
Bioinformatics
Teams
Computational
Biologists
Solution #3: Collaborative Workspaces
Raw Data
Analyses
Raw Data Raw Data
Packaged Workflows and Tools
powered by
Databricks Runtime
“One click” execution
Best Practice Pipelines
From interactive queries to AI
Powerful Analytics
Lack of ProductivityDramatically Improve Productivity
Collaborative Workspaces
Researchers
and Clinicians
Bioinformatics
Teams
Computational
Biologists
“Databricks allows us to take
clinical research and turn it into
a clinically validated screen in
far less time.”
Sr. Director of Computational
Bioinformatics, Lynn Carmichael
Unified Analytics Platform for Genomics
All Your
Genomic Data
Visualizations
Machine Learning
Best Practice
Pipelines
Tertiary
Analytics and
AI at Scale
Collaborative
Workspaces
Genomic Analytics
(e.g. GWAS, eQTL)
Unified Analytics Platform for Genomics
All Your
Genomic Data
Visualizations
Machine Learning
Best Practice
Pipelines
Tertiary
Analytics and
AI at Scale
Collaborative
Workspaces
Genomic Analytics
(e.g. GWAS, eQTL)
Genomics-specific optimizations
increase performance by up to 100x
Sign-up for the preview
databricks.com/genomics
Accelerate Discovery
Accelerate Discovery
Demo: Preventing Disease
with Genomics at Scale
Typical patient intake and treatment
Identify Diagnose Treat
Typical patient intake and treatment
Identify Diagnose Treat
...but this is very reactive and costly.
Typical patient intake and treatment
Identify Diagnose Treat
...but this is very reactive and costly.
By the age of 15, over 30% of Europeans
will develop a chronic disease
Let’s shift our thinking
What if we could identify
an individual’s risk for
developing a disease
and prevent that disease
before it ever occurs?
The preventative care process
Predict Prevent
The preventative care process
Predict Prevent
Accelerated treatment improves outcomes
The preventative care process
Predict Prevent
Huge opportunity for genomics
Accelerated treatment improves outcomes
But genomic analysis is really hard
Population Scale Data
Arrives (e.g. Biobank)
Process
for Analysis
Export Model and
Apply to Individual
Generate Dashboard
for Clinician
Let’s try this with the
Databricks Unified Analytics
Platform for Genomics...
Sign-up for the preview
databricks.com/genomics

More Related Content

What's hot

Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma
Ankur Khanna
 
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Databricks
 

What's hot (20)

Digital transformation of translational medicine
Digital transformation of translational medicineDigital transformation of translational medicine
Digital transformation of translational medicine
 
Neo4j GraphDay Munich - Improve Health Research
Neo4j GraphDay Munich - Improve Health ResearchNeo4j GraphDay Munich - Improve Health Research
Neo4j GraphDay Munich - Improve Health Research
 
Pharma data analytics
Pharma data analyticsPharma data analytics
Pharma data analytics
 
How BrackenData Leverages Data on Over 250,000 Clinical Trials
How BrackenData Leverages Data on Over 250,000 Clinical TrialsHow BrackenData Leverages Data on Over 250,000 Clinical Trials
How BrackenData Leverages Data on Over 250,000 Clinical Trials
 
MedChemica BigData What Is That All About?
MedChemica BigData What Is That All About?MedChemica BigData What Is That All About?
MedChemica BigData What Is That All About?
 
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use Cases
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use CasesFrom Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use Cases
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use Cases
 
The FDA’s Data Exploring Tools: ‘Array Track HCA and PCA Packages’
The FDA’s Data Exploring Tools: ‘Array Track HCA and PCA Packages’The FDA’s Data Exploring Tools: ‘Array Track HCA and PCA Packages’
The FDA’s Data Exploring Tools: ‘Array Track HCA and PCA Packages’
 
Baker mckenzie
Baker mckenzieBaker mckenzie
Baker mckenzie
 
Validating microbiome claims – including the latest DNA techniques
Validating microbiome claims – including the latest DNA techniquesValidating microbiome claims – including the latest DNA techniques
Validating microbiome claims – including the latest DNA techniques
 
Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
 
How we Built a Large Scale Matched Pair Analysis Engine (MCPairs) using OpenE...
How we Built a Large Scale Matched Pair Analysis Engine (MCPairs) using OpenE...How we Built a Large Scale Matched Pair Analysis Engine (MCPairs) using OpenE...
How we Built a Large Scale Matched Pair Analysis Engine (MCPairs) using OpenE...
 
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
 
Irving-TeraData: data and science driven big industry-nfdp13
Irving-TeraData: data and science driven big industry-nfdp13Irving-TeraData: data and science driven big industry-nfdp13
Irving-TeraData: data and science driven big industry-nfdp13
 
Pistoia Alliance conference April 2016: Big Data: Eric Little
Pistoia Alliance conference April 2016: Big Data: Eric LittlePistoia Alliance conference April 2016: Big Data: Eric Little
Pistoia Alliance conference April 2016: Big Data: Eric Little
 
MedChemica Active Learning - Combining MMPA and ML
MedChemica Active Learning - Combining MMPA and MLMedChemica Active Learning - Combining MMPA and ML
MedChemica Active Learning - Combining MMPA and ML
 
Expert Panel on Data Challenges in Translational Research
Expert Panel on Data Challenges in Translational ResearchExpert Panel on Data Challenges in Translational Research
Expert Panel on Data Challenges in Translational Research
 
MPS webinar master deck
MPS webinar master deckMPS webinar master deck
MPS webinar master deck
 
Data analytics - May 2016
Data analytics - May 2016Data analytics - May 2016
Data analytics - May 2016
 
Elastic as a Fundamental Core to Pfizer’s Scientific Data Cloud
Elastic as a Fundamental Core to Pfizer’s Scientific Data CloudElastic as a Fundamental Core to Pfizer’s Scientific Data Cloud
Elastic as a Fundamental Core to Pfizer’s Scientific Data Cloud
 

Similar to The Future of Healthcare with Big Data and AI with Ion Stoica and Frank Nothaft

2016.10 HPDA in Precision Medicine
2016.10 HPDA in Precision Medicine2016.10 HPDA in Precision Medicine
2016.10 HPDA in Precision Medicine
Michael Atkins
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
Jordan Engbers
 
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Databricks
 

Similar to The Future of Healthcare with Big Data and AI with Ion Stoica and Frank Nothaft (20)

WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
 
2016.10 HPDA in Precision Medicine
2016.10 HPDA in Precision Medicine2016.10 HPDA in Precision Medicine
2016.10 HPDA in Precision Medicine
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, LucidworksBio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
 
Accure ai healthcare offering v4
Accure ai healthcare offering v4Accure ai healthcare offering v4
Accure ai healthcare offering v4
 
Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021 Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021
 
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
 
BioData World Basel 2018
BioData World Basel 2018BioData World Basel 2018
BioData World Basel 2018
 
Big data
Big dataBig data
Big data
 
Maximizing Production Efficiency with Big Data Analytics in semiconductor Man...
Maximizing Production Efficiency with Big Data Analytics in semiconductor Man...Maximizing Production Efficiency with Big Data Analytics in semiconductor Man...
Maximizing Production Efficiency with Big Data Analytics in semiconductor Man...
 
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
 
data.2.pptx
data.2.pptxdata.2.pptx
data.2.pptx
 
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
 
Insight into AstraZeneca's Technology Services.
Insight into AstraZeneca's Technology Services.Insight into AstraZeneca's Technology Services.
Insight into AstraZeneca's Technology Services.
 
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
 
David Cocker big data MDCPartners ta-scan
David Cocker big data MDCPartners ta-scanDavid Cocker big data MDCPartners ta-scan
David Cocker big data MDCPartners ta-scan
 
Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...
Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...
Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...
 
Best Practices for Building an End-to-End Workflow for Microbial Genomics
 Best Practices for Building an End-to-End Workflow for Microbial Genomics Best Practices for Building an End-to-End Workflow for Microbial Genomics
Best Practices for Building an End-to-End Workflow for Microbial Genomics
 
Borys Pratsiuk "How to be NVidia partner"
Borys Pratsiuk "How to be NVidia partner"Borys Pratsiuk "How to be NVidia partner"
Borys Pratsiuk "How to be NVidia partner"
 

More from Databricks

Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
Databricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 

More from Databricks (20)

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML Monitoring
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on Kubernetes
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature Aggregations
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and Spark
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta Lake
 

Recently uploaded

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
JohnnyPlasten
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
shambhavirathore45
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
shivangimorya083
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
shivangimorya083
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 

Recently uploaded (20)

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 

The Future of Healthcare with Big Data and AI with Ion Stoica and Frank Nothaft

  • 1. The Future of Healthcare with Big Data and AI
  • 6. Massive Investments in Genomic Data
  • 7. Potential to Transform the Industry Faster Drug Discovery Reduced Health Claims Better Patient Outcomes
  • 8. 40,000 Petabytes / year by 2025 Genomic Data Volumes are Exploding From $2.7B to <$1,000
  • 10. Sequencing Data Processed / $ 2018 Sequencing Data / $
  • 11. Sequencing Data Processed / $ 2018 Cost of processing all DNA sequenced in a year is growing exponentially year-over-year! Sequencing Data / $
  • 12. Challenge #1: Complex Pipelines Complex Genomic Pipelines Costly and time consuming Annotation Alignment Variant Calling Quality Control BWA Analysis Raw Data
  • 13. Challenge #2: Rigid Analytics Complex Genomic Pipelines Costly and time consuming Annotation Alignment Variant Calling Quality Control BWA Analysis Raw Data Rigid Analytics Reduced Scope of Research
  • 14. Challenge #3: Siloed Teams Complex Genomic Pipelines Costly and time consuming Annotation Alignment Variant Calling Quality Control BWA Analysis Raw Data Rigid Analytics Reduced Scope of Research Siloed Teams Lack of Productivity Researchers and Clinicians Bioinformatics Teams Computational Biologists
  • 15. Solution #1: Prebuilt Pipelines Complex Genomic Pipelines Costly and time consuming Annotation Alignment Variant Calling Quality Control BWA Analysis Raw Data Rigid Analytics Reduced Scope of Research Siloed Teams Lack of Productivity Researchers and Clinicians Bioinformatics Teams Computational Biologists Raw Data Analyses Raw Data Raw Data Packaged Workflows and Tools powered by Databricks Runtime “One click” execution Best Practice Pipelines
  • 16. Solution #1: Prebuilt Pipelines Complex Genomic Pipeline Costly and time consuming Annotation Alignment Variant Calling Quality Control BWA Analysis Raw Data Rigid Analytics Reduced Scope of Research Siloed Teams Lack of Productivity Researchers and Clinicians Bioinformatics Teams Computational Biologists Raw Data Analyses Raw Data Raw Data Packaged Workflows and Tools powered by Databricks Runtime “One click” execution Best Practice Pipelines 30x Coverage Whole Genome (GVCF) 0:30:00 1:00:00 1:30:00 2:00:00 Processing Time 3.8x faster than industry leader Edico 2:29:23 0:39:23
  • 17. Solution #2: Powerful Analytics Rigid Analytics Reduced Scope of Research Siloed Teams Lack of Productivity Researchers and Clinicians Bioinformatics Teams Computational Biologists From interactive queries to AI Powerful Analytics Raw Data Analyses Raw Data Raw Data Packaged Workflows and Tools powered by Databricks Runtime “One click” execution Best Practice Pipelines
  • 18. Solution #2: Powerful Analytics Rigid Analytics Reduced Scope of Research Siloed Teams Lack of Productivity Researchers and Clinicians Bioinformatics Teams Computational Biologists From interactive queries to AI Powerful Analytics Raw Data Analyses Raw Data Raw Data Packaged Workflows and Tools powered by Databricks Runtime “One click” execution Best Practice Pipelines “Having the data is the first step, enabling drug development teams to answer questions with the data is how we are building the future of drug discovery.” Dr. Jeff Reid, Exec Dir at Regeneron “Queries on 60B+ genome associations in 3 seconds vs. 30 minutes”
  • 19. Siloed Teams Lack of Productivity Researchers and Clinicians Bioinformatics Teams Computational Biologists Solution #3: Collaborative Workspaces Raw Data Analyses Raw Data Raw Data Packaged Workflows and Tools powered by Databricks Runtime “One click” execution Best Practice Pipelines From interactive queries to AI Powerful Analytics Lack of ProductivityDramatically Improve Productivity Collaborative Workspaces Researchers and Clinicians Bioinformatics Teams Computational Biologists
  • 20. Siloed Teams Lack of Productivity Researchers and Clinicians Bioinformatics Teams Computational Biologists Solution #3: Collaborative Workspaces Raw Data Analyses Raw Data Raw Data Packaged Workflows and Tools powered by Databricks Runtime “One click” execution Best Practice Pipelines From interactive queries to AI Powerful Analytics Lack of ProductivityDramatically Improve Productivity Collaborative Workspaces Researchers and Clinicians Bioinformatics Teams Computational Biologists “Databricks allows us to take clinical research and turn it into a clinically validated screen in far less time.” Sr. Director of Computational Bioinformatics, Lynn Carmichael
  • 21. Unified Analytics Platform for Genomics All Your Genomic Data Visualizations Machine Learning Best Practice Pipelines Tertiary Analytics and AI at Scale Collaborative Workspaces Genomic Analytics (e.g. GWAS, eQTL)
  • 22. Unified Analytics Platform for Genomics All Your Genomic Data Visualizations Machine Learning Best Practice Pipelines Tertiary Analytics and AI at Scale Collaborative Workspaces Genomic Analytics (e.g. GWAS, eQTL) Genomics-specific optimizations increase performance by up to 100x
  • 23. Sign-up for the preview databricks.com/genomics
  • 25. Demo: Preventing Disease with Genomics at Scale
  • 26. Typical patient intake and treatment Identify Diagnose Treat
  • 27. Typical patient intake and treatment Identify Diagnose Treat ...but this is very reactive and costly.
  • 28. Typical patient intake and treatment Identify Diagnose Treat ...but this is very reactive and costly. By the age of 15, over 30% of Europeans will develop a chronic disease
  • 29. Let’s shift our thinking What if we could identify an individual’s risk for developing a disease and prevent that disease before it ever occurs?
  • 30. The preventative care process Predict Prevent
  • 31. The preventative care process Predict Prevent Accelerated treatment improves outcomes
  • 32. The preventative care process Predict Prevent Huge opportunity for genomics Accelerated treatment improves outcomes
  • 33. But genomic analysis is really hard Population Scale Data Arrives (e.g. Biobank) Process for Analysis Export Model and Apply to Individual Generate Dashboard for Clinician
  • 34. Let’s try this with the Databricks Unified Analytics Platform for Genomics...
  • 35. Sign-up for the preview databricks.com/genomics