SlideShare a Scribd company logo
1 of 41
Big Data and the Promise and Pitfalls
when Applied to Disease Prevention
and Promoting Better Health
Philip E. Bourne Ph.D., FACMI
Associate Director for Data Science
National Institutes of Health
philip.bourne@nih.gov
http://www.slideshare.net/pebourne
Agenda
 What are Big Data anyway?
 What are the implications for healthcare
generally?
 What are the implications for NIH
specifically?
 Examples of big data applied to disease
prevention & promoting better health
What are Big Data:
Quantifying the Problem
 Big Data
– Total data from NIH-funded research currently
estimated at 650 PB*
– 20 PB of that is in NCBI/NLM (3%) and it is
expected to grow by 10 PB this year
 Dark Data
– Only 12% of data described in published papers is
in recognized archives – 88% is dark data^
 Cost
– 2007-2014: NIH spent ~$1.2Bn extramurally on
maintaining data archives
* In 2012 Library of Congress was 3 PB
^ http://www.ncbi.nlm.nih.gov/pubmed/26207759
Big Data in Biomedicine…
This speaks to something more fundamental that more
data …
It speaks to new methodologies, new skills, new
emphasis, new cultures, new modes of discovery …
Agenda
 What are Big Data anyway?
 What are the implications for healthcare
generally?
 What are the implications for NIH
specifically?
 Examples of big data applied to disease
prevention & promoting better health
It Follows …
We are entering a period of disruption
in biomedical research and we should
all be thinking about what this means
http://i1.wp.com/chisconsult.com/wp-
content/uploads/2013/05/disruption-is-a-
process.jpg
http://cdn2.hubspot.net/hubfs/418817/disruption1.jpg
We are at a Point of Deception …
 Evidence:
– Google car
– 3D printers
– Waze
– Robotics
– Sensors
From: The Second Machine Age: Work, Progress,
and Prosperity in a Time of Brilliant Technologies
by Erik Brynjolfsson & Andrew McAfee
Disruption: Example - Photography
Digitization
Deception
Disruption
Demonetization
Dematerialization
Democratization
Time
Volume,Velocity,Variety
Digital camera invented by
Kodak but shelved
Megapixels & quality improve slowly;
Kodak slow to react
Film market collapses;
Kodak goes bankrupt
Phones replace
cameras
Instagram,
Flickr become the
value proposition
Digital media becomes bona fide
form of communication
Agenda
 What are Big Data anyway?
 What are the implications for healthcare
generally?
 What are the implications for NIH
specifically?
 Examples of big data applied to disease
prevention & promoting better health
Disruption: Biomedical Research
Digitization of Basic &
Clinical Research & EHR’s
Deception
We Are Here
Disruption
Demonetization
Dematerialization
Democratization
Open science
Patient centered health care
Implications: Sustainability
Source Michael Bell http://homepages.cs.ncl.ac.uk/m.j.bell1/blog/?p=830
Implications:
Reproducibility
Changing Value of Scholarship (?)
“And that’s why we’re here today. Because something
called precision medicine … gives us one of the greatest
opportunities for new medical breakthroughs that we
have ever seen.”
President Barack Obama
January 30, 2015
Implications – New Science
Precision Medicine Initiative
 National Research Cohort
– >1 million U.S. volunteers
– Numerous existing cohorts (many funded by NIH)
– New volunteers
 Participants will be centrally involved in design and
implementation of the cohort
 They will be able to share genomic data, lifestyle
information, biological samples – all linked to their
electronic health records
What Are Some General Implications
of Such a Future?
 Open collaborative science becomes of increasing
importance nationally and internationally
 Global cooperation between funders will be needed
to sustain the emergent digital enterprise
 The value of data and associated analytics becomes
of increasing value to scholarship
 Opportunities exist to improve the efficiency of the
research enterprise and hence fund more research
 Current training content and modalities will not
match supply to demand
 Balancing accessibility vs security becomes more
important yet more complex
What are the implications of not
acting?
Use Case:
Aggregate integrated data offers the
potential for new insights into rare
diseases …
As we get more precise every disease becomes a rare disease
Diffuse Intrinsic Pontine Gliomas (DIPG): In
need of a new data-driven approach
• Occur 1:100,000
individuals
• Peak incidence 6-8 years
of age
• Median survival 9-12
months
• Surgery is not an option
• Chemotherapy ineffective
and radiotherapy only
transitive
From Adam Resnick
Timeline of Genomic Studies in DIPG
• Landmark studies
identify histone mutations
as recurrent driver
mutations in DIPG ~2012
• Almost 3 years later, in
largely the same
datasets, but partially
expanded, the same two
groups and 2 others
identify ACVR1
mutations as a
secondary, co-ocurring
mutation
From Adam Resnick
Hypothesis: The Commons would have
revealed ACVR1
• ACVR1 is a targetable kinase
• Inhibition of ACVR1 inhibited tumor
progression in vitro
• ~300 DIPG patients a year
• ~60 are predicted to have ACVR1
• If large scale data sets were only
integrated with TCGA and/or rare
disease data in 2012, ACVR1 mutations
would have been identified
• 60 patients/year X 3 years = 180
children’s lives (who likely succumbed to
the disease during that time) could have
been impacted if only data were FAIR
From Adam Resnick
The Commons –
The Internet of Data
 Findable
 Accessible
 Interoperable
 Reusable
* http://www.ncbi.nlm.nih.gov/pubmed/26978244
The Commons offers a path forward to integrate
discreet cloud-based initiatives using BD2K
developments to make data FAIR*
The internet started as discreet networks that
merged - the same could happen with data
Examples of Commons Based Initiatives
5 PB
40TB AWS
The Role of BD2K
1. Commons
– Resource
Indexing
– Standards
– Cloud & HPC
– Sustainability
2. Data Science
Research
– Centers
– Software
Analysis &
Methods
3. Training & Workforce Development
Agenda
 What are Big Data anyway?
 What are the implications for healthcare
generally?
 What are the implications for NIH
specifically?
 Examples of big data applied to disease
prevention & promoting better health
An Example of That Promise:
Comorbidity Network for 6.2M Danes
Over 14.9 Years
Jensen et al 2014 Nat Comm 5:4022
EHR-based
phenotyping
neuroimage-based
phenotyping
transcriptome-based
phenotyping
epigenome-based
phenotyping
phenotype models for
breast cancer screening
stochastic
modeling
low-dimensional
representations
data management
value of information
Projects
Labs
The Center for Predictive
Computational Phenotyping
EHR-based phenotyping
time
now
prospective phenotyping: predict a
phenotype of interest before it is
exhibited
retrospective phenotyping:
identify subjects who have
exhibited a phenotype of
interest (i.e. identify cases and
controls)
?
genotype
demographics
events in EHR (diagnoses,
procedures, medications,
labs, etc.)
We can predict thousands of diagnoses
months in advance of being recorded in an
EHR
• ~ 1.5 million subjects from Marshfield Clinic
• models learned for all ICD-9 codes (~3500) for which 500 cases and
controls identified
Mobile Sensor Data-to-Knowledge
(MD2K)MobileSensorsMobileSensors
SmartwatchSmartwatch ChestbandsChestbands Smart EyeglassesSmart Eyeglasses
ExposuresExposuresBehaviorsBehaviorsOutcomesOutcomes
Detecting First Lapses in Smoking Cessation
Modeling Challenges
1. Ephemeral (very short duration)
– 3~4 sec for each puff
– 10,000 breaths in 10 hours
– 2,000 hand to mouth gestures
– But, only 6~7 positive instances
– Need high recall & low false alarm
2. Numerous confounders
– Eating, drinking, yawning
Wide person & situation variability
https://www.pinterest.com/pin/52
6710118890712075/
Saleheen, et. al., ACM UbiComp 2015
Key Observations
• First lapse consists of 7 (vs. 15) puffs
• Only 20 (out of 28) reported lapse
• Inaccuracy of self-reported lapse
– 12 min before to 41 min after lapse
– Recall inaccuracy even higher
Main Results
• Applied on smoking cessation data
from 61 smokers
• Detected 28 (out of 32) first lapses
• False alarm rate of 1/6 per day
Summary
 Digital Big Data offers unprecedented
opportunities
 Those opportunities require a cultural shift –
small for some communities large for others –
never easy
 We are implementing an environment to
encourage change
 We would very much like to hear from you
opportunities for disease prevention and
promoting better health
I not only use all the brains
I have, but all I can borrow.
– Woodrow Wilson
ADDS Team
BD2K Representatives
NIHNIH……
Turning Discovery Into HealthTurning Discovery Into Health
philip.bourne@nih.gov
https://datascience.nih.gov/
http://www.ncbi.nlm.nih.gov/research/staff/bourne/
Goal: To strengthen the ability of a
diverse biomedical workforce to develop
and benefit from data science

More Related Content

What's hot

Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectivePhilip Bourne
 
Secure Data Sharing and Related Matters – An NIH View
Secure Data Sharing and Related Matters – An NIH ViewSecure Data Sharing and Related Matters – An NIH View
Secure Data Sharing and Related Matters – An NIH ViewPhilip Bourne
 
Pacific Symposium on Biocomputing 2018
Pacific Symposium on Biocomputing 2018Pacific Symposium on Biocomputing 2018
Pacific Symposium on Biocomputing 2018Warren Kibbe
 
Open Data in a Global Ecosystem
Open Data in a Global EcosystemOpen Data in a Global Ecosystem
Open Data in a Global EcosystemPhilip Bourne
 
From Research to Practice - New Models for Data-sharing and Collaboration to ...
From Research to Practice - New Models for Data-sharing and Collaboration to ...From Research to Practice - New Models for Data-sharing and Collaboration to ...
From Research to Practice - New Models for Data-sharing and Collaboration to ...Health Data Consortium
 
One Funder’s View for Advancing Open Science
One Funder’s View for Advancing Open ScienceOne Funder’s View for Advancing Open Science
One Funder’s View for Advancing Open SciencePhilip Bourne
 
What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?Philip Bourne
 
A SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHA SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHPhilip Bourne
 
IBM Watson in Healthcare
IBM Watson in HealthcareIBM Watson in Healthcare
IBM Watson in HealthcareAnders Quitzau
 
The Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataThe Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataPhilip Bourne
 
Unlocking Scientific Research: A Confrontation Between Publishers and Federa...
Unlocking Scientific Research:  A Confrontation Between Publishers and Federa...Unlocking Scientific Research:  A Confrontation Between Publishers and Federa...
Unlocking Scientific Research: A Confrontation Between Publishers and Federa...Angelica Tavella
 
National Cancer Policy Forum Summit - Warren Kibbe Keynote November 2013
National Cancer Policy Forum Summit - Warren Kibbe Keynote November 2013National Cancer Policy Forum Summit - Warren Kibbe Keynote November 2013
National Cancer Policy Forum Summit - Warren Kibbe Keynote November 2013Warren Kibbe
 
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...Amit Sheth
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
IBM Terkko Pop-up Presentation by Pekka Leppänen
IBM Terkko Pop-up Presentation by Pekka LeppänenIBM Terkko Pop-up Presentation by Pekka Leppänen
IBM Terkko Pop-up Presentation by Pekka LeppänenTerkkoHub
 
Suzana_Qatar_poster_final
Suzana_Qatar_poster_finalSuzana_Qatar_poster_final
Suzana_Qatar_poster_finalSuzana Brown
 

What's hot (20)

Data Analytics
Data AnalyticsData Analytics
Data Analytics
 
VIVO Keynote
VIVO KeynoteVIVO Keynote
VIVO Keynote
 
BD2K Update
BD2K Update BD2K Update
BD2K Update
 
Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH Perspective
 
Secure Data Sharing and Related Matters – An NIH View
Secure Data Sharing and Related Matters – An NIH ViewSecure Data Sharing and Related Matters – An NIH View
Secure Data Sharing and Related Matters – An NIH View
 
Pacific Symposium on Biocomputing 2018
Pacific Symposium on Biocomputing 2018Pacific Symposium on Biocomputing 2018
Pacific Symposium on Biocomputing 2018
 
Open Data in a Global Ecosystem
Open Data in a Global EcosystemOpen Data in a Global Ecosystem
Open Data in a Global Ecosystem
 
From Research to Practice - New Models for Data-sharing and Collaboration to ...
From Research to Practice - New Models for Data-sharing and Collaboration to ...From Research to Practice - New Models for Data-sharing and Collaboration to ...
From Research to Practice - New Models for Data-sharing and Collaboration to ...
 
One Funder’s View for Advancing Open Science
One Funder’s View for Advancing Open ScienceOne Funder’s View for Advancing Open Science
One Funder’s View for Advancing Open Science
 
What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?
 
A SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHA SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIH
 
IBM Watson in Healthcare
IBM Watson in HealthcareIBM Watson in Healthcare
IBM Watson in Healthcare
 
The Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataThe Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big Data
 
Atul Butte presentation at the Morris Collen 100 birthday celebration
Atul Butte presentation at the Morris Collen 100 birthday celebrationAtul Butte presentation at the Morris Collen 100 birthday celebration
Atul Butte presentation at the Morris Collen 100 birthday celebration
 
Unlocking Scientific Research: A Confrontation Between Publishers and Federa...
Unlocking Scientific Research:  A Confrontation Between Publishers and Federa...Unlocking Scientific Research:  A Confrontation Between Publishers and Federa...
Unlocking Scientific Research: A Confrontation Between Publishers and Federa...
 
National Cancer Policy Forum Summit - Warren Kibbe Keynote November 2013
National Cancer Policy Forum Summit - Warren Kibbe Keynote November 2013National Cancer Policy Forum Summit - Warren Kibbe Keynote November 2013
National Cancer Policy Forum Summit - Warren Kibbe Keynote November 2013
 
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
IBM Terkko Pop-up Presentation by Pekka Leppänen
IBM Terkko Pop-up Presentation by Pekka LeppänenIBM Terkko Pop-up Presentation by Pekka Leppänen
IBM Terkko Pop-up Presentation by Pekka Leppänen
 
Suzana_Qatar_poster_final
Suzana_Qatar_poster_finalSuzana_Qatar_poster_final
Suzana_Qatar_poster_final
 

Similar to Big Data and the Promise and Pitfalls when Applied to Disease Prevention and Promoting Better Health

Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...Health IT Conference – iHT2
 
Khoury ashg2014
Khoury ashg2014Khoury ashg2014
Khoury ashg2014muink
 
Big Data and the Future by Sherri Rose
Big Data and the Future by Sherri RoseBig Data and the Future by Sherri Rose
Big Data and the Future by Sherri RoseLewis Lin 🦊
 
Scripps lj grand rounds
Scripps lj grand roundsScripps lj grand rounds
Scripps lj grand rounds3GDR
 
Are we ready for disruption in Translational Research through Digital Medicine?
Are we ready for disruption in Translational Research through Digital Medicine?Are we ready for disruption in Translational Research through Digital Medicine?
Are we ready for disruption in Translational Research through Digital Medicine?Ashish Atreja, MD, MPH
 
Big Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBig Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBigData_Europe
 
Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...Rick Silva
 
Big Data Big Picture - Professor Derek Bell
Big Data Big Picture - Professor Derek BellBig Data Big Picture - Professor Derek Bell
Big Data Big Picture - Professor Derek BellNapier University
 
Biomedicine from Stethoscope to Computer
Biomedicine from Stethoscope to ComputerBiomedicine from Stethoscope to Computer
Biomedicine from Stethoscope to ComputerPhilip Bourne
 
Professor Jeremy Wyatt- Health Futures: Real or Virtual?
Professor Jeremy Wyatt- Health Futures: Real or Virtual? Professor Jeremy Wyatt- Health Futures: Real or Virtual?
Professor Jeremy Wyatt- Health Futures: Real or Virtual? Warwick Knowledge
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Philip Bourne
 
K Bobyk - %22A Primer on Personalized Medicine - The Imminent Systemic Shift%...
K Bobyk - %22A Primer on Personalized Medicine - The Imminent Systemic Shift%...K Bobyk - %22A Primer on Personalized Medicine - The Imminent Systemic Shift%...
K Bobyk - %22A Primer on Personalized Medicine - The Imminent Systemic Shift%...Kostyantyn Bobyk
 
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meetingDay 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meetingWarren Kibbe
 
Emerging Technologies and Tools in Precision Medicine Research
Emerging Technologies and Tools in Precision Medicine ResearchEmerging Technologies and Tools in Precision Medicine Research
Emerging Technologies and Tools in Precision Medicine ResearchClinosolIndia
 
2013-04-17: The Promise, Current State, And Future of Personalized Medicine
2013-04-17: The Promise, Current State, And Future of Personalized Medicine2013-04-17: The Promise, Current State, And Future of Personalized Medicine
2013-04-17: The Promise, Current State, And Future of Personalized MedicineBaltimore Lean Startup
 
Sun==big data analytics for health care
Sun==big data analytics for health careSun==big data analytics for health care
Sun==big data analytics for health careAravindharamanan S
 
의료의 미래, 디지털 헬스케어: 신약개발을 중심으로
의료의 미래, 디지털 헬스케어: 신약개발을 중심으로의료의 미래, 디지털 헬스케어: 신약개발을 중심으로
의료의 미래, 디지털 헬스케어: 신약개발을 중심으로Yoon Sup Choi
 

Similar to Big Data and the Promise and Pitfalls when Applied to Disease Prevention and Promoting Better Health (20)

Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
 
Khoury ashg2014
Khoury ashg2014Khoury ashg2014
Khoury ashg2014
 
Big Data and the Future by Sherri Rose
Big Data and the Future by Sherri RoseBig Data and the Future by Sherri Rose
Big Data and the Future by Sherri Rose
 
New sources of big data for precision medicine: are we ready?
New sources of big data for precision medicine: are we ready?New sources of big data for precision medicine: are we ready?
New sources of big data for precision medicine: are we ready?
 
Scripps lj grand rounds
Scripps lj grand roundsScripps lj grand rounds
Scripps lj grand rounds
 
Are we ready for disruption in Translational Research through Digital Medicine?
Are we ready for disruption in Translational Research through Digital Medicine?Are we ready for disruption in Translational Research through Digital Medicine?
Are we ready for disruption in Translational Research through Digital Medicine?
 
Big Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBig Data Analytics in the Health Domain
Big Data Analytics in the Health Domain
 
Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...
 
Big Data Big Picture - Professor Derek Bell
Big Data Big Picture - Professor Derek BellBig Data Big Picture - Professor Derek Bell
Big Data Big Picture - Professor Derek Bell
 
Biomedicine from Stethoscope to Computer
Biomedicine from Stethoscope to ComputerBiomedicine from Stethoscope to Computer
Biomedicine from Stethoscope to Computer
 
Professor Jeremy Wyatt- Health Futures: Real or Virtual?
Professor Jeremy Wyatt- Health Futures: Real or Virtual? Professor Jeremy Wyatt- Health Futures: Real or Virtual?
Professor Jeremy Wyatt- Health Futures: Real or Virtual?
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?
 
From Research to Practice: New Models for Data-sharing and Collaboration to I...
From Research to Practice: New Models for Data-sharing and Collaboration to I...From Research to Practice: New Models for Data-sharing and Collaboration to I...
From Research to Practice: New Models for Data-sharing and Collaboration to I...
 
K Bobyk - %22A Primer on Personalized Medicine - The Imminent Systemic Shift%...
K Bobyk - %22A Primer on Personalized Medicine - The Imminent Systemic Shift%...K Bobyk - %22A Primer on Personalized Medicine - The Imminent Systemic Shift%...
K Bobyk - %22A Primer on Personalized Medicine - The Imminent Systemic Shift%...
 
Pavia wsp october 2011
Pavia wsp october 2011Pavia wsp october 2011
Pavia wsp october 2011
 
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meetingDay 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
 
Emerging Technologies and Tools in Precision Medicine Research
Emerging Technologies and Tools in Precision Medicine ResearchEmerging Technologies and Tools in Precision Medicine Research
Emerging Technologies and Tools in Precision Medicine Research
 
2013-04-17: The Promise, Current State, And Future of Personalized Medicine
2013-04-17: The Promise, Current State, And Future of Personalized Medicine2013-04-17: The Promise, Current State, And Future of Personalized Medicine
2013-04-17: The Promise, Current State, And Future of Personalized Medicine
 
Sun==big data analytics for health care
Sun==big data analytics for health careSun==big data analytics for health care
Sun==big data analytics for health care
 
의료의 미래, 디지털 헬스케어: 신약개발을 중심으로
의료의 미래, 디지털 헬스케어: 신약개발을 중심으로의료의 미래, 디지털 헬스케어: 신약개발을 중심으로
의료의 미래, 디지털 헬스케어: 신약개발을 중심으로
 

More from Philip Bourne

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationPhilip Bourne
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingPhilip Bourne
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityPhilip Bourne
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?Philip Bourne
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangePhilip Bourne
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug DiscoveryPhilip Bourne
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AlonePhilip Bourne
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchPhilip Bourne
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data SciencePhilip Bourne
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewPhilip Bourne
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptxPhilip Bourne
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Philip Bourne
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision EducationPhilip Bourne
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Philip Bourne
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Philip Bourne
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance SustainabilityPhilip Bourne
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesPhilip Bourne
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in ResearchPhilip Bourne
 

More from Philip Bourne (20)

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in Research
 

Recently uploaded

Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...KokoStevan
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 

Recently uploaded (20)

Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 

Big Data and the Promise and Pitfalls when Applied to Disease Prevention and Promoting Better Health

  • 1. Big Data and the Promise and Pitfalls when Applied to Disease Prevention and Promoting Better Health Philip E. Bourne Ph.D., FACMI Associate Director for Data Science National Institutes of Health philip.bourne@nih.gov http://www.slideshare.net/pebourne
  • 2. Agenda  What are Big Data anyway?  What are the implications for healthcare generally?  What are the implications for NIH specifically?  Examples of big data applied to disease prevention & promoting better health
  • 3. What are Big Data: Quantifying the Problem  Big Data – Total data from NIH-funded research currently estimated at 650 PB* – 20 PB of that is in NCBI/NLM (3%) and it is expected to grow by 10 PB this year  Dark Data – Only 12% of data described in published papers is in recognized archives – 88% is dark data^  Cost – 2007-2014: NIH spent ~$1.2Bn extramurally on maintaining data archives * In 2012 Library of Congress was 3 PB ^ http://www.ncbi.nlm.nih.gov/pubmed/26207759
  • 4. Big Data in Biomedicine… This speaks to something more fundamental that more data … It speaks to new methodologies, new skills, new emphasis, new cultures, new modes of discovery …
  • 5. Agenda  What are Big Data anyway?  What are the implications for healthcare generally?  What are the implications for NIH specifically?  Examples of big data applied to disease prevention & promoting better health
  • 6. It Follows … We are entering a period of disruption in biomedical research and we should all be thinking about what this means http://i1.wp.com/chisconsult.com/wp- content/uploads/2013/05/disruption-is-a- process.jpg http://cdn2.hubspot.net/hubfs/418817/disruption1.jpg
  • 7. We are at a Point of Deception …  Evidence: – Google car – 3D printers – Waze – Robotics – Sensors From: The Second Machine Age: Work, Progress, and Prosperity in a Time of Brilliant Technologies by Erik Brynjolfsson & Andrew McAfee
  • 8. Disruption: Example - Photography Digitization Deception Disruption Demonetization Dematerialization Democratization Time Volume,Velocity,Variety Digital camera invented by Kodak but shelved Megapixels & quality improve slowly; Kodak slow to react Film market collapses; Kodak goes bankrupt Phones replace cameras Instagram, Flickr become the value proposition Digital media becomes bona fide form of communication
  • 9. Agenda  What are Big Data anyway?  What are the implications for healthcare generally?  What are the implications for NIH specifically?  Examples of big data applied to disease prevention & promoting better health
  • 10. Disruption: Biomedical Research Digitization of Basic & Clinical Research & EHR’s Deception We Are Here Disruption Demonetization Dematerialization Democratization Open science Patient centered health care
  • 11. Implications: Sustainability Source Michael Bell http://homepages.cs.ncl.ac.uk/m.j.bell1/blog/?p=830
  • 13. “And that’s why we’re here today. Because something called precision medicine … gives us one of the greatest opportunities for new medical breakthroughs that we have ever seen.” President Barack Obama January 30, 2015 Implications – New Science
  • 14. Precision Medicine Initiative  National Research Cohort – >1 million U.S. volunteers – Numerous existing cohorts (many funded by NIH) – New volunteers  Participants will be centrally involved in design and implementation of the cohort  They will be able to share genomic data, lifestyle information, biological samples – all linked to their electronic health records
  • 15. What Are Some General Implications of Such a Future?  Open collaborative science becomes of increasing importance nationally and internationally  Global cooperation between funders will be needed to sustain the emergent digital enterprise  The value of data and associated analytics becomes of increasing value to scholarship  Opportunities exist to improve the efficiency of the research enterprise and hence fund more research  Current training content and modalities will not match supply to demand  Balancing accessibility vs security becomes more important yet more complex
  • 16. What are the implications of not acting?
  • 17. Use Case: Aggregate integrated data offers the potential for new insights into rare diseases … As we get more precise every disease becomes a rare disease
  • 18. Diffuse Intrinsic Pontine Gliomas (DIPG): In need of a new data-driven approach • Occur 1:100,000 individuals • Peak incidence 6-8 years of age • Median survival 9-12 months • Surgery is not an option • Chemotherapy ineffective and radiotherapy only transitive From Adam Resnick
  • 19. Timeline of Genomic Studies in DIPG • Landmark studies identify histone mutations as recurrent driver mutations in DIPG ~2012 • Almost 3 years later, in largely the same datasets, but partially expanded, the same two groups and 2 others identify ACVR1 mutations as a secondary, co-ocurring mutation From Adam Resnick
  • 20. Hypothesis: The Commons would have revealed ACVR1 • ACVR1 is a targetable kinase • Inhibition of ACVR1 inhibited tumor progression in vitro • ~300 DIPG patients a year • ~60 are predicted to have ACVR1 • If large scale data sets were only integrated with TCGA and/or rare disease data in 2012, ACVR1 mutations would have been identified • 60 patients/year X 3 years = 180 children’s lives (who likely succumbed to the disease during that time) could have been impacted if only data were FAIR From Adam Resnick
  • 21. The Commons – The Internet of Data  Findable  Accessible  Interoperable  Reusable * http://www.ncbi.nlm.nih.gov/pubmed/26978244 The Commons offers a path forward to integrate discreet cloud-based initiatives using BD2K developments to make data FAIR* The internet started as discreet networks that merged - the same could happen with data
  • 22. Examples of Commons Based Initiatives 5 PB 40TB AWS
  • 23. The Role of BD2K 1. Commons – Resource Indexing – Standards – Cloud & HPC – Sustainability 2. Data Science Research – Centers – Software Analysis & Methods 3. Training & Workforce Development
  • 24. Agenda  What are Big Data anyway?  What are the implications for healthcare generally?  What are the implications for NIH specifically?  Examples of big data applied to disease prevention & promoting better health
  • 25. An Example of That Promise: Comorbidity Network for 6.2M Danes Over 14.9 Years Jensen et al 2014 Nat Comm 5:4022
  • 26. EHR-based phenotyping neuroimage-based phenotyping transcriptome-based phenotyping epigenome-based phenotyping phenotype models for breast cancer screening stochastic modeling low-dimensional representations data management value of information Projects Labs The Center for Predictive Computational Phenotyping
  • 27. EHR-based phenotyping time now prospective phenotyping: predict a phenotype of interest before it is exhibited retrospective phenotyping: identify subjects who have exhibited a phenotype of interest (i.e. identify cases and controls) ? genotype demographics events in EHR (diagnoses, procedures, medications, labs, etc.)
  • 28. We can predict thousands of diagnoses months in advance of being recorded in an EHR • ~ 1.5 million subjects from Marshfield Clinic • models learned for all ICD-9 codes (~3500) for which 500 cases and controls identified
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35. Mobile Sensor Data-to-Knowledge (MD2K)MobileSensorsMobileSensors SmartwatchSmartwatch ChestbandsChestbands Smart EyeglassesSmart Eyeglasses ExposuresExposuresBehaviorsBehaviorsOutcomesOutcomes
  • 36. Detecting First Lapses in Smoking Cessation Modeling Challenges 1. Ephemeral (very short duration) – 3~4 sec for each puff – 10,000 breaths in 10 hours – 2,000 hand to mouth gestures – But, only 6~7 positive instances – Need high recall & low false alarm 2. Numerous confounders – Eating, drinking, yawning Wide person & situation variability https://www.pinterest.com/pin/52 6710118890712075/ Saleheen, et. al., ACM UbiComp 2015 Key Observations • First lapse consists of 7 (vs. 15) puffs • Only 20 (out of 28) reported lapse • Inaccuracy of self-reported lapse – 12 min before to 41 min after lapse – Recall inaccuracy even higher Main Results • Applied on smoking cessation data from 61 smokers • Detected 28 (out of 32) first lapses • False alarm rate of 1/6 per day
  • 37. Summary  Digital Big Data offers unprecedented opportunities  Those opportunities require a cultural shift – small for some communities large for others – never easy  We are implementing an environment to encourage change  We would very much like to hear from you opportunities for disease prevention and promoting better health
  • 38. I not only use all the brains I have, but all I can borrow. – Woodrow Wilson
  • 40. NIHNIH…… Turning Discovery Into HealthTurning Discovery Into Health philip.bourne@nih.gov https://datascience.nih.gov/ http://www.ncbi.nlm.nih.gov/research/staff/bourne/
  • 41. Goal: To strengthen the ability of a diverse biomedical workforce to develop and benefit from data science

Editor's Notes

  1. $1.25bn per year to capture all data. After a significant effort at reduction, intramurally data is spread across > 60 data centers; imagine the extramural situation.
  2. Photos: FC tweet; RK screen grab
  3. Images of people from Infographic (NOTE: Image is just a placeholder—Jill will tweak) Detailed Notes: National Research Cohort <<OR name of study>> >1 million U.S. volunteers committed to participating in research Will combine a number of existing cohorts Will include Dept of Veterans Affairs Million Veteran Program—note Veteran is singular per http://www.research.va.gov/MVP/
  4. Goal is to be able to easily draw upon data across these initiatives. Internet of data will see these individual initiatives merge.
  5. 16 million hospital inpatient events (24.5% of total), 35 million outpatient clinic events (53.6% of total) and 14 million emergency department events (21.9% of total)
  6. Health history -
  7. Prediction of 1000’s of phenotypes predict ICD9 codes ..5 guessing 1 exact
  8. Azumio – monitors a variety of features Nhanes –National Health and nutritional examination survey – manual collection CDC
  9. Gini Coeff measures inequality among values of a frequency distribution. 0 equality 1 total inequality
  10. Short term: produce a searchable catalog of physical and virtual courses; Funding diversity awards to work with BD2K Centers; Expand IRP training started Jan 2015 e.g. Software carpentry and Train the trainers Long term: evaluation