SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Big Data and Data Science:
Opportunities for Biomedical
Engineering
Philip E. Bourne PhD, FACMI
Stephenson Chair of Data Science
Director, Data Science Institute
Professor of Biomedical Engineering
peb6a@virginia.edu
https://www.slideshare.net/pebourne
04/08/18 AIMBE Academic Council 1
@pebourne
Disclaimer
• This is mostly NOT a talk about my own
research
• It draws upon my now one-year old view of
NIH as the former Associate Director for Data
Science (ADDS)
• It suffers from my drinking my own Kool-aid at
the University of Virginia
04/08/18 AIMBE Academic Council 2
Take home (hopefully)
• Increased awareness of the value of data
science to your activities
• Increased awareness of where NIH is headed
• Some thoughts about how to build out data
science in your own institutions
04/08/18 AIMBE Academic Council 3
Big data and data science are like the
Internet…
If I asked you to define them you would all
say something different, yet you use them
every day…
04/08/18 AIMBE Academic Council 4
http://vadlo.com/cartoons.php?id=357
So what do I mean by big data/data
science?
• Use of the ever increasing amount of open, complex,
diverse digital data
• Finding ways to ask and then answer relevant
questions by combining such diverse data sets
• Arriving at statistically significant conclusions not
otherwise obtainable
• Sharing such findings in a useful way
• Translating such findings into actions that improve
the human condition
04/08/18 AIMBE Academic Council 5
Cause
• There are ~2.7 Zetabytes (2.7 x 106 PB) of digital data
• Volume is doubling every two years
• Sheer volume of digital data e.g., $1000 genome,
wearable sensors, mandatory EHRs
• New tools e.g., Deep Artificial Neural Networks (DNNs)
• New computing power e.g., GPUs
04/08/18 AIMBE Academic Council 6
Effect
• Big data currently estimated as a $50bn
business – could save $3.1tn
• 50% growth in data/yr; 5% growth in IT
expenditure
• US 140,000- 190,000 unfilled deep data
analytics jobs
• UVA DSI has 600 applicants this year for 50
spots; MSDS/MBA highly sought
AIMBE Academic Council 704/08/18
Effect ++
• Big data currently estimated as a $50bn business
– could save $3.1tn – private sector research
• 50% growth in data/yr; 5% growth in IT
expenditure - undervalued
• US 140,000- 190,000 unfilled deep data analytics
jobs – competition for skilled researchers high
• DSI has 600 applicants this year for 50 spots;
MSDS/MBA highly sought – large human capital
AIMBE Academic Council 804/08/18
How much biomedical data?
• Big Data
– Total data from NIH-funded research in 2016
estimated at 650 PB*
– 20 PB of that is in NCBI/NLM (3%) and it is
expected to grow by 10 PB in 2016
• Dark Data
– Only 12% of data described in published papers is
in recognized archives – 88% is dark data^
• Cost
– 2007-2014: NIH spent ~$1.2Bn extramurally on
maintaining data archives
* In 2012 Library of Congress was 3 PB
^ http://www.ncbi.nlm.nih.gov/pubmed/26207759
04/08/18 AIMBE Academic Council 9
Consider some current high profile NIH
examples where and how data science is
being applied
• Moonshot - platforms and
integration, ML
• MODs – automated curation
• Human Microbiome Project –
new cloud based tools, ML
• TOPMed - platforms and
integration
• All-of-Us - platforms and
integration
• ECHO – platforms and
integration
• BRAIN - ML
10
All: Analytics, the Commons, FAIR, sustainability, workforce
04/08/18 AIMBE Academic Council
What of the future?
One view is the 6D’s
04/08/18 AIMBE Academic Council 11
Digitization
Deception
Disruption
Demonetization
Dematerialization
Democratization
Time
Volume,Velocity,Variety
Digital camera invented by
Kodak but shelved
Megapixels & quality improve slowly;
Kodak slow to react
Film market collapses;
Kodak goes bankrupt
Phones replace
cameras
Instagram,
Flickr become the
value proposition
Digital media becomes bona fide
form of communication
From a presentation to the Advisory Board to the NIH Director
Example - photography
1204/08/18 AIMBE Academic Council
A call for making these data open
• Mandates
– NIH, NSF, Data
Management Plans
• Business models can be
protected yet everyone
benefits
• It saves lives ….
04/08/18 AIMBE Academic Council 13
Why a More Open Process?
Use case:
Diffuse Intrinsic Pontine Gliomas (DIPG)
• Occur 1:100,000
individuals
• Peak incidence 6-8 years
of age
• Median survival 9-12
months
• Surgery is not an option
• Chemotherapy ineffective
and radiotherapy only
transitive
From Adam Resnick04/08/18 AIMBE Academic Council 14
Timeline of genomic studies in DIPG
• Landmark studies identify
histone mutations as
recurrent driver mutations in
DIPG ~2012
• Almost 3 years later, in
largely the same datasets,
but partially expanded, the
same two groups and 2
others identify ACVR1
mutations as a secondary, co-
occurring mutation
From Adam Resnick
04/08/18 AIMBE Academic Council 15
What do we need to do differently to
reveal ACVR1?
• ACVR1 is a targetable kinase
• Inhibition of ACVR1 inhibited tumor
progression in vitro
• ~300 DIPG patients a year
• ~60 are predicted to have ACVR1
• If large scale data sets were only
integrated with TCGA and/or rare
disease data in 2012, ACVR1 mutations
would have been identified
• 60 patients/year X 3 years = 180
children’s lives (who likely succumbed to
the disease during that time) could have
been impacted if only data were FAIR
From Adam Resnick
04/08/18 AIMBE Academic Council 16
How to promote
departmental/institutional openness?
• Encourage persistent identifiers e.g., ORCID
• Encourage preprints
• Encourage Open Access (OA)
• Recognize openness in hiring and P&T
• Teach open scholarship
• Promote institutional openness – repositories,
wikimedian in residence
• Support institutional open data governance
04/08/18 AIMBE Academic Council 17
NIH Strategic Plan for Data
• Support a Highly Efficient and
Effective Biomedical Research
Data Infrastructure
• Promote Modernization of the
Data-Resources Ecosystem
• Support the Development and
Dissemination of Advanced
Data Management, Analytics,
and Visualization Tools
• Enhance Workforce
Development for Biomedical
Data Science
• Enact Appropriate Policies to
Promote Stewardship and
Sustainability
04/08/18 AIMBE Academic Council 18
https://grants.nih.gov/grants/rfi/NIH-Strategic-Plan-for-Data-Science.pdf
Research Data Infrastructure …
Both funders and some institutions
see the need to move from pipes to
platforms to accelerate research…
04/08/18 AIMBE Academic Council 19
https://blog.lexicata.com/wp-content/uploads/2015/03/platform-model-
750x410.png
If platforms are the answer we could
ask the question…
Will biomedical research become more
like Airbnb?
04/08/18 AIMBE Academic Council 20
Vivien Bonazzi
Should biomedical research be Like Airbnb?
doi: 10.1371/journal.pbio.2001818
I am not crazy, hear me out
• Airbnb is a platform that supports a trusted relationship
between consumer (renter) and supplier (host)
• The platform focuses on maximizing the exchange of services
between supplier and consumer and maximizing the amount
of trust associated with a given stakeholder
• It seems to be working:
– 60 million users searching 2 million listings in 192 countries
– Average of 500,000 stays per night.
– Evaluation of US $25bn
04/08/18 AIMBE Academic Council 21
Should biomedical research be Like Airbnb?
doi: 10.1371/journal.pbio.2001818
Platforms will ultimately digitally
integrate the scholarly workflow for
human and machine analysis
Should biomedical research be Like Airbnb?
doi: 10.1371/journal.pbio.2001818
AIMBE Academic Council 2204/08/18
Paper Author Paper Reader
Data Provider Data Consumer
Employer Employee
Reagent Provider Reagent Consumer
Software Provider Software Consumer
Grant Writer Grant Reviewer
Supplier Consumer Platform
MS Project
Google Drive
Coursera
Researchgate
Academia.edu
Open Science
Framework
Synapse
F1000
Rio
Educator Student
Pilot Open Data Lab
(ODL) underway
AIMBE Academic Council 23gDOC04/08/18
Why a comparison to Airbnb is not fair
• Airbnb was born digital
• The exchange of services on Airbnb are
simple compared to what is required of a
platform to support biomedical research
Nevertheless there is much to be
learnt
04/08/18 AIMBE Academic Council 24
Impediments to a biomedical platform
• Current work practices by all stakeholders
• Entrenched business models
• Size of the undertaking aka resources
needed
• Trust
• Incentives to use the platform
http://www.forbes.com/sites/johnhall/2013/04/29/1
0-barriers-to-employee-innovation/#8bdbaa811133
04/08/18 AIMBE Academic Council 25
Such platforms combined with
emerging analytics will likely have
significant impact on biomedical
engineering
04/08/18 AIMBE Academic Council 26
Machine learning has been around for
over 20 years – why now?
• Amount of data available for training
• Open source - R and python
• Advances in computing (e.g., GPU’s) allow for deeper
neural nets (deep learning)
• Algorithmic efficiency gains (e.g., in back
propagation)
• Success promotes further research
• Commercialization
04/08/18 AIMBE Academic Council 27
Pastur-Romay et al. 2016 doi:10.3390/ijms17081313
Let me touch on our research in
protein engineering oh so briefly….
04/08/18 AIMBE Academic Council 28
Structural Biology Meets Data Science – Does Anything
Change?
Crowd Source: Current Opinions in Structural Biology 2018
https://docs.google.com/document/d/1rD3Qh1btTYlnGkKefN
GSFVq8v_mqRNa8I0o5MP3ZMW4/edit
Are their new scaffolds out there Nature
has yet to discover that AI could?
There are ~ 20300 possible proteins
>>>> all the atoms in the Universe
96M protein sequences from
73,000 species (source RefSeq)
135,000 protein structures
yield 1221 folds (SCOPe 2.06)
AIMBE Academic Council 2904/08/18
AIMBE Academic Council 30
At DeepMind, which is based in London,
AlphaGo Zero is working out how proteins
fold, a massive scientific challenge that
could give drug discovery a sorely needed
shot in the arm.
04/08/18
04/08/18 AIMBE Academic Council 31
http://cartertoons.com/
How should academic institutions
think about exploiting data science?
04/08/18 AIMBE Academic Council 32
Organization: core data science
verticals
AIMBE Academic Council 33
Data Integration
& Engineering
Machine Learning
& Analytics
Visualization
& Dissemination
Data Acquisition Ethics, Law,
Policy,
Social Implications
04/08/18
Organization: interdisciplinary
horizontals
AIMBE Academic Council 34
Data Integration
& Engineering
Machine Learning
& Analytics
Visualization
Data Acquisition
& Dissemination
Ethics, Law,
Policy,
Social Implications
Biomedical Engineering
04/08/18
Data Acquisition
• Sensors
• Nanotechnology
• Imaging
• Unexpected sources e.g., DMV
AIMBE Academic Council 35gDOC04/08/18
Data Integration and
Engineering
• Ontologies
• Object identifiers
• Indexing schemes
• Common data models
AIMBE Academic Council 36gDOC04/08/18
Biomedical:
Machine Learning &
Analytics
• Neural nets
• Deep learning
• Natural Language
Processing (NLP)
• Gene expression &
neurological disease (Kipnis)
• Predicting opioid overdose
(VA Health)
• Predicting escalating care
and mortality risk of
cirrhosis patients (UVA HS)
• Human microbiome &
mental health in maternal
health (Psychology &
Nursing)
AIMBE Academic Council 37gDOC04/08/18
Biomedical:
Visualization
• Virtual Reality (VR)
• Networks
• Sonics
• Visualizing microbial
stability (Biology &
Systems)
AIMBE Academic Council 38gDOC04/08/18
Ethics, Law,
Policy & Social
Implications
• Data sharing
• Privacy
• Normativity
AIMBE Academic Council 39gDOC
Wendy Novicoff, Ph.D
04/08/18
Conclusion:
Driven by large amounts of open
digital data of different types and new
algorithms and approaches biomedical
researchers are destined to follow the
private sector towards the fourth
paradigm
04/08/18 AIMBE Academic Council 40
Acknowledgements
04/08/18 AIMBE Academic Council 41
The BD2K Team at NIH
My Colleagues at UVA
The 150 folks who have passed through my laboratory
https://docs.google.com/spreadsheets/d/1QZ48UaKcwDl_iFCvBmJsT03FK-bMchdfuIHe9Oxc-rw/edit#gid=0
Thank You
peb6a@virginia.edu
4204/08/18 AIMBE Academic Council

Weitere ähnliche Inhalte

Was ist angesagt?

Life in a Data Driven World
Life in a Data Driven WorldLife in a Data Driven World
Life in a Data Driven WorldPhilip Bourne
 
Perspectives from the African Open Science Platform/Susan Veldsman
Perspectives from the African Open Science Platform/Susan VeldsmanPerspectives from the African Open Science Platform/Susan Veldsman
Perspectives from the African Open Science Platform/Susan VeldsmanAfrican Open Science Platform
 
UK data management environment and support
UK data management environment and supportUK data management environment and support
UK data management environment and supportJisc
 
Research Data Alliance Plenary 9: DDRI Working Group Session
Research Data Alliance Plenary 9: DDRI Working Group SessionResearch Data Alliance Plenary 9: DDRI Working Group Session
Research Data Alliance Plenary 9: DDRI Working Group Sessionamiraryani
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
 
The Future of Data Science @ UVA
The Future of Data Science @ UVAThe Future of Data Science @ UVA
The Future of Data Science @ UVAPhilip Bourne
 
What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?Philip Bourne
 
How Does Data Science Impact the Semantic Web?
How Does Data Science Impact the Semantic Web?How Does Data Science Impact the Semantic Web?
How Does Data Science Impact the Semantic Web?Philip Bourne
 
Data management: The new frontier for libraries
Data management: The new frontier for librariesData management: The new frontier for libraries
Data management: The new frontier for librariesLEARN Project
 
Benefits of Open Data and Policy Developments, perspectives from research ins...
Benefits of Open Data and Policy Developments, perspectives from research ins...Benefits of Open Data and Policy Developments, perspectives from research ins...
Benefits of Open Data and Policy Developments, perspectives from research ins...Academy of Science of South Africa (ASSAf)
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Jisc
 
BD2K @ NIH - A Vision Through 2020
BD2K @ NIH - A Vision Through 2020BD2K @ NIH - A Vision Through 2020
BD2K @ NIH - A Vision Through 2020Philip Bourne
 
Big Data Analytics and Open Data
Big Data Analytics and Open Data Big Data Analytics and Open Data
Big Data Analytics and Open Data Sharjeel Imtiaz
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonAfrican Open Science Platform
 
Moving Forward with Open Data Science - SWOT Analysis
Moving Forward with Open Data Science - SWOT AnalysisMoving Forward with Open Data Science - SWOT Analysis
Moving Forward with Open Data Science - SWOT AnalysisPhilip Bourne
 

Was ist angesagt? (20)

It's Just Not FAIR
It's Just Not FAIRIt's Just Not FAIR
It's Just Not FAIR
 
Life in a Data Driven World
Life in a Data Driven WorldLife in a Data Driven World
Life in a Data Driven World
 
Perspectives from the African Open Science Platform/Susan Veldsman
Perspectives from the African Open Science Platform/Susan VeldsmanPerspectives from the African Open Science Platform/Susan Veldsman
Perspectives from the African Open Science Platform/Susan Veldsman
 
UK data management environment and support
UK data management environment and supportUK data management environment and support
UK data management environment and support
 
Research Data Alliance Plenary 9: DDRI Working Group Session
Research Data Alliance Plenary 9: DDRI Working Group SessionResearch Data Alliance Plenary 9: DDRI Working Group Session
Research Data Alliance Plenary 9: DDRI Working Group Session
 
African Open Science Platform
African Open Science PlatformAfrican Open Science Platform
African Open Science Platform
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
The Future of Data Science @ UVA
The Future of Data Science @ UVAThe Future of Data Science @ UVA
The Future of Data Science @ UVA
 
What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?
 
How Does Data Science Impact the Semantic Web?
How Does Data Science Impact the Semantic Web?How Does Data Science Impact the Semantic Web?
How Does Data Science Impact the Semantic Web?
 
Data management: The new frontier for libraries
Data management: The new frontier for librariesData management: The new frontier for libraries
Data management: The new frontier for libraries
 
Benefits of Open Data and Policy Developments, perspectives from research ins...
Benefits of Open Data and Policy Developments, perspectives from research ins...Benefits of Open Data and Policy Developments, perspectives from research ins...
Benefits of Open Data and Policy Developments, perspectives from research ins...
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015
 
BD2K @ NIH - A Vision Through 2020
BD2K @ NIH - A Vision Through 2020BD2K @ NIH - A Vision Through 2020
BD2K @ NIH - A Vision Through 2020
 
Big Data Analytics and Open Data
Big Data Analytics and Open Data Big Data Analytics and Open Data
Big Data Analytics and Open Data
 
The African Open Science Platform/Susan Veldsman
The African Open Science Platform/Susan VeldsmanThe African Open Science Platform/Susan Veldsman
The African Open Science Platform/Susan Veldsman
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
Data and communication of research: incentives and disincentives
Data and communication of research: incentives and disincentivesData and communication of research: incentives and disincentives
Data and communication of research: incentives and disincentives
 
Moving Forward with Open Data Science - SWOT Analysis
Moving Forward with Open Data Science - SWOT AnalysisMoving Forward with Open Data Science - SWOT Analysis
Moving Forward with Open Data Science - SWOT Analysis
 

Ähnlich wie Big Data and Data Science: Opportunities for Biomedical Engineering

Data Science Meets Academia - What Comes Next?
Data Science Meets Academia - What Comes Next?Data Science Meets Academia - What Comes Next?
Data Science Meets Academia - What Comes Next?Philip Bourne
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangePhilip Bourne
 
Big Data and its Role in Biomedical Research
Big Data and its Role in Biomedical ResearchBig Data and its Role in Biomedical Research
Big Data and its Role in Biomedical ResearchPhilip Bourne
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AlonePhilip Bourne
 
What Is It Going To Cost And What Is In It For Me?
What Is It Going To Cost And What Is In It For Me?What Is It Going To Cost And What Is In It For Me?
What Is It Going To Cost And What Is In It For Me?Philip Bourne
 
Research Data, or: How I Learned to Stop Worrying and Love the Policy
Research Data, or: How I Learned to Stop Worrying and Love the PolicyResearch Data, or: How I Learned to Stop Worrying and Love the Policy
Research Data, or: How I Learned to Stop Worrying and Love the PolicyTorsten Reimer
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataPhilip Bourne
 
Implications of the Fourth Paradigm
Implications of the Fourth ParadigmImplications of the Fourth Paradigm
Implications of the Fourth ParadigmPhilip Bourne
 
Research Data Management - Gaps and Opportunities
Research Data Management - Gaps and OpportunitiesResearch Data Management - Gaps and Opportunities
Research Data Management - Gaps and OpportunitiesMartin Hamilton
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
Meeting Federal Research Requirements
Meeting Federal Research RequirementsMeeting Federal Research Requirements
Meeting Federal Research RequirementsICPSR
 
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018Pistoia Alliance
 
"We have met the enemy and he is us": Professional, social, and financial cos...
"We have met the enemy and he is us": Professional, social, and financial cos..."We have met the enemy and he is us": Professional, social, and financial cos...
"We have met the enemy and he is us": Professional, social, and financial cos...amellison17
 
Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...
Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...
Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...Academy of Science of South Africa (ASSAf)
 
I o dav data workshop prof wafula final 19.9.17
I o dav data workshop prof wafula final 19.9.17I o dav data workshop prof wafula final 19.9.17
I o dav data workshop prof wafula final 19.9.17Tom Nyongesa
 
Informatics Transform : Re-engineering Libraries for the Data Decade
Informatics Transform : Re-engineering Libraries for the Data DecadeInformatics Transform : Re-engineering Libraries for the Data Decade
Informatics Transform : Re-engineering Libraries for the Data DecadeLiz Lyon
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxwahiba ben abdessalem
 
Incentives for modern research
Incentives for modern researchIncentives for modern research
Incentives for modern researchJisc
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECAProject
 

Ähnlich wie Big Data and Data Science: Opportunities for Biomedical Engineering (20)

Data Science Meets Academia - What Comes Next?
Data Science Meets Academia - What Comes Next?Data Science Meets Academia - What Comes Next?
Data Science Meets Academia - What Comes Next?
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
Big Data and its Role in Biomedical Research
Big Data and its Role in Biomedical ResearchBig Data and its Role in Biomedical Research
Big Data and its Role in Biomedical Research
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
What Is It Going To Cost And What Is In It For Me?
What Is It Going To Cost And What Is In It For Me?What Is It Going To Cost And What Is In It For Me?
What Is It Going To Cost And What Is In It For Me?
 
Research Data, or: How I Learned to Stop Worrying and Love the Policy
Research Data, or: How I Learned to Stop Worrying and Love the PolicyResearch Data, or: How I Learned to Stop Worrying and Love the Policy
Research Data, or: How I Learned to Stop Worrying and Love the Policy
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
 
Implications of the Fourth Paradigm
Implications of the Fourth ParadigmImplications of the Fourth Paradigm
Implications of the Fourth Paradigm
 
Research Data Management - Gaps and Opportunities
Research Data Management - Gaps and OpportunitiesResearch Data Management - Gaps and Opportunities
Research Data Management - Gaps and Opportunities
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Meeting Federal Research Requirements
Meeting Federal Research RequirementsMeeting Federal Research Requirements
Meeting Federal Research Requirements
 
African Open Science Platform: Pilot Phase
African Open Science Platform: Pilot PhaseAfrican Open Science Platform: Pilot Phase
African Open Science Platform: Pilot Phase
 
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
 
"We have met the enemy and he is us": Professional, social, and financial cos...
"We have met the enemy and he is us": Professional, social, and financial cos..."We have met the enemy and he is us": Professional, social, and financial cos...
"We have met the enemy and he is us": Professional, social, and financial cos...
 
Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...
Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...
Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...
 
I o dav data workshop prof wafula final 19.9.17
I o dav data workshop prof wafula final 19.9.17I o dav data workshop prof wafula final 19.9.17
I o dav data workshop prof wafula final 19.9.17
 
Informatics Transform : Re-engineering Libraries for the Data Decade
Informatics Transform : Re-engineering Libraries for the Data DecadeInformatics Transform : Re-engineering Libraries for the Data Decade
Informatics Transform : Re-engineering Libraries for the Data Decade
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Incentives for modern research
Incentives for modern researchIncentives for modern research
Incentives for modern research
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
 

Mehr von Philip Bourne

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationPhilip Bourne
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingPhilip Bourne
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityPhilip Bourne
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?Philip Bourne
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug DiscoveryPhilip Bourne
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchPhilip Bourne
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data SciencePhilip Bourne
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewPhilip Bourne
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptxPhilip Bourne
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Philip Bourne
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision EducationPhilip Bourne
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Philip Bourne
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Philip Bourne
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance SustainabilityPhilip Bourne
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesPhilip Bourne
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in ResearchPhilip Bourne
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?Philip Bourne
 
The UVA School of Data Science
The UVA School of Data ScienceThe UVA School of Data Science
The UVA School of Data SciencePhilip Bourne
 

Mehr von Philip Bourne (20)

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in Research
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?
 
The UVA School of Data Science
The UVA School of Data ScienceThe UVA School of Data Science
The UVA School of Data Science
 

Kürzlich hochgeladen

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Shubhangi Sonawane
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterMateoGardella
 

Kürzlich hochgeladen (20)

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 

Big Data and Data Science: Opportunities for Biomedical Engineering

  • 1. Big Data and Data Science: Opportunities for Biomedical Engineering Philip E. Bourne PhD, FACMI Stephenson Chair of Data Science Director, Data Science Institute Professor of Biomedical Engineering peb6a@virginia.edu https://www.slideshare.net/pebourne 04/08/18 AIMBE Academic Council 1 @pebourne
  • 2. Disclaimer • This is mostly NOT a talk about my own research • It draws upon my now one-year old view of NIH as the former Associate Director for Data Science (ADDS) • It suffers from my drinking my own Kool-aid at the University of Virginia 04/08/18 AIMBE Academic Council 2
  • 3. Take home (hopefully) • Increased awareness of the value of data science to your activities • Increased awareness of where NIH is headed • Some thoughts about how to build out data science in your own institutions 04/08/18 AIMBE Academic Council 3
  • 4. Big data and data science are like the Internet… If I asked you to define them you would all say something different, yet you use them every day… 04/08/18 AIMBE Academic Council 4 http://vadlo.com/cartoons.php?id=357
  • 5. So what do I mean by big data/data science? • Use of the ever increasing amount of open, complex, diverse digital data • Finding ways to ask and then answer relevant questions by combining such diverse data sets • Arriving at statistically significant conclusions not otherwise obtainable • Sharing such findings in a useful way • Translating such findings into actions that improve the human condition 04/08/18 AIMBE Academic Council 5
  • 6. Cause • There are ~2.7 Zetabytes (2.7 x 106 PB) of digital data • Volume is doubling every two years • Sheer volume of digital data e.g., $1000 genome, wearable sensors, mandatory EHRs • New tools e.g., Deep Artificial Neural Networks (DNNs) • New computing power e.g., GPUs 04/08/18 AIMBE Academic Council 6
  • 7. Effect • Big data currently estimated as a $50bn business – could save $3.1tn • 50% growth in data/yr; 5% growth in IT expenditure • US 140,000- 190,000 unfilled deep data analytics jobs • UVA DSI has 600 applicants this year for 50 spots; MSDS/MBA highly sought AIMBE Academic Council 704/08/18
  • 8. Effect ++ • Big data currently estimated as a $50bn business – could save $3.1tn – private sector research • 50% growth in data/yr; 5% growth in IT expenditure - undervalued • US 140,000- 190,000 unfilled deep data analytics jobs – competition for skilled researchers high • DSI has 600 applicants this year for 50 spots; MSDS/MBA highly sought – large human capital AIMBE Academic Council 804/08/18
  • 9. How much biomedical data? • Big Data – Total data from NIH-funded research in 2016 estimated at 650 PB* – 20 PB of that is in NCBI/NLM (3%) and it is expected to grow by 10 PB in 2016 • Dark Data – Only 12% of data described in published papers is in recognized archives – 88% is dark data^ • Cost – 2007-2014: NIH spent ~$1.2Bn extramurally on maintaining data archives * In 2012 Library of Congress was 3 PB ^ http://www.ncbi.nlm.nih.gov/pubmed/26207759 04/08/18 AIMBE Academic Council 9
  • 10. Consider some current high profile NIH examples where and how data science is being applied • Moonshot - platforms and integration, ML • MODs – automated curation • Human Microbiome Project – new cloud based tools, ML • TOPMed - platforms and integration • All-of-Us - platforms and integration • ECHO – platforms and integration • BRAIN - ML 10 All: Analytics, the Commons, FAIR, sustainability, workforce 04/08/18 AIMBE Academic Council
  • 11. What of the future? One view is the 6D’s 04/08/18 AIMBE Academic Council 11
  • 12. Digitization Deception Disruption Demonetization Dematerialization Democratization Time Volume,Velocity,Variety Digital camera invented by Kodak but shelved Megapixels & quality improve slowly; Kodak slow to react Film market collapses; Kodak goes bankrupt Phones replace cameras Instagram, Flickr become the value proposition Digital media becomes bona fide form of communication From a presentation to the Advisory Board to the NIH Director Example - photography 1204/08/18 AIMBE Academic Council
  • 13. A call for making these data open • Mandates – NIH, NSF, Data Management Plans • Business models can be protected yet everyone benefits • It saves lives …. 04/08/18 AIMBE Academic Council 13
  • 14. Why a More Open Process? Use case: Diffuse Intrinsic Pontine Gliomas (DIPG) • Occur 1:100,000 individuals • Peak incidence 6-8 years of age • Median survival 9-12 months • Surgery is not an option • Chemotherapy ineffective and radiotherapy only transitive From Adam Resnick04/08/18 AIMBE Academic Council 14
  • 15. Timeline of genomic studies in DIPG • Landmark studies identify histone mutations as recurrent driver mutations in DIPG ~2012 • Almost 3 years later, in largely the same datasets, but partially expanded, the same two groups and 2 others identify ACVR1 mutations as a secondary, co- occurring mutation From Adam Resnick 04/08/18 AIMBE Academic Council 15
  • 16. What do we need to do differently to reveal ACVR1? • ACVR1 is a targetable kinase • Inhibition of ACVR1 inhibited tumor progression in vitro • ~300 DIPG patients a year • ~60 are predicted to have ACVR1 • If large scale data sets were only integrated with TCGA and/or rare disease data in 2012, ACVR1 mutations would have been identified • 60 patients/year X 3 years = 180 children’s lives (who likely succumbed to the disease during that time) could have been impacted if only data were FAIR From Adam Resnick 04/08/18 AIMBE Academic Council 16
  • 17. How to promote departmental/institutional openness? • Encourage persistent identifiers e.g., ORCID • Encourage preprints • Encourage Open Access (OA) • Recognize openness in hiring and P&T • Teach open scholarship • Promote institutional openness – repositories, wikimedian in residence • Support institutional open data governance 04/08/18 AIMBE Academic Council 17
  • 18. NIH Strategic Plan for Data • Support a Highly Efficient and Effective Biomedical Research Data Infrastructure • Promote Modernization of the Data-Resources Ecosystem • Support the Development and Dissemination of Advanced Data Management, Analytics, and Visualization Tools • Enhance Workforce Development for Biomedical Data Science • Enact Appropriate Policies to Promote Stewardship and Sustainability 04/08/18 AIMBE Academic Council 18 https://grants.nih.gov/grants/rfi/NIH-Strategic-Plan-for-Data-Science.pdf
  • 19. Research Data Infrastructure … Both funders and some institutions see the need to move from pipes to platforms to accelerate research… 04/08/18 AIMBE Academic Council 19 https://blog.lexicata.com/wp-content/uploads/2015/03/platform-model- 750x410.png
  • 20. If platforms are the answer we could ask the question… Will biomedical research become more like Airbnb? 04/08/18 AIMBE Academic Council 20 Vivien Bonazzi Should biomedical research be Like Airbnb? doi: 10.1371/journal.pbio.2001818
  • 21. I am not crazy, hear me out • Airbnb is a platform that supports a trusted relationship between consumer (renter) and supplier (host) • The platform focuses on maximizing the exchange of services between supplier and consumer and maximizing the amount of trust associated with a given stakeholder • It seems to be working: – 60 million users searching 2 million listings in 192 countries – Average of 500,000 stays per night. – Evaluation of US $25bn 04/08/18 AIMBE Academic Council 21 Should biomedical research be Like Airbnb? doi: 10.1371/journal.pbio.2001818
  • 22. Platforms will ultimately digitally integrate the scholarly workflow for human and machine analysis Should biomedical research be Like Airbnb? doi: 10.1371/journal.pbio.2001818 AIMBE Academic Council 2204/08/18
  • 23. Paper Author Paper Reader Data Provider Data Consumer Employer Employee Reagent Provider Reagent Consumer Software Provider Software Consumer Grant Writer Grant Reviewer Supplier Consumer Platform MS Project Google Drive Coursera Researchgate Academia.edu Open Science Framework Synapse F1000 Rio Educator Student Pilot Open Data Lab (ODL) underway AIMBE Academic Council 23gDOC04/08/18
  • 24. Why a comparison to Airbnb is not fair • Airbnb was born digital • The exchange of services on Airbnb are simple compared to what is required of a platform to support biomedical research Nevertheless there is much to be learnt 04/08/18 AIMBE Academic Council 24
  • 25. Impediments to a biomedical platform • Current work practices by all stakeholders • Entrenched business models • Size of the undertaking aka resources needed • Trust • Incentives to use the platform http://www.forbes.com/sites/johnhall/2013/04/29/1 0-barriers-to-employee-innovation/#8bdbaa811133 04/08/18 AIMBE Academic Council 25
  • 26. Such platforms combined with emerging analytics will likely have significant impact on biomedical engineering 04/08/18 AIMBE Academic Council 26
  • 27. Machine learning has been around for over 20 years – why now? • Amount of data available for training • Open source - R and python • Advances in computing (e.g., GPU’s) allow for deeper neural nets (deep learning) • Algorithmic efficiency gains (e.g., in back propagation) • Success promotes further research • Commercialization 04/08/18 AIMBE Academic Council 27 Pastur-Romay et al. 2016 doi:10.3390/ijms17081313
  • 28. Let me touch on our research in protein engineering oh so briefly…. 04/08/18 AIMBE Academic Council 28 Structural Biology Meets Data Science – Does Anything Change? Crowd Source: Current Opinions in Structural Biology 2018 https://docs.google.com/document/d/1rD3Qh1btTYlnGkKefN GSFVq8v_mqRNa8I0o5MP3ZMW4/edit
  • 29. Are their new scaffolds out there Nature has yet to discover that AI could? There are ~ 20300 possible proteins >>>> all the atoms in the Universe 96M protein sequences from 73,000 species (source RefSeq) 135,000 protein structures yield 1221 folds (SCOPe 2.06) AIMBE Academic Council 2904/08/18
  • 30. AIMBE Academic Council 30 At DeepMind, which is based in London, AlphaGo Zero is working out how proteins fold, a massive scientific challenge that could give drug discovery a sorely needed shot in the arm. 04/08/18
  • 31. 04/08/18 AIMBE Academic Council 31 http://cartertoons.com/
  • 32. How should academic institutions think about exploiting data science? 04/08/18 AIMBE Academic Council 32
  • 33. Organization: core data science verticals AIMBE Academic Council 33 Data Integration & Engineering Machine Learning & Analytics Visualization & Dissemination Data Acquisition Ethics, Law, Policy, Social Implications 04/08/18
  • 34. Organization: interdisciplinary horizontals AIMBE Academic Council 34 Data Integration & Engineering Machine Learning & Analytics Visualization Data Acquisition & Dissemination Ethics, Law, Policy, Social Implications Biomedical Engineering 04/08/18
  • 35. Data Acquisition • Sensors • Nanotechnology • Imaging • Unexpected sources e.g., DMV AIMBE Academic Council 35gDOC04/08/18
  • 36. Data Integration and Engineering • Ontologies • Object identifiers • Indexing schemes • Common data models AIMBE Academic Council 36gDOC04/08/18
  • 37. Biomedical: Machine Learning & Analytics • Neural nets • Deep learning • Natural Language Processing (NLP) • Gene expression & neurological disease (Kipnis) • Predicting opioid overdose (VA Health) • Predicting escalating care and mortality risk of cirrhosis patients (UVA HS) • Human microbiome & mental health in maternal health (Psychology & Nursing) AIMBE Academic Council 37gDOC04/08/18
  • 38. Biomedical: Visualization • Virtual Reality (VR) • Networks • Sonics • Visualizing microbial stability (Biology & Systems) AIMBE Academic Council 38gDOC04/08/18
  • 39. Ethics, Law, Policy & Social Implications • Data sharing • Privacy • Normativity AIMBE Academic Council 39gDOC Wendy Novicoff, Ph.D 04/08/18
  • 40. Conclusion: Driven by large amounts of open digital data of different types and new algorithms and approaches biomedical researchers are destined to follow the private sector towards the fourth paradigm 04/08/18 AIMBE Academic Council 40
  • 41. Acknowledgements 04/08/18 AIMBE Academic Council 41 The BD2K Team at NIH My Colleagues at UVA The 150 folks who have passed through my laboratory https://docs.google.com/spreadsheets/d/1QZ48UaKcwDl_iFCvBmJsT03FK-bMchdfuIHe9Oxc-rw/edit#gid=0