SlideShare ist ein Scribd-Unternehmen logo
1 von 36
1
1
T. K. Prasad (Krishnaprasad Thirunarayan )
Professor of Computer Science and Engineering
Kno.e.sis – Ohio Center of Excellence in Knowledge-enabled Computing
Wright State University, Dayton, OH-45435
Big Data and Smart Healthcare
Honors Institute Symposium on Visions of the Future
Big Data Processing and Smart Healthcare
Krishnaprasad Thirunarayan (T. K. Prasad)
Kno.e.sis – Ohio Center of Excellence in Knowledge-enabled Computing
Outline
• Extent and Economics of Healthcare Problem
• Nature of Health-related Big Data
• Cognitive Computing Goals
• Five V’s of Big Data Research
• Our Research
– Semantic Perception for Scalability
– Lightweight Semantics to Manage Heterogeneity
– Hybrid Knowledge Representation and Reasoning
• Anomaly, Correlation, Causation
03/20/2014 Prasad 3
Acute Decompensated Heart Failure (ADHF) Statistics
• Heart failure affects > 5 million people in the US.
• > 550,000 new cases are diagnosed each year.
• The estimated cost of heart failure in the US for
2008 is $34.8 billion.
• Approximately 25% of patients are re-hospitalized
within 30 days of discharge.
• Approximately 50% of patients are re-hospitalized
within 6 months of discharge.
03/20/2014 Prasad 4
Asthma Statistics
• Asthma affects > 25 million people in the US.
• > 7 million are children.
• The current reactive cost > $56 billion.
• Asthma is the third leading cause of hospitalization
with 800,000 emergency room visits among
children under the age of 15.
03/20/2014 Prasad 5
Obesity Statistics
03/20/2014 Prasad 6
• The number of severely obese (BMI ≥ 40)
patients has quadrupled between 1986 and
2000 from one in 200 to one in 50.
• Obesity-related medical treatment costs
> $150 billion a year.
• Hospitalizations of children and youths with
obesity doubled from 1999 to 2005.
Parkinson’s Disease (PD) Statistics
03/20/2014 Prasad 7
• In 2010, 630,000 people in the US had a
diagnosis of PD.
• The number of people with PD will
double by 2040.
• Just medical costs for people with PD is
$8.1 billion total.
The Patient of the Future
MIT Technology Review, 2012
http://www.technologyreview.com/featuredstory/426968/the-patient-of-the-future/ 8
Healthcare Related Big Data for Potential Exploitation:
Assorted Examples
• Sensor data: M. J. Fox Foundation Parkinson
disease challenge
• Other Applications: The healthcare industry spends
roughly $250 billion per year due to fraud.
03/20/2014 Prasad 9
Structured vs Unstructured Data
Patient Disorders ICD-9 Code
Patient1 Hypertension 401
Patient2 Atrial fibrillation 427.31
Patient1 Pulmonary hypertension 416
Patient3 Edema 782.3
Patient4 hyperthyroidism 242.9
Coronary artery disease, status post four-vessel coronary
artery bypass graft surgery on , by Dr. X with a left internal
mammary artery to the left anterior descending artery,
sequential vein graft to the ramus and first diagonal, and a vein
graft to the posterior descending artery. He had normal left
ventricular function. He is having some symptoms that are
unclear if they are angina or not. I am therefore going to get
him scheduled for an exercise Cardiolite stress test.
VS
Patient Data Distribution
Structured data
Unstructured data
Search Mining
Decision Support
Knowledge Discovery Prediction
NLP
+
Semantics
Nature of Processing
An Example
He is off both Diovan and Lotrel. I am unsure if it is due to underlying renal insufficiency. He
has actually been on atenolol alone for his hypertension.
Raw Text
Concepts
Knowledge
Inference
diovan lotrel
renal
insufficiency
atenolol hypertension
diovanvaltuna
valsartan
antihypertensive
agent
atenolol
tenominatenix
kidney
failure
renal
insufficiency
kidney
disease
disorder
blood pressure
disorder
hypertension
systoloc
hypertension
pulmonary
hypertension
Patient taking diovan
for hypertension
Patient has
kidney disease
Patient is on
antihypertensive drugs
is used to treat
is a
drug
disorder
Purpose of Big Data Analytics Vetted by Domain Experts
Data can help compensate for our overconfidence
in our own intuitions and reduce the extent to
which our desires distort our perceptions.
-- David Brooks of New York Times
However, inferred correlations require clear
justification that they are not coincidental, to
inspire confidence.
03/20/2014 Prasad 14
Cognitive Computing Systems
03/20/2014 Prasad 15
• Leverage Big Data using human experts to
enable better decisions.
– Process natural language and unstructured
data.
– Use of Artificial Intelligence (e.g., Machine
Learning algorithms) to
sense, infer, predict, abduce, and, in some
ways, think.
Check engine light analogy
Research Challenges : 5V’s of Big Data
Volume
Velocity
Variety
Veracity
Value
Big Data => Smart Data
03/20/2014 Prasad 16
Volume : (1) Semantic Perception
Semantic Perception : Volume => Value
Distill voluminous machine-sensed data
into human comprehensible nuggets
necessary for decision-making using
background knowledge
03/20/2014 Prasad 17
Parkinson’s Disease Use Case
03/20/2014 Prasad 20
Heart Failure Use Case
03/20/2014 Prasad 22
Asthma Use Case
03/20/2014 Prasad 23
Volume : (2) Exploiting Embarrassing Parallelism
03/20/2014 Prasad 24
Volume with a Twist
Resource-constrained reasoning on
mobile-devices
03/20/2014 Prasad 25
Cory Henson’s Thesis Statement
Machine perception can be
formalized using semantic web
technologies to derive abstractions
from sensor data using background
knowledge on the Web, and
efficiently executed on resource-
constrained devices.
03/20/2014 Prasad 26
* based on Neisser’s cognitive model of perception
Observe
Property
Perceive
Feature
Explanation
Discrimination
1
2
Perception Cycle* that exploits background knowledge / domain models
Abstracting raw data
for human
comprehension
Focus generation for
disambiguation and action
(incl. human in the loop)
Prior Knowledge
2703/20/2014 Prasad
O(n3) < x < O(n4) O(n)
Efficiency Improvement
• Problem size increased from 10’s to 1000’s of nodes
• Time reduced from minutes to milliseconds
• Complexity reduced from polynomial to linear
Evaluation on a mobile device
Prasad 35
36
kHealth: Health Signal Processing Architecture
Take Medication before going to work Avoid going out in the evening due to
high pollen levels
Domain ExpertsDomain Knowledge
Risk Model
Data Acquisition &
aggregation
Analysis
Personalized
Actionable
Information
Personal level
Signals
Public level
Signals
Population level
Signals
Events from
Social Streams
Contact doctor
kHealth Demo
• kHealth: http://www.youtube.com/watch?v=btnRi64hJp4
38
Variety
Syntactic and semantic heterogeneity
• in textual and sensor data,
• in social media and Web forums data
• In Electronic Medical Records
03/20/2014 Prasad 39
Variety (How?): (1) Granularity of Semantics & Applications
• Lightweight semantics: File and document-level
annotation to enable discovery and sharing
• Richer semantics: Data-level annotation and
extraction for semantic search and summarization
• Fine-grained semantics: Data
integration, interoperability and reasoning in
Linked Open Data
Cost-benefit trade-off and continuum
03/20/2014 Prasad 40
Variety (How?): (2) Hybrid KRR
Blending data-driven models with declarative
knowledge
– Data-gleaned models: Bottom-up, correlation-
based, statistical
– Expert-given KBs: Top-
down, causal/taxonomical, logical
– Refine structure to better estimate parameters
E.g., Medical Data Analytics using PGMs + KBs
03/20/2014 Prasad 42
Veracity
Scalable and Agile Big Data Analytics cannot
deliver value unless we have confidence and
trust in our data.
Open Problem:
Develop expressive frameworks for trust to
make explicit all aspects that go into trust
formation and inferences.
03/20/2014 Prasad 45
Veracity: Confession of sorts!
Trust is well-known,
but is not well-understood.
The utility of a notion testifies
not to its clarity but rather to the
philosophical importance of
clarifying it.
-- Nelson Goodman
(Fact, Fiction and Forecast, 1955)
03/20/2014 Prasad 46
(More on) Value
Discovering gaps and enriching domain models using
data
E.g., Semantics Driven Approach for Knowledge
Acquisition from EMRs
03/20/2014 Prasad 47
(More on) Value
Discovering drug-drug interaction by analyzing
search query logs
• E.g., The antidepressant, paroxetine, and the
cholesterol lowering drug, pravastatin, were
shown to interfere causing high blood sugar, by
correlated searches with “hyperglycemia”, “high
blood sugar” or “blurry vision”.
03/20/2014 Prasad 48
Conclusions
• Glimpse of our research organized around
the 5 V’s of Big Data
• Discussed role in harnessing Value
– Semantic Perception (Volume)
– Continuum of Semantic models to manage
Heterogeneity (Variety)
– Hybrid KRR: Probabilistic + Logical (Variety)
– Trust Models (Veracity)
03/20/2014 Prasad 49
thank you, and please visit us at
http://knoesis.org/
Department of Computer Science and Engineering
Wright State University, Dayton, Ohio, USA
Kno.e.sis: Ohio Center of Excellence in Knowledge-enabled Computing
Special Thanks to: Pramod Anantharam, Sujan Perera,
Dr. Cory Henson, Professor Amit Sheth
03/20/2014 Prasad 50

Weitere ähnliche Inhalte

Was ist angesagt?

Intel next-generation-medical-imaging-data-and-analytics
Intel next-generation-medical-imaging-data-and-analyticsIntel next-generation-medical-imaging-data-and-analytics
Intel next-generation-medical-imaging-data-and-analyticsCarestream
 
Big Data Analytics in Hospitals By Dr.Mahboob ali khan Phd
Big Data Analytics in Hospitals By Dr.Mahboob ali khan PhdBig Data Analytics in Hospitals By Dr.Mahboob ali khan Phd
Big Data Analytics in Hospitals By Dr.Mahboob ali khan PhdHealthcare consultant
 
prediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithmsprediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithmsINFOGAIN PUBLICATION
 
2016.10 HPDA in Precision Medicine
2016.10 HPDA in Precision Medicine2016.10 HPDA in Precision Medicine
2016.10 HPDA in Precision MedicineMichael Atkins
 
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Hellmuth Broda
 
Benefits of Big Data in Health Care A Revolution
Benefits of Big Data in Health Care A RevolutionBenefits of Big Data in Health Care A Revolution
Benefits of Big Data in Health Care A Revolutionijtsrd
 
Intelligent data analysis for medicinal diagnosis
Intelligent data analysis for medicinal diagnosisIntelligent data analysis for medicinal diagnosis
Intelligent data analysis for medicinal diagnosisIRJET Journal
 
A novel methodology for diagnosing the heart disease using fuzzy database
A novel methodology for diagnosing the heart disease using fuzzy databaseA novel methodology for diagnosing the heart disease using fuzzy database
A novel methodology for diagnosing the heart disease using fuzzy databaseeSAT Journals
 
Heart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesHeart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesIJRES Journal
 
A Survey on Heart Disease Prediction Techniques
A Survey on Heart Disease Prediction TechniquesA Survey on Heart Disease Prediction Techniques
A Survey on Heart Disease Prediction Techniquesijtsrd
 
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A SurveyPrediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A Surveyrahulmonikasharma
 
PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...
PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...
PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...Sivagowry Shathesh
 
Big Data in Healthcare and Medical Devices
Big Data in Healthcare and Medical DevicesBig Data in Healthcare and Medical Devices
Big Data in Healthcare and Medical DevicesPremNarayanan6
 
Ijarcet vol-2-issue-4-1393-1397
Ijarcet vol-2-issue-4-1393-1397Ijarcet vol-2-issue-4-1393-1397
Ijarcet vol-2-issue-4-1393-1397Editor IJARCET
 
Chronic Kidney Disease Prediction
Chronic Kidney Disease PredictionChronic Kidney Disease Prediction
Chronic Kidney Disease PredictionRajandeep Gill
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksIAEME Publication
 
LASYR Slides IEEE event 07 APR 2021
LASYR Slides IEEE event 07 APR 2021LASYR Slides IEEE event 07 APR 2021
LASYR Slides IEEE event 07 APR 2021Sean Manion PhD
 
How much is that data in the window : Healthcare data valuation
How much is that data in the window : Healthcare data valuationHow much is that data in the window : Healthcare data valuation
How much is that data in the window : Healthcare data valuationSean Manion PhD
 
DISEASE PREDICTION BY MACHINE LEARNING OVER BIG DATA FROM HEALTHCARE COMMUNI...
 DISEASE PREDICTION BY MACHINE LEARNING OVER BIG DATA FROM HEALTHCARE COMMUNI... DISEASE PREDICTION BY MACHINE LEARNING OVER BIG DATA FROM HEALTHCARE COMMUNI...
DISEASE PREDICTION BY MACHINE LEARNING OVER BIG DATA FROM HEALTHCARE COMMUNI...Nexgen Technology
 

Was ist angesagt? (20)

Intel next-generation-medical-imaging-data-and-analytics
Intel next-generation-medical-imaging-data-and-analyticsIntel next-generation-medical-imaging-data-and-analytics
Intel next-generation-medical-imaging-data-and-analytics
 
Big Data Analytics in Hospitals By Dr.Mahboob ali khan Phd
Big Data Analytics in Hospitals By Dr.Mahboob ali khan PhdBig Data Analytics in Hospitals By Dr.Mahboob ali khan Phd
Big Data Analytics in Hospitals By Dr.Mahboob ali khan Phd
 
prediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithmsprediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithms
 
2016.10 HPDA in Precision Medicine
2016.10 HPDA in Precision Medicine2016.10 HPDA in Precision Medicine
2016.10 HPDA in Precision Medicine
 
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
 
Benefits of Big Data in Health Care A Revolution
Benefits of Big Data in Health Care A RevolutionBenefits of Big Data in Health Care A Revolution
Benefits of Big Data in Health Care A Revolution
 
Intelligent data analysis for medicinal diagnosis
Intelligent data analysis for medicinal diagnosisIntelligent data analysis for medicinal diagnosis
Intelligent data analysis for medicinal diagnosis
 
A novel methodology for diagnosing the heart disease using fuzzy database
A novel methodology for diagnosing the heart disease using fuzzy databaseA novel methodology for diagnosing the heart disease using fuzzy database
A novel methodology for diagnosing the heart disease using fuzzy database
 
Heart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesHeart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining Techniques
 
A Survey on Heart Disease Prediction Techniques
A Survey on Heart Disease Prediction TechniquesA Survey on Heart Disease Prediction Techniques
A Survey on Heart Disease Prediction Techniques
 
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A SurveyPrediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
 
PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...
PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...
PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...
 
Big Data in Healthcare and Medical Devices
Big Data in Healthcare and Medical DevicesBig Data in Healthcare and Medical Devices
Big Data in Healthcare and Medical Devices
 
Final ppt
Final pptFinal ppt
Final ppt
 
Ijarcet vol-2-issue-4-1393-1397
Ijarcet vol-2-issue-4-1393-1397Ijarcet vol-2-issue-4-1393-1397
Ijarcet vol-2-issue-4-1393-1397
 
Chronic Kidney Disease Prediction
Chronic Kidney Disease PredictionChronic Kidney Disease Prediction
Chronic Kidney Disease Prediction
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networks
 
LASYR Slides IEEE event 07 APR 2021
LASYR Slides IEEE event 07 APR 2021LASYR Slides IEEE event 07 APR 2021
LASYR Slides IEEE event 07 APR 2021
 
How much is that data in the window : Healthcare data valuation
How much is that data in the window : Healthcare data valuationHow much is that data in the window : Healthcare data valuation
How much is that data in the window : Healthcare data valuation
 
DISEASE PREDICTION BY MACHINE LEARNING OVER BIG DATA FROM HEALTHCARE COMMUNI...
 DISEASE PREDICTION BY MACHINE LEARNING OVER BIG DATA FROM HEALTHCARE COMMUNI... DISEASE PREDICTION BY MACHINE LEARNING OVER BIG DATA FROM HEALTHCARE COMMUNI...
DISEASE PREDICTION BY MACHINE LEARNING OVER BIG DATA FROM HEALTHCARE COMMUNI...
 

Ähnlich wie Big data healthcare

AI in Healthcare
AI in HealthcareAI in Healthcare
AI in HealthcarePaul Agapow
 
InformaticsLecture1.pdf
InformaticsLecture1.pdfInformaticsLecture1.pdf
InformaticsLecture1.pdfOgunsina1
 
Big data in healthcare
Big data in healthcareBig data in healthcare
Big data in healthcareBYTE Project
 
Sun==big data analytics for health care
Sun==big data analytics for health careSun==big data analytics for health care
Sun==big data analytics for health careAravindharamanan S
 
Intelligent Healthcare Systems Mar 25 1a.pdf
Intelligent Healthcare Systems Mar 25 1a.pdfIntelligent Healthcare Systems Mar 25 1a.pdf
Intelligent Healthcare Systems Mar 25 1a.pdfssuser45b2b8
 
Realising the potential of Health Data Science: opportunities and challenges ...
Realising the potential of Health Data Science:opportunities and challenges ...Realising the potential of Health Data Science:opportunities and challenges ...
Realising the potential of Health Data Science: opportunities and challenges ...Paolo Missier
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Philip Bourne
 
JHIAPSMCON 2024 ; Presentation on Need for big data in tribal health
JHIAPSMCON 2024 ; Presentation on Need for big data in tribal healthJHIAPSMCON 2024 ; Presentation on Need for big data in tribal health
JHIAPSMCON 2024 ; Presentation on Need for big data in tribal healthEx WHO/USAID
 
Role of data in precision oncology
Role of data in precision oncologyRole of data in precision oncology
Role of data in precision oncologyWarren Kibbe
 
The Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcareThe Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcarePerficient, Inc.
 
Towards online universal quality healthcare through AI
Towards online universal quality healthcare through AITowards online universal quality healthcare through AI
Towards online universal quality healthcare through AIXavier Amatriain
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
Augmented Personalized Health: using AI techniques on semantically integrated...
Augmented Personalized Health: using AI techniques on semantically integrated...Augmented Personalized Health: using AI techniques on semantically integrated...
Augmented Personalized Health: using AI techniques on semantically integrated...Amit Sheth
 
Heart Diseases Diagnosis Using Data Mining Techniques
Heart Diseases Diagnosis Using Data Mining TechniquesHeart Diseases Diagnosis Using Data Mining Techniques
Heart Diseases Diagnosis Using Data Mining Techniquespaperpublications3
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MININGAshish Salve
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
 
Rock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_HealthRock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_HealthRock Health
 

Ähnlich wie Big data healthcare (20)

AI in Healthcare
AI in HealthcareAI in Healthcare
AI in Healthcare
 
InformaticsLecture1.pdf
InformaticsLecture1.pdfInformaticsLecture1.pdf
InformaticsLecture1.pdf
 
Big data in healthcare
Big data in healthcareBig data in healthcare
Big data in healthcare
 
Sun==big data analytics for health care
Sun==big data analytics for health careSun==big data analytics for health care
Sun==big data analytics for health care
 
Intelligent Healthcare Systems Mar 25 1a.pdf
Intelligent Healthcare Systems Mar 25 1a.pdfIntelligent Healthcare Systems Mar 25 1a.pdf
Intelligent Healthcare Systems Mar 25 1a.pdf
 
Realising the potential of Health Data Science: opportunities and challenges ...
Realising the potential of Health Data Science:opportunities and challenges ...Realising the potential of Health Data Science:opportunities and challenges ...
Realising the potential of Health Data Science: opportunities and challenges ...
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?
 
JHIAPSMCON 2024 ; Presentation on Need for big data in tribal health
JHIAPSMCON 2024 ; Presentation on Need for big data in tribal healthJHIAPSMCON 2024 ; Presentation on Need for big data in tribal health
JHIAPSMCON 2024 ; Presentation on Need for big data in tribal health
 
Day 1: Real-World Data Panel
Day 1: Real-World Data Panel Day 1: Real-World Data Panel
Day 1: Real-World Data Panel
 
Role of data in precision oncology
Role of data in precision oncologyRole of data in precision oncology
Role of data in precision oncology
 
The Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcareThe Role of Data Lakes in Healthcare
The Role of Data Lakes in Healthcare
 
Towards online universal quality healthcare through AI
Towards online universal quality healthcare through AITowards online universal quality healthcare through AI
Towards online universal quality healthcare through AI
 
Building a National Data Infrastructure to Advance Patient-Centered Comparati...
Building a National Data Infrastructure to Advance Patient-Centered Comparati...Building a National Data Infrastructure to Advance Patient-Centered Comparati...
Building a National Data Infrastructure to Advance Patient-Centered Comparati...
 
Nurses and Data Science
Nurses and Data ScienceNurses and Data Science
Nurses and Data Science
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Augmented Personalized Health: using AI techniques on semantically integrated...
Augmented Personalized Health: using AI techniques on semantically integrated...Augmented Personalized Health: using AI techniques on semantically integrated...
Augmented Personalized Health: using AI techniques on semantically integrated...
 
Heart Diseases Diagnosis Using Data Mining Techniques
Heart Diseases Diagnosis Using Data Mining TechniquesHeart Diseases Diagnosis Using Data Mining Techniques
Heart Diseases Diagnosis Using Data Mining Techniques
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MINING
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Rock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_HealthRock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_Health
 

Kürzlich hochgeladen

Ulhasnagar Call girl escort *88638//40496* Call me monika call girls 24*
Ulhasnagar Call girl escort *88638//40496* Call me monika call girls 24*Ulhasnagar Call girl escort *88638//40496* Call me monika call girls 24*
Ulhasnagar Call girl escort *88638//40496* Call me monika call girls 24*Mumbai Call girl
 
💚Chandigarh Call Girls Service 💯Jiya 📲🔝8868886958🔝Call Girls In Chandigarh No...
💚Chandigarh Call Girls Service 💯Jiya 📲🔝8868886958🔝Call Girls In Chandigarh No...💚Chandigarh Call Girls Service 💯Jiya 📲🔝8868886958🔝Call Girls In Chandigarh No...
💚Chandigarh Call Girls Service 💯Jiya 📲🔝8868886958🔝Call Girls In Chandigarh No...Sheetaleventcompany
 
💞 Safe And Secure Call Girls Jabalpur 🧿 9332606886 🧿 High Class Call Girl Ser...
💞 Safe And Secure Call Girls Jabalpur 🧿 9332606886 🧿 High Class Call Girl Ser...💞 Safe And Secure Call Girls Jabalpur 🧿 9332606886 🧿 High Class Call Girl Ser...
💞 Safe And Secure Call Girls Jabalpur 🧿 9332606886 🧿 High Class Call Girl Ser...India Call Girls
 
💞 Safe And Secure Call Girls Nanded 🧿 9332606886 🧿 High Class Call Girl Servi...
💞 Safe And Secure Call Girls Nanded 🧿 9332606886 🧿 High Class Call Girl Servi...💞 Safe And Secure Call Girls Nanded 🧿 9332606886 🧿 High Class Call Girl Servi...
💞 Safe And Secure Call Girls Nanded 🧿 9332606886 🧿 High Class Call Girl Servi...India Call Girls
 
Independent Call Girls Service Chandigarh Sector 17 | 8868886958 | Call Girl ...
Independent Call Girls Service Chandigarh Sector 17 | 8868886958 | Call Girl ...Independent Call Girls Service Chandigarh Sector 17 | 8868886958 | Call Girl ...
Independent Call Girls Service Chandigarh Sector 17 | 8868886958 | Call Girl ...Sheetaleventcompany
 
💞 Safe And Secure Call Girls chhindwara 🧿 9332606886 🧿 High Class Call Girl S...
💞 Safe And Secure Call Girls chhindwara 🧿 9332606886 🧿 High Class Call Girl S...💞 Safe And Secure Call Girls chhindwara 🧿 9332606886 🧿 High Class Call Girl S...
💞 Safe And Secure Call Girls chhindwara 🧿 9332606886 🧿 High Class Call Girl S...India Call Girls
 
💞 Safe And Secure Call Girls gaya 🧿 9332606886 🧿 High Class Call Girl Service...
💞 Safe And Secure Call Girls gaya 🧿 9332606886 🧿 High Class Call Girl Service...💞 Safe And Secure Call Girls gaya 🧿 9332606886 🧿 High Class Call Girl Service...
💞 Safe And Secure Call Girls gaya 🧿 9332606886 🧿 High Class Call Girl Service...India Call Girls
 
💞 Safe And Secure Call Girls Coimbatore 🧿 9332606886 🧿 High Class Call Girl S...
💞 Safe And Secure Call Girls Coimbatore 🧿 9332606886 🧿 High Class Call Girl S...💞 Safe And Secure Call Girls Coimbatore 🧿 9332606886 🧿 High Class Call Girl S...
💞 Safe And Secure Call Girls Coimbatore 🧿 9332606886 🧿 High Class Call Girl S...India Call Girls
 
Delhi Call Girl Service 📞8650700400📞Just Call Divya📲 Call Girl In Delhi No💰Ad...
Delhi Call Girl Service 📞8650700400📞Just Call Divya📲 Call Girl In Delhi No💰Ad...Delhi Call Girl Service 📞8650700400📞Just Call Divya📲 Call Girl In Delhi No💰Ad...
Delhi Call Girl Service 📞8650700400📞Just Call Divya📲 Call Girl In Delhi No💰Ad...Sheetaleventcompany
 
Low Rate Call Girls Udaipur {9xx000xx09} ❤️VVIP NISHA CCall Girls in Udaipur ...
Low Rate Call Girls Udaipur {9xx000xx09} ❤️VVIP NISHA CCall Girls in Udaipur ...Low Rate Call Girls Udaipur {9xx000xx09} ❤️VVIP NISHA CCall Girls in Udaipur ...
Low Rate Call Girls Udaipur {9xx000xx09} ❤️VVIP NISHA CCall Girls in Udaipur ...Sheetaleventcompany
 
❤️Chandigarh Escort Service☎️9815457724☎️ Call Girl service in Chandigarh☎️ C...
❤️Chandigarh Escort Service☎️9815457724☎️ Call Girl service in Chandigarh☎️ C...❤️Chandigarh Escort Service☎️9815457724☎️ Call Girl service in Chandigarh☎️ C...
❤️Chandigarh Escort Service☎️9815457724☎️ Call Girl service in Chandigarh☎️ C...Rashmi Entertainment
 
❤️Chandigarh Escort Service☎️9814379184☎️ Call Girl service in Chandigarh☎️ C...
❤️Chandigarh Escort Service☎️9814379184☎️ Call Girl service in Chandigarh☎️ C...❤️Chandigarh Escort Service☎️9814379184☎️ Call Girl service in Chandigarh☎️ C...
❤️Chandigarh Escort Service☎️9814379184☎️ Call Girl service in Chandigarh☎️ C...Sheetaleventcompany
 
🍑👄Ludhiana Escorts Service☎️98157-77685🍑👄 Call Girl service in Ludhiana☎️Ludh...
🍑👄Ludhiana Escorts Service☎️98157-77685🍑👄 Call Girl service in Ludhiana☎️Ludh...🍑👄Ludhiana Escorts Service☎️98157-77685🍑👄 Call Girl service in Ludhiana☎️Ludh...
🍑👄Ludhiana Escorts Service☎️98157-77685🍑👄 Call Girl service in Ludhiana☎️Ludh...dilpreetentertainmen
 
BLOOD-Physio-D&R-Agam blood physiology notes
BLOOD-Physio-D&R-Agam blood physiology notesBLOOD-Physio-D&R-Agam blood physiology notes
BLOOD-Physio-D&R-Agam blood physiology notessurgeryanesthesiamon
 
Gorgeous Call Girls In Pune {9xx000xx09} ❤️VVIP ANKITA Call Girl in Pune Maha...
Gorgeous Call Girls In Pune {9xx000xx09} ❤️VVIP ANKITA Call Girl in Pune Maha...Gorgeous Call Girls In Pune {9xx000xx09} ❤️VVIP ANKITA Call Girl in Pune Maha...
Gorgeous Call Girls In Pune {9xx000xx09} ❤️VVIP ANKITA Call Girl in Pune Maha...Sheetaleventcompany
 
Top 20 Famous Indian Female Pornstars Name List 2024
Top 20 Famous Indian Female Pornstars Name List 2024Top 20 Famous Indian Female Pornstars Name List 2024
Top 20 Famous Indian Female Pornstars Name List 2024Sheetaleventcompany
 
Call Girls Service Amritsar Just Call 9352988975 Top Class Call Girl Service ...
Call Girls Service Amritsar Just Call 9352988975 Top Class Call Girl Service ...Call Girls Service Amritsar Just Call 9352988975 Top Class Call Girl Service ...
Call Girls Service Amritsar Just Call 9352988975 Top Class Call Girl Service ...Escorts In Kolkata
 
❤️Chandigarh Escorts☎️9814379184☎️ Call Girl service in Chandigarh☎️ Chandiga...
❤️Chandigarh Escorts☎️9814379184☎️ Call Girl service in Chandigarh☎️ Chandiga...❤️Chandigarh Escorts☎️9814379184☎️ Call Girl service in Chandigarh☎️ Chandiga...
❤️Chandigarh Escorts☎️9814379184☎️ Call Girl service in Chandigarh☎️ Chandiga...Sheetaleventcompany
 
Call Girls Goa Just Call 9xx000xx09 Top Class Call Girl Service Available
Call Girls Goa Just Call 9xx000xx09 Top Class Call Girl Service AvailableCall Girls Goa Just Call 9xx000xx09 Top Class Call Girl Service Available
Call Girls Goa Just Call 9xx000xx09 Top Class Call Girl Service AvailableSheetaleventcompany
 
Call Now ☎ 8868886958 || Call Girls in Chandigarh Escort Service Chandigarh
Call Now ☎ 8868886958 || Call Girls in Chandigarh Escort Service ChandigarhCall Now ☎ 8868886958 || Call Girls in Chandigarh Escort Service Chandigarh
Call Now ☎ 8868886958 || Call Girls in Chandigarh Escort Service ChandigarhSheetaleventcompany
 

Kürzlich hochgeladen (20)

Ulhasnagar Call girl escort *88638//40496* Call me monika call girls 24*
Ulhasnagar Call girl escort *88638//40496* Call me monika call girls 24*Ulhasnagar Call girl escort *88638//40496* Call me monika call girls 24*
Ulhasnagar Call girl escort *88638//40496* Call me monika call girls 24*
 
💚Chandigarh Call Girls Service 💯Jiya 📲🔝8868886958🔝Call Girls In Chandigarh No...
💚Chandigarh Call Girls Service 💯Jiya 📲🔝8868886958🔝Call Girls In Chandigarh No...💚Chandigarh Call Girls Service 💯Jiya 📲🔝8868886958🔝Call Girls In Chandigarh No...
💚Chandigarh Call Girls Service 💯Jiya 📲🔝8868886958🔝Call Girls In Chandigarh No...
 
💞 Safe And Secure Call Girls Jabalpur 🧿 9332606886 🧿 High Class Call Girl Ser...
💞 Safe And Secure Call Girls Jabalpur 🧿 9332606886 🧿 High Class Call Girl Ser...💞 Safe And Secure Call Girls Jabalpur 🧿 9332606886 🧿 High Class Call Girl Ser...
💞 Safe And Secure Call Girls Jabalpur 🧿 9332606886 🧿 High Class Call Girl Ser...
 
💞 Safe And Secure Call Girls Nanded 🧿 9332606886 🧿 High Class Call Girl Servi...
💞 Safe And Secure Call Girls Nanded 🧿 9332606886 🧿 High Class Call Girl Servi...💞 Safe And Secure Call Girls Nanded 🧿 9332606886 🧿 High Class Call Girl Servi...
💞 Safe And Secure Call Girls Nanded 🧿 9332606886 🧿 High Class Call Girl Servi...
 
Independent Call Girls Service Chandigarh Sector 17 | 8868886958 | Call Girl ...
Independent Call Girls Service Chandigarh Sector 17 | 8868886958 | Call Girl ...Independent Call Girls Service Chandigarh Sector 17 | 8868886958 | Call Girl ...
Independent Call Girls Service Chandigarh Sector 17 | 8868886958 | Call Girl ...
 
💞 Safe And Secure Call Girls chhindwara 🧿 9332606886 🧿 High Class Call Girl S...
💞 Safe And Secure Call Girls chhindwara 🧿 9332606886 🧿 High Class Call Girl S...💞 Safe And Secure Call Girls chhindwara 🧿 9332606886 🧿 High Class Call Girl S...
💞 Safe And Secure Call Girls chhindwara 🧿 9332606886 🧿 High Class Call Girl S...
 
💞 Safe And Secure Call Girls gaya 🧿 9332606886 🧿 High Class Call Girl Service...
💞 Safe And Secure Call Girls gaya 🧿 9332606886 🧿 High Class Call Girl Service...💞 Safe And Secure Call Girls gaya 🧿 9332606886 🧿 High Class Call Girl Service...
💞 Safe And Secure Call Girls gaya 🧿 9332606886 🧿 High Class Call Girl Service...
 
💞 Safe And Secure Call Girls Coimbatore 🧿 9332606886 🧿 High Class Call Girl S...
💞 Safe And Secure Call Girls Coimbatore 🧿 9332606886 🧿 High Class Call Girl S...💞 Safe And Secure Call Girls Coimbatore 🧿 9332606886 🧿 High Class Call Girl S...
💞 Safe And Secure Call Girls Coimbatore 🧿 9332606886 🧿 High Class Call Girl S...
 
Delhi Call Girl Service 📞8650700400📞Just Call Divya📲 Call Girl In Delhi No💰Ad...
Delhi Call Girl Service 📞8650700400📞Just Call Divya📲 Call Girl In Delhi No💰Ad...Delhi Call Girl Service 📞8650700400📞Just Call Divya📲 Call Girl In Delhi No💰Ad...
Delhi Call Girl Service 📞8650700400📞Just Call Divya📲 Call Girl In Delhi No💰Ad...
 
Low Rate Call Girls Udaipur {9xx000xx09} ❤️VVIP NISHA CCall Girls in Udaipur ...
Low Rate Call Girls Udaipur {9xx000xx09} ❤️VVIP NISHA CCall Girls in Udaipur ...Low Rate Call Girls Udaipur {9xx000xx09} ❤️VVIP NISHA CCall Girls in Udaipur ...
Low Rate Call Girls Udaipur {9xx000xx09} ❤️VVIP NISHA CCall Girls in Udaipur ...
 
❤️Chandigarh Escort Service☎️9815457724☎️ Call Girl service in Chandigarh☎️ C...
❤️Chandigarh Escort Service☎️9815457724☎️ Call Girl service in Chandigarh☎️ C...❤️Chandigarh Escort Service☎️9815457724☎️ Call Girl service in Chandigarh☎️ C...
❤️Chandigarh Escort Service☎️9815457724☎️ Call Girl service in Chandigarh☎️ C...
 
❤️Chandigarh Escort Service☎️9814379184☎️ Call Girl service in Chandigarh☎️ C...
❤️Chandigarh Escort Service☎️9814379184☎️ Call Girl service in Chandigarh☎️ C...❤️Chandigarh Escort Service☎️9814379184☎️ Call Girl service in Chandigarh☎️ C...
❤️Chandigarh Escort Service☎️9814379184☎️ Call Girl service in Chandigarh☎️ C...
 
🍑👄Ludhiana Escorts Service☎️98157-77685🍑👄 Call Girl service in Ludhiana☎️Ludh...
🍑👄Ludhiana Escorts Service☎️98157-77685🍑👄 Call Girl service in Ludhiana☎️Ludh...🍑👄Ludhiana Escorts Service☎️98157-77685🍑👄 Call Girl service in Ludhiana☎️Ludh...
🍑👄Ludhiana Escorts Service☎️98157-77685🍑👄 Call Girl service in Ludhiana☎️Ludh...
 
BLOOD-Physio-D&R-Agam blood physiology notes
BLOOD-Physio-D&R-Agam blood physiology notesBLOOD-Physio-D&R-Agam blood physiology notes
BLOOD-Physio-D&R-Agam blood physiology notes
 
Gorgeous Call Girls In Pune {9xx000xx09} ❤️VVIP ANKITA Call Girl in Pune Maha...
Gorgeous Call Girls In Pune {9xx000xx09} ❤️VVIP ANKITA Call Girl in Pune Maha...Gorgeous Call Girls In Pune {9xx000xx09} ❤️VVIP ANKITA Call Girl in Pune Maha...
Gorgeous Call Girls In Pune {9xx000xx09} ❤️VVIP ANKITA Call Girl in Pune Maha...
 
Top 20 Famous Indian Female Pornstars Name List 2024
Top 20 Famous Indian Female Pornstars Name List 2024Top 20 Famous Indian Female Pornstars Name List 2024
Top 20 Famous Indian Female Pornstars Name List 2024
 
Call Girls Service Amritsar Just Call 9352988975 Top Class Call Girl Service ...
Call Girls Service Amritsar Just Call 9352988975 Top Class Call Girl Service ...Call Girls Service Amritsar Just Call 9352988975 Top Class Call Girl Service ...
Call Girls Service Amritsar Just Call 9352988975 Top Class Call Girl Service ...
 
❤️Chandigarh Escorts☎️9814379184☎️ Call Girl service in Chandigarh☎️ Chandiga...
❤️Chandigarh Escorts☎️9814379184☎️ Call Girl service in Chandigarh☎️ Chandiga...❤️Chandigarh Escorts☎️9814379184☎️ Call Girl service in Chandigarh☎️ Chandiga...
❤️Chandigarh Escorts☎️9814379184☎️ Call Girl service in Chandigarh☎️ Chandiga...
 
Call Girls Goa Just Call 9xx000xx09 Top Class Call Girl Service Available
Call Girls Goa Just Call 9xx000xx09 Top Class Call Girl Service AvailableCall Girls Goa Just Call 9xx000xx09 Top Class Call Girl Service Available
Call Girls Goa Just Call 9xx000xx09 Top Class Call Girl Service Available
 
Call Now ☎ 8868886958 || Call Girls in Chandigarh Escort Service Chandigarh
Call Now ☎ 8868886958 || Call Girls in Chandigarh Escort Service ChandigarhCall Now ☎ 8868886958 || Call Girls in Chandigarh Escort Service Chandigarh
Call Now ☎ 8868886958 || Call Girls in Chandigarh Escort Service Chandigarh
 

Big data healthcare

  • 1. 1 1 T. K. Prasad (Krishnaprasad Thirunarayan ) Professor of Computer Science and Engineering Kno.e.sis – Ohio Center of Excellence in Knowledge-enabled Computing Wright State University, Dayton, OH-45435 Big Data and Smart Healthcare Honors Institute Symposium on Visions of the Future
  • 2. Big Data Processing and Smart Healthcare Krishnaprasad Thirunarayan (T. K. Prasad) Kno.e.sis – Ohio Center of Excellence in Knowledge-enabled Computing
  • 3. Outline • Extent and Economics of Healthcare Problem • Nature of Health-related Big Data • Cognitive Computing Goals • Five V’s of Big Data Research • Our Research – Semantic Perception for Scalability – Lightweight Semantics to Manage Heterogeneity – Hybrid Knowledge Representation and Reasoning • Anomaly, Correlation, Causation 03/20/2014 Prasad 3
  • 4. Acute Decompensated Heart Failure (ADHF) Statistics • Heart failure affects > 5 million people in the US. • > 550,000 new cases are diagnosed each year. • The estimated cost of heart failure in the US for 2008 is $34.8 billion. • Approximately 25% of patients are re-hospitalized within 30 days of discharge. • Approximately 50% of patients are re-hospitalized within 6 months of discharge. 03/20/2014 Prasad 4
  • 5. Asthma Statistics • Asthma affects > 25 million people in the US. • > 7 million are children. • The current reactive cost > $56 billion. • Asthma is the third leading cause of hospitalization with 800,000 emergency room visits among children under the age of 15. 03/20/2014 Prasad 5
  • 6. Obesity Statistics 03/20/2014 Prasad 6 • The number of severely obese (BMI ≥ 40) patients has quadrupled between 1986 and 2000 from one in 200 to one in 50. • Obesity-related medical treatment costs > $150 billion a year. • Hospitalizations of children and youths with obesity doubled from 1999 to 2005.
  • 7. Parkinson’s Disease (PD) Statistics 03/20/2014 Prasad 7 • In 2010, 630,000 people in the US had a diagnosis of PD. • The number of people with PD will double by 2040. • Just medical costs for people with PD is $8.1 billion total.
  • 8. The Patient of the Future MIT Technology Review, 2012 http://www.technologyreview.com/featuredstory/426968/the-patient-of-the-future/ 8
  • 9. Healthcare Related Big Data for Potential Exploitation: Assorted Examples • Sensor data: M. J. Fox Foundation Parkinson disease challenge • Other Applications: The healthcare industry spends roughly $250 billion per year due to fraud. 03/20/2014 Prasad 9
  • 10. Structured vs Unstructured Data Patient Disorders ICD-9 Code Patient1 Hypertension 401 Patient2 Atrial fibrillation 427.31 Patient1 Pulmonary hypertension 416 Patient3 Edema 782.3 Patient4 hyperthyroidism 242.9 Coronary artery disease, status post four-vessel coronary artery bypass graft surgery on , by Dr. X with a left internal mammary artery to the left anterior descending artery, sequential vein graft to the ramus and first diagonal, and a vein graft to the posterior descending artery. He had normal left ventricular function. He is having some symptoms that are unclear if they are angina or not. I am therefore going to get him scheduled for an exercise Cardiolite stress test. VS
  • 11. Patient Data Distribution Structured data Unstructured data
  • 12. Search Mining Decision Support Knowledge Discovery Prediction NLP + Semantics Nature of Processing
  • 13. An Example He is off both Diovan and Lotrel. I am unsure if it is due to underlying renal insufficiency. He has actually been on atenolol alone for his hypertension. Raw Text Concepts Knowledge Inference diovan lotrel renal insufficiency atenolol hypertension diovanvaltuna valsartan antihypertensive agent atenolol tenominatenix kidney failure renal insufficiency kidney disease disorder blood pressure disorder hypertension systoloc hypertension pulmonary hypertension Patient taking diovan for hypertension Patient has kidney disease Patient is on antihypertensive drugs is used to treat is a drug disorder
  • 14. Purpose of Big Data Analytics Vetted by Domain Experts Data can help compensate for our overconfidence in our own intuitions and reduce the extent to which our desires distort our perceptions. -- David Brooks of New York Times However, inferred correlations require clear justification that they are not coincidental, to inspire confidence. 03/20/2014 Prasad 14
  • 15. Cognitive Computing Systems 03/20/2014 Prasad 15 • Leverage Big Data using human experts to enable better decisions. – Process natural language and unstructured data. – Use of Artificial Intelligence (e.g., Machine Learning algorithms) to sense, infer, predict, abduce, and, in some ways, think. Check engine light analogy
  • 16. Research Challenges : 5V’s of Big Data Volume Velocity Variety Veracity Value Big Data => Smart Data 03/20/2014 Prasad 16
  • 17. Volume : (1) Semantic Perception Semantic Perception : Volume => Value Distill voluminous machine-sensed data into human comprehensible nuggets necessary for decision-making using background knowledge 03/20/2014 Prasad 17
  • 18. Parkinson’s Disease Use Case 03/20/2014 Prasad 20
  • 19. Heart Failure Use Case 03/20/2014 Prasad 22
  • 21. Volume : (2) Exploiting Embarrassing Parallelism 03/20/2014 Prasad 24
  • 22. Volume with a Twist Resource-constrained reasoning on mobile-devices 03/20/2014 Prasad 25
  • 23. Cory Henson’s Thesis Statement Machine perception can be formalized using semantic web technologies to derive abstractions from sensor data using background knowledge on the Web, and efficiently executed on resource- constrained devices. 03/20/2014 Prasad 26
  • 24. * based on Neisser’s cognitive model of perception Observe Property Perceive Feature Explanation Discrimination 1 2 Perception Cycle* that exploits background knowledge / domain models Abstracting raw data for human comprehension Focus generation for disambiguation and action (incl. human in the loop) Prior Knowledge 2703/20/2014 Prasad
  • 25. O(n3) < x < O(n4) O(n) Efficiency Improvement • Problem size increased from 10’s to 1000’s of nodes • Time reduced from minutes to milliseconds • Complexity reduced from polynomial to linear Evaluation on a mobile device Prasad 35
  • 26. 36 kHealth: Health Signal Processing Architecture Take Medication before going to work Avoid going out in the evening due to high pollen levels Domain ExpertsDomain Knowledge Risk Model Data Acquisition & aggregation Analysis Personalized Actionable Information Personal level Signals Public level Signals Population level Signals Events from Social Streams Contact doctor
  • 27. kHealth Demo • kHealth: http://www.youtube.com/watch?v=btnRi64hJp4 38
  • 28. Variety Syntactic and semantic heterogeneity • in textual and sensor data, • in social media and Web forums data • In Electronic Medical Records 03/20/2014 Prasad 39
  • 29. Variety (How?): (1) Granularity of Semantics & Applications • Lightweight semantics: File and document-level annotation to enable discovery and sharing • Richer semantics: Data-level annotation and extraction for semantic search and summarization • Fine-grained semantics: Data integration, interoperability and reasoning in Linked Open Data Cost-benefit trade-off and continuum 03/20/2014 Prasad 40
  • 30. Variety (How?): (2) Hybrid KRR Blending data-driven models with declarative knowledge – Data-gleaned models: Bottom-up, correlation- based, statistical – Expert-given KBs: Top- down, causal/taxonomical, logical – Refine structure to better estimate parameters E.g., Medical Data Analytics using PGMs + KBs 03/20/2014 Prasad 42
  • 31. Veracity Scalable and Agile Big Data Analytics cannot deliver value unless we have confidence and trust in our data. Open Problem: Develop expressive frameworks for trust to make explicit all aspects that go into trust formation and inferences. 03/20/2014 Prasad 45
  • 32. Veracity: Confession of sorts! Trust is well-known, but is not well-understood. The utility of a notion testifies not to its clarity but rather to the philosophical importance of clarifying it. -- Nelson Goodman (Fact, Fiction and Forecast, 1955) 03/20/2014 Prasad 46
  • 33. (More on) Value Discovering gaps and enriching domain models using data E.g., Semantics Driven Approach for Knowledge Acquisition from EMRs 03/20/2014 Prasad 47
  • 34. (More on) Value Discovering drug-drug interaction by analyzing search query logs • E.g., The antidepressant, paroxetine, and the cholesterol lowering drug, pravastatin, were shown to interfere causing high blood sugar, by correlated searches with “hyperglycemia”, “high blood sugar” or “blurry vision”. 03/20/2014 Prasad 48
  • 35. Conclusions • Glimpse of our research organized around the 5 V’s of Big Data • Discussed role in harnessing Value – Semantic Perception (Volume) – Continuum of Semantic models to manage Heterogeneity (Variety) – Hybrid KRR: Probabilistic + Logical (Variety) – Trust Models (Veracity) 03/20/2014 Prasad 49
  • 36. thank you, and please visit us at http://knoesis.org/ Department of Computer Science and Engineering Wright State University, Dayton, Ohio, USA Kno.e.sis: Ohio Center of Excellence in Knowledge-enabled Computing Special Thanks to: Pramod Anantharam, Sujan Perera, Dr. Cory Henson, Professor Amit Sheth 03/20/2014 Prasad 50

Hinweis der Redaktion

  1. EVENT: Wright State Honors Institute Symposium “Visions of the Future” on Thursday, March 20, 2014. ABSTRACT:With the rapid proliferation of mobile phones, social media, and sensors, it is critical to collect and convert big data so generated into actionable information that is relevant for decision making. In this session, we explore challenges and approaches for synthesizing relevant background knowledge and inferences that can enable smart healthcare and ultimately benefit community at large.
  2. EVENT: Wright State Honors Institute Symposium “Visions of the Future” on Thursday, March 20, 2014. ABSTRACT:With the rapid proliferation of mobile phones, social media, and sensors, it is critical to collect and convert big data so generated into actionable information that is relevant for decision making. In this session, we explore challenges and approaches for synthesizing relevant background knowledgeand inferences that can enable smart healthcare and ultimately benefit community at large.-----------------------Semantics-empowered Approaches to Big Data Processing for Physical-Cyber-Social Applications Big Data Research: Sensor, Social, and Cyber-Physical Systems-----Our research thro the lens of big data.
  3. Statistics in terms of the number of people effected and costs involved Heterogeniety: Sensor data, social media data, text documents / forum posts, Semi-structured Electronic Medical RecordsIBM Vision: Machine-sensed data to human action by distilling the data into nuggets of actionable information and progressively improving decision making by learningNature of computational problems to be addressedOur technical work : Web 3.0
  4. Population of US : 315 million GDP : $16 trillionObama legislation Affordable Care Act : Hospital will not be reimbursed by medicare/medicaid insurance if patient readmitted within 30 daysChronic condition – can we help reduce preventable readmissions?CHF: Congestive Heart Failure
  5. Can we determine cause/potential triggers, predict asthma exacerbation to avoid, treat, or control symptoms.chronic obstructive pulmonary disease (COPD)
  6. Awareness important because it impacts overall healthQuantified Self
  7. Quality of life
  8. Larry Smarr is a professor at the University of California, San DiegoAnd he diagnosed himself with Crohn’s DiseaseHe is a pioneer in the area of Quantified-Self, which uses sensors to monitor physiological symptomsThrough this self-tracking process he discovered inflammation, which led him to discovery of Crohn’sDisease
  9. EMR: capture information exchanged during Doctor’s visit and tests data : disease/symptom/prescribed medications/suggested regimen(PHR: Personal Health Record)Social media engagement : self-reported data from public at large----------------------------Huge amount of raw data generated by continuous monitoring =&gt; (what we are lacking is) actionable nuggets of information for decision making (treatment/control/avoidance/change in lifestyle)-----------Quantified SelfMonitoring for disease diagnosis, severity, and progression-------Semantics-based approaches needed to deal with variety or to transcend abstraction levels--------
  10. ---------------------discovering “unexpected” correlations, and then seeking a transparent basis for them, seems worthy of pursuit. For instance, consider the controversies surrounding assertions such as ‘smoking causes cancer’, ‘high debt causes low growth’, ‘low growth causes high debt’, and ‘religious fanaticism breeds terrorists’.
  11. Jeopardy : WATSON beat out (crème de la crème) human competitorsBig Data growth is accelerating as more of the world&apos;s activity is expressed digitally.Process and make sense of it, and enhance and extend the expertise of humans. -----------------http://www.forbes.com/sites/matthewherper/2014/03/19/what-watson-cant-tell-us-about-our-genes-yet/-----------------Check engine light signals/alerts : on detecting -&gt; anomaly / problem =&gt; for further analysis / action--------
  12. Size, rate of flow/accumulation and change, (syntactic and semantic) heterogeneity, trustworthiness/quality (signal to noise ratio), end-use (nuggets of wisdom)(develop techniques to harness data to derive value for decision making in the presence of these challenges)
  13. What does semantic perception entail?Making sense of large amounts of low level data and communicating it in a meaningful waye.g. Ranges, aggregate/statistical measures ---------------------Semantic Perception: Converting Sensory Observations to Abstractions Using perception cycle and domain models: derive explanation, determine focus to disambiguate and discriminate for taking actionsHybrid reasoning: interleaved abductive and deductive components[**complex domain models reflecting comorbidities : high-fidelity models**] [**Gleaning Patterns from data**] [**Personalization**]
  14. Saffir Simpson Hurricane Wind ScaleHurricane/Typoon/Cyclone(5 catergories) / Tropical storm / Tropical depression vs TsunamiNational Oceanic and Atmospheric Administration (NOAA)
  15. ---------------------------ParkinsonMild(person) = Tremor(person) ∧ PoorBalance(person)ParkinsonModerate(person) = MoveSlow(person) ∧ PoorSleep(person) ∧ MonotoneSpeech(person)ParkinsonAdvanced(person) = Fall(person)----------------------------Loss of speech / food intake impossible / lack of balance =&gt; is there value in continuous monitoring? =&gt; Signatures for proactive control?----------------------------Dataset Characteristics: 8 weeks of data from 5 sensors on a smart phone, collected for 12 patients resulting in ~150 GB (with lot of missing data).--------------------------Control group vs PD patients distinguished on the basis of restricted motion, monotone speech, etc.
  16. Main idea: Prior knowledge of PD was used to facilitate its detection from massive sensor data by reducing the search spaceDetails:Declarative knowledge of PD includes PD severity and their symptoms as shown in the logical rule aboveEach PD severity level is a conjunction of a set of PD symptomsEach symptom was mapped to its manifestation in sensor observationsThe availability of declarative knowledge significantly improved the analytics by aiding feature selection processThe graphs above contrasts the physical movements and voice of two control group members and two PD patients
  17.  congestive heart failure / acute decompensated heart failure-- weight change due to water retention-------------------------------------------- cardiologist evaluate risk based on periodic monitoring data (+ human sensed health info inputs)--------------------------------------------Reduce preventable readmissions: 25% patients readmitted 30 day after discharge 50% patients readmitted within 6 months-------------------
  18. EVIDENCE-BASED Approach to diagnosis, treatment and control (IRB)Environmental: CO, CO2, NO, pollen counts, mold, dust, smoke, humidity, temperature, pressure, etc. (sensordrone, dust –smoke sensor, air quality egg)Physiological: Wheezometer (breathing), heart rate, etc25 million people in the U.S. are diagnosed with asthma (7 million are children).300 million people suffering from asthma worldwide.Asthma related healthcare costs alone are around $50 billion a year.155,000 hospital admissions and 593,000 emergency department visits in 2006.
  19. Volume: (1) semantic perception (2) parallelism
  20. An Efficient Bit Vector Approach to Semantics-Based Machine Perception in Resource-Constrained Devices.Resources: memory, cpu, power, …Healthcare use-case – privacy, mobility, cheap onboard sensors, personalization, power, convenience-considerations dominateAbstracting and summarizing multimodal machine sensed observations + human observations for actionable and human accessible situational awareness and decision making---------Characteristics of a big data problem: size of the data exceeds the resources available/needed to compute
  21. perception cycle contains interleaved iterative execution of two primary phasesExplanation (abductive)translating low-level signals into high-level abstractions inference to the best explanationDiscrimination (declarative)focusing attention on those properties that will help distinguish between multiple possible explanationsused to intelligently task sensors and collect additional observations (rather than brute force approach of blindly collecting all observations)-----------------------Ask human relevant questions
  22. perception cycle contains two primary phasesexplanationtranslating low-level signals into high-level abstractions inference to the best explanationdiscriminationfocusing attention on those properties that will help distinguish between multiple possible explanationsused to intelligently task sensors and collect additional observations (rather than brute force approach of blindly collecting all observations)
  23. Observe units on x and y axis : small vs large problem size; small vs large amount time Step as opposed to linear which reflects allocation in quantum of 1 word (32 or 64 bits)---------Size of the graph is plotted in terms of number of nodes as we hold one of feature/property fixedOtherwise, the size of the graph is o(n^2)
  24. Research on Asthma has three phases Data collection: what signals to collect?Analysis: what analysis to be done?Actionable information: what action to recommend?In the next slide, we take a peek into the analysis that we do for Asthma
  25. Syntactic : different data formatsSemantic :Conceptual modelsSemantic : multimodal sensing + different conceptual models--------------Complementary and corroborative information =&gt; complete and reliable/robust;---------------------------“Semantics Empowered Web 3.0” book
  26. Semantics at different levels of detail and developed in stages : ---------------------Ease of use by domain expertsFaster and wider adoption, promoting evolutionLow upfront cost to supportShallow semantics has wider applicability to a range of documents/data and appeal to a broader communityBottom-line: “Learn to Walk before we Run”------------------------------------------------------Controlled vocabularies &lt;= Lightweight ontologies [ legacy vocab + community agreed semantic relationships] &lt;= Formal ontologiesOriginal document vs its translation =&gt; traceability (provenance)---------Past Research: We have dealt with top-down UMLS ontology vs bottom-up facts from Pubmed in HPCO (Literature-based discovery -&gt; LBD)-----------------------------RECALL: materials and process specs typically describe: composition, processing, testing, and packaging of materialFormalizing a procedure (a process or a test) as an aggregation of characteristic/parameter-value pairs = LOD  Eventually allows combining and comparing specs==============================Biomaterials use case: Gold surface affinity of peptide sequence
  27. Semantic Perception and Hybrid KRR =&gt; Event, disease, human comprehensible features … (e.g., Parkinson, Asthma)--------------Slow traffic vs reason for it (accident vs tree fall): semantics to data : sensors monitoring traffic space-----------Cardiology use case – how a patient is feeling – giddy, depressed, etc.
  28. Idea : Glean statistical correlations from data (PGM) and enrich/validate it using symbolic knowledge (manually curated) orient undirected links, delete conflicting links, + complement nodes and links Explicit declarative knowledge obviates the need to generate it, especially in the context of sparse/skewed data PLUS it will be relaible------------Structure learning uncovers qualitative conditional dependencies integrate with declarative information using progressively expressive graphical models : same abstraction levelParameter learning using refined structure to estimate better fitting model
  29. Taxonomic : relating and organizing terms : nomenclature
  30. e.g., tides and ebbs caused by the alignment of earth, sun and moon, around full moon and new moon; “anomalous” orbits of Solar system planets w.r.t. the “circular” motion of stars in geocentric theory (‘planet’ is ‘wanderer’ in Greek) explained by heliocentrism and theory of gravitation, (Copernicus) correlation of time period and distance of planets (Kepler)and the “anomalous” precision of Mercury’s orbit clarified by General Theory of Relativity; (Einstein) C-peptide protein can be used to estimate insulin produced by a patient’s pancreas =&gt; ANOMALY (Copernicus) and REGULARITY (Kepler) =&gt; CAUSE (Newton)=&gt; (Newtonian Mechanics) =&gt; (General Theory of Relativity)Bold claims all the time in politicsBeer vs diaper; Walmart’s hurricanes vspoptarts ---------------------(4) Stress/spicy foods are correlated with peptic ulcers, but the latter are caused by Helicobacter Pyrolias demonstrated by Nobel Prize winning works of Marshall and Warren.ORIENTATION UNCLEAR: ‘high debt causes low growth’, ‘low growth causes high debt’, ------------------(5) Since the 1950s, both the atmospheric Carbon Dioxide level and obesity levels have increased sharply. (6) Pavlovian learning induced conditional reflex, and some of the financial market moves, seem to be classic cases of correlation turning into causation! ---------PARADOXES : THE SEEDS OF PROGRESSZeno’s paradox, Hydrostatic paradox, light speed constant in all reference frames, CBR, Expanding universe, …
  31. complementary and corroboratory
  32. EMR
  33. http://www.nytimes.com/2013/03/07/science/unreported-side-effects-of-drugs-found-using-internet-data-study-finds.html?_r=0They determined that people who searched for both drugs during the 12-month period were significantly more likely to search for terms related to hyperglycemia than were those who searched for just one of the drugs. They also found that people who did the searches for symptoms relating to both drugs were likely to do the searches in a short time period: 30 percent did the search on the same day, 40 percent during the same week and 50 percent during the same month.
  34. Semantic Perception : Hybrid Abductive/Deductive Reasoning (Volume)Cost-benefit trade-off and Continuum of Semantic models to manage Heterogeneity (Variety)Hybrid Knowledge Representation and Reasoning : Probabilisitc + Logical : structure + parameter estimation (Variety)