SlideShare ist ein Scribd-Unternehmen logo
1 von 17
Late Binding in Data Warehouses:
Designing for Analytic Agility

Dale Sanders, Oct 2013
© 2013 Health Catalyst
www.healthcatalyst.com
© 2013 Health Catalyst
www.healthcatalyst.com
Overview
•

The concept of “binding” in software and data
engineering

•

Examples of data binding in healthcare

•

The two tests for early binding
•

•

The six points of binding in data warehouse design
•

•

Comprehensive & persistent agreement

Data Modeling vs. Late Binding

The importance of binding in analytic progression
•

Eight levels of analytic adoption in healthcare

© 2013 Health Catalyst
www.healthcatalyst.com
Late Binding in Software Engineering
1980s: Object Oriented Programming
●

Alan Kay Universities of Colorado & Utah, Xerox/PARC

●

Small objects of code, reflecting the real world

●

Compiled individually, linked at runtime, only as needed

●

Major agility and adaptability to address new use cases

Steve Jobs
●

NeXT computing

●

Commercial, large-scale adoption of Kay’s concepts

●

Late binding– or as late as practical– becomes the norm

●

Maybe Jobs’ largest contribution to computer science

3
© 2013 Health Catalyst
www.healthcatalyst.com
Late Binding in Data Engineering
Atomic data must be “bound” to business rules about that data and
to vocabularies related to that data in order to create information
Vocabulary binding in healthcare is pretty obvious
●
●
●
●

Unique patient and provider identifiers
Standard facility, department, and revenue center codes
Standard definitions for gender, race, ethnicity
ICD, CPT, SNOMED, LOINC, RxNorm, RADLEX, etc.

Examples of binding data to business rules
●
●
●
●
●
●
●

Length of stay
Patient relationship attribution to a provider
Revenue (or expense) allocation and projections to a department
Revenue (or expense) allocation and projections to a physician
Data definitions of general disease states and patient registries
Patient exclusion criteria from disease/population management
Patient admission/discharge/transfer rules

4
© 2013 Health Catalyst
www.healthcatalyst.com
Data Binding
Software
Programming

Pieces of
meaningless
data

112
60

Vocabulary
Binds
data to

“systolic &
diastolic
blood pressure”

Rules
“normal”

What’s the rule for declaring and managing a
“hypertensive patient”?
© 2013 Health Catalyst
www.healthcatalyst.com
Why Is This Concept Important?
Knowing when to bind data, and how
tightly, to vocabularies and rules is
THE KEY to analytic success and agility
Comprehensive
Agreement
Is the rule or vocabulary widely
accepted as true and accurate in
the organization or industry?

Persistent
Agreement
Is the rule or vocabulary stable
and rarely change?

Two tests for tight, early binding

Acknowledgements to
Mark Beyer of Gartner

6

© 2013 Health Catalyst
www.healthcatalyst.com
Six Binding Points in a Data Warehouse
SOURCE
DATA CONTENT
SUPPLIES

INTERNAL

CUSTOMIZED
DATA MARTS

CLINICAL

CLINICAL

FINANCIAL

FINANCIAL

QlikView

DISEASE REGISTRIES

SUPPLIES

DATA
ANALYSIS

MATERIALS MANAGEMENT

SOURCE SYSTEM
ANALYTICS

Microsoft Access/
ODBC

Excel

OPERATIONAL EVENTS

SAS, SPSS

RESEASRCH REGISTRIES

Et al

HR

OTHERS

EXTERNAL

Web applications

CLINICAL EVENTS

HR

COMPLIANCE AND PAYER
MEASURES

OTHERS

STATE
STATE
ACADEMIC
ACADEMIC

1

2

3

4

5

6

Data Rules and Vocabulary Binding Points
High Comprehension &
Persistence of vocabulary &
business rules? => Early binding

Low Comprehension and
Persistence of vocabulary or
business rules? => Late binding
© 2013 Health Catalyst
www.healthcatalyst.com
Data Modeling for Analytics
Five Basic Methodologies
●

Corporate Information Model

Early binding

‒ Popularized by Bill Inmon and Claudia Imhoff
●

I2B2
‒ Popularized by Academic Medicine

●

Star Schema
‒ Popularized by Ralph Kimball

●

Data Bus Architecture
‒ Popularized by Dale Sanders

●

File Structure Association
‒ Popularized by IBM mainframes in 1960s
‒ Reappearing in Hadoop & NoSQL
‒ No traditional relational data model

Late binding

© 2013 Health Catalyst
www.healthcatalyst.com
Binding to Analytic Relations
In data warehousing, the key is to relate data, not model data
Core Data Elements
Charge code
CPT code
Date & Time

In today’s environment, about 20 data elements
represent 80-90% of analytic use cases. This will
grow over time, but right now, it’s fairly simple.

DRG code
Drug code
Employee ID
Employer ID
Encounter ID

Source data
vocabulary Z
(e.g., EMR)

Gender
ICD diagnosis code
ICD procedure code
Department ID
Facility ID
Lab code
Patient type
Patient/member ID
Payer/carrier ID
Postal code
Provider ID

Source data
vocabulary Y
(e.g., Claims)

Source data
vocabulary X
(e.g., Rx)
The Bus Architecture
Client
Developed
Apps

Vendor Apps

Ad Hoc
Query Tools

Third Party
Apps

EMR

Claims

Rx

Cost

Patient Sat

Provider ID

Payer/carrier ID

Member ID

Patient type

Lab code

Facility ID

Department ID

ICD diagnosis code

Gender

Encounter ID

Employer ID

Employee ID

Drug code

DRG code

Date & Time

CPT code

Late Binding Bus Architecture

Etc.

© 2013 Health Catalyst
www.healthcatalyst.com
Healthcare Analytics Adoption Model
Level 8

Personalized Medicine
& Prescriptive Analytics

Tailoring patient care based on population outcomes and
genetic data. Fee-for-quality rewards health maintenance.

Level 7

Clinical Risk Intervention
& Predictive Analytics

Organizational processes for intervention are supported
with predictive risk models. Fee-for-quality includes fixed
per capita payment.

Level 6

Population Health Management
& Suggestive Analytics

Tailoring patient care based upon population metrics. Feefor-quality includes bundled per case payment.

Level 5

Waste & Care Variability Reduction

Reducing variability in care processes. Focusing on
internal optimization and waste reduction.

Level 4

Automated External Reporting

Efficient, consistent production of reports & adaptability to
changing requirements.

Level 3

Automated Internal Reporting

Efficient, consistent production of reports & widespread
availability in the organization.

Level 2

Standardized Vocabulary
& Patient Registries

Relating and organizing the core data content.

Level 1

Enterprise Data Warehouse

Collecting and integrating the core data content.

Level 0

Fragmented Point Solutions

Inefficient, inconsistent versions of the truth. Cumbersome
internal and external reporting.

© 2013 Health Catalyst
www.healthcatalyst.com
Progression in the Model
The patterns at each level
•

Data content expands
•

•

Data timeliness increases
•

•

To support faster decision cycles and lower “Mean Time To
Improvement”

Data governance expands
•

•

Adding new sources of data to expand our understanding of care
delivery and the patient

Advocating greater data access, utilization, and quality

The complexity of data binding and algorithms increases
•

From descriptive to prescriptive analytics

•

From “What happened?” to “What should we do?”

© 2013 Health Catalyst
www.healthcatalyst.com
The Expanding Ecosystem of Data Content
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•

Real time 7x24 biometric monitoring
data for all patients in the ACO
Genomic data
Long term care facility data
Patient reported outcomes data*
Home monitoring data
Familial data
External pharmacy data
Bedside monitoring data
Detailed cost accounting data*
HIE data
Claims data
Outpatient EMR data
Inpatient EMR data
Imaging data
Lab data
Billing data

2-4 years

1-2 years

3-12 months

* - Not currently being addressed by vendor products
© 2013 Health Catalyst
www.healthcatalyst.com
Six Phases of Data Governance
You need to move through
these phases in no more
than two years
•

Phase 6: Acquisition of Data

•

Phase 5: Utilization of Data

•

Phase 4: Quality of Data

•

Phase 3: Stewardship of Data

•

Phase 2: Access to Data

•

2-4 years

Phase 1: Cultural Tone of “Data Driven”

1-2 years

3-12 months

© 2013 Health Catalyst
14
www.healthcatalyst.com
One Page Self Inspection Guide
Principles to Remember
1. Delay binding as long as possible… until a clear analytic

use case requires it
2. Earlier binding is appropriate for business rules or

vocabularies that change infrequently or that the
organization wants to “lock down” for consistent analytics
3. Late binding, in the visualization layer, is appropriate for

“what if” scenario analysis
4. Retain a record of the changes to vocabulary and rules

bindings in the data models of the data warehouse
●

Bake the history of vocabulary and business rules bindings into
the data models so you can retrace your analytic steps if need be

16

© 2013 Health Catalyst
www.healthcatalyst.com
Closing Words of Caution
Healthcare suffers from a low degree of
Comprehensive and Persistent agreement on
many topics that impact analytics

The vast majority of vendors and home grown
data warehouses bind to rules and vocabulary
too early and too tightly, in comprehensive
enterprise data models

Analytic agility and adaptability suffers greatly
• “We’ve been building our EDW for two years.”
• “I asked for that report last month.”

17

© 2013 Health Catalyst
www.healthcatalyst.com

Weitere ähnliche Inhalte

Was ist angesagt?

Data governance
Data governanceData governance
Data governance
SambaSoup
 
Data quality overview
Data quality overviewData quality overview
Data quality overview
Alex Meadows
 
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Alan McSweeney
 
ETIS09 - Data Quality: Common Problems & Checks - Presentation
ETIS09 -  Data Quality: Common Problems & Checks - PresentationETIS09 -  Data Quality: Common Problems & Checks - Presentation
ETIS09 - Data Quality: Common Problems & Checks - Presentation
David Walker
 

Was ist angesagt? (20)

Data warehouse proposal
Data warehouse proposalData warehouse proposal
Data warehouse proposal
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data Governance
 
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
 
Building a Data Governance Strategy
Building a Data Governance StrategyBuilding a Data Governance Strategy
Building a Data Governance Strategy
 
Data Catalog as the Platform for Data Intelligence
Data Catalog as the Platform for Data IntelligenceData Catalog as the Platform for Data Intelligence
Data Catalog as the Platform for Data Intelligence
 
Data Quality Management: Cleaner Data, Better Reporting
Data Quality Management: Cleaner Data, Better ReportingData Quality Management: Cleaner Data, Better Reporting
Data Quality Management: Cleaner Data, Better Reporting
 
Data Governance
Data GovernanceData Governance
Data Governance
 
Data Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityData Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data Quality
 
Hybrid Data Platform
Hybrid Data Platform Hybrid Data Platform
Hybrid Data Platform
 
Data governance
Data governanceData governance
Data governance
 
DAS Slides: Data Quality Best Practices
DAS Slides: Data Quality Best PracticesDAS Slides: Data Quality Best Practices
DAS Slides: Data Quality Best Practices
 
The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of Metadata
 
Data Architecture Brief Overview
Data Architecture Brief OverviewData Architecture Brief Overview
Data Architecture Brief Overview
 
DataEd Webinar: Reference & Master Data Management - Unlocking Business Value
DataEd Webinar:  Reference & Master Data Management - Unlocking Business ValueDataEd Webinar:  Reference & Master Data Management - Unlocking Business Value
DataEd Webinar: Reference & Master Data Management - Unlocking Business Value
 
Introduction to data pre-processing and cleaning
Introduction to data pre-processing and cleaning Introduction to data pre-processing and cleaning
Introduction to data pre-processing and cleaning
 
Implementing Effective Data Governance
Implementing Effective Data GovernanceImplementing Effective Data Governance
Implementing Effective Data Governance
 
Data quality overview
Data quality overviewData quality overview
Data quality overview
 
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
 
ETIS09 - Data Quality: Common Problems & Checks - Presentation
ETIS09 -  Data Quality: Common Problems & Checks - PresentationETIS09 -  Data Quality: Common Problems & Checks - Presentation
ETIS09 - Data Quality: Common Problems & Checks - Presentation
 

Ähnlich wie Late Binding in Data Warehouses

Healthcare 2.0: The Age of Analytics
Healthcare 2.0: The Age of AnalyticsHealthcare 2.0: The Age of Analytics
Healthcare 2.0: The Age of Analytics
Dale Sanders
 
Consumer Behavior: Factors Affecting Member Attrition and Retention
Consumer Behavior: Factors Affecting Member Attrition and RetentionConsumer Behavior: Factors Affecting Member Attrition and Retention
Consumer Behavior: Factors Affecting Member Attrition and Retention
Altegra Health
 
Data Management in Clinical Research
Data Management in Clinical ResearchData Management in Clinical Research
Data Management in Clinical Research
ijtsrd
 
Choosing an Analytics Solution in Healthcare
Choosing an Analytics Solution in HealthcareChoosing an Analytics Solution in Healthcare
Choosing an Analytics Solution in Healthcare
Dale Sanders
 
Reviewing the Healthcare Analytics Adoption Model: A Roadmap and Recipe for A...
Reviewing the Healthcare Analytics Adoption Model: A Roadmap and Recipe for A...Reviewing the Healthcare Analytics Adoption Model: A Roadmap and Recipe for A...
Reviewing the Healthcare Analytics Adoption Model: A Roadmap and Recipe for A...
Health Catalyst
 
7 Essential Practices for Data Governance in Healthcare
7 Essential Practices for Data Governance in Healthcare7 Essential Practices for Data Governance in Healthcare
7 Essential Practices for Data Governance in Healthcare
Health Catalyst
 
PROJECT softwares (28 May 14)
PROJECT softwares (28 May 14)PROJECT softwares (28 May 14)
PROJECT softwares (28 May 14)
Preeti Sirohi
 
JR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsJR's Lifetime Advanced Analytics
JR's Lifetime Advanced Analytics
Chase Hamilton
 
JR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsJR's Lifetime Advanced Analytics
JR's Lifetime Advanced Analytics
d-Wise Technologies
 
Microsoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big DataMicrosoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big Data
Health Catalyst
 

Ähnlich wie Late Binding in Data Warehouses (20)

Late Binding: The New Standard For Data Warehousing
Late Binding: The New Standard For Data WarehousingLate Binding: The New Standard For Data Warehousing
Late Binding: The New Standard For Data Warehousing
 
Late Binding in Data Warehouses: Desiging for Analytic Agility
Late Binding in Data Warehouses: Desiging for Analytic AgilityLate Binding in Data Warehouses: Desiging for Analytic Agility
Late Binding in Data Warehouses: Desiging for Analytic Agility
 
Late-Binding Data Warehouse - An Update on the Fastest Growing Trend in Healt...
Late-Binding Data Warehouse - An Update on the Fastest Growing Trend in Healt...Late-Binding Data Warehouse - An Update on the Fastest Growing Trend in Healt...
Late-Binding Data Warehouse - An Update on the Fastest Growing Trend in Healt...
 
Healthcare Analytics Adoption Model
Healthcare Analytics Adoption ModelHealthcare Analytics Adoption Model
Healthcare Analytics Adoption Model
 
Healthcare 2.0: The Age of Analytics
Healthcare 2.0: The Age of AnalyticsHealthcare 2.0: The Age of Analytics
Healthcare 2.0: The Age of Analytics
 
Healthcare 2.0: The Age of Analytics
Healthcare 2.0: The Age of AnalyticsHealthcare 2.0: The Age of Analytics
Healthcare 2.0: The Age of Analytics
 
Accelerating Your Move to Value-Based Care
Accelerating Your Move to Value-Based CareAccelerating Your Move to Value-Based Care
Accelerating Your Move to Value-Based Care
 
Consumer Behavior: Factors Affecting Member Attrition and Retention
Consumer Behavior: Factors Affecting Member Attrition and RetentionConsumer Behavior: Factors Affecting Member Attrition and Retention
Consumer Behavior: Factors Affecting Member Attrition and Retention
 
Data Management in Clinical Research
Data Management in Clinical ResearchData Management in Clinical Research
Data Management in Clinical Research
 
Transforming Big Data into Big Value
Transforming Big Data into Big ValueTransforming Big Data into Big Value
Transforming Big Data into Big Value
 
Choosing an Analytics Solution in Healthcare
Choosing an Analytics Solution in HealthcareChoosing an Analytics Solution in Healthcare
Choosing an Analytics Solution in Healthcare
 
Reviewing the Healthcare Analytics Adoption Model: A Roadmap and Recipe for A...
Reviewing the Healthcare Analytics Adoption Model: A Roadmap and Recipe for A...Reviewing the Healthcare Analytics Adoption Model: A Roadmap and Recipe for A...
Reviewing the Healthcare Analytics Adoption Model: A Roadmap and Recipe for A...
 
7 Essential Practices for Data Governance in Healthcare
7 Essential Practices for Data Governance in Healthcare7 Essential Practices for Data Governance in Healthcare
7 Essential Practices for Data Governance in Healthcare
 
AI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdfAI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdf
 
PROJECT softwares (28 May 14)
PROJECT softwares (28 May 14)PROJECT softwares (28 May 14)
PROJECT softwares (28 May 14)
 
Clinical Data Capture
Clinical Data CaptureClinical Data Capture
Clinical Data Capture
 
JR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsJR's Lifetime Advanced Analytics
JR's Lifetime Advanced Analytics
 
JR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsJR's Lifetime Advanced Analytics
JR's Lifetime Advanced Analytics
 
Microsoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big DataMicrosoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big Data
 
Data mining and data warehousing
Data mining and data warehousingData mining and data warehousing
Data mining and data warehousing
 

Mehr von Dale Sanders

The 12 Criteria of Population Health Management
The 12 Criteria of Population Health ManagementThe 12 Criteria of Population Health Management
The 12 Criteria of Population Health Management
Dale Sanders
 
An Overview of Disease Registries
An Overview of Disease RegistriesAn Overview of Disease Registries
An Overview of Disease Registries
Dale Sanders
 
OECD Health Indicators at a Glance
OECD Health Indicators at a GlanceOECD Health Indicators at a Glance
OECD Health Indicators at a Glance
Dale Sanders
 
HIMSS National Data Warehousing Webinar
HIMSS National Data Warehousing WebinarHIMSS National Data Warehousing Webinar
HIMSS National Data Warehousing Webinar
Dale Sanders
 

Mehr von Dale Sanders (20)

The Philosophy, Psychology, and Technology of Data in Healthcare
The Philosophy, Psychology, and  Technology of Data in HealthcareThe Philosophy, Psychology, and  Technology of Data in Healthcare
The Philosophy, Psychology, and Technology of Data in Healthcare
 
Healthcare Analytics Summit Keynote Fall 2017
Healthcare Analytics Summit Keynote Fall 2017Healthcare Analytics Summit Keynote Fall 2017
Healthcare Analytics Summit Keynote Fall 2017
 
AMDIS CHIME Fall Symposium
AMDIS CHIME Fall SymposiumAMDIS CHIME Fall Symposium
AMDIS CHIME Fall Symposium
 
The Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareThe Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of Healthcare
 
Healthcare Best Practices in Data Warehousing & Analytics
Healthcare Best Practices in Data Warehousing & AnalyticsHealthcare Best Practices in Data Warehousing & Analytics
Healthcare Best Practices in Data Warehousing & Analytics
 
Is Big Data a Big Deal... or Not?
Is Big Data a Big Deal... or Not?Is Big Data a Big Deal... or Not?
Is Big Data a Big Deal... or Not?
 
Microsoft: A Waking Giant in Healthcare Analytics and Big Data
Microsoft: A Waking Giant in Healthcare Analytics and Big DataMicrosoft: A Waking Giant in Healthcare Analytics and Big Data
Microsoft: A Waking Giant in Healthcare Analytics and Big Data
 
Predicting the Future of Predictive Analytics in Healthcare
Predicting the Future of Predictive Analytics in HealthcarePredicting the Future of Predictive Analytics in Healthcare
Predicting the Future of Predictive Analytics in Healthcare
 
Precise Patient Registries for Clinical Research and Population Management
Precise Patient Registries for Clinical Research and Population ManagementPrecise Patient Registries for Clinical Research and Population Management
Precise Patient Registries for Clinical Research and Population Management
 
Break All The Rules: What the Leading Health Systems Do Differently with Anal...
Break All The Rules: What the Leading Health Systems Do Differently with Anal...Break All The Rules: What the Leading Health Systems Do Differently with Anal...
Break All The Rules: What the Leading Health Systems Do Differently with Anal...
 
Healthcare Billing and Reimbursement: Starting from Scratch
Healthcare Billing and Reimbursement: Starting from ScratchHealthcare Billing and Reimbursement: Starting from Scratch
Healthcare Billing and Reimbursement: Starting from Scratch
 
The 12 Criteria of Population Health Management
The 12 Criteria of Population Health ManagementThe 12 Criteria of Population Health Management
The 12 Criteria of Population Health Management
 
Population Health Management
Population Health ManagementPopulation Health Management
Population Health Management
 
Managing National Health: An Overview of Metrics & Options
Managing National Health: An Overview of Metrics & OptionsManaging National Health: An Overview of Metrics & Options
Managing National Health: An Overview of Metrics & Options
 
Healthcare Analytics Market Categorization
Healthcare Analytics Market CategorizationHealthcare Analytics Market Categorization
Healthcare Analytics Market Categorization
 
Strategic Options for Analytics in Healthcare
Strategic Options for Analytics in HealthcareStrategic Options for Analytics in Healthcare
Strategic Options for Analytics in Healthcare
 
An Overview of Disease Registries
An Overview of Disease RegistriesAn Overview of Disease Registries
An Overview of Disease Registries
 
OECD Health Indicators at a Glance
OECD Health Indicators at a GlanceOECD Health Indicators at a Glance
OECD Health Indicators at a Glance
 
Healthcare Analytics Adoption Model
Healthcare Analytics Adoption ModelHealthcare Analytics Adoption Model
Healthcare Analytics Adoption Model
 
HIMSS National Data Warehousing Webinar
HIMSS National Data Warehousing WebinarHIMSS National Data Warehousing Webinar
HIMSS National Data Warehousing Webinar
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 

Late Binding in Data Warehouses

  • 1. Late Binding in Data Warehouses: Designing for Analytic Agility Dale Sanders, Oct 2013 © 2013 Health Catalyst www.healthcatalyst.com © 2013 Health Catalyst www.healthcatalyst.com
  • 2. Overview • The concept of “binding” in software and data engineering • Examples of data binding in healthcare • The two tests for early binding • • The six points of binding in data warehouse design • • Comprehensive & persistent agreement Data Modeling vs. Late Binding The importance of binding in analytic progression • Eight levels of analytic adoption in healthcare © 2013 Health Catalyst www.healthcatalyst.com
  • 3. Late Binding in Software Engineering 1980s: Object Oriented Programming ● Alan Kay Universities of Colorado & Utah, Xerox/PARC ● Small objects of code, reflecting the real world ● Compiled individually, linked at runtime, only as needed ● Major agility and adaptability to address new use cases Steve Jobs ● NeXT computing ● Commercial, large-scale adoption of Kay’s concepts ● Late binding– or as late as practical– becomes the norm ● Maybe Jobs’ largest contribution to computer science 3 © 2013 Health Catalyst www.healthcatalyst.com
  • 4. Late Binding in Data Engineering Atomic data must be “bound” to business rules about that data and to vocabularies related to that data in order to create information Vocabulary binding in healthcare is pretty obvious ● ● ● ● Unique patient and provider identifiers Standard facility, department, and revenue center codes Standard definitions for gender, race, ethnicity ICD, CPT, SNOMED, LOINC, RxNorm, RADLEX, etc. Examples of binding data to business rules ● ● ● ● ● ● ● Length of stay Patient relationship attribution to a provider Revenue (or expense) allocation and projections to a department Revenue (or expense) allocation and projections to a physician Data definitions of general disease states and patient registries Patient exclusion criteria from disease/population management Patient admission/discharge/transfer rules 4 © 2013 Health Catalyst www.healthcatalyst.com
  • 5. Data Binding Software Programming Pieces of meaningless data 112 60 Vocabulary Binds data to “systolic & diastolic blood pressure” Rules “normal” What’s the rule for declaring and managing a “hypertensive patient”? © 2013 Health Catalyst www.healthcatalyst.com
  • 6. Why Is This Concept Important? Knowing when to bind data, and how tightly, to vocabularies and rules is THE KEY to analytic success and agility Comprehensive Agreement Is the rule or vocabulary widely accepted as true and accurate in the organization or industry? Persistent Agreement Is the rule or vocabulary stable and rarely change? Two tests for tight, early binding Acknowledgements to Mark Beyer of Gartner 6 © 2013 Health Catalyst www.healthcatalyst.com
  • 7. Six Binding Points in a Data Warehouse SOURCE DATA CONTENT SUPPLIES INTERNAL CUSTOMIZED DATA MARTS CLINICAL CLINICAL FINANCIAL FINANCIAL QlikView DISEASE REGISTRIES SUPPLIES DATA ANALYSIS MATERIALS MANAGEMENT SOURCE SYSTEM ANALYTICS Microsoft Access/ ODBC Excel OPERATIONAL EVENTS SAS, SPSS RESEASRCH REGISTRIES Et al HR OTHERS EXTERNAL Web applications CLINICAL EVENTS HR COMPLIANCE AND PAYER MEASURES OTHERS STATE STATE ACADEMIC ACADEMIC 1 2 3 4 5 6 Data Rules and Vocabulary Binding Points High Comprehension & Persistence of vocabulary & business rules? => Early binding Low Comprehension and Persistence of vocabulary or business rules? => Late binding © 2013 Health Catalyst www.healthcatalyst.com
  • 8. Data Modeling for Analytics Five Basic Methodologies ● Corporate Information Model Early binding ‒ Popularized by Bill Inmon and Claudia Imhoff ● I2B2 ‒ Popularized by Academic Medicine ● Star Schema ‒ Popularized by Ralph Kimball ● Data Bus Architecture ‒ Popularized by Dale Sanders ● File Structure Association ‒ Popularized by IBM mainframes in 1960s ‒ Reappearing in Hadoop & NoSQL ‒ No traditional relational data model Late binding © 2013 Health Catalyst www.healthcatalyst.com
  • 9. Binding to Analytic Relations In data warehousing, the key is to relate data, not model data Core Data Elements Charge code CPT code Date & Time In today’s environment, about 20 data elements represent 80-90% of analytic use cases. This will grow over time, but right now, it’s fairly simple. DRG code Drug code Employee ID Employer ID Encounter ID Source data vocabulary Z (e.g., EMR) Gender ICD diagnosis code ICD procedure code Department ID Facility ID Lab code Patient type Patient/member ID Payer/carrier ID Postal code Provider ID Source data vocabulary Y (e.g., Claims) Source data vocabulary X (e.g., Rx)
  • 10. The Bus Architecture Client Developed Apps Vendor Apps Ad Hoc Query Tools Third Party Apps EMR Claims Rx Cost Patient Sat Provider ID Payer/carrier ID Member ID Patient type Lab code Facility ID Department ID ICD diagnosis code Gender Encounter ID Employer ID Employee ID Drug code DRG code Date & Time CPT code Late Binding Bus Architecture Etc. © 2013 Health Catalyst www.healthcatalyst.com
  • 11. Healthcare Analytics Adoption Model Level 8 Personalized Medicine & Prescriptive Analytics Tailoring patient care based on population outcomes and genetic data. Fee-for-quality rewards health maintenance. Level 7 Clinical Risk Intervention & Predictive Analytics Organizational processes for intervention are supported with predictive risk models. Fee-for-quality includes fixed per capita payment. Level 6 Population Health Management & Suggestive Analytics Tailoring patient care based upon population metrics. Feefor-quality includes bundled per case payment. Level 5 Waste & Care Variability Reduction Reducing variability in care processes. Focusing on internal optimization and waste reduction. Level 4 Automated External Reporting Efficient, consistent production of reports & adaptability to changing requirements. Level 3 Automated Internal Reporting Efficient, consistent production of reports & widespread availability in the organization. Level 2 Standardized Vocabulary & Patient Registries Relating and organizing the core data content. Level 1 Enterprise Data Warehouse Collecting and integrating the core data content. Level 0 Fragmented Point Solutions Inefficient, inconsistent versions of the truth. Cumbersome internal and external reporting. © 2013 Health Catalyst www.healthcatalyst.com
  • 12. Progression in the Model The patterns at each level • Data content expands • • Data timeliness increases • • To support faster decision cycles and lower “Mean Time To Improvement” Data governance expands • • Adding new sources of data to expand our understanding of care delivery and the patient Advocating greater data access, utilization, and quality The complexity of data binding and algorithms increases • From descriptive to prescriptive analytics • From “What happened?” to “What should we do?” © 2013 Health Catalyst www.healthcatalyst.com
  • 13. The Expanding Ecosystem of Data Content • • • • • • • • • • • • • • • • Real time 7x24 biometric monitoring data for all patients in the ACO Genomic data Long term care facility data Patient reported outcomes data* Home monitoring data Familial data External pharmacy data Bedside monitoring data Detailed cost accounting data* HIE data Claims data Outpatient EMR data Inpatient EMR data Imaging data Lab data Billing data 2-4 years 1-2 years 3-12 months * - Not currently being addressed by vendor products © 2013 Health Catalyst www.healthcatalyst.com
  • 14. Six Phases of Data Governance You need to move through these phases in no more than two years • Phase 6: Acquisition of Data • Phase 5: Utilization of Data • Phase 4: Quality of Data • Phase 3: Stewardship of Data • Phase 2: Access to Data • 2-4 years Phase 1: Cultural Tone of “Data Driven” 1-2 years 3-12 months © 2013 Health Catalyst 14 www.healthcatalyst.com
  • 15. One Page Self Inspection Guide
  • 16. Principles to Remember 1. Delay binding as long as possible… until a clear analytic use case requires it 2. Earlier binding is appropriate for business rules or vocabularies that change infrequently or that the organization wants to “lock down” for consistent analytics 3. Late binding, in the visualization layer, is appropriate for “what if” scenario analysis 4. Retain a record of the changes to vocabulary and rules bindings in the data models of the data warehouse ● Bake the history of vocabulary and business rules bindings into the data models so you can retrace your analytic steps if need be 16 © 2013 Health Catalyst www.healthcatalyst.com
  • 17. Closing Words of Caution Healthcare suffers from a low degree of Comprehensive and Persistent agreement on many topics that impact analytics The vast majority of vendors and home grown data warehouses bind to rules and vocabulary too early and too tightly, in comprehensive enterprise data models Analytic agility and adaptability suffers greatly • “We’ve been building our EDW for two years.” • “I asked for that report last month.” 17 © 2013 Health Catalyst www.healthcatalyst.com