SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Downloaden Sie, um offline zu lesen
 Data Vault Modeling & Approach

 DW2.0 & Unstructured Data

 Master Data Management

 Agile DW

Data Vault
Modeling the Agile
Data Warehouse
Webinar Event

Q4 2013

Hans P. Hultgren
gohansgo
© 2013 Genesee Academy, LLC
Data Vault: Modeling the Agile DW
AGENDA

About Hans Hultgren:

• Welcome
• Background
• Unified Decomposition &
Modeling Ensemble
• Data Vault Hubs, Links and
Satellites
• Working with Data Vault
• Extreme Data Warehouse Agility
• Architecture
• Information Modeling
• Succeeding with the Agile Data
Warehouse
© 2013 Genesee Academy, LLC

Author, Advisor, Speaker &
Industry Analyst; President
Genesee Academy LLC,
Principal at TopofMinds

Book available on Amazon.com

Affecto Webinar Event Q4 2013

2
The Data Vault modeling approach
• The Data Vault is a data modeling approach
…so it fits into the family of modeling approaches:
3rd Normal Form

Data Vault

Dimensional

• While 3rd Normal Form is optimal for Operational Systems
…and Dimensional is optimal for Data Marts
…the Data Vault is optimal for the Data Warehouse (EDW)

© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

3
Data Vault Benefits
• Business
•
•
•
•
•

Ability to adapt quickly to new business needs
Data is traceable allowing for a fully auditable, integrated data store.
Allows the EDW to absorb all data all of the time.
Easily adapts to new data sources and changing business rules –
without expensive re-engineering
Results in an Data Warehouse with lower total cost of ownership (TCO)

• Projects
•
•

Ideal for agile development techniques resulting in lower project risk and
more frequent deliverables
Can be built incrementally without compromising the core architecture

• Architecture
•
•
•

Parallel loading and restartability
Architecture that supports future expanded scope
Can scale to virtually any size without breaking down

© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

4
A Saga of Data Warehousing
Once upon a time data warehousing was becoming more popular and
everyone was eager to build their own. But whenever they tried they failed.
They called upon their best to fix this but they just couldn’t solve the
problem.
They discovered that meeting the needs of the data warehouse meant that
the tables got too big and too hard to work with. They just could not handle
changes over time. If the smallest thing changed it always meant they had
to change the entire table. When just a single attribute was updated they
had to insert a record for all of the attributes. All seemed lost.
But around the world there were rebels who questioned the conventional
wisdom. And their voices were finally heard: Why not separate the things
that change from the things that don’t change?

© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

5
Unified Decomposition™
• Separating the things that change from the things that don’t change.
• break things out into component parts flexibility and capture things that
– are interpreted in different ways or
– changing independently of each other

© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

6
Ensemble Modeling™
• The constellation of component parts acts as a whole – an Ensemble.
All the parts of a thing taken together, so that
each part is considered only in relation to the whole.

• With Ensemble Modeling the Core Business Concepts that we define
and model are represented as a whole – an ensemble – including all of
the component parts.
© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

7
The Data Vault Ensemble
• The Data Vault Ensemble conforms to a single key – embodied in the Hub
construct.

• The component parts for the Data Vault Ensemble include:
– Hub
The Natural Business Key
– Link
The Natural Business Relationships
– Satellite
All Context, Descriptive Data and History
© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

8
Hubs
– A Hub Construct in Data Vault
• contains Business Key
• only the Business Key
• contains No Context
• is always 1:1 with EWBK

H_Customer
H_Customer_SID
Business Key 
Date/Time Stamp
Record source

– A Hub Table contains only
• Business Key
• Surrogate Key (Data Warehouse)
• Load Date / Time Stamp
• Record Source
© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

9
Links
H_Customer

– A Link Construct in Data Vault
• contains Relationship
• only a Relationship
• contains No Context
• is always 1:1 with Relationship

H_Customer_SID
Business Key 
Date/Tim e Stamp

L_Cust_Class
L_Cust_Class_SID
H_Customer_SID
H_Sequence2_SID
Date/Time Stamp
Record source

– A Link Table contains only
• 2-n FKs for the Relationship
• Surrogate Key (Data Warehouse)
• Load Date / Time Stamp
• Record Source
© 2013 Genesee Academy, LLC

Record source

Affecto Webinar Event Q4 2013

– Unique
– Specific
– Natural
Business
Relationship

10
Satellites
– A Satellite Construct in Data Vault
• contains Context only
• has no FKs (no relationships)
• Designed by * Rate of Change
* Type of Data * System…

S_Customer
H_Customer_SID
Date/Time Stamp
Context A
Context B
Context C
Context D

– A Satellite Table contains only
• Business Key FK +
•
Load Date / Time Stamp
• Context Data…
• Record Source

© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

Record source

H_Customer
H_Customer_SID
Business Key 
Date/Tim e Stamp
Record source

11
Sample: Sales Data Vault Model

© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

12
Sample Model

Sales DV Model - Backbone

© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

13
Data Vault means thinking differently
Customer

• The minimal construct then for an “entity”
such as “Customer” is now a
Hub with a set of Satellites
Customer

© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

14
Comparing the Models
Operational

© 2013 Genesee Academy, LLC

Data Warehouse

Affecto Webinar Event Q4 2013

Data Mart

15
A Customer Rating Changes 3 times…
Operational

© 2013 Genesee Academy, LLC

Data Warehouse

Affecto Webinar Event Q4 2013

Data Mart

16
A New Attribute is Added to Address…
Operational

© 2013 Genesee Academy, LLC

Data Warehouse

Affecto Webinar Event Q4 2013

Data Mart

17
Relationship to Cust_Class Changes…
Operational

© 2013 Genesee Academy, LLC

Data Warehouse

Affecto Webinar Event Q4 2013

Data Mart

18
Staging

© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013
Load

Transform

Calculate
Convert

Cleanse

Profile
Validate

Extract

Raw

Transform

Calculate
Convert

Cleanse

Profile
Validate

Integrate

Load

D/T Stamp

Integrate

Extract

Fundamental Architecture

Information Model

BDW
Data
Mart

Data
Mart
Data
Mart

EDW

19
Succeeding with the Agile DW
Applying an agile modeling methodology. This can only be accomplished if the
program considers the people, processes, tools and techniques together.
Data Warehouse
Data Marts

© 2013 Genesee Academy, LLC

Enterprise
Data Warehouse

Affecto Webinar Event Q4 2013

20
About Data Vault Ensemble

Estimated 800 Data Vault based
Data Warehouses around the world

© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

21
Links and Information
CDVDM Training & Certification
www.GeneseeAcademy.com
Hans@GeneseeAcademy.com

gohansgo

Book DataVaultBook.blogspot.com
HansHultgren.WordPress.com
HansHultgren
DataVaultAcademy

Online video-lesson training

DataVaultAcademy.com
© 2013 Genesee Academy, LLC

Affecto Webinar Event Q4 2013

22

Weitere ähnliche Inhalte

Andere mochten auch

Data Vault: What is it? Where does it fit? SQL Saturday #249
Data Vault: What is it?  Where does it fit?  SQL Saturday #249Data Vault: What is it?  Where does it fit?  SQL Saturday #249
Data Vault: What is it? Where does it fit? SQL Saturday #249Daniel Upton
 
Data Vault ReConnect Speed Presenting PM Part Four
Data Vault ReConnect Speed Presenting PM Part FourData Vault ReConnect Speed Presenting PM Part Four
Data Vault ReConnect Speed Presenting PM Part FourHans Hultgren
 
Data Vault ReConnect Speed Presenting AM Part Two
Data Vault ReConnect Speed Presenting AM Part TwoData Vault ReConnect Speed Presenting AM Part Two
Data Vault ReConnect Speed Presenting AM Part TwoHans Hultgren
 
Guru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best PracticesGuru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best PracticesCGI
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Empowered Holdings, LLC
 
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingAgile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingKent Graziano
 
Introduction to Data Vault Modeling
Introduction to Data Vault ModelingIntroduction to Data Vault Modeling
Introduction to Data Vault ModelingKent Graziano
 

Andere mochten auch (8)

Data Vault: What is it? Where does it fit? SQL Saturday #249
Data Vault: What is it?  Where does it fit?  SQL Saturday #249Data Vault: What is it?  Where does it fit?  SQL Saturday #249
Data Vault: What is it? Where does it fit? SQL Saturday #249
 
Data Vault ReConnect Speed Presenting PM Part Four
Data Vault ReConnect Speed Presenting PM Part FourData Vault ReConnect Speed Presenting PM Part Four
Data Vault ReConnect Speed Presenting PM Part Four
 
Data Vault ReConnect Speed Presenting AM Part Two
Data Vault ReConnect Speed Presenting AM Part TwoData Vault ReConnect Speed Presenting AM Part Two
Data Vault ReConnect Speed Presenting AM Part Two
 
Guru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best PracticesGuru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best Practices
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012
 
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingAgile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
 
Agile KPIs
Agile KPIsAgile KPIs
Agile KPIs
 
Introduction to Data Vault Modeling
Introduction to Data Vault ModelingIntroduction to Data Vault Modeling
Introduction to Data Vault Modeling
 

Kürzlich hochgeladen

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 

Kürzlich hochgeladen (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Data Vault Affecto Nordics Webinar Q4 2013

  • 1.  Data Vault Modeling & Approach  DW2.0 & Unstructured Data  Master Data Management  Agile DW Data Vault Modeling the Agile Data Warehouse Webinar Event Q4 2013 Hans P. Hultgren gohansgo © 2013 Genesee Academy, LLC
  • 2. Data Vault: Modeling the Agile DW AGENDA About Hans Hultgren: • Welcome • Background • Unified Decomposition & Modeling Ensemble • Data Vault Hubs, Links and Satellites • Working with Data Vault • Extreme Data Warehouse Agility • Architecture • Information Modeling • Succeeding with the Agile Data Warehouse © 2013 Genesee Academy, LLC Author, Advisor, Speaker & Industry Analyst; President Genesee Academy LLC, Principal at TopofMinds Book available on Amazon.com Affecto Webinar Event Q4 2013 2
  • 3. The Data Vault modeling approach • The Data Vault is a data modeling approach …so it fits into the family of modeling approaches: 3rd Normal Form Data Vault Dimensional • While 3rd Normal Form is optimal for Operational Systems …and Dimensional is optimal for Data Marts …the Data Vault is optimal for the Data Warehouse (EDW) © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 3
  • 4. Data Vault Benefits • Business • • • • • Ability to adapt quickly to new business needs Data is traceable allowing for a fully auditable, integrated data store. Allows the EDW to absorb all data all of the time. Easily adapts to new data sources and changing business rules – without expensive re-engineering Results in an Data Warehouse with lower total cost of ownership (TCO) • Projects • • Ideal for agile development techniques resulting in lower project risk and more frequent deliverables Can be built incrementally without compromising the core architecture • Architecture • • • Parallel loading and restartability Architecture that supports future expanded scope Can scale to virtually any size without breaking down © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 4
  • 5. A Saga of Data Warehousing Once upon a time data warehousing was becoming more popular and everyone was eager to build their own. But whenever they tried they failed. They called upon their best to fix this but they just couldn’t solve the problem. They discovered that meeting the needs of the data warehouse meant that the tables got too big and too hard to work with. They just could not handle changes over time. If the smallest thing changed it always meant they had to change the entire table. When just a single attribute was updated they had to insert a record for all of the attributes. All seemed lost. But around the world there were rebels who questioned the conventional wisdom. And their voices were finally heard: Why not separate the things that change from the things that don’t change? © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 5
  • 6. Unified Decomposition™ • Separating the things that change from the things that don’t change. • break things out into component parts flexibility and capture things that – are interpreted in different ways or – changing independently of each other © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 6
  • 7. Ensemble Modeling™ • The constellation of component parts acts as a whole – an Ensemble. All the parts of a thing taken together, so that each part is considered only in relation to the whole. • With Ensemble Modeling the Core Business Concepts that we define and model are represented as a whole – an ensemble – including all of the component parts. © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 7
  • 8. The Data Vault Ensemble • The Data Vault Ensemble conforms to a single key – embodied in the Hub construct. • The component parts for the Data Vault Ensemble include: – Hub The Natural Business Key – Link The Natural Business Relationships – Satellite All Context, Descriptive Data and History © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 8
  • 9. Hubs – A Hub Construct in Data Vault • contains Business Key • only the Business Key • contains No Context • is always 1:1 with EWBK H_Customer H_Customer_SID Business Key  Date/Time Stamp Record source – A Hub Table contains only • Business Key • Surrogate Key (Data Warehouse) • Load Date / Time Stamp • Record Source © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 9
  • 10. Links H_Customer – A Link Construct in Data Vault • contains Relationship • only a Relationship • contains No Context • is always 1:1 with Relationship H_Customer_SID Business Key  Date/Tim e Stamp L_Cust_Class L_Cust_Class_SID H_Customer_SID H_Sequence2_SID Date/Time Stamp Record source – A Link Table contains only • 2-n FKs for the Relationship • Surrogate Key (Data Warehouse) • Load Date / Time Stamp • Record Source © 2013 Genesee Academy, LLC Record source Affecto Webinar Event Q4 2013 – Unique – Specific – Natural Business Relationship 10
  • 11. Satellites – A Satellite Construct in Data Vault • contains Context only • has no FKs (no relationships) • Designed by * Rate of Change * Type of Data * System… S_Customer H_Customer_SID Date/Time Stamp Context A Context B Context C Context D – A Satellite Table contains only • Business Key FK + • Load Date / Time Stamp • Context Data… • Record Source © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 Record source H_Customer H_Customer_SID Business Key  Date/Tim e Stamp Record source 11
  • 12. Sample: Sales Data Vault Model © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 12
  • 13. Sample Model Sales DV Model - Backbone © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 13
  • 14. Data Vault means thinking differently Customer • The minimal construct then for an “entity” such as “Customer” is now a Hub with a set of Satellites Customer © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 14
  • 15. Comparing the Models Operational © 2013 Genesee Academy, LLC Data Warehouse Affecto Webinar Event Q4 2013 Data Mart 15
  • 16. A Customer Rating Changes 3 times… Operational © 2013 Genesee Academy, LLC Data Warehouse Affecto Webinar Event Q4 2013 Data Mart 16
  • 17. A New Attribute is Added to Address… Operational © 2013 Genesee Academy, LLC Data Warehouse Affecto Webinar Event Q4 2013 Data Mart 17
  • 18. Relationship to Cust_Class Changes… Operational © 2013 Genesee Academy, LLC Data Warehouse Affecto Webinar Event Q4 2013 Data Mart 18
  • 19. Staging © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 Load Transform Calculate Convert Cleanse Profile Validate Extract Raw Transform Calculate Convert Cleanse Profile Validate Integrate Load D/T Stamp Integrate Extract Fundamental Architecture Information Model BDW Data Mart Data Mart Data Mart EDW 19
  • 20. Succeeding with the Agile DW Applying an agile modeling methodology. This can only be accomplished if the program considers the people, processes, tools and techniques together. Data Warehouse Data Marts © 2013 Genesee Academy, LLC Enterprise Data Warehouse Affecto Webinar Event Q4 2013 20
  • 21. About Data Vault Ensemble Estimated 800 Data Vault based Data Warehouses around the world © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 21
  • 22. Links and Information CDVDM Training & Certification www.GeneseeAcademy.com Hans@GeneseeAcademy.com gohansgo Book DataVaultBook.blogspot.com HansHultgren.WordPress.com HansHultgren DataVaultAcademy Online video-lesson training DataVaultAcademy.com © 2013 Genesee Academy, LLC Affecto Webinar Event Q4 2013 22