SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
DATA WAREHOUSE BASICS
a presentation by
W H Inmon
The data warehouse
- a definition
A subject oriented, non volatile,
integrated, time variant collection
of data for the support of management’s
decisions
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
Granular, detailed data and lots of it
Data that can be shaped and reshaped
A foundation of reconcilability
A basis for new, unknown analysis
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
key
time
primary data
secondary data
What a typical record of the data warehouse
looks like
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
key
An identifier
Unique or non unique
Often a compound key
May be natural or blind
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
time
Time variancy
- continuous
- from date/to date
- periodic discrete
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
Name
Address
Phone
Zip
Email
…….. A continuous
time span record
from
date
to
date
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
Name
Address
Phone
Zip
Email
……..
from
date
to
date
Name
Address
Phone
Zip
Email
……..
from
date
to
date
Name
Address
Phone
Zip
Email
……..
from
date
to
date
A sequence of time span records
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
No overlap
Discontinuity is a possibility
999000 From the beginning of time to the end of time
Continuous time span data
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
Periodic discrete structure
Jan 1
Expenses
Revenues
No of employees
Stock price
Price per share
………………….
Feb 1
Expenses
Revenues
No of employees
Stock price
Price per share
………………….
Mar 1
Expenses
Revenues
No of employees
Stock price
Price per share
………………….
Apr 1
Expenses
Revenues
No of employees
Stock price
Price per share
………………….
The notion of taking a snapshot as of some one
moment in time
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
Periodic discrete structure
Jan 1
Expenses
Revenues
No of employees
Stock price
Price per share
………………….
Feb 1
Expenses
Revenues
No of employees
Stock price
Price per share
………………….
Mar 1
Expenses
Revenues
No of employees
Stock price
Price per share
………………….
Apr 1
Expenses
Revenues
No of employees
Stock price
Price per share
………………….
The structure says nothing about values as of any other
date
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
Periodic discrete structure
For few variables
For slow changing variables
Continuous time span data
For many variables
For quickly changing variables
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
Primary data
Primary data relates directly to the key
Example – key – ssno
- primary data – name, date of birth
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
Secondary data
Secondary data relates directly to
the primary data
Example – key – ssno
- primary data – name, date of birth
- secondary data – address, zip, phone
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
The granular data in the
data warehouse –
- serves as a basis for
many other forms of DSS
- is instantly available
- forms a foundation of
reconcilability
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
Relational
structures Star joins
requirements
The data warehouse is shaped by the data model;
The star join world is shaped by requirements
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
Often called
Multi dimensional data
Often called
Atomic data
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
applications
Legacy data
Operational data
Transactional data
Atomic
data
Data
warehouse
The source of data warehouse data
is the operational environment
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
m/f
1/0
x/y
male/
female
gender
m/f
integration of data in the data warehouse
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
inches
cms
feet
miles
unit of
measure
cms
units of measurement need
to be integrated
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
ETL
Extract/transform/load
The integration and conversion of data
is the most difficult part of the data warehouse
process
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
Transformation code can
be generated manually or
automatically.
Automatically is always
preferred
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
The functions performed
by the ETL process are
not trivial -
Convert
Reformat
Add time element
Restructure
New key
Add default values
Change dbms
Change operating system
Summarize
Break into multiple records
Convert key structure
Merge records
Collect metadata
Conform to data model
Select data/reject data
Add indexes
Change encoding
Change hardware environments
Resequence data
Ascii to ebcdic;ebcdic to ascii
Partition data
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
ETL performed in host
environment
ETL performed in
source environment
ETL processing can be
performed in different places
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C
data warehouse –
at the center of the
decision making of
the corporation
Forest
Rim
Technology
Copyright Inmon Consulting Services, 2008C

Weitere ähnliche Inhalte

Ähnlich wie [db tech showcase Tokyo 2015] DATA WAREHOUSE BASICS by Wiliiam Inmon

Optim test data management for IMS 2011
Optim test data management for IMS 2011Optim test data management for IMS 2011
Optim test data management for IMS 2011
evgeni77
 
TUW-ASE- Summer 2004: Data marketplaces: core models and concepts
TUW-ASE- Summer 2004: Data marketplaces: core models and conceptsTUW-ASE- Summer 2004: Data marketplaces: core models and concepts
TUW-ASE- Summer 2004: Data marketplaces: core models and concepts
Hong-Linh Truong
 
Optim Insync10 Paul Griffin presentation
Optim Insync10 Paul Griffin presentationOptim Insync10 Paul Griffin presentation
Optim Insync10 Paul Griffin presentation
InSync Conference
 

Ähnlich wie [db tech showcase Tokyo 2015] DATA WAREHOUSE BASICS by Wiliiam Inmon (20)

ISSA Boston - PCI and Beyond: A Cost Effective Approach to Data Protection
ISSA Boston - PCI and Beyond: A Cost Effective Approach to Data ProtectionISSA Boston - PCI and Beyond: A Cost Effective Approach to Data Protection
ISSA Boston - PCI and Beyond: A Cost Effective Approach to Data Protection
 
Optim test data management for IMS 2011
Optim test data management for IMS 2011Optim test data management for IMS 2011
Optim test data management for IMS 2011
 
TUW-ASE- Summer 2004: Data marketplaces: core models and concepts
TUW-ASE- Summer 2004: Data marketplaces: core models and conceptsTUW-ASE- Summer 2004: Data marketplaces: core models and concepts
TUW-ASE- Summer 2004: Data marketplaces: core models and concepts
 
ADB NYSE Show JM
ADB NYSE Show JMADB NYSE Show JM
ADB NYSE Show JM
 
StorageWorks Business Continuity & Availability Solutions-Hp-8sept2010
StorageWorks Business Continuity & Availability Solutions-Hp-8sept2010StorageWorks Business Continuity & Availability Solutions-Hp-8sept2010
StorageWorks Business Continuity & Availability Solutions-Hp-8sept2010
 
Smarter Data Protection And Storage Management Solutions
Smarter Data Protection And Storage Management SolutionsSmarter Data Protection And Storage Management Solutions
Smarter Data Protection And Storage Management Solutions
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
 
Pemahaman Pelanggan & Machine Learning (Level 200 – 300) | Kenali Pelanggan A...
Pemahaman Pelanggan & Machine Learning (Level 200 – 300) | Kenali Pelanggan A...Pemahaman Pelanggan & Machine Learning (Level 200 – 300) | Kenali Pelanggan A...
Pemahaman Pelanggan & Machine Learning (Level 200 – 300) | Kenali Pelanggan A...
 
Data Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroData Virtualization: From Zero to Hero
Data Virtualization: From Zero to Hero
 
TUW-ASE Summer 2015: Data marketplaces: core models and concepts
TUW-ASE Summer 2015: Data marketplaces:  core models and conceptsTUW-ASE Summer 2015: Data marketplaces:  core models and concepts
TUW-ASE Summer 2015: Data marketplaces: core models and concepts
 
Gilbane 2009 -- How Can Content Management Software Keep Pace?
Gilbane 2009 -- How Can Content Management Software Keep Pace?Gilbane 2009 -- How Can Content Management Software Keep Pace?
Gilbane 2009 -- How Can Content Management Software Keep Pace?
 
Introduction Big Data
Introduction Big DataIntroduction Big Data
Introduction Big Data
 
How to Secure your Data Lake
How to Secure your Data LakeHow to Secure your Data Lake
How to Secure your Data Lake
 
How to secure your data lake
How to secure your data lakeHow to secure your data lake
How to secure your data lake
 
Optim Insync10 Paul Griffin presentation
Optim Insync10 Paul Griffin presentationOptim Insync10 Paul Griffin presentation
Optim Insync10 Paul Griffin presentation
 
70a1bee5afaa499bff2de1419845-2545809091213.ppt
70a1bee5afaa499bff2de1419845-2545809091213.ppt70a1bee5afaa499bff2de1419845-2545809091213.ppt
70a1bee5afaa499bff2de1419845-2545809091213.ppt
 
SBDS
SBDSSBDS
SBDS
 
Data Protection Presentation
Data Protection PresentationData Protection Presentation
Data Protection Presentation
 
Stephen Kennett
Stephen KennettStephen Kennett
Stephen Kennett
 
Stephen Kennett presentation
Stephen Kennett   presentationStephen Kennett   presentation
Stephen Kennett presentation
 

Mehr von Insight Technology, Inc.

コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
Insight Technology, Inc.
 

Mehr von Insight Technology, Inc. (20)

グラフデータベースは如何に自然言語を理解するか?
グラフデータベースは如何に自然言語を理解するか?グラフデータベースは如何に自然言語を理解するか?
グラフデータベースは如何に自然言語を理解するか?
 
Docker and the Oracle Database
Docker and the Oracle DatabaseDocker and the Oracle Database
Docker and the Oracle Database
 
Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~
Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~
Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~
 
事例を通じて機械学習とは何かを説明する
事例を通じて機械学習とは何かを説明する事例を通じて機械学習とは何かを説明する
事例を通じて機械学習とは何かを説明する
 
仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン
仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン
仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン
 
MBAAで覚えるDBREの大事なおしごと
MBAAで覚えるDBREの大事なおしごとMBAAで覚えるDBREの大事なおしごと
MBAAで覚えるDBREの大事なおしごと
 
グラフデータベースは如何に自然言語を理解するか?
グラフデータベースは如何に自然言語を理解するか?グラフデータベースは如何に自然言語を理解するか?
グラフデータベースは如何に自然言語を理解するか?
 
DBREから始めるデータベースプラットフォーム
DBREから始めるデータベースプラットフォームDBREから始めるデータベースプラットフォーム
DBREから始めるデータベースプラットフォーム
 
SQL Server エンジニアのためのコンテナ入門
SQL Server エンジニアのためのコンテナ入門SQL Server エンジニアのためのコンテナ入門
SQL Server エンジニアのためのコンテナ入門
 
Lunch & Learn, AWS NoSQL Services
Lunch & Learn, AWS NoSQL ServicesLunch & Learn, AWS NoSQL Services
Lunch & Learn, AWS NoSQL Services
 
db tech showcase2019オープニングセッション @ 森田 俊哉
db tech showcase2019オープニングセッション @ 森田 俊哉 db tech showcase2019オープニングセッション @ 森田 俊哉
db tech showcase2019オープニングセッション @ 森田 俊哉
 
db tech showcase2019 オープニングセッション @ 石川 雅也
db tech showcase2019 オープニングセッション @ 石川 雅也db tech showcase2019 オープニングセッション @ 石川 雅也
db tech showcase2019 オープニングセッション @ 石川 雅也
 
db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー
db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー
db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー
 
難しいアプリケーション移行、手軽に試してみませんか?
難しいアプリケーション移行、手軽に試してみませんか?難しいアプリケーション移行、手軽に試してみませんか?
難しいアプリケーション移行、手軽に試してみませんか?
 
Attunityのソリューションと異種データベース・クラウド移行事例のご紹介
Attunityのソリューションと異種データベース・クラウド移行事例のご紹介Attunityのソリューションと異種データベース・クラウド移行事例のご紹介
Attunityのソリューションと異種データベース・クラウド移行事例のご紹介
 
そのデータベース、クラウドで使ってみませんか?
そのデータベース、クラウドで使ってみませんか?そのデータベース、クラウドで使ってみませんか?
そのデータベース、クラウドで使ってみませんか?
 
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
 
複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。
複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。 複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。
複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。
 
Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...
Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...
Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...
 
レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]
レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]
レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]
 

Kürzlich hochgeladen

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Kürzlich hochgeladen (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

[db tech showcase Tokyo 2015] DATA WAREHOUSE BASICS by Wiliiam Inmon

  • 1. Forest Rim Technology Copyright Inmon Consulting Services, 2008C DATA WAREHOUSE BASICS a presentation by W H Inmon
  • 2. The data warehouse - a definition A subject oriented, non volatile, integrated, time variant collection of data for the support of management’s decisions Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 3. Granular, detailed data and lots of it Data that can be shaped and reshaped A foundation of reconcilability A basis for new, unknown analysis Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 4. key time primary data secondary data What a typical record of the data warehouse looks like Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 5. key An identifier Unique or non unique Often a compound key May be natural or blind Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 6. time Time variancy - continuous - from date/to date - periodic discrete Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 7. Name Address Phone Zip Email …….. A continuous time span record from date to date Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 9. No overlap Discontinuity is a possibility 999000 From the beginning of time to the end of time Continuous time span data Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 10. Periodic discrete structure Jan 1 Expenses Revenues No of employees Stock price Price per share …………………. Feb 1 Expenses Revenues No of employees Stock price Price per share …………………. Mar 1 Expenses Revenues No of employees Stock price Price per share …………………. Apr 1 Expenses Revenues No of employees Stock price Price per share …………………. The notion of taking a snapshot as of some one moment in time Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 11. Periodic discrete structure Jan 1 Expenses Revenues No of employees Stock price Price per share …………………. Feb 1 Expenses Revenues No of employees Stock price Price per share …………………. Mar 1 Expenses Revenues No of employees Stock price Price per share …………………. Apr 1 Expenses Revenues No of employees Stock price Price per share …………………. The structure says nothing about values as of any other date Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 12. Periodic discrete structure For few variables For slow changing variables Continuous time span data For many variables For quickly changing variables Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 13. Primary data Primary data relates directly to the key Example – key – ssno - primary data – name, date of birth Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 14. Secondary data Secondary data relates directly to the primary data Example – key – ssno - primary data – name, date of birth - secondary data – address, zip, phone Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 15. The granular data in the data warehouse – - serves as a basis for many other forms of DSS - is instantly available - forms a foundation of reconcilability Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 16. Relational structures Star joins requirements The data warehouse is shaped by the data model; The star join world is shaped by requirements Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 17. Often called Multi dimensional data Often called Atomic data Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 18. applications Legacy data Operational data Transactional data Atomic data Data warehouse The source of data warehouse data is the operational environment Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 19. m/f 1/0 x/y male/ female gender m/f integration of data in the data warehouse Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 20. inches cms feet miles unit of measure cms units of measurement need to be integrated Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 21. ETL Extract/transform/load The integration and conversion of data is the most difficult part of the data warehouse process Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 22. Transformation code can be generated manually or automatically. Automatically is always preferred Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 23. The functions performed by the ETL process are not trivial - Convert Reformat Add time element Restructure New key Add default values Change dbms Change operating system Summarize Break into multiple records Convert key structure Merge records Collect metadata Conform to data model Select data/reject data Add indexes Change encoding Change hardware environments Resequence data Ascii to ebcdic;ebcdic to ascii Partition data Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 24. ETL performed in host environment ETL performed in source environment ETL processing can be performed in different places Forest Rim Technology Copyright Inmon Consulting Services, 2008C
  • 25. data warehouse – at the center of the decision making of the corporation Forest Rim Technology Copyright Inmon Consulting Services, 2008C