SlideShare ist ein Scribd-Unternehmen logo
1 von 5
Downloaden Sie, um offline zu lesen
Tech Notes



Why Data Warehouse Projects Fail
Using Schema Examination Tools to Ensure Information Quality,
Schema Compliance, and Project Success

Embarcadero Technologies

January 2008




Corporate Headquarters        EMEA Headquarters         Asia-Pacific Headquarters
100 California Street, 12th   York House                L7. 313 La Trobe Street
Floor                         18 York Road              Melbourne VIC 3000
San Francisco, California     Maidenhead, Berkshire     Australia
94111                         SL6 1SF, United Kingdom
Why Data Warehouse Projects Fail



According to a 2003 Gartner report, more than 50 percent of data warehouse projects
failed, and the ones that survived were delivered very late with extremely high costs. In
a 2007 study, Gartner predicted once more that 50 percent of data warehouse
projects would have limited acceptance or be outright failures as a result of lack of
attention to data quality issues.


DATA QUALITY IS NOT ENOUGH
Data Quality is one of the hottest topics in any IT shop. Although very important, Data
Quality is far from being enough because decisions are based on information, not on
data. Having quality data does not assure quality information. To have quality
information, it is necessary to have quality data, but this is not sufficient on its own. We
need more.


IT IS ALL ABOUT THE DATABASE SCHEMA
Information is produced by an application program that accesses data in a database,
usually a relational database such as Oracle, DB2, Sybase, SQL Server, etc. The core of
the database is the database schema, wherein are stored all the data definitions, the
relationships between the data, and the business rules.

The quality of the information depends on 3 things: (1) the quality of the data itself, (2)
the quality of the application programs and (3) the quality of the database schema.

Joe Celko (www.celko.com), a very well known expert and consultant in relational
technology, states that without a quality database schema, it is very difficult to:
• Achieve good program performance and
• Deliver quality information
When developing any database application, we must always ensure the database
schema has integrity and consistency – and no flaws. This must be done when the
schema is created, and every time it is changed. If the database schema has flaws, the
information will be flawed and the Data Warehouse projects will fail.


MODELING TOOLS ARE NOT ENOUGH
Database schemas are normally created using a modeling tool such as ERwin®,
ER/Studio®, or PowerDesigner®. These tools validate the data model for completeness of
the model, but they do not have the intelligence to “debug” the data model.


QUALITY OF FEEDER SYSTEMS
Data Warehouse projects depend on feeder systems. If the database schemas of the
feeder systems have flaws, the information produced by the data warehouse will not
have quality. This is the major reason why data warehouse projects fail. The database



Embarcadero Technologies                                                                -1-
Why Data Warehouse Projects Fail


schemas of the feeder systems must be validated for consistency, integrity and
compliance to the rules of the relational technology before a data warehouse project
is initiated. This is where Embarcadero® Schema Examiner™ comes in.


SCHEMA EXAMINER
Schema Examiner was created to fill this gap, providing a means to “debug” the
schema. Schema Examiner provides over 50 diagnostics to assure the schema adheres
to the relational model, is consistent and has integrity. Schema Examiner can validate
the data model, a set of SQL/DDL scripts or the database schema directly. Schema
Examiner can also compare schemas, indicating the differences.

Manual validation is impossible due to the size and complexity of today’s database
schemas.




A SUCCESSFUL DATA WAREHOUSE PROJECT
A corporation in the telecom business contracted with one of the major consulting
companies to develop a large data warehouse project. The cost of the project was $10


Embarcadero Technologies                                                          -2-
Why Data Warehouse Projects Fail


million. After the project was in production, they discovered that the quality of the
information was not good; many answers were inconsistent. They considered to re-do
the entire project or even scrap it.

Their committee suggested the hiring of a consultant. The consultant used Schema
Examiner and after a couple of weeks of analyzing the feeder systems, he made
several suggestions based upon the findings of Schema Examiner. Once the
recommendations were adopted, the results improved dramatically and the project
was a total success.

The client has stated that the success of the project was due to the use of Schema
Examiner.

They immediately purchased an enterprise license and made it mandatory to use
Schema Examiner in all their IT projects, internal or external.

The project used 4 Oracle feeder system that were “debugged” using Schema
Examiner; they were also compared to each other to discover inconsistencies. Below is
a graphical representation of the project.


SUMMARY
•   About half of all Data Warehouse projects fail due to poor data quality (Gartner
    Group)
•   Data Quality is not enough - decisions are based on information quality, not on data
    quality
•   A flawed schema impacts negatively on information quality
•   Database schemas must be validated for compliance with the rules of relational
    technology
•    Modeling tools validate data models / schemas for completeness, not for
    compliance
•   Data Warehouse feeder systems schemas must be validated for compliance
•   Schema Examiner validates schemas from feeder systems and compares them to
    verify inconsistencies




Embarcadero Technologies                                                            -3-
Embarcadero Technologies, Inc. is a leading provider of award-winning tools for
application developers and database professionals so they can design systems right,
build them faster and run them better, regardless of their platform or programming
language. Ninety of the Fortune 100 and an active community of more than three
million users worldwide rely on Embarcadero products to increase productivity, reduce
costs, simplify change management and compliance and accelerate innovation. The
company’s flagship tools include: Embarcadero® Change Manager™, CodeGear™
RAD Studio, DBArtisan®, Delphi®, ER/Studio®, JBuilder® and Rapid SQL®. Founded in 1993,
Embarcadero is headquartered in San Francisco, with offices located around the world.
Embarcadero is online at www.embarcadero.com.

Weitere ähnliche Inhalte

Mehr von Embarcadero Technologies

Getting Started Building Mobile Applications for iOS and Android
Getting Started Building Mobile Applications for iOS and AndroidGetting Started Building Mobile Applications for iOS and Android
Getting Started Building Mobile Applications for iOS and Android
Embarcadero Technologies
 

Mehr von Embarcadero Technologies (20)

Python for Delphi Developers - Part 2
Python for Delphi Developers - Part 2Python for Delphi Developers - Part 2
Python for Delphi Developers - Part 2
 
Python for Delphi Developers - Part 1 Introduction
Python for Delphi Developers - Part 1 IntroductionPython for Delphi Developers - Part 1 Introduction
Python for Delphi Developers - Part 1 Introduction
 
RAD Industrial Automation, Labs, and Instrumentation
RAD Industrial Automation, Labs, and InstrumentationRAD Industrial Automation, Labs, and Instrumentation
RAD Industrial Automation, Labs, and Instrumentation
 
Embeddable Databases for Mobile Apps: Stress-Free Solutions with InterBase
Embeddable Databases for Mobile Apps: Stress-Free Solutions with InterBaseEmbeddable Databases for Mobile Apps: Stress-Free Solutions with InterBase
Embeddable Databases for Mobile Apps: Stress-Free Solutions with InterBase
 
Rad Server Industry Template - Connected Nurses Station - Setup Document
Rad Server Industry Template - Connected Nurses Station - Setup DocumentRad Server Industry Template - Connected Nurses Station - Setup Document
Rad Server Industry Template - Connected Nurses Station - Setup Document
 
TMS Google Mapping Components
TMS Google Mapping ComponentsTMS Google Mapping Components
TMS Google Mapping Components
 
Move Desktop Apps to the Cloud - RollApp & Embarcadero webinar
Move Desktop Apps to the Cloud - RollApp & Embarcadero webinarMove Desktop Apps to the Cloud - RollApp & Embarcadero webinar
Move Desktop Apps to the Cloud - RollApp & Embarcadero webinar
 
Useful C++ Features You Should be Using
Useful C++ Features You Should be UsingUseful C++ Features You Should be Using
Useful C++ Features You Should be Using
 
Getting Started Building Mobile Applications for iOS and Android
Getting Started Building Mobile Applications for iOS and AndroidGetting Started Building Mobile Applications for iOS and Android
Getting Started Building Mobile Applications for iOS and Android
 
Embarcadero RAD server Launch Webinar
Embarcadero RAD server Launch WebinarEmbarcadero RAD server Launch Webinar
Embarcadero RAD server Launch Webinar
 
ER/Studio 2016: Build a Business-Driven Data Architecture
ER/Studio 2016: Build a Business-Driven Data ArchitectureER/Studio 2016: Build a Business-Driven Data Architecture
ER/Studio 2016: Build a Business-Driven Data Architecture
 
The Secrets of SQL Server: Database Worst Practices
The Secrets of SQL Server: Database Worst PracticesThe Secrets of SQL Server: Database Worst Practices
The Secrets of SQL Server: Database Worst Practices
 
Driving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsDriving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data Assets
 
Troubleshooting Plan Changes with Query Store in SQL Server 2016
Troubleshooting Plan Changes with Query Store in SQL Server 2016Troubleshooting Plan Changes with Query Store in SQL Server 2016
Troubleshooting Plan Changes with Query Store in SQL Server 2016
 
Great Scott! Dealing with New Datatypes
Great Scott! Dealing with New DatatypesGreat Scott! Dealing with New Datatypes
Great Scott! Dealing with New Datatypes
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
What's New in DBArtisan and Rapid SQL 2016
What's New in DBArtisan and Rapid SQL 2016What's New in DBArtisan and Rapid SQL 2016
What's New in DBArtisan and Rapid SQL 2016
 
Is This Really a SAN Problem? Understanding the Performance of Your IO Subsy...
Is This Really a SAN Problem? Understanding the Performance of  Your IO Subsy...Is This Really a SAN Problem? Understanding the Performance of  Your IO Subsy...
Is This Really a SAN Problem? Understanding the Performance of Your IO Subsy...
 
RAD Studio, Delphi and C++Builder 10 Feature Matrix
RAD Studio, Delphi and C++Builder 10 Feature MatrixRAD Studio, Delphi and C++Builder 10 Feature Matrix
RAD Studio, Delphi and C++Builder 10 Feature Matrix
 
7 Dangerous Myths DBAs Believe about Data Modeling
7 Dangerous Myths DBAs Believe about Data Modeling7 Dangerous Myths DBAs Believe about Data Modeling
7 Dangerous Myths DBAs Believe about Data Modeling
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Why Data Warehouse Projects Fail

  • 1. Tech Notes Why Data Warehouse Projects Fail Using Schema Examination Tools to Ensure Information Quality, Schema Compliance, and Project Success Embarcadero Technologies January 2008 Corporate Headquarters EMEA Headquarters Asia-Pacific Headquarters 100 California Street, 12th York House L7. 313 La Trobe Street Floor 18 York Road Melbourne VIC 3000 San Francisco, California Maidenhead, Berkshire Australia 94111 SL6 1SF, United Kingdom
  • 2. Why Data Warehouse Projects Fail According to a 2003 Gartner report, more than 50 percent of data warehouse projects failed, and the ones that survived were delivered very late with extremely high costs. In a 2007 study, Gartner predicted once more that 50 percent of data warehouse projects would have limited acceptance or be outright failures as a result of lack of attention to data quality issues. DATA QUALITY IS NOT ENOUGH Data Quality is one of the hottest topics in any IT shop. Although very important, Data Quality is far from being enough because decisions are based on information, not on data. Having quality data does not assure quality information. To have quality information, it is necessary to have quality data, but this is not sufficient on its own. We need more. IT IS ALL ABOUT THE DATABASE SCHEMA Information is produced by an application program that accesses data in a database, usually a relational database such as Oracle, DB2, Sybase, SQL Server, etc. The core of the database is the database schema, wherein are stored all the data definitions, the relationships between the data, and the business rules. The quality of the information depends on 3 things: (1) the quality of the data itself, (2) the quality of the application programs and (3) the quality of the database schema. Joe Celko (www.celko.com), a very well known expert and consultant in relational technology, states that without a quality database schema, it is very difficult to: • Achieve good program performance and • Deliver quality information When developing any database application, we must always ensure the database schema has integrity and consistency – and no flaws. This must be done when the schema is created, and every time it is changed. If the database schema has flaws, the information will be flawed and the Data Warehouse projects will fail. MODELING TOOLS ARE NOT ENOUGH Database schemas are normally created using a modeling tool such as ERwin®, ER/Studio®, or PowerDesigner®. These tools validate the data model for completeness of the model, but they do not have the intelligence to “debug” the data model. QUALITY OF FEEDER SYSTEMS Data Warehouse projects depend on feeder systems. If the database schemas of the feeder systems have flaws, the information produced by the data warehouse will not have quality. This is the major reason why data warehouse projects fail. The database Embarcadero Technologies -1-
  • 3. Why Data Warehouse Projects Fail schemas of the feeder systems must be validated for consistency, integrity and compliance to the rules of the relational technology before a data warehouse project is initiated. This is where Embarcadero® Schema Examiner™ comes in. SCHEMA EXAMINER Schema Examiner was created to fill this gap, providing a means to “debug” the schema. Schema Examiner provides over 50 diagnostics to assure the schema adheres to the relational model, is consistent and has integrity. Schema Examiner can validate the data model, a set of SQL/DDL scripts or the database schema directly. Schema Examiner can also compare schemas, indicating the differences. Manual validation is impossible due to the size and complexity of today’s database schemas. A SUCCESSFUL DATA WAREHOUSE PROJECT A corporation in the telecom business contracted with one of the major consulting companies to develop a large data warehouse project. The cost of the project was $10 Embarcadero Technologies -2-
  • 4. Why Data Warehouse Projects Fail million. After the project was in production, they discovered that the quality of the information was not good; many answers were inconsistent. They considered to re-do the entire project or even scrap it. Their committee suggested the hiring of a consultant. The consultant used Schema Examiner and after a couple of weeks of analyzing the feeder systems, he made several suggestions based upon the findings of Schema Examiner. Once the recommendations were adopted, the results improved dramatically and the project was a total success. The client has stated that the success of the project was due to the use of Schema Examiner. They immediately purchased an enterprise license and made it mandatory to use Schema Examiner in all their IT projects, internal or external. The project used 4 Oracle feeder system that were “debugged” using Schema Examiner; they were also compared to each other to discover inconsistencies. Below is a graphical representation of the project. SUMMARY • About half of all Data Warehouse projects fail due to poor data quality (Gartner Group) • Data Quality is not enough - decisions are based on information quality, not on data quality • A flawed schema impacts negatively on information quality • Database schemas must be validated for compliance with the rules of relational technology • Modeling tools validate data models / schemas for completeness, not for compliance • Data Warehouse feeder systems schemas must be validated for compliance • Schema Examiner validates schemas from feeder systems and compares them to verify inconsistencies Embarcadero Technologies -3-
  • 5. Embarcadero Technologies, Inc. is a leading provider of award-winning tools for application developers and database professionals so they can design systems right, build them faster and run them better, regardless of their platform or programming language. Ninety of the Fortune 100 and an active community of more than three million users worldwide rely on Embarcadero products to increase productivity, reduce costs, simplify change management and compliance and accelerate innovation. The company’s flagship tools include: Embarcadero® Change Manager™, CodeGear™ RAD Studio, DBArtisan®, Delphi®, ER/Studio®, JBuilder® and Rapid SQL®. Founded in 1993, Embarcadero is headquartered in San Francisco, with offices located around the world. Embarcadero is online at www.embarcadero.com.