SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Data Quality – “Are We There Yet?”

         August 17, 2011
                  Presented
                  By
                  Arvind Mattoo, CBIP
Data Quality

• Data Quality – Explained
• Data Quality – CEO’s Concern
• Data Quality – CIO’s Nightmare
• Data Quality – PM’s Approach
• Data Quality – IT’s Deliverable


                                    2
Data Quality – Dimensions
    Process Dimension                        Business Dimension
•     Accessible                         •     Relevant
•     Consistent                         •     Existent
•     Complete                           •     Reliable
•     Lineage                            •     Reportable
•     Controllable                       •     Compliant
•     Secure                             •     Measurable



                          Data Quality
                             FACT

    Technical Dimension                       Time Dimension
•     Accurate
•     Integral                           •     Currency
•     Unique                             •     Timeliness
•     Valid                              •     Historical
•     Secure



                                                            3
Dimension – Business

Relevant:         Does it Map to our Requirements?
Existent:         Do we Own it?
Reliable:         Can we Trust it?
Reportable:       Can we Visualize it?
Compliance:       Is it Mandated?
Measurable:       Can we Baseline it?




                                                     4
Dimension – Process

Accessible:        Can I Get it?
Consistent:        Can I Standardize it?
Complete:          Does it Encompass Usability?
Lineage:           Can we Trace it?
Controllable:      Can we Discipline it?
Secure:            Can we Trust it?




                                                  5
Dimension – Technical

Accurate:       To what Degree does it Jive?
Integral:       Does it Comply Structurally?
Unique:         To what extent is it De-Duped?
Valid:          Does it Conform by the Rules?
Secure:         To what Level is it Secured?




                                                 6
Dimension – Time

Currency:       To what Degree is it Current?
Timeliness:     How Readily is it Available?
Historical:     How far back can we Audit?




                                                7
Data Quality – CEO’s Concern

•   Lack of Strategic Information Capabilities
•   Quality of Decision Making
•   Lack of Visibility
•   Loss of Opportunities
•   Increasing IT Expenditures
•   Diminishing Rate of Return
•   Lack of Collaboration

                                                 8
Data Quality – CIO’s Nightmare

•   How did we get into this mess?
•   How does it impact our business?
•   Are we the only one?
•   How do we get out of this?
•   How do we sustain it?
•   Are we there yet?



                                       9
Data Quality – As We Speak!

• Data Misused: Not Authorized

• Data Abused:     Not Qualified

• Data Confused: Not Clarified

• Data Refused:    Not Ratified

• Data Diffused:   Not Archived

                                    10
How did we get into this mess?
 Business                          Technical
   • Mergers                          • Conversion

   • Acquisitions                     • Manual Data Feeds

   • Expansions                       • Lack of Automation

   • Diversification                  • System Upgrades

   • Regulatory                       • Consolidation

   • Lack of Ownership                • Insufficient DQ Rules

   • Business Process Changes         • System Errors

   • Lack of Executive Awareness      • Source System Changes

   • Lack of Training                 • Lack of Expertise


                                                                11
How does it impact our business?

              CEO                              CIO
•   Reputation at Stake         •   Time to Reconcile Data
•   Lower Quality of Service    •   Delay in New System Deployment
•   Customer dissatisfaction    •   Poor System Performance
•   Loss of Motivation          •   Loss of Credibility
•   Compliance Issues           •   Downstream System Data Issues
•   Expectations not met        •   No Single Version of Truth




                          Surging Cost

                                                                12
Are we the only one?




                       13
How Bad is it?




                 14
Who is Controlling Whom?




                           15
How do we get out of this?



• Data Quality – PM’s Approach


• Data Quality – IT’s Deliverables




                                     16
Data Quality – PM’s Approach
              Methodology
               • Assess/Profile Data
               • Define Baseline
               • Define Metrics and Targets
               • Define and Build Data Quality Rules
               • Enforce Data Standards across Board
               • Monitor Data Quality against Targets
               • Review Exceptions and Gaps
               • Cataloguing Errors
               • Refine Data Quality Rules
               • Manage Data Quality against Targets
               • Automate Data Quality Process
               • Fine Tuning Data Quality Rules

                                                   17
Data Quality – PM’s Approach

                 Governance Team

                 • Governance Committee
                 • Data Stewards
                 • Business SME
                 • Business Analysts
                 • Technology SME
                 • Process SME



                                       18
Data Quality – PM’s Approach
               Technology
               • Data Profiler
               • CRM
               • Data Warehouse
               • Master Data Management
               • ETL/ELT
               • CASE
               • Custom Data Integration
               • Master Data Integration

                                           19
Data Quality – IT’s Deliverables
                   Establish Data Quality Rules
                      •   Referential Integrity Rules
                      •   Attribute Rules
                      •   Attribute Domain Rules
                      •   Attribute Dependency Rules
                      •   Historical Data Rules
                      •   State-Dependent Rules
                   Cataloguing Errors
                      • Error Tracking
                      • Error Notifications/Alerts
                   Score carding
                      • Record Level
                      • Domain Level


                                                     20
How do we Sustain over time?

• Follow Data Quality Framework
• Profile Data consistently
• Update Rule Based Engine Frequently
• Exploit Embedded DQ Functions/Solutions
• Adopt Proactive Approach
• Establish Stewardship
• Practice DQ Governance
                                            21
Data Quality – Are We There Yet?

• Accessible   • Accurate

• Relevant     • Consistent

• Reliable     • Complete

• Reportable   • Secured

• Compliant    • Integral




                                      22
Data Quality – Are We There Yet?



Not really!

Data Quality is an iterative process…




                                        23

Weitere ähnliche Inhalte

Ähnlich wie Data Quality - Are We There Yet?

Davide Hanan
Davide HananDavide Hanan
Davide Hananabneru
 
Defence IT 2012 - Data Quality and Financial Services - Solvency II
Defence IT 2012 - Data Quality and Financial Services - Solvency IIDefence IT 2012 - Data Quality and Financial Services - Solvency II
Defence IT 2012 - Data Quality and Financial Services - Solvency IIDavid Twaddell
 
Establishing a Strategy for Data Quality
Establishing a Strategy for Data QualityEstablishing a Strategy for Data Quality
Establishing a Strategy for Data QualityDatabase Answers Ltd.
 
Akili Oil & Gas Data Practice - PPDM
Akili Oil & Gas Data Practice - PPDMAkili Oil & Gas Data Practice - PPDM
Akili Oil & Gas Data Practice - PPDMrnaramore
 
BI: How Can Your High-Performance BI System Meet Expectations When You Feed I...
BI: How Can Your High-Performance BI System Meet Expectations When You Feed I...BI: How Can Your High-Performance BI System Meet Expectations When You Feed I...
BI: How Can Your High-Performance BI System Meet Expectations When You Feed I...Ray Mcglew
 
Data Governance And Technology Enablement First San Francisco Partners 2009
Data Governance And Technology Enablement   First San Francisco Partners  2009Data Governance And Technology Enablement   First San Francisco Partners  2009
Data Governance And Technology Enablement First San Francisco Partners 2009First San Francisco Partners
 
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMAOAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMAAlex Fiteni
 
Retail and Wholesale Consumer Centric Merchandising
Retail and Wholesale Consumer Centric MerchandisingRetail and Wholesale Consumer Centric Merchandising
Retail and Wholesale Consumer Centric MerchandisingDave DeBonis
 
Pragmatics Driven Issues in Data and Process Integrity in Enterprises
Pragmatics Driven Issues in Data and Process Integrity in EnterprisesPragmatics Driven Issues in Data and Process Integrity in Enterprises
Pragmatics Driven Issues in Data and Process Integrity in EnterprisesAmit Sheth
 
The New Age Data Quality
The New Age Data QualityThe New Age Data Quality
The New Age Data QualityRanjeet202050
 
Solving the Credit Union 'Tower of Babel' (Conference Session Slides)
Solving the Credit Union 'Tower of Babel' (Conference Session Slides)Solving the Credit Union 'Tower of Babel' (Conference Session Slides)
Solving the Credit Union 'Tower of Babel' (Conference Session Slides)NAFCU Services Corporation
 
IOUG93 - Technical Architecture for the Data Warehouse - Presentation
IOUG93 - Technical Architecture for the Data Warehouse - PresentationIOUG93 - Technical Architecture for the Data Warehouse - Presentation
IOUG93 - Technical Architecture for the Data Warehouse - PresentationDavid Walker
 
Agile Base Camp - Agile metrics
Agile Base Camp - Agile metricsAgile Base Camp - Agile metrics
Agile Base Camp - Agile metricsSerge Kovaleff
 
Ladies Be Architects - Study Group II: Data Governance
Ladies Be Architects - Study Group II: Data GovernanceLadies Be Architects - Study Group II: Data Governance
Ladies Be Architects - Study Group II: Data Governancegemziebeth
 
Miesterdisplay Technologies
Miesterdisplay TechnologiesMiesterdisplay Technologies
Miesterdisplay TechnologiesSrijeet Mishra
 
Miesterdisplay Technologies
Miesterdisplay TechnologiesMiesterdisplay Technologies
Miesterdisplay TechnologiesSrijeet Mishra
 

Ähnlich wie Data Quality - Are We There Yet? (20)

Davide Hanan
Davide HananDavide Hanan
Davide Hanan
 
Defence IT 2012 - Data Quality and Financial Services - Solvency II
Defence IT 2012 - Data Quality and Financial Services - Solvency IIDefence IT 2012 - Data Quality and Financial Services - Solvency II
Defence IT 2012 - Data Quality and Financial Services - Solvency II
 
Establishing a Strategy for Data Quality
Establishing a Strategy for Data QualityEstablishing a Strategy for Data Quality
Establishing a Strategy for Data Quality
 
Akili Oil & Gas Data Practice - PPDM
Akili Oil & Gas Data Practice - PPDMAkili Oil & Gas Data Practice - PPDM
Akili Oil & Gas Data Practice - PPDM
 
BI: How Can Your High-Performance BI System Meet Expectations When You Feed I...
BI: How Can Your High-Performance BI System Meet Expectations When You Feed I...BI: How Can Your High-Performance BI System Meet Expectations When You Feed I...
BI: How Can Your High-Performance BI System Meet Expectations When You Feed I...
 
Data Governance And Technology Enablement First San Francisco Partners 2009
Data Governance And Technology Enablement   First San Francisco Partners  2009Data Governance And Technology Enablement   First San Francisco Partners  2009
Data Governance And Technology Enablement First San Francisco Partners 2009
 
Tatiana Stebakova
Tatiana StebakovaTatiana Stebakova
Tatiana Stebakova
 
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMAOAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
 
Retail and Wholesale Consumer Centric Merchandising
Retail and Wholesale Consumer Centric MerchandisingRetail and Wholesale Consumer Centric Merchandising
Retail and Wholesale Consumer Centric Merchandising
 
Pragmatics Driven Issues in Data and Process Integrity in Enterprises
Pragmatics Driven Issues in Data and Process Integrity in EnterprisesPragmatics Driven Issues in Data and Process Integrity in Enterprises
Pragmatics Driven Issues in Data and Process Integrity in Enterprises
 
The New Age Data Quality
The New Age Data QualityThe New Age Data Quality
The New Age Data Quality
 
Solving the Credit Union 'Tower of Babel' (Conference Session Slides)
Solving the Credit Union 'Tower of Babel' (Conference Session Slides)Solving the Credit Union 'Tower of Babel' (Conference Session Slides)
Solving the Credit Union 'Tower of Babel' (Conference Session Slides)
 
IOUG93 - Technical Architecture for the Data Warehouse - Presentation
IOUG93 - Technical Architecture for the Data Warehouse - PresentationIOUG93 - Technical Architecture for the Data Warehouse - Presentation
IOUG93 - Technical Architecture for the Data Warehouse - Presentation
 
Agile Base Camp - Agile metrics
Agile Base Camp - Agile metricsAgile Base Camp - Agile metrics
Agile Base Camp - Agile metrics
 
Ladies Be Architects - Study Group II: Data Governance
Ladies Be Architects - Study Group II: Data GovernanceLadies Be Architects - Study Group II: Data Governance
Ladies Be Architects - Study Group II: Data Governance
 
Security audit
Security auditSecurity audit
Security audit
 
Security Audit
Security AuditSecurity Audit
Security Audit
 
32 cc 3_a_l-drumheller
32 cc 3_a_l-drumheller32 cc 3_a_l-drumheller
32 cc 3_a_l-drumheller
 
Miesterdisplay Technologies
Miesterdisplay TechnologiesMiesterdisplay Technologies
Miesterdisplay Technologies
 
Miesterdisplay Technologies
Miesterdisplay TechnologiesMiesterdisplay Technologies
Miesterdisplay Technologies
 

Mehr von dmurph4

Insurance Data & Analytics Summit
Insurance Data & Analytics SummitInsurance Data & Analytics Summit
Insurance Data & Analytics Summitdmurph4
 
Metadata Use Cases
Metadata Use CasesMetadata Use Cases
Metadata Use Casesdmurph4
 
UML and Data Modeling - A Reconciliation
UML and Data Modeling - A ReconciliationUML and Data Modeling - A Reconciliation
UML and Data Modeling - A Reconciliationdmurph4
 
Metadata Use Cases You Can Use
Metadata Use Cases You Can UseMetadata Use Cases You Can Use
Metadata Use Cases You Can Usedmurph4
 
Dama Chicago June 2012 Newsletter
Dama Chicago June 2012 NewsletterDama Chicago June 2012 Newsletter
Dama Chicago June 2012 Newsletterdmurph4
 
Big Data and Analytics
Big Data and AnalyticsBig Data and Analytics
Big Data and Analyticsdmurph4
 
Mergers & Acquisitions
Mergers & AcquisitionsMergers & Acquisitions
Mergers & Acquisitionsdmurph4
 
Dama chicago newsletter_2012_issue_1
Dama chicago newsletter_2012_issue_1Dama chicago newsletter_2012_issue_1
Dama chicago newsletter_2012_issue_1dmurph4
 
2012 February dama chicago
2012 February dama chicago2012 February dama chicago
2012 February dama chicagodmurph4
 
Sample Dama Newsletter
Sample Dama NewsletterSample Dama Newsletter
Sample Dama Newsletterdmurph4
 
Building a Data Quality Program from Scratch
Building a Data Quality Program from ScratchBuilding a Data Quality Program from Scratch
Building a Data Quality Program from Scratchdmurph4
 

Mehr von dmurph4 (11)

Insurance Data & Analytics Summit
Insurance Data & Analytics SummitInsurance Data & Analytics Summit
Insurance Data & Analytics Summit
 
Metadata Use Cases
Metadata Use CasesMetadata Use Cases
Metadata Use Cases
 
UML and Data Modeling - A Reconciliation
UML and Data Modeling - A ReconciliationUML and Data Modeling - A Reconciliation
UML and Data Modeling - A Reconciliation
 
Metadata Use Cases You Can Use
Metadata Use Cases You Can UseMetadata Use Cases You Can Use
Metadata Use Cases You Can Use
 
Dama Chicago June 2012 Newsletter
Dama Chicago June 2012 NewsletterDama Chicago June 2012 Newsletter
Dama Chicago June 2012 Newsletter
 
Big Data and Analytics
Big Data and AnalyticsBig Data and Analytics
Big Data and Analytics
 
Mergers & Acquisitions
Mergers & AcquisitionsMergers & Acquisitions
Mergers & Acquisitions
 
Dama chicago newsletter_2012_issue_1
Dama chicago newsletter_2012_issue_1Dama chicago newsletter_2012_issue_1
Dama chicago newsletter_2012_issue_1
 
2012 February dama chicago
2012 February dama chicago2012 February dama chicago
2012 February dama chicago
 
Sample Dama Newsletter
Sample Dama NewsletterSample Dama Newsletter
Sample Dama Newsletter
 
Building a Data Quality Program from Scratch
Building a Data Quality Program from ScratchBuilding a Data Quality Program from Scratch
Building a Data Quality Program from Scratch
 

Kürzlich hochgeladen

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 

Kürzlich hochgeladen (20)

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 

Data Quality - Are We There Yet?

  • 1. Data Quality – “Are We There Yet?” August 17, 2011 Presented By Arvind Mattoo, CBIP
  • 2. Data Quality • Data Quality – Explained • Data Quality – CEO’s Concern • Data Quality – CIO’s Nightmare • Data Quality – PM’s Approach • Data Quality – IT’s Deliverable 2
  • 3. Data Quality – Dimensions Process Dimension Business Dimension • Accessible • Relevant • Consistent • Existent • Complete • Reliable • Lineage • Reportable • Controllable • Compliant • Secure • Measurable Data Quality FACT Technical Dimension Time Dimension • Accurate • Integral • Currency • Unique • Timeliness • Valid • Historical • Secure 3
  • 4. Dimension – Business Relevant: Does it Map to our Requirements? Existent: Do we Own it? Reliable: Can we Trust it? Reportable: Can we Visualize it? Compliance: Is it Mandated? Measurable: Can we Baseline it? 4
  • 5. Dimension – Process Accessible: Can I Get it? Consistent: Can I Standardize it? Complete: Does it Encompass Usability? Lineage: Can we Trace it? Controllable: Can we Discipline it? Secure: Can we Trust it? 5
  • 6. Dimension – Technical Accurate: To what Degree does it Jive? Integral: Does it Comply Structurally? Unique: To what extent is it De-Duped? Valid: Does it Conform by the Rules? Secure: To what Level is it Secured? 6
  • 7. Dimension – Time Currency: To what Degree is it Current? Timeliness: How Readily is it Available? Historical: How far back can we Audit? 7
  • 8. Data Quality – CEO’s Concern • Lack of Strategic Information Capabilities • Quality of Decision Making • Lack of Visibility • Loss of Opportunities • Increasing IT Expenditures • Diminishing Rate of Return • Lack of Collaboration 8
  • 9. Data Quality – CIO’s Nightmare • How did we get into this mess? • How does it impact our business? • Are we the only one? • How do we get out of this? • How do we sustain it? • Are we there yet? 9
  • 10. Data Quality – As We Speak! • Data Misused: Not Authorized • Data Abused: Not Qualified • Data Confused: Not Clarified • Data Refused: Not Ratified • Data Diffused: Not Archived 10
  • 11. How did we get into this mess?  Business  Technical • Mergers • Conversion • Acquisitions • Manual Data Feeds • Expansions • Lack of Automation • Diversification • System Upgrades • Regulatory • Consolidation • Lack of Ownership • Insufficient DQ Rules • Business Process Changes • System Errors • Lack of Executive Awareness • Source System Changes • Lack of Training • Lack of Expertise 11
  • 12. How does it impact our business? CEO CIO • Reputation at Stake • Time to Reconcile Data • Lower Quality of Service • Delay in New System Deployment • Customer dissatisfaction • Poor System Performance • Loss of Motivation • Loss of Credibility • Compliance Issues • Downstream System Data Issues • Expectations not met • No Single Version of Truth Surging Cost 12
  • 13. Are we the only one? 13
  • 14. How Bad is it? 14
  • 15. Who is Controlling Whom? 15
  • 16. How do we get out of this? • Data Quality – PM’s Approach • Data Quality – IT’s Deliverables 16
  • 17. Data Quality – PM’s Approach Methodology • Assess/Profile Data • Define Baseline • Define Metrics and Targets • Define and Build Data Quality Rules • Enforce Data Standards across Board • Monitor Data Quality against Targets • Review Exceptions and Gaps • Cataloguing Errors • Refine Data Quality Rules • Manage Data Quality against Targets • Automate Data Quality Process • Fine Tuning Data Quality Rules 17
  • 18. Data Quality – PM’s Approach Governance Team • Governance Committee • Data Stewards • Business SME • Business Analysts • Technology SME • Process SME 18
  • 19. Data Quality – PM’s Approach Technology • Data Profiler • CRM • Data Warehouse • Master Data Management • ETL/ELT • CASE • Custom Data Integration • Master Data Integration 19
  • 20. Data Quality – IT’s Deliverables Establish Data Quality Rules • Referential Integrity Rules • Attribute Rules • Attribute Domain Rules • Attribute Dependency Rules • Historical Data Rules • State-Dependent Rules Cataloguing Errors • Error Tracking • Error Notifications/Alerts Score carding • Record Level • Domain Level 20
  • 21. How do we Sustain over time? • Follow Data Quality Framework • Profile Data consistently • Update Rule Based Engine Frequently • Exploit Embedded DQ Functions/Solutions • Adopt Proactive Approach • Establish Stewardship • Practice DQ Governance 21
  • 22. Data Quality – Are We There Yet? • Accessible • Accurate • Relevant • Consistent • Reliable • Complete • Reportable • Secured • Compliant • Integral 22
  • 23. Data Quality – Are We There Yet? Not really! Data Quality is an iterative process… 23