SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Downloaden Sie, um offline zu lesen
Copyright Global Data Strategy, Ltd. 2022
Data Quality Best Practices
Donna Burbank and Nigel Turner
Global Data Strategy, Ltd.
August 25th, 2022
Follow on Twitter @donnaburbank, @nigelturner8
@GlobalDataStrat
Twitter Event hashtag: #DAStrategies
1
Global Data Strategy, Ltd. 2022
Donna Burbank
2
• Recognized industry expert in information
management with over 25 years of
experience in data strategy, information
management, data modeling, metadata
management, and enterprise architecture
• Managing Director at Global Data Strategy,
Ltd., an international information
management consulting company that
specializes in the alignment of business
drivers with data-centric technology
• Worked with dozens of Fortune 500
companies worldwide in the Americas,
Europe, Asia, and Africa and speaks
regularly at industry conferences
• Excellence in Data Management Award
from DAMA International
• Past President and Advisor to the DAMA
Rocky Mountain chapter
• Co-author of several books on data
management
• Regular contributor to industry
publications
• She can be reached at
donna.burbank@globaldatastrategy.com
Donna is based in Boulder, Colorado, US
Follow on Twitter @donnaburbank
@GlobalDataStrat
Global Data Strategy, Ltd. 2022
• Worked in Information Management
(IM) and related areas for over 30
years. Experience has embraced Data
Governance, Information Strategy,
Data Quality, Data Governance, Master
Data Management & Business
Intelligence.
• Principal Consultant, EMEA, for Global
Data Strategy, Ltd.
• Spent much of his career in British
Telecommunications Group (BT)
where he led a series of enterprise-
wide IM & data governance initiatives.
• Also been VP of Information
Management Strategy at Harte Hanks
Trillium Software, and Principal
Consultant at FromHereOn and IPL.
• Nigel is very active in professional Data
Management organizations and is an
elected Data Management Association
(DAMA) UK Committee member.
• He was the joint winner of DAMA
International’s 2015 Community Award
for the work he initiated and led in
setting up a mentoring scheme in the
UK where experienced DAMA
professionals coach and support newer
data management professionals.
• Nigel is based in Cardiff, Wales, UK.
Follow on Twitter @NigelTurner8
Today’s hashtag: # DAStrategies
Nigel Turner
3
Global Data Strategy, Ltd. 2022
DATAVERSITY Data Architecture Strategies
• January Emerging Trends in Data Architecture – What’s the Next Big Thing?
• February Building a Data Strategy - Practical Steps for Aligning with Business Goals
• March Master Data Management – Aligning Data, Process, and Governance
• April Data Governance & Data Architecture: Alignment & Synergies
• May Improving Data Literacy Around Data Architecture
• June Business Intelligence & Data Analytics: An Architected Approach
• July Best Practices in Metadata Management
• August Data Quality Best Practices – with special guest Nigel Turner
• September Business-centric Data Modeling
• October Graph Databases: Benefits & Risks
• December Enterprise Architecture vs. Data Architecture
4
This Year’s Lineup
Global Data Strategy, Ltd. 2022 5
What We’ll Cover Today – Our Agenda
• Explore the Data Management Lifecycle and why data
quality management is a ‘must have’ across all stages
of the lifecycle
• Outline specific data quality emphases and approaches
that support data quality in specific stages of the Data
Management Lifecycle
Global Data Strategy, Ltd. 2022 6
A Successful Data Strategy links Business Goals with Technology Solutions
“Top-Down” alignment with
business priorities
“Bottom-Up” management &
inventory of data sources
Managing the people, process,
policies & culture around data
Coordinating & integrating
disparate data sources
Leveraging & managing data for
strategic advantage
Data Quality is Part of a Wider Data Strategy
www.globaldatastrategy.com
Global Data Strategy, Ltd. 2022
Data Quality Remains a Growing Priority
7
0% 10% 20% 30% 40% 50% 60% 70%
Data Strategy
Data Quality
Data Architecture
Data Governance
Data Security
Business Intelligence & Reporting
Data Warehouse
0% 5% 10% 15% 20% 25% 30% 35% 40% 45%
Data Architecture
Metadata Management
Master Data Management
Data Quality
Self-Service Reporting & Analytics
Data Strategy
Data Governance
Data Priorities in
Place for 2021
Future Data
Priorities
in 2022-2023
Data Quality is #6
Data Quality is #4
From the 2020 DATAVERSITY survey on “Trends in Data Management”, by Donna Burbank & Michelle Knight
Available for download at: https://globaldatastrategy.com/resources/white-papers/
Global Data Strategy, Ltd. 2022
Primary Data Quality Challenges
8
Headline Findings:
• Data quality challenges remain holistic, embracing
people, process and technology issues
• Business challenges include culture change, skills,
training, senior management buy in etc.
• Technical challenges include scaling, handling new
data types, and managing data quality across
multiple platforms and sources
• Challenges embrace all stages of the data lifecycle,
from data creation to data usage and publication
Source: 2022 State of Data Quality, TDWI, James Kobielus
Available for download at https://globaldatastrategy.com/resources/white-papers/
Global Data Strategy, Ltd. 2022
Data Quality: the 1:10:100 Principle
9
$1
$10
$100
Cost of Prevention
Cost of Failure
Cost of Correction
The 1:10:100 Principle
It costs exponentially more
to identify and correct data
quality problems and data
errors the later they are
identified and addressed
Global Data Strategy, Ltd. 2022
Customer touch points
Wrong address
X
The 1:10:100 Principle – A Use Case from British
Telecommunications (BT) • Incorrect or incomplete addresses input on
data entry led to huge costs of failure,
impacting:
• Network connection
• Customer equipment installation
• Customer billing
• Fault repair
• Customer marketing
• Electronic directory enquiries and paper
directories
• Emergency services (999 / 911)
• All of these resulted in failures at different
customer touch points
• Conclusion:
• Minimize data entry errors!
• Any costs incurred in enhancing address
data entry (e.g. business process
changes, drop down menus, MDM etc.)
insignificant when compared to
downstream failure costs
10
Global Data Strategy, Ltd. 2022
How This Led to Bigger Data Quality Successes in BT
11
• Address improvement became the first major project in
a data quality improvement program
• The program eventually ran for 10 years – adapting and
evolving to reflect BT’s transforming business
• Over 75 Data Quality improvement projects completed
• Projects ranged from tactical data cleanses to strategic
enterprise wide projects, e.g. Customer MDM (including
address)
• Benefits derived from process efficiencies, capital
expenditure avoidance, better asset utilization, revenue
recovery etc.
• Total audited bottom line benefits exceeded $1 billion +
• Praised by Gartner, Forrester, OVUM, Tom Redman and
others for its business alignment & rapid delivery cycles
Global Data Strategy, Ltd. 2022
Data Management Lifecycle: What Is It?
• Most living or manufactured objects have a natural lifecycle; as
an asset, data is no different
• Data is:
• Created
• Has a lifespan
• Ultimately reduces in value as its currency declines
• Is eventually discarded or disposed of
• A Data Management Lifecycle can be defined as:
A way of describing the different stages data will go through from
design and collection to dissemination and archival / destruction’
(Source: UK Government Data Quality Framework 2020)
• Organizations creating, storing, using and publishing data need to
be aware of the data lifecycle and plan to manage data
throughout its lifecycle
• How data is managed will vary according to its lifecycle stage
12
Global Data Strategy, Ltd. 2022
Data Management Lifecycle
13
PLAN
COLLECT /
ACQUIRE /
INGEST
PREPARE /
STORE /
MAINTAIN
USE AND
PROCESS
SHARE
AND
PUBLISH
ARCHIVE
OR
DESTROY
Sources:
• UK Government Data
Quality Framework
• DAMA Data
Management Book of
Knowledge (DMBoK)
Data
Management
Lifecycle
Stage 1
Stage 5
Stage 4 Stage 3
Stage 2
Stage 6
Global Data Strategy, Ltd. 2022
Data Management Lifecycle and Data Lineage
14
Data Management Lifecycle
Data Lineage
‘Data lineage… helps to determine the
data provenance for your organization. It
can provide an ongoing and continuously
updated record of where a data asset
originates, how it moves through the
organization, how it gets transformed,
where it’s stored, who accesses it and
other key metadata.’
Source: Informatica
‘A way of describing the different
stages data will go through from
design and collection to dissemination
and archival / destruction.’
Source: UK Government Data Quality
Framework 2020
The concepts of Data
Management Lifecycle and Data
Lineage overlap in that both:
• Emphasize that data is
managed through a sequence
of processes and activities
• Describe the way data changes
as it passes through the
sequence
But differ in that:
• Lifecycle Management is more
business focused as it
describes the overall maturity
of and use of data from its
creation to disposal
• Data Lineage is more
technically focused as it
describes the physical flow of
data as it moves from its point
of origin to its point of usage
Global Data Strategy, Ltd. 2022
Data Lifecycle and the 1:10:100 Principle
15
PLAN
COLLECT /
ACQUIRE
/ INGEST
PREPARE
/ STORE /
MAINTAIN
USE AND
PROCESS
SHARE
AND
PUBLISH
ARCHIVE
OR
DESTROY
1 10 100
Cost to correct data errors and problems
Data Management Lifecycle emphasis
Prevention Correction Cost of Failure Remediation
Global Data Strategy, Ltd. 2022
Stage 1: PLAN
16
PLAN
PURPOSE OF STAGE
KEY STAGE ACTIVITIES
DATA QUALITY FOCUS
DATA QUALITY CHALLENGES
• Unclear ownership and
accountability for new data
• Potential data duplication
• Data fitness for purpose
• Lack of data definitions and
standards (format & content)
• Impact on downstream data
processing
• Poor design of data collection /
acquisition / ingestion
• Identify business needs &
objectives
• Investigate if data already exists
• Develop a business case
• Produce requirements
specification, including business
rules
• Determine data provider(s) –
internal / external (Open &
Propriety)
• Design data creation / ingestion
processes
• Conduct impact analysis
• Appoint business data owner and
data steward to ensure early
accountability
• Engage key data stakeholders to
establish ‘ fitness for purpose’
• Investigate current data sources to
ensure existing sources not available
• Profile potential data sources to
quantify baseline data quality
• Define required data definitions &
standards & agree supplier SLA
• Consider data validation methods
and rules, e.g. drop down lists
Make the business case for and
create a plan for a new data
source (internal and / or
external)
Global Data Strategy, Ltd. 2022
Stage 2: COLLECT / ACQUIRE / INGEST
17
PURPOSE OF STAGE
KEY STAGE ACTIVITIES
DATA QUALITY FOCUS
DATA QUALITY CHALLENGES
COLLECT /
ACQUIRE /
INGEST
Design & implement data
entry / acquisition processes
and templates
• Specify and design optimum
data entry types
▪ Users
▪ Devices
▪ Applications
▪ IoT etc.
• Determine target platform(s)
(existing or new)
• Design and implement source
to target mapping
• Lack of data source metadata
• Errors in manual data entry
• Inconsistent data formats &
content
• Incomplete or inaccurate data
• Create data model(s)
• Populate business glossary / data
catalog
• Design detailed DQ rules and
enforce on data entry / ingestion
• Implement regular source data
monitoring
• Establish data quality measures
and dashboard
Global Data Strategy, Ltd. 2022
Stage 3: PREPARE / STORE /MAINTAIN
18
PURPOSE OF STAGE
KEY STAGE ACTIVITIES
DATA QUALITY FOCUS
DATA QUALITY CHALLENGES
PREPARE /
STORE /
MAINTAIN
Implement a stable policy
driven technical environment
to store and maintain data
• Define and implement
storage policies (e.g.
encryption, compression
etc.)
• Publish data security & data
privacy policies
• Design & implement ETL /
ELT processes
• Integrate data into existing
data stores or new data
stores
• Inappropriate data access and
uses
• Multiple data consumer needs
and requirements
• Data transformation (source to
target): too often DQ is first
considered here
• Metadata management
• Ongoing data quality validation
& reporting
• Implement role based access and
usage to data
• Embed predefined DQ rules in
ETL / ELT routines – Implement
DQ rules engine
• Implement DQ dashboards
• Actively steward the data to
ensure all data consumer needs
met
• Automatic update of data catalog
Global Data Strategy, Ltd. 2022
Stage 4: USE & PROCESS
19
PURPOSE OF STAGE
KEY STAGE ACTIVITIES DATA QUALITY FOCUS
DATA QUALITY CHALLENGES
USE AND
PROCESS
Data is stored and
processed in a stable
environment for use by
data consumers
• Schedule regular (real time
/ event or time driven)
data ingestion and storage
processes
• Actively maintain the data
within the technical
environment
• Monitor use and
adherence to data policies
• Lack of data quality error
monitoring and reporting
• Absence of data consumer
feedback channels to report
DQ problems
• Inaction on tackling data
quality issues
• Active data stewardship
• Generate data consumer DQ
reports & deploy feedback
workflows
• Consider MDM workflows
• Deploy DQ rules engine to
enforce DQ rules throughout
the lifespan of the data
Global Data Strategy, Ltd. 2022
Automating Data Quality Business Rules via a DQ Rules Engine
DATA
INPUT
DATA
WAREHOUSE
STAGING / ETL
LAYER
SOURCE
SYSTEMS
REPORTING
LAYER
DATA
MARTS
Real Time Data Validation
Batch
Validation
DATA QUALITY
RULES ENGINE
20
Global Data Strategy, Ltd. 2022
Use of Tools & Technology in Data Quality Management
21
Source: 2022 State of Data Quality, TDWI, James Kobielus
Available for download at https://globaldatastrategy.com/resources/white-papers/
Headline Findings:
• Tools and technologies are
increasingly being acquired
and used to support data
quality management
• Strong focus on Data
Preparation & Transformation
reinforce finding that
analytics and intelligence are
key drivers for better data
quality management
• Although only 16% currently
have an enterprise wide data
catalog, 45% intend
implementing one within the
next 12 months
Tools enable:
• Profiling
• Monitoring
• Parsing
• Standardizing
• Matching
• Merging
• Cleansing
• Correcting
• Enhancing
• Management of data
quality rules
• Data quality workflows
• Data quality reporting
Global Data Strategy, Ltd. 2022
Stage 5: SHARE & PUBLISH
22
PURPOSE OF STAGE
KEY STAGE ACTIVITIES
DATA QUALITY FOCUS
DATA QUALITY CHALLENGES
SHARE
AND
PUBLISH
• Make data available to all
authorized consumers and
users
• Develop pre-canned reports
and / or self-service reporting
capabilities
• Generate and maintain user
metadata
Data consumers access and
modify the data to use in BI,
analytics, visualisation etc.
• Continuing data quality
issues – accuracy,
completeness, uniqueness,
consistency etc.
• Active management and
update of metadata
• Publish report catalog &
associated metadata
• Publish user focused DQ
dashboards
• Ensure DQ workflows and
feedback mechanisms are
actively managed and acted on
• Apply automated data catalog
update processes
Global Data Strategy, Ltd. 2022
Stage 6: ARCHIVE & DESTROY
23
PURPOSE OF STAGE
KEY STAGE ACTIVITIES
DATA QUALITY FOCUS
DATA QUALITY CHALLENGES
ARCHIVE
OR
DESTROY
To manage data that is no
longer required for current
operational or reporting
purposes
• Establish and operate data
archiving and / or
destruction policies and
processes where data is no
longer actively used because:
▪ It is time expired
▪ Legal or regulatory
constraints demand
action
▪ A specific project ends
(e.g. analytics or data
science sandpit)
• Difficulty in identifying data
to be archived or destroyed
• Ensuring data archived is
secure, tagged and
accessible in case of future
need
• Loss of potential knowledge
/ expertise of archived data
• Ensure data retention
policies and processes are in
place and enforced
• Apply data archive security &
privacy policies
• Ensure archived data is
actively stewarded and
maintained, including
metadata
Global Data Strategy, Ltd. 2022
Effective Data Lifecycle Management:
Data Quality Implications
• In order to minimize costs of failure in the data management lifecycle, the
most critical stages are Stage 1 (Plan) and Stage 2 (Collect / Ingest / Acquire)
• In these stages Data Quality by Design should be the key objective
• This will ensure:
• From the outset the data is fit for purpose throughout its lifecycle
• Potential data quality issues and problems can be identified early and
preventative actions taken before usage
• Delaying fixing issues to later stages will:
• Lead to operational inefficiencies and poor decision making
• Require more effort to re-engineer the data
• Increase the costs of remediation
• Each stage of the data management lifecycle will require different KPIs and
measures to ensure effective data quality management (e.g. Data Collection v
Data Sharing & Publication)
24
Global Data Strategy, Ltd. 2022
Effective Data Lifecycle Management:
Data Governance Implications
• Data Quality is most effectively sustained and enforced through
Data Governance policies and practices
• Data Governance is a critical enabler to ensure end to end data
lifecycle management, including data quality management
• Data accountability needs to be assigned throughout the lifecycle
by:
• Appointing Data Owners and Data Stewards during the Plan stage of
the data management lifecycle to ensure new data creation is
required by the business and is aligned with existing data policies and
standards
• Ensuring policies and processes exist to ensure key data is governed
throughout its lifecycle
• Accountability may change as the data moves through the lifecycle
(e.g. from processing to sharing, and usage to archiving)
25
Global Data Strategy, Ltd. 2022
Summary
• Data quality management remains a holistic challenge
involving People, Process & Technology
• Data quality management must embrace the entire data
lifecycle from data creation to data disposal
• Even when data is at the later stages of the lifecycle
thinking about data quality from a data lifecycle
management perspective helps to focus on the root
causes of data quality problems
• Putting focus and effort on the early stages of the data
lifecycle reduces cost of failure impacts and remediation
activities and costs
• Data Governance is the key enabler for sustaining
improved data quality and needs to underpin the entire
data management lifecycle
26
Global Data Strategy, Ltd. 2022
Dataversity Data Architecture Strategies Series:
Related Previous Data Quality Webinars
27
https://www.dataversity.net/das-webinar-data-quality-best-practices-3/
https://www.dataversity.net/das-slides-data-quality-best-practices-2/
August 2020:
The A2E Methodology for Tackling Data Quality Problems
August 2021:
Designing and applying Business Rules to Support
Data Quality Improvement
Global Data Strategy, Ltd. 2022
DATAVERSITY Data Architecture Strategies
• January Emerging Trends in Data Architecture – What’s the Next Big Thing?
• February Building a Data Strategy - Practical Steps for Aligning with Business Goals
• March Master Data Management – Aligning Data, Process, and Governance
• April Data Governance & Data Architecture: Alignment & Synergies
• May Improving Data Literacy Around Data Architecture
• June Business Intelligence & Data Analytics: An Architected Approach
• July Best Practices in Metadata Management
• August Data Quality Best Practices – with special guest Nigel Turner
• September Business-centric Data Modeling
• October Graph Databases: Benefits & Risks
• December Enterprise Architecture vs. Data Architecture
28
This Year’s Lineup
Global Data Strategy, Ltd. 2022
Who We Are: Business-Focused Data Strategy
Maximize the Organizational Value of Your Data Investment
In today’s business environment, showing rapid time to value for
any technical investment is critical.
But technology and data can be complex. At Global Data Strategy,
we help demystify technical complexity to help you:
• Demonstrate the ROI and business value of data to your
management
• Build a data strategy at your pace to match your unique culture
and organizational style.
• Create an actionable roadmap for “quick wins”, which building
towards a long-term scalable architecture.
Global Data Strategy’s shares experience from some of the largest
international organizations scaled to the pace of your unique team.
www.globaldatastrategy.com
Global Data Strategy has worked with organizations globally in the
following industries:
Finance · Retail · Social Services · Health Care · Education · Manufacturing
· Government · Public Utilities · Construction · Media & Entertainment ·
Insurance …. and more
29
Global Data Strategy, Ltd. 2022
Questions?
Thoughts? Ideas?
30

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Quality & Data Governance
Data Quality & Data GovernanceData Quality & Data Governance
Data Quality & Data Governance
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Data Governance
Data GovernanceData Governance
Data Governance
 
Data Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityData Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data Quality
 
Business Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachBusiness Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected Approach
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...
 
How to Build & Sustain a Data Governance Operating Model
How to Build & Sustain a Data Governance Operating Model How to Build & Sustain a Data Governance Operating Model
How to Build & Sustain a Data Governance Operating Model
 
The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of Metadata
 
Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)
 
DMBOK and Data Governance
DMBOK and Data GovernanceDMBOK and Data Governance
DMBOK and Data Governance
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Introduction to Data Management Maturity Models
Introduction to Data Management Maturity ModelsIntroduction to Data Management Maturity Models
Introduction to Data Management Maturity Models
 
The Role of Data Governance in a Data Strategy
The Role of Data Governance in a Data StrategyThe Role of Data Governance in a Data Strategy
The Role of Data Governance in a Data Strategy
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 

Ähnlich wie Data Quality Best Practices

Ähnlich wie Data Quality Best Practices (20)

DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
 
Data Governance & Data Architecture - Alignment and Synergies
Data Governance & Data Architecture - Alignment and SynergiesData Governance & Data Architecture - Alignment and Synergies
Data Governance & Data Architecture - Alignment and Synergies
 
DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...
 
DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...
 
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
 
DAS Slides: Data Quality Best Practices
DAS Slides: Data Quality Best PracticesDAS Slides: Data Quality Best Practices
DAS Slides: Data Quality Best Practices
 
dataversitydatagovernanceorgchangeapril2019-190429155809.pdf
dataversitydatagovernanceorgchangeapril2019-190429155809.pdfdataversitydatagovernanceorgchangeapril2019-190429155809.pdf
dataversitydatagovernanceorgchangeapril2019-190429155809.pdf
 
DAS Slides: Data Governance and Data Architecture – Alignment and Synergies
DAS Slides: Data Governance and Data Architecture – Alignment and SynergiesDAS Slides: Data Governance and Data Architecture – Alignment and Synergies
DAS Slides: Data Governance and Data Architecture – Alignment and Synergies
 
Master Data Management – Aligning Data, Process, and Governance
Master Data Management – Aligning Data, Process, and GovernanceMaster Data Management – Aligning Data, Process, and Governance
Master Data Management – Aligning Data, Process, and Governance
 
DAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
DAS Slides: Self-Service Reporting and Data Prep – Benefits & RisksDAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
DAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
 
Key Elements of a Successful Data Governance Program
Key Elements of a Successful Data Governance ProgramKey Elements of a Successful Data Governance Program
Key Elements of a Successful Data Governance Program
 
DAS Slides: Best Practices in Metadata Management
DAS Slides: Best Practices in Metadata ManagementDAS Slides: Best Practices in Metadata Management
DAS Slides: Best Practices in Metadata Management
 
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Architecture Best Practices for Today’s Rapidly Changing Data Landscape
Data Architecture Best Practices for Today’s Rapidly Changing Data LandscapeData Architecture Best Practices for Today’s Rapidly Changing Data Landscape
Data Architecture Best Practices for Today’s Rapidly Changing Data Landscape
 
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
 
Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM
 
Data-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMData-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDM
 
DAS Slides: Data Virtualization – Separating Myth from Reality
DAS Slides: Data Virtualization – Separating Myth from RealityDAS Slides: Data Virtualization – Separating Myth from Reality
DAS Slides: Data Virtualization – Separating Myth from Reality
 
Data Governance — Aligning Technical and Business Approaches
Data Governance — Aligning Technical and Business ApproachesData Governance — Aligning Technical and Business Approaches
Data Governance — Aligning Technical and Business Approaches
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 

Mehr von DATAVERSITY

The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 

Mehr von DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
 
Empowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business IntelligenceEmpowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business Intelligence
 
Data Governance Best Practices, Assessments, and Roadmaps
Data Governance Best Practices, Assessments, and RoadmapsData Governance Best Practices, Assessments, and Roadmaps
Data Governance Best Practices, Assessments, and Roadmaps
 
Including All Your Mission-Critical Data in Modern Apps and Analytics
Including All Your Mission-Critical Data in Modern Apps and AnalyticsIncluding All Your Mission-Critical Data in Modern Apps and Analytics
Including All Your Mission-Critical Data in Modern Apps and Analytics
 

Kürzlich hochgeladen

Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
HyderabadDolls
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
HyderabadDolls
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
SayantanBiswas37
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
Health
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 

Kürzlich hochgeladen (20)

Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 

Data Quality Best Practices

  • 1. Copyright Global Data Strategy, Ltd. 2022 Data Quality Best Practices Donna Burbank and Nigel Turner Global Data Strategy, Ltd. August 25th, 2022 Follow on Twitter @donnaburbank, @nigelturner8 @GlobalDataStrat Twitter Event hashtag: #DAStrategies 1
  • 2. Global Data Strategy, Ltd. 2022 Donna Burbank 2 • Recognized industry expert in information management with over 25 years of experience in data strategy, information management, data modeling, metadata management, and enterprise architecture • Managing Director at Global Data Strategy, Ltd., an international information management consulting company that specializes in the alignment of business drivers with data-centric technology • Worked with dozens of Fortune 500 companies worldwide in the Americas, Europe, Asia, and Africa and speaks regularly at industry conferences • Excellence in Data Management Award from DAMA International • Past President and Advisor to the DAMA Rocky Mountain chapter • Co-author of several books on data management • Regular contributor to industry publications • She can be reached at donna.burbank@globaldatastrategy.com Donna is based in Boulder, Colorado, US Follow on Twitter @donnaburbank @GlobalDataStrat
  • 3. Global Data Strategy, Ltd. 2022 • Worked in Information Management (IM) and related areas for over 30 years. Experience has embraced Data Governance, Information Strategy, Data Quality, Data Governance, Master Data Management & Business Intelligence. • Principal Consultant, EMEA, for Global Data Strategy, Ltd. • Spent much of his career in British Telecommunications Group (BT) where he led a series of enterprise- wide IM & data governance initiatives. • Also been VP of Information Management Strategy at Harte Hanks Trillium Software, and Principal Consultant at FromHereOn and IPL. • Nigel is very active in professional Data Management organizations and is an elected Data Management Association (DAMA) UK Committee member. • He was the joint winner of DAMA International’s 2015 Community Award for the work he initiated and led in setting up a mentoring scheme in the UK where experienced DAMA professionals coach and support newer data management professionals. • Nigel is based in Cardiff, Wales, UK. Follow on Twitter @NigelTurner8 Today’s hashtag: # DAStrategies Nigel Turner 3
  • 4. Global Data Strategy, Ltd. 2022 DATAVERSITY Data Architecture Strategies • January Emerging Trends in Data Architecture – What’s the Next Big Thing? • February Building a Data Strategy - Practical Steps for Aligning with Business Goals • March Master Data Management – Aligning Data, Process, and Governance • April Data Governance & Data Architecture: Alignment & Synergies • May Improving Data Literacy Around Data Architecture • June Business Intelligence & Data Analytics: An Architected Approach • July Best Practices in Metadata Management • August Data Quality Best Practices – with special guest Nigel Turner • September Business-centric Data Modeling • October Graph Databases: Benefits & Risks • December Enterprise Architecture vs. Data Architecture 4 This Year’s Lineup
  • 5. Global Data Strategy, Ltd. 2022 5 What We’ll Cover Today – Our Agenda • Explore the Data Management Lifecycle and why data quality management is a ‘must have’ across all stages of the lifecycle • Outline specific data quality emphases and approaches that support data quality in specific stages of the Data Management Lifecycle
  • 6. Global Data Strategy, Ltd. 2022 6 A Successful Data Strategy links Business Goals with Technology Solutions “Top-Down” alignment with business priorities “Bottom-Up” management & inventory of data sources Managing the people, process, policies & culture around data Coordinating & integrating disparate data sources Leveraging & managing data for strategic advantage Data Quality is Part of a Wider Data Strategy www.globaldatastrategy.com
  • 7. Global Data Strategy, Ltd. 2022 Data Quality Remains a Growing Priority 7 0% 10% 20% 30% 40% 50% 60% 70% Data Strategy Data Quality Data Architecture Data Governance Data Security Business Intelligence & Reporting Data Warehouse 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% Data Architecture Metadata Management Master Data Management Data Quality Self-Service Reporting & Analytics Data Strategy Data Governance Data Priorities in Place for 2021 Future Data Priorities in 2022-2023 Data Quality is #6 Data Quality is #4 From the 2020 DATAVERSITY survey on “Trends in Data Management”, by Donna Burbank & Michelle Knight Available for download at: https://globaldatastrategy.com/resources/white-papers/
  • 8. Global Data Strategy, Ltd. 2022 Primary Data Quality Challenges 8 Headline Findings: • Data quality challenges remain holistic, embracing people, process and technology issues • Business challenges include culture change, skills, training, senior management buy in etc. • Technical challenges include scaling, handling new data types, and managing data quality across multiple platforms and sources • Challenges embrace all stages of the data lifecycle, from data creation to data usage and publication Source: 2022 State of Data Quality, TDWI, James Kobielus Available for download at https://globaldatastrategy.com/resources/white-papers/
  • 9. Global Data Strategy, Ltd. 2022 Data Quality: the 1:10:100 Principle 9 $1 $10 $100 Cost of Prevention Cost of Failure Cost of Correction The 1:10:100 Principle It costs exponentially more to identify and correct data quality problems and data errors the later they are identified and addressed
  • 10. Global Data Strategy, Ltd. 2022 Customer touch points Wrong address X The 1:10:100 Principle – A Use Case from British Telecommunications (BT) • Incorrect or incomplete addresses input on data entry led to huge costs of failure, impacting: • Network connection • Customer equipment installation • Customer billing • Fault repair • Customer marketing • Electronic directory enquiries and paper directories • Emergency services (999 / 911) • All of these resulted in failures at different customer touch points • Conclusion: • Minimize data entry errors! • Any costs incurred in enhancing address data entry (e.g. business process changes, drop down menus, MDM etc.) insignificant when compared to downstream failure costs 10
  • 11. Global Data Strategy, Ltd. 2022 How This Led to Bigger Data Quality Successes in BT 11 • Address improvement became the first major project in a data quality improvement program • The program eventually ran for 10 years – adapting and evolving to reflect BT’s transforming business • Over 75 Data Quality improvement projects completed • Projects ranged from tactical data cleanses to strategic enterprise wide projects, e.g. Customer MDM (including address) • Benefits derived from process efficiencies, capital expenditure avoidance, better asset utilization, revenue recovery etc. • Total audited bottom line benefits exceeded $1 billion + • Praised by Gartner, Forrester, OVUM, Tom Redman and others for its business alignment & rapid delivery cycles
  • 12. Global Data Strategy, Ltd. 2022 Data Management Lifecycle: What Is It? • Most living or manufactured objects have a natural lifecycle; as an asset, data is no different • Data is: • Created • Has a lifespan • Ultimately reduces in value as its currency declines • Is eventually discarded or disposed of • A Data Management Lifecycle can be defined as: A way of describing the different stages data will go through from design and collection to dissemination and archival / destruction’ (Source: UK Government Data Quality Framework 2020) • Organizations creating, storing, using and publishing data need to be aware of the data lifecycle and plan to manage data throughout its lifecycle • How data is managed will vary according to its lifecycle stage 12
  • 13. Global Data Strategy, Ltd. 2022 Data Management Lifecycle 13 PLAN COLLECT / ACQUIRE / INGEST PREPARE / STORE / MAINTAIN USE AND PROCESS SHARE AND PUBLISH ARCHIVE OR DESTROY Sources: • UK Government Data Quality Framework • DAMA Data Management Book of Knowledge (DMBoK) Data Management Lifecycle Stage 1 Stage 5 Stage 4 Stage 3 Stage 2 Stage 6
  • 14. Global Data Strategy, Ltd. 2022 Data Management Lifecycle and Data Lineage 14 Data Management Lifecycle Data Lineage ‘Data lineage… helps to determine the data provenance for your organization. It can provide an ongoing and continuously updated record of where a data asset originates, how it moves through the organization, how it gets transformed, where it’s stored, who accesses it and other key metadata.’ Source: Informatica ‘A way of describing the different stages data will go through from design and collection to dissemination and archival / destruction.’ Source: UK Government Data Quality Framework 2020 The concepts of Data Management Lifecycle and Data Lineage overlap in that both: • Emphasize that data is managed through a sequence of processes and activities • Describe the way data changes as it passes through the sequence But differ in that: • Lifecycle Management is more business focused as it describes the overall maturity of and use of data from its creation to disposal • Data Lineage is more technically focused as it describes the physical flow of data as it moves from its point of origin to its point of usage
  • 15. Global Data Strategy, Ltd. 2022 Data Lifecycle and the 1:10:100 Principle 15 PLAN COLLECT / ACQUIRE / INGEST PREPARE / STORE / MAINTAIN USE AND PROCESS SHARE AND PUBLISH ARCHIVE OR DESTROY 1 10 100 Cost to correct data errors and problems Data Management Lifecycle emphasis Prevention Correction Cost of Failure Remediation
  • 16. Global Data Strategy, Ltd. 2022 Stage 1: PLAN 16 PLAN PURPOSE OF STAGE KEY STAGE ACTIVITIES DATA QUALITY FOCUS DATA QUALITY CHALLENGES • Unclear ownership and accountability for new data • Potential data duplication • Data fitness for purpose • Lack of data definitions and standards (format & content) • Impact on downstream data processing • Poor design of data collection / acquisition / ingestion • Identify business needs & objectives • Investigate if data already exists • Develop a business case • Produce requirements specification, including business rules • Determine data provider(s) – internal / external (Open & Propriety) • Design data creation / ingestion processes • Conduct impact analysis • Appoint business data owner and data steward to ensure early accountability • Engage key data stakeholders to establish ‘ fitness for purpose’ • Investigate current data sources to ensure existing sources not available • Profile potential data sources to quantify baseline data quality • Define required data definitions & standards & agree supplier SLA • Consider data validation methods and rules, e.g. drop down lists Make the business case for and create a plan for a new data source (internal and / or external)
  • 17. Global Data Strategy, Ltd. 2022 Stage 2: COLLECT / ACQUIRE / INGEST 17 PURPOSE OF STAGE KEY STAGE ACTIVITIES DATA QUALITY FOCUS DATA QUALITY CHALLENGES COLLECT / ACQUIRE / INGEST Design & implement data entry / acquisition processes and templates • Specify and design optimum data entry types ▪ Users ▪ Devices ▪ Applications ▪ IoT etc. • Determine target platform(s) (existing or new) • Design and implement source to target mapping • Lack of data source metadata • Errors in manual data entry • Inconsistent data formats & content • Incomplete or inaccurate data • Create data model(s) • Populate business glossary / data catalog • Design detailed DQ rules and enforce on data entry / ingestion • Implement regular source data monitoring • Establish data quality measures and dashboard
  • 18. Global Data Strategy, Ltd. 2022 Stage 3: PREPARE / STORE /MAINTAIN 18 PURPOSE OF STAGE KEY STAGE ACTIVITIES DATA QUALITY FOCUS DATA QUALITY CHALLENGES PREPARE / STORE / MAINTAIN Implement a stable policy driven technical environment to store and maintain data • Define and implement storage policies (e.g. encryption, compression etc.) • Publish data security & data privacy policies • Design & implement ETL / ELT processes • Integrate data into existing data stores or new data stores • Inappropriate data access and uses • Multiple data consumer needs and requirements • Data transformation (source to target): too often DQ is first considered here • Metadata management • Ongoing data quality validation & reporting • Implement role based access and usage to data • Embed predefined DQ rules in ETL / ELT routines – Implement DQ rules engine • Implement DQ dashboards • Actively steward the data to ensure all data consumer needs met • Automatic update of data catalog
  • 19. Global Data Strategy, Ltd. 2022 Stage 4: USE & PROCESS 19 PURPOSE OF STAGE KEY STAGE ACTIVITIES DATA QUALITY FOCUS DATA QUALITY CHALLENGES USE AND PROCESS Data is stored and processed in a stable environment for use by data consumers • Schedule regular (real time / event or time driven) data ingestion and storage processes • Actively maintain the data within the technical environment • Monitor use and adherence to data policies • Lack of data quality error monitoring and reporting • Absence of data consumer feedback channels to report DQ problems • Inaction on tackling data quality issues • Active data stewardship • Generate data consumer DQ reports & deploy feedback workflows • Consider MDM workflows • Deploy DQ rules engine to enforce DQ rules throughout the lifespan of the data
  • 20. Global Data Strategy, Ltd. 2022 Automating Data Quality Business Rules via a DQ Rules Engine DATA INPUT DATA WAREHOUSE STAGING / ETL LAYER SOURCE SYSTEMS REPORTING LAYER DATA MARTS Real Time Data Validation Batch Validation DATA QUALITY RULES ENGINE 20
  • 21. Global Data Strategy, Ltd. 2022 Use of Tools & Technology in Data Quality Management 21 Source: 2022 State of Data Quality, TDWI, James Kobielus Available for download at https://globaldatastrategy.com/resources/white-papers/ Headline Findings: • Tools and technologies are increasingly being acquired and used to support data quality management • Strong focus on Data Preparation & Transformation reinforce finding that analytics and intelligence are key drivers for better data quality management • Although only 16% currently have an enterprise wide data catalog, 45% intend implementing one within the next 12 months Tools enable: • Profiling • Monitoring • Parsing • Standardizing • Matching • Merging • Cleansing • Correcting • Enhancing • Management of data quality rules • Data quality workflows • Data quality reporting
  • 22. Global Data Strategy, Ltd. 2022 Stage 5: SHARE & PUBLISH 22 PURPOSE OF STAGE KEY STAGE ACTIVITIES DATA QUALITY FOCUS DATA QUALITY CHALLENGES SHARE AND PUBLISH • Make data available to all authorized consumers and users • Develop pre-canned reports and / or self-service reporting capabilities • Generate and maintain user metadata Data consumers access and modify the data to use in BI, analytics, visualisation etc. • Continuing data quality issues – accuracy, completeness, uniqueness, consistency etc. • Active management and update of metadata • Publish report catalog & associated metadata • Publish user focused DQ dashboards • Ensure DQ workflows and feedback mechanisms are actively managed and acted on • Apply automated data catalog update processes
  • 23. Global Data Strategy, Ltd. 2022 Stage 6: ARCHIVE & DESTROY 23 PURPOSE OF STAGE KEY STAGE ACTIVITIES DATA QUALITY FOCUS DATA QUALITY CHALLENGES ARCHIVE OR DESTROY To manage data that is no longer required for current operational or reporting purposes • Establish and operate data archiving and / or destruction policies and processes where data is no longer actively used because: ▪ It is time expired ▪ Legal or regulatory constraints demand action ▪ A specific project ends (e.g. analytics or data science sandpit) • Difficulty in identifying data to be archived or destroyed • Ensuring data archived is secure, tagged and accessible in case of future need • Loss of potential knowledge / expertise of archived data • Ensure data retention policies and processes are in place and enforced • Apply data archive security & privacy policies • Ensure archived data is actively stewarded and maintained, including metadata
  • 24. Global Data Strategy, Ltd. 2022 Effective Data Lifecycle Management: Data Quality Implications • In order to minimize costs of failure in the data management lifecycle, the most critical stages are Stage 1 (Plan) and Stage 2 (Collect / Ingest / Acquire) • In these stages Data Quality by Design should be the key objective • This will ensure: • From the outset the data is fit for purpose throughout its lifecycle • Potential data quality issues and problems can be identified early and preventative actions taken before usage • Delaying fixing issues to later stages will: • Lead to operational inefficiencies and poor decision making • Require more effort to re-engineer the data • Increase the costs of remediation • Each stage of the data management lifecycle will require different KPIs and measures to ensure effective data quality management (e.g. Data Collection v Data Sharing & Publication) 24
  • 25. Global Data Strategy, Ltd. 2022 Effective Data Lifecycle Management: Data Governance Implications • Data Quality is most effectively sustained and enforced through Data Governance policies and practices • Data Governance is a critical enabler to ensure end to end data lifecycle management, including data quality management • Data accountability needs to be assigned throughout the lifecycle by: • Appointing Data Owners and Data Stewards during the Plan stage of the data management lifecycle to ensure new data creation is required by the business and is aligned with existing data policies and standards • Ensuring policies and processes exist to ensure key data is governed throughout its lifecycle • Accountability may change as the data moves through the lifecycle (e.g. from processing to sharing, and usage to archiving) 25
  • 26. Global Data Strategy, Ltd. 2022 Summary • Data quality management remains a holistic challenge involving People, Process & Technology • Data quality management must embrace the entire data lifecycle from data creation to data disposal • Even when data is at the later stages of the lifecycle thinking about data quality from a data lifecycle management perspective helps to focus on the root causes of data quality problems • Putting focus and effort on the early stages of the data lifecycle reduces cost of failure impacts and remediation activities and costs • Data Governance is the key enabler for sustaining improved data quality and needs to underpin the entire data management lifecycle 26
  • 27. Global Data Strategy, Ltd. 2022 Dataversity Data Architecture Strategies Series: Related Previous Data Quality Webinars 27 https://www.dataversity.net/das-webinar-data-quality-best-practices-3/ https://www.dataversity.net/das-slides-data-quality-best-practices-2/ August 2020: The A2E Methodology for Tackling Data Quality Problems August 2021: Designing and applying Business Rules to Support Data Quality Improvement
  • 28. Global Data Strategy, Ltd. 2022 DATAVERSITY Data Architecture Strategies • January Emerging Trends in Data Architecture – What’s the Next Big Thing? • February Building a Data Strategy - Practical Steps for Aligning with Business Goals • March Master Data Management – Aligning Data, Process, and Governance • April Data Governance & Data Architecture: Alignment & Synergies • May Improving Data Literacy Around Data Architecture • June Business Intelligence & Data Analytics: An Architected Approach • July Best Practices in Metadata Management • August Data Quality Best Practices – with special guest Nigel Turner • September Business-centric Data Modeling • October Graph Databases: Benefits & Risks • December Enterprise Architecture vs. Data Architecture 28 This Year’s Lineup
  • 29. Global Data Strategy, Ltd. 2022 Who We Are: Business-Focused Data Strategy Maximize the Organizational Value of Your Data Investment In today’s business environment, showing rapid time to value for any technical investment is critical. But technology and data can be complex. At Global Data Strategy, we help demystify technical complexity to help you: • Demonstrate the ROI and business value of data to your management • Build a data strategy at your pace to match your unique culture and organizational style. • Create an actionable roadmap for “quick wins”, which building towards a long-term scalable architecture. Global Data Strategy’s shares experience from some of the largest international organizations scaled to the pace of your unique team. www.globaldatastrategy.com Global Data Strategy has worked with organizations globally in the following industries: Finance · Retail · Social Services · Health Care · Education · Manufacturing · Government · Public Utilities · Construction · Media & Entertainment · Insurance …. and more 29
  • 30. Global Data Strategy, Ltd. 2022 Questions? Thoughts? Ideas? 30