SlideShare ist ein Scribd-Unternehmen logo
1 von 57
Downloaden Sie, um offline zu lesen
Getting (Re)Started
with Data Stewardship
© Copyright 2021 by Peter Aiken Slide # 1
paiken@plusanythingawesome.com+1.804.382.5957 Peter Aiken, PhD
Integrating the role of Data Debt fighting
Peter Aiken, Ph.D.
• I've been doing this a long time
• My work is recognized as useful
• Associate Professor of IS (vcu.edu)
• Institute for Defense Analyses (ida.org)
• DAMA International (dama.org)
• MIT CDO Society (iscdo.org)
• Anything Awesome (plusanythingawesome.com)
• Experienced w/ 500+ data
management practices worldwide
• Multi-year immersions
– US DoD (DISA/Army/Marines/DLA)
– Nokia
– Deutsche Bank
– Wells Fargo
– Walmart …
• 12 books and dozens
of articles
© Copyright 2021 by Peter Aiken Slide # 2
https://plusanythingawesome.com
+
• DAMA International President 2009-2013/2018/2020
• DAMA International Achievement Award 2001
(with Dr. E. F. "Ted" Codd
• DAMA International Community Award 2005
3
Program
© Copyright 2021 by Peter Aiken Slide #
• Why do we need data stewards?
– Definitions: Data steward/stewardship, data debit
– Architectural context (cannot just clean up)
– The role of strategy
• How are stewards supposed to
eliminate data debit while
facing compliance/regulation/initiatives?
– Resolve prerequisite challenges
– Relationship with governance/organization
– Stewardship role (in context of data governance)
– Fire station model (Reactive & proactive foci)
• When (Organizational improvement)
– Differing cadence (Need for different structural approach)
– Foundational prerequisites
– Need for simplicity, agility, practice
• Take aways ➜ Q&A
Getting (Re)Started with
Data Stewardship
Integrating the role of Data Debit fighting
© Copyright 2021 by Peter Aiken Slide # 4
https://plusanythingawesome.com
Question?
• How many starting versus how many re-starting?
How old is your profession?
© Copyright 2021 by Peter Aiken Slide # 5
https://plusanythingawesome.com
Augusta Ada King
Countess of Lovelace
(1815-52)
• 8,000+ years
• formalize practices
• GAAP
What do we teach knowledge workers about data?
© Copyright 2021 by Peter Aiken Slide # 6
https://plusanythingawesome.com
What percentage of the deal with it daily?
What do we teach IT professionals about data?
© Copyright 2021 by Peter Aiken Slide # 7
https://plusanythingawesome.com
• 1 course
– How to build a
new database
• What
impressions do IT
professionals get
from this
education?
– Data is a technical
skill that is needed
when developing
new databases
Confusion
• IT thinks data is a business problem
– "If they can connect to the server, then my job is done!"
• The business thinks IT is managing data adequately
– "Who else would be taking care of it?"
© Copyright 2021 by Peter Aiken Slide # 8
https://plusanythingawesome.com
Bad Data Decisions/Illiteracy Spiral
© Copyright 2021 by Peter Aiken Slide # 9
https://plusanythingawesome.com
Bad data decisions
Technical
decision makers
are not data literate
Business decision
makers are not
data literate
Poor organizational outcomes
Poor treatment of
organizational data
assets
Poor
quality
data
© Copyright 2021 by Peter Aiken Slide # 10
https://plusanythingawesome.com
?
Definitions
Steward
• 1. a person who looks after the passengers on a ship, aircraft, or train and brings them meals.
– synonyms: flight attendant, cabin attendant, air hostess, purser "an air steward"
– a person responsible for supplies of food to a college, club, or other institution.
• 2. an official appointed to supervise arrangements or keep order at a large public event, for ex. sporting
event.
– synonyms: official, marshal, organizer "the race stewards"
– short for shop steward.
• 3. a person employed to manage another's property, especially a large house or estate.
– synonyms: (estate) manager, agent, overseer, custodian, caretaker; historical "the steward of the estate"
– a person whose responsibility it is to take care of something."farmers pride themselves on being stewards of the countryside"
Stewarding
• 1. (of an official) supervise arrangements or keep order at (a large public event).
"the event was organized and stewarded properly"
• 2. manage or look after (another's property).
Data Steward
• Manage data assets on behalf of all stake holders and in the best interests of the organization
(McGilvray, 2008)
• Represent the interests of all stakeholders and take an enterprise perspective
• Have dedicated time enough to be accountable and responsible
Trust
• Firm belief in the reliability, truth, ability, or strength of someone or something (google.com)
Fiduciary
• Involving trust, especially with regard to the relationship between a trustee and a beneficiary (google.com)
© Copyright 2021 by Peter Aiken Slide # 11
https://plusanythingawesome.com
one who actively directs the use of
organizational data assets in support
of specific mission objectives
Steward
• one who actively directs
© Copyright 2021 by Peter Aiken Slide # 12
https://plusanythingawesome.com
, Data
Data
Steward
© Copyright 2021 by Peter Aiken Slide # 13
https://plusanythingawesome.com
• What do data stewards do in our organization?
- Improve the organization's data assets value, and
- Advocate/evangelize for increasing the scope/rigor of
data-centric practices
- Ensure efficient/effective data management practices
- Improve data's use achieving organizational objectives
Source: Unknown
© Copyright 2021 by Peter Aiken Slide # 14
https://plusanythingawesome.com
Data
Management
Body
of
Knowledge
(DM
BoK
V2)
Practice
Areas
from The DAMA Guide to the Data Management Body of Knowledge 2E © 2017 by DAMA International
Data Debt
• The time and effort it will take to
return your data to a governed
state from its likely current state of
ungoverned
• Getting back to zero
– Involves undoing existing stuff
– Likely new skills are required
• At zero-must start from scratch
– Typically requires annual proof of value
• Now you need to get good at both
– Almost all data challenges involve
interoperability
– Little guidance at optimizing data
management practices
– Very little guidance as to how to get
back to zero
© Copyright 2021 by Peter Aiken Slide # 15
https://plusanythingawesome.com
Poor data manifests as multifaceted organizational challenges
© Copyright 2021 by Peter Aiken Slide # 16
https://plusanythingawesome.com
Root cause analysis is part of data stewardship
© Copyright 2021 by Peter Aiken Slide # 17
https://plusanythingawesome.com
IT
System
Business
Challenge
Business
Process
Business
Challenge
IT
Process
Business
Challenge
Business
System
Business
Challenge
IT
Process
Business
Challenge
IT
System
Business
Challenge
Business
Process
Business
Challenge
Poor results
Consistency Encourages Quality Analysis
© Copyright 2021 by Peter Aiken Slide # 18
https://plusanythingawesome.com
IT
System
Business
Challenge
Business
Process
Business
Challenge
IT
Process
Business
Challenge
Business
System
Business
Challenge
IT
Process
Business
Challenge
IT
System
Business
Challenge
Business
Process
Business
Challenge
Eliminating data debt
requires a team with
specialized skills
deployed to create a
repeatable process
and develop sustained
organizational
skillsets
Governance and
Architecture
© Copyright 2021 by Peter Aiken Slide # 19
https://plusanythingawesome.com
Example from: https://www.slideshare.net/AnthonyDehnashi/architecture-governance
https://plusanythingawesome.com
Corporate Governance
• "Corporate governance - which
can be defined narrowly as the
relationship of a company to its
shareholders or, more broadly,
as its relationship to society….",
Financial Times, 1997.
• "Corporate governance is about
promoting corporate fairness,
transparency and accountability"
James Wolfensohn, World Bank, President
Financial Times, June 1999.
• “Corporate governance deals
with the ways in which suppliers
of finance to corporations assure
themselves of getting a return on
their investment”,
The Journal of Finance, Shleifer and Vishny,
1997.
© Copyright 2021 by Peter Aiken Slide # 20
https://plusanythingawesome.com
• "Corporate governance - which
can be defined narrowly as the
relationship of a company to its
shareholders or, more broadly,
as its relationship to society….",
Financial Times, 1997.
• "Corporate governance is about
promoting corporate fairness,
transparency and accountability"
James Wolfensohn, World Bank, President
Financial Times, June 1999.
• “Corporate governance deals
with the ways in which suppliers
of finance to corporations assure
themselves of getting a return on
their investment”,
The Journal of Finance, Shleifer and Vishny,
1997.
© Copyright 2021 by Peter Aiken Slide # 21
https://plusanythingawesome.com
https://plusanythingawesome.com
• "Putting structure around how organizations align IT strategy with
business strategy, ensuring that companies stay on track to achieve
their strategies and goals, and implementing good ways to measure
IT’s performance.
• It makes sure that all stakeholders’ interests
are taken into account and that
processes provide measurable results.
• Framework should answer some key
questions, such as how the IT department
is functioning overall, what key metrics
management needs and what return IT
is giving back to the business from the
investment it’s making." CIO Magazine (May 2007)
IT Governance Institute, 5 areas of focus:
• Strategic Alignment
• Value Delivery
• Resource Management
• Risk Management
• Performance Measures
• "Putting structure around how organizations align IT strategy with
business strategy, ensuring that companies stay on track to achieve
their strategies and goals, and implementing good ways to measure
IT’s performance.
• It makes sure that all stakeholders’ interests
are taken into account and that
processes provide measurable results.
• Framework should answer some key
questions, such as how the IT department
is functioning overall, what key metrics
management needs and what return IT
is giving back to the business from the
investment it’s making." CIO Magazine (May 2007)
IT Governance Institute, 5 areas of focus:
• Strategic Alignment
• Value Delivery
• Resource Management
• Risk Management
• Performance Measures
IT Governance
© Copyright 2021 by Peter Aiken Slide # 22
https://plusanythingawesome.com
© Copyright 2021 by Peter Aiken Slide # 23
https://plusanythingawesome.com
https://plusanythingawesome.com
Elevator Pitch
7 Data Governance Definitions
24
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• The formal orchestration of people, process, and technology to enable
an organization to leverage data as an enterprise asset – The MDM Institute
• A convergence of data quality, data management, business process
management, and risk management surrounding the handling of data in an
organization – Wikipedia
• A system of decision rights and accountabilities for information-related
processes, executed according to agreed-upon models which describe who
can take what actions with what information, and when, under what
circumstances, using what methods – Data Governance Institute
• The execution and enforcement of authority over the management of data
assets and the performance of data functions – KiK Consulting
• A quality control discipline for assessing, managing, using, improving,
monitoring, maintaining, and protecting organizational
information – IBM Data Governance Council
• Data governance is the formulation of policy to optimize,
secure, and leverage information as an enterprise asset
by aligning multiple functional objectives – Sunil Soares
• The exercise of authority and control over the
management of data assets – DM BoK
What is Data Governance?
© Copyright 2021 by Peter Aiken Slide # 25
https://plusanythingawesome.com
Managing
Data
with
Guidance
Would
you
want
your
sole,
non-
depletable,
non-
degrading,
durable,
strategic
asset
managed
without
guidance?
https://plusanythingawesome.com
What is Data Governance?
© Copyright 2021 by Peter Aiken Slide # 26
https://plusanythingawesome.com
Managing
Data
Decisions with
Guidance
Would
you
want
your
sole,
non-
depletable,
non-
degrading,
durable,
strategic
asset
managed
without
guidance?
https://plusanythingawesome.com
Architecture
• Things
– (components)
data structures
• The functions of the things
– (individually)
sources and uses of data
• How the things interact
– (as a system, towards a goal)
Efficiencies/effectiveness
© Copyright 2021 by Peter Aiken Slide # 27
https://plusanythingawesome.com
Organizational
Architectures
• Amazon
– Traditional
structure
• Google
– Team of 3
• Facebook
– Do you really
have a structure?
• Microsoft
– Eliminate their
own products
• Apple
– Everything
revolves around
one individual
• Oracle
– Buys one
company after
another
© Copyright 2021 by Peter Aiken Slide # 28
https://plusanythingawesome.com
Data/Information Architectures – Useful Definition
• Common vocabulary expressing
integrated requirements ensuring that
data assets are stored, arranged,
managed, and used in systems in support
of organizational strategy [Aiken 2010]
© Copyright 2021 by Peter Aiken Slide # 29
https://plusanythingawesome.com
• A specific definition
– 'Understanding an architecture'
– Documented and articulated as a (digital) blueprint illustrating the
commonalities and interconnections among the
architectural components
– Ideally the understanding
is shared by systems and
humans
Understanding = Interoperability
© Copyright 2021 by Peter Aiken Slide # 30
https://plusanythingawesome.com
Data Architectures: here, whether you like it or not
© Copyright 2021 by Peter Aiken Slide # 31
https://plusanythingawesome.com
deviantart.com
• All organizations
have data
architectures
– Some are better
understood and
documented (and
therefore more
useful to the
organization) than
others
Typically Managed Organizational Architectures
• Process Architecture
– Arrangement of inputs -> transformations = value -> outputs
– Typical elements: Functions, activities, workflow, events, cycles, products,
procedures
• Systems Architecture
– Applications, software components, interfaces, projects
• Business Architecture
– Goals, strategies, roles, organizational structure, location(s)
• Security Architecture
– Arrangement of security controls in relation to IT Architecture
• Technical Architecture/Tarchitecture
– Relation of software capabilities/technology stack
– Typical elements: Networks, hardware, software platforms, standards/protocols
• Data/Information Architecture
– Arrangement of data assets supporting organizational strategy
– Typical elements: specifications expressed as entities, relationships, attributes,
definitions, values, vocabularies
© Copyright 2021 by Peter Aiken Slide # 32
https://plusanythingawesome.com
The Role of Strategy
© Copyright 2021 by Peter Aiken Slide # 33
https://plusanythingawesome.com
Example from: https://slideplayer.com/slide/5082003/
https://plusanythingawesome.com
What is Strategy?
• Current use derived from military
- a pattern in a stream of decisions
[Henry Mintzberg]
© Copyright 2021 by Peter Aiken Slide # 34
https://plusanythingawesome.com
A thing
Every Day
Low Price
Former Walmart Business Strategy
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com 35
Wayne Gretzky’s
Definition of Strategy
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
He skates to where he
thinks the puck will be ...
36
Strategy in Action: Napoleon faces a larger enemy
• Question?
– How do I defeat the competition when their forces
are bigger than mine?
• Answer:
– Divide
and
conquer!
– “a pattern
in a stream
of decisions”
© Copyright 2021 by Peter Aiken Slide # 37
https://plusanythingawesome.com
Supply Line Metadata
(as part of a divide and conquer strategy)
© Copyright 2021 by Peter Aiken Slide # 38
https://plusanythingawesome.com
First Divide
© Copyright 2021 by Peter Aiken Slide # 39
https://plusanythingawesome.com
Then Conquer
© Copyright 2021 by Peter Aiken Slide # 40
https://plusanythingawesome.com
Complex Strategy
• First
– Hit both armies hard
at just the right spot
• Then
– Turn right and defeat
the Prussians
• Then
– Turn left and defeat
the British
© Copyright 2021 by Peter Aiken Slide #
Whilesomeoneisshootingatyou!
41
https://plusanythingawesome.com
Data Governance Environment
Complex Data Governance Environment
Complex
Data Governance Council
• Liaise between agency operations & CDO
• Advise CDO on technology, policy, and governance
strategies
• Administer data governance policies set by the board
• implement data sharing & analytics projects
• Review open data assets
• Report progress & compliance to the Board
Advise CDO
© Copyright 2021 by Peter Aiken Slide # 42
https://plusanythingawesome.com
Data sharing and analytics
• Identify goals and objectives
• Prioritize initiatives
• Study & report
• Recommend changes to budget and code
DATA COMMISSION
LEGISLATIVE
Region representatives
EXECUTIVE
JUDICIAL
Advise Governor
TRUSTEE
execute
• Define, approve, and communicate data strategies, policies,
standards, rules, guidelines, & best practices
• Provide a governance, policy, and technology framework
• Define agency data governance responsibilities
• Encourage & facilitate data sharing
• Facilitate coordination to prevent duplication
• Coordinate policy and technology proposals and recommendations
• Administer and manage the commonwealth data trust
• Track and enforce compliance and conformance
• Oversee dissemination of open data
Executive data board
Oversee council
• Translate commonwealth goals to agency performance
targets
• provide resources
• Remove organizational obstacles
• appoint data governance council members
• Oversee the data governance council
• Oversee data sharing & analytics projects
Strategy Guides Workgroup Activities
© Copyright 2021 by Peter Aiken Slide #
A pattern
in a stream
of decisions
43
https://plusanythingawesome.com
Strategic Approach - Master/Reference Data
• Reference
– Controls accessible data values
• Master
– Controls access to system capabilities
• Transaction
– Instances of values
© Copyright 2021 by Peter Aiken Slide # 44
https://plusanythingawesome.com
Countries where we do business?
Types of accounts available?
Controlled vocabulary items
Are you a member of our premium club?
Authorizing uses/users?
Common/standard data structures
$5
Authorized
Like
Example from: Dr. Christopher Bradley of DMAdvisors–he has more, ping him at chris.bradley@dmadvisors.co.uk
Data Strategy provides focus for stewardship efforts
Note: Reducing ROT increases data leverage
© Copyright 2021 by Peter Aiken Slide # 45
https://plusanythingawesome.com
Organizational Data
Data Stewards
Technologies
Process
People
Less Data ROT ->
Data Stewardship Considerations
• Why?
– Stewardship terminology is not widely known
– We do not have agreed upon definitions
– It has become a de-facto widely used, ill defined term
– Stewards work effectively with architectural components
– Strategy focuses steward leveraging activities
© Copyright 2021 by Peter Aiken Slide # 46
https://plusanythingawesome.com
http://williamnava.com/philosophy-shaves-barber-21/
© Copyright 2021 by Peter Aiken Slide # 47
https://plusanythingawesome.com
All
inputs
are
data
All
outputs
are
data
All
inputs
are
data
All
inputs
are
data
All
inputs
are
data
All
outputs
are
data
All
outputs
are
data
All
outputs
are
data
All
inputs
are
data
All
outputs
are
data
All
inputs
are
data
All
outputs
are
data
All
inputs
are
data
All
outputs
are
data
The Data Machine
How to determine what to manage formally?
Too much requires expensive and slow bureaucracy ← → Too little misses opportunities
Interoperability is the primary value determinant
48
Program
© Copyright 2021 by Peter Aiken Slide #
• Why do we need data stewards?
– Definitions: Data steward/stewardship, data debit
– Architectural context (cannot just clean up)
– The role of strategy
• How are stewards supposed to
eliminate data debit while
facing compliance/regulation/initiatives?
– Resolve prerequisite challenges
– Relationship with governance/organization
– Stewardship role (in context of data governance)
– Fire station model (Reactive & proactive foci)
• When (Organizational improvement)
– Differing cadence (Need for different structural approach)
– Foundational prerequisites
– Need for simplicity, agility, practice
• Take aways ➜ Q&A
Getting (Re)Started with
Data Stewardship
Integrating the role of Data Debit fighting
• Overly dependent upon:
– Human-beings
– Wetwear
– Knowledge workers
– Informal communications
– Often described
as the weakest link
Data / Information Gap
Information
© Copyright 2021 by Peter Aiken Slide # 49
https://plusanythingawesome.com
Data
© Copyright 2021 by Peter Aiken Slide #
• Have little idea what data they have
• Do not know where it is (and)
• Do not know what their knowledge workers do with it
Put simply, organizations:
50
https://plusanythingawesome.com
Data Knowledge is too little and too informal
• Data stewardship happens 'pretty well' at
the workgroup level
– Defining characteristic of a workgroup (Workgroups get work done!)
– Without guidance, what are the chances that all
workgroups are pulling toward the same objectives?
– Consider the time spent attempting informal practices
• Data chaff becomes sand in the machinery
– Preventing smooth interoperation and exchanges
– Death by 1,000 cuts that have been difficult to account for
• Organizations and individuals lack
– Knowledge
– Skills
– Maturity
© Copyright 2021 by Peter Aiken Slide # 51
https://plusanythingawesome.com
Separating the Wheat from the Chaff
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com 52
https://plusanythingawesome.com
Is well organized data worth more?
Pre-Information Age Metadata
• Examples of information architecture achievements that
happened well before the information age:
– Page numbering
– Alphabetical order
– Table of contents
– Indexes
– Lexicons
– Maps
– Diagrams
© Copyright 2021 by Peter Aiken Slide # 53
https://plusanythingawesome.com
Example from: How to make sense of any mess
by Abby Covert (2014) ISBN: 1500615994
"While we can arrange things
with the intent to communicate
certain information, we can't
actually make information. Our
users do that for us."
https://www.youtube.com/watch?v=60oD1TDzAXQ&feature=emb_logo
https://www.youtube.com/watch?v=r10Sod44rME&t=1s
https://www.youtube.com/watch?v=XD2OkDPAl6s
https://plusanythingawesome.com
https://plusanythingawesome.com
Remove the structure and things fall apart rapidly
• Better organized data increases in value
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com 54
https://plusanythingawesome.com
https://plusanythingawesome.com
Separating the Wheat from the Chaff
• Data that is better organized increases in value
• Poor data management practices are costing organizations
money/time/effort
• 80% of organizational data is ROT
– Redundant
– Obsolete
– Trivial
• The question is which
data to eliminate?
– Most enterprise data is never analyzed
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com 55
https://plusanythingawesome.com
Data Assets Win!
• Today, data is the most powerful, yet underutilized and poorly managed
organizational asset
• Data is your
– Sole
– Non-depletable
– Non-degrading
– Durable
– Strategic
• Asset
– Data is the new oil!
– Data is the new (s)oil!
– Data is the new bacon!
• As such, data deserves:
– It's own strategy
– Attention on par with similar
organizational assets
– Professional ministration
to make up for past neglect
© Copyright 2021 by Peter Aiken Slide #
2020 American Airlines market value ~ $6b
AAdvantage valued between $19.5-$31.5
United market value ~ 9$b
MileagePlus ~ $22b
https://www.forbes.com/sites/advisor/2020/07/15/how-airlines-make-billions-from-monetizing-frequent-flyer-programs/?sh=66da87a614e9
Data
Assets
Financial
Assets
Real
Estate Assets
Inventory
Assets
Non-
depletable
Available for
subsequent
use
Can be
used up
Can be
used up
Non-
degrading √ √ Can degrade
over time
Can degrade
over time
Durable Non-taxed √ √
Strategic
Asset √ √ √ √
56
https://plusanythingawesome.com
Asset: A resource controlled by the organization as a result of past events or
transactions and from which future economic benefits are expected to flow [Wikipedia]
Data Assets Win!
Data Governance Frameworks
• A system of ideas for guiding
analyses
• A means of organizing
project data
• Priorities for data decision
making
• A means of assessing progress
– Don’t put up walls until foundation
inspection is passed
– Put the roof on ASAP
• Make it all dependent upon
continued funding
© Copyright 2021 by Peter Aiken Slide # 57
https://plusanythingawesome.com
A Framework For Stewardship
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
A Framework For Stewardship
from https://www.trainingjournal.com/articles/feature/stewardship
Organizational Data
Challenges
Stewardship Engine
Regulation and Policy
A Framework for Data Stewardship
© Copyright 2021 by Peter Aiken Slide # 59
https://plusanythingawesome.com
Monetary
Proactive Reactive
Stewardship Activities
Address Some
Other Time
Strategic Consideration
Non-monetary
Value
© Copyright 2021 by Peter Aiken Slide # 60
https://plusanythingawesome.com
https://plusanythingawesome.com
Data and Duct Tape
© Copyright 2021 by Peter Aiken Slide # 61
https://plusanythingawesome.com
https://plusanythingawesome.com
© Copyright 2021 by Peter Aiken Slide # 62
https://plusanythingawesome.com
https://plusanythingawesome.com
Leadership
(data decision makers)
Domain expertise is less ← | → Domain expertise is greater
Roles more formally defined ← | → Roles less formally defined
Encounter
governed
data
more
directly
←
|
→
Encounter
governed
data
less
directly
More
time
is
dedicated
←
|
→
Less
time
is
dedicated
IT/Systems Development
Stewards
(data trustees)
Guidance
Decisions
Participants/Experts
(data subject matter experts)
Other Sources/Uses
(data makers & consumers)
IT/Systems
Development
Data/feedback
Changes
Action
R
e
s
o
u
r
c
e
s
Ideas
Data/Feedback
Components comprising the data community
© Copyright 2021 by Peter Aiken Slide # 63
https://plusanythingawesome.com
Leadership
(data decision makers)
Stewards
(data trustees)
Guidance
Decisions
Participants/Experts
(data subject matter experts)
Other Sources/Uses
(data makers & consumers)
Data/feedback
Changes
Action
R
e
s
o
u
r
c
e
s
Ideas
Data/Feedback
Basic Version
© Copyright 2021 by Peter Aiken Slide # 64
https://plusanythingawesome.com
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
≈
Data Governance Role: Produce systemic organizational changes that impact
data and work practices over time
© Copyright 2021 by Peter Aiken Slide # 65
https://plusanythingawesome.com
Data
Leadership
Feedback
Data
Governance
Data
Improvement
Data
Stewards
Data
Community
Participants
Data
Generators/Data
Users
Data
Things
Happen
Organizational
Things
Happen
DIPs
Data
Improves
Over
Time
Data
Improves As
A Result of
Focus
Feedback
X
$
X
$
X
$
X
$
X
$
X
$
X
$
X
$
X $ $
Data Strategy in Context
© Copyright 2021 by Peter Aiken Slide # 66
https://plusanythingawesome.com
Organizational
Strategy
Data Strategy
IT Projects
Organizational Operations
Data
Governance
Data
asset support for
organizational
strategy
What the data
assets need to do to
support strategy
How well data is
supporting strategy
Operational
feedback
How IT
supports strategy
Other
aspects of
organizational
strategy
Data Governance & Data Stewards
© Copyright 2021 by Peter Aiken Slide # 67
https://plusanythingawesome.com
Data Strategy
Data
Governance
What the data
assets do to support
strategy
How well the data
strategy is working
(Business Goals)
(Metadata)
Data Stewards
What is the
most effective
use of steward
investments?
(Metadata)
Progress,
plans,
problems
Getting Started with Data Stewardship
How?
• Transform tribal knowledge-based processes to data asset
leveraging
• Understand stewards transform governance into by strategy
focused action
• Apply a framework to your tasks
• Understand and get good at both reactive and proactive activities
• Attempt to incorporate leadership outside of traditional channels
• Know that you cannot accomplish everything
© Copyright 2021 by Peter Aiken Slide # 68
https://plusanythingawesome.com
https://hatrabbits.com/en/how-how-diagram/
https://plusanythingawesome.com
Keep the proper focus
• Wrong question:
– How should we mange this data?
• Right question:
– Should we include this
data item within the
scope of our current
stewardship
practices?
• Either way document why!
© Copyright 2021 by Peter Aiken Slide # 69
https://plusanythingawesome.com
https://plusanythingawesome.com
70
Program
© Copyright 2021 by Peter Aiken Slide #
• Why do we need data stewards?
– Definitions: Data steward/stewardship, data debit
– Architectural context (cannot just clean up)
– The role of strategy
• How are stewards supposed to
eliminate data debit while
facing compliance/regulation/initiatives?
– Resolve prerequisite challenges
– Relationship with governance/organization
– Stewardship role (in context of data governance)
– Fire station model (Reactive & proactive foci)
• When (Organizational improvement)
– Differing cadence (Need for different structural approach)
– Foundational prerequisites
– Need for simplicity, agility, practice
• Take aways ➜ Q&A
Getting (Re)Started with
Data Stewardship
Integrating the role of Data Debit fighting
Data Footprints
• SQL Server
– 47,000,000,000,000 bytes
– Largest table 34 billion records 3.5 TBs
• Informix
– 1,800,000,000 queries/day
– 65,000,000 tables / 517,000 databases
• Teradata
– 117 billion records
– 23 TBs for one table
• DB2
– 29,838,518,078 daily queries
© Copyright 2021 by Peter Aiken Slide # 71
https://plusanythingawesome.com
Standard data
Data supply
Data literacy
Making a Better Data Governance Sandwich
© Copyright 2021 by Peter Aiken Slide #
Data literacy
Standard data
Data supply
72
https://plusanythingawesome.com
Making a Better Data Governance Sandwich
© Copyright 2021 by Peter Aiken Slide #
Standard data
Data supply
Data literacy
73
https://plusanythingawesome.com
Making a Better Data Sandwich
© Copyright 2021 by Peter Aiken Slide #
Standard data
Data supply
Data literacy
This cannot happen without data engineering and architecture!
Quality data engineering/
architecture work products
do not happen accidentally!
74
https://plusanythingawesome.com
• Durable asset
- An asset that has a usable
life more than one year
• Reasonable project
deliverables
- 90 day increments
- Data evolution is measured in years
• Data
- Evolves - it is not created
- Significantly more stable
• Readymade data architectural components
- Prerequisite to agile development
• Only alternative is to create additional data siloes!
Data is not a Project
© Copyright 2021 by Peter Aiken Slide # 75
https://plusanythingawesome.com
Differences between Programs and Projects
• Programs are Ongoing, Projects End
– Managing a program involves long term strategic planning and
continuous process improvement is not required of a project
• Programs are Tied to the Financial Calendar
– Program managers are often responsible for delivering
results tied to the organization's financial calendar
• Program Management is Governance Intensive
– Programs are governed by a senior board that provides direction,
oversight, and control while projects tend to be less governance-intensive
• Programs Have Greater Scope of Financial Management
– Projects typically have a straight-forward budget and project financial management
is focused on spending to budget while program planning, management and control
is significantly more complex
• Program Change Management is an Executive Leadership Capability
– Projects employ a formal change management process while at the program level,
change management requires executive leadership skills and program change is
driven more by an organization's strategy and is subject to market conditions and
changing business goals
© Copyright 2021 by Peter Aiken Slide #
Adapted from http://top.idownloadnew.com/program_vs_project/ and http://management.simplicable.com/management/new/program-management-vs-project-management
76
https://plusanythingawesome.com
Your data program
must last at least as
long as your HR
program!
© Copyright 2021 by Peter Aiken Slide # 77
https://plusanythingawesome.com
“Your Organization is
all about Data,
until it’s not about just Data”
What Business
are you in?
• A management paradigm that views any
manageable system as being limited in
achieving more of its goals by a small
number of constraints
• There is always at least one constraint, and
TOC uses a focusing process to identify the constraint and
restructure the rest of the organization to address it
• TOC adopts the common idiom "a
chain is no stronger than its weakest
link," processes, organizations, etc.,
are vulnerable because the weakest
component can damage or break
them or at least adversely affect the
outcome
© Copyright 2021 by Peter Aiken Slide # 78
https://plusanythingawesome.com https://en.wikipedia.org/wiki/Theory_of_constraints
(TOC)
https://plusanythingawesome.com
Organizational Data Usage Practices
© Copyright 2021 by Peter Aiken Slide # 79
https://plusanythingawesome.com
Data
Management
Practices
Duplicated but ETLed Data
(quality & transformations applied)
"Warehoused" Data
Learning/
Feedback
Marts
Analytics Practices
T1
Organizations
without
a formalized
data stewards
T3
Data Steward: Use data
to create strategic
opportunities
T4
Data Steward: both
Improve Operations
Innovation
The focus of data stewards should be sequenced
© Copyright 2021 by Peter Aiken Slide # 80
https://plusanythingawesome.com
Only 1 is 10 organizations has a board
approved data strategy!
T2
Data Steward: Increase
organizational efficiencies/
effectiveness
X
X
Data Strategy in Context – THIS IS WRONG!
© Copyright 2021 by Peter Aiken Slide #
Organizational Strategy
IT Strategy
Data Strategy
x 81
https://plusanythingawesome.com
Organizational Strategy
IT Strategy
This is correct …
© Copyright 2021 by Peter Aiken Slide #
Data Strategy
https://plusanythingawesome.com
https://plusanythingawesome.com
KISS (Keep it simple, stupid!)
© Copyright 2021 by Peter Aiken Slide # 83
https://plusanythingawesome.com
Data –
Driven?
Centric?
Focused?
First?
…
the Data Doctrine® (V2)
© Copyright 2021 by Peter Aiken Slide # 84
https://plusanythingawesome.com Source: theagiledoctrine.org
We are uncovering better ways of developing
IT systems by doing it and helping others do it.
Through this work we have come to value:
data programs driving IT programs
informed information investing over technology acquisition activities
stable, shared organizational data over IT component evolution
data reuse over the acquisition of new data sources
the Data Doctrine® (V2)
We are uncovering better ways of developing
IT systems by doing it and helping others do it.
Through this work we have come to value:
data programs driving IT programs
informed information investing over technology acquisition activities
stable, shared organizational data over IT component evolution
data reuse over the acquisition of new data sources
© Copyright 2021 by Peter Aiken Slide # 85
https://plusanythingawesome.com
That is, while there is value in the items on
the right, we value the items on the left more.
Source: theagiledoctrine.org
Data Decisions Variety
© Copyright 2021 by Peter Aiken Slide # 86
https://plusanythingawesome.com
https://www.healthcatalyst.com/why-are-data-stewards-so-important-for-healthcare
Data Source Steward
Data Steward
• Business data steward
– Manage from the perspective of business elements (i.e. business definitions
and data quality)
• Technical data steward
– Focus on the use of data by systems and models (i.e. code operation)
• Project data steward
– Gather definitions, quality rules and issues for referral to business/technical stewards
• Domain data steward
– Manage data/metadata required across multiple business areas (i.e. customer data)
• Operational data steward
– Directly input data or instruct those who do; aid business
stewards identifying root cause and addressing issues
• Metadata Data Steward
– Manage metadata as an asset
• Legacy Data Steward
– Manage legacy data as an asset
• Data steward auditor
– Ensures compliance with data guidance
• Data steward manager
– Planning, organizing, leading and controlling
© Copyright 2021 by Peter Aiken Slide # 87
https://plusanythingawesome.com
(list adapted from Plotkin, 2014)
Strategy Example 1
© Copyright 2021 by Peter Aiken Slide #
Good Guys
(Us)
Bad Guys
(Them)
88
https://plusanythingawesome.com
Strategy Example 2
© Copyright 2021 by Peter Aiken Slide #
Good Guys
(Us)
Bad Guys
(Them)
89
https://plusanythingawesome.com
Strategy Example 3
© Copyright 2021 by Peter Aiken Slide #
Good Guys
(Us)
Bad Guys
(Them)
90
https://plusanythingawesome.com
• Why do we need data stewards?
– Definitions: Data steward/stewardship, data debt
– Architectural context (cannot just clean up)
– The role of strategy
• How are stewards supposed to
eliminate data debt while
facing compliance/regulation/initiatives?
– Resolve prerequisite challenges
– Relationship with governance/organization
– Stewardship role (in context of data governance)
– Fire station model (Reactive & proactive foci)
• When (Organizational improvement)
– Differing cadence (Need for different structural approach)
– Foundational prerequisites
– Need for simplicity, agility, practice
• Take aways ➜ Q&A
© Copyright 2021 by Peter Aiken Slide # 91
https://plusanythingawesome.com
Program
Getting (Re)Started with
Data Stewardship
Integrating the role of Data Debt fighting
Take Aways
© Copyright 2021 by Peter Aiken Slide # 92
https://plusanythingawesome.com
• Since stewards own the results,
they control the remediation process
• Need for professional data stewards is
increasing
– Increase in data volume
– Lack of practice improvement
• Data stewardship is a relatively new discipline
– Must conform to organizational constraints
– No one best way
• Data stewards must be driven by a data
strategy complimenting organizational
strategy
• Data stewards direct data management
efforts
• The language of data stewards is metadata
• Process improvement can improve data
stewardship and
organizational practices
• Data Governance: How to
Design, Deploy, and Sustain
an Effective Data
Governance Program
• John Ladley
• Amazon Best Sellers Rank:
#641,937 in Books (See Top
100 in Books)
– #242 in Management Information
Systems
– #209 in Library Management
– #380 in Database Storage & Design
© Copyright 2021 by Peter Aiken Slide # 93
https://plusanythingawesome.com
Upcoming Events
Approaching Data Quality–Engineering Success Stories
14 September 2021
Essential Metadata Strategies
12 October 2021
Necessary Prerequisites to Data Success:
Exorcising the Seven Deadly Data Sins
9 November 2021
© Copyright 2021 by Peter Aiken Slide # 94
https://plusanythingawesome.com
Brought to you by:
Time: 19:00 UTC (2:00 PM NYC) | Presented by: Peter Aiken, PhD
Note: In this .pdf, clicking any webinar title opens the registration link
Event Pricing
© Copyright 2021 by Peter Aiken Slide # 95
https://plusanythingawesome.com
• 20% off
directly from the publisher on
select titles
• My Book Store @
http://plusanythingawesome.com
• Enter the code
"anythingawesome" at the
Technics bookstore checkout
where it says to
"Apply Coupon"
peter@plusanythingawesome.com +1.804.382.5957
Questions?
Thank You!
© Copyright 2021 by Peter Aiken Slide # 96
+ =
8/10/21, 3:31 PM
You Probably Don't Need a Data Dictionary - Locally Optimistic
Page 1 of 9
https://locallyoptimistic.com/post/data_dictionaries/
Home About Getting Involved LO Events
In Tools Tags Analytics, Business Intelligence, Data Dictionary, Data Engineering June 17, 2019 Michael Kaminsky & Alex J
You Probably Don’t Need a Data Dictionary
While efforts to build a data dictionary are often undertaken out of a zeal for documentation
applaud, in practice data dictionaries and data catalogs end up being a large maintenance
value, and tend to very quickly become out of date.
Instead of investing in building out traditional data dictionaries, we recommend a few differe
8/10/21, 3:31 PM
You Probably Don't Need a Data Dictionary - Locally Optimistic
Page 2 of 9
https://locallyoptimistic.com/post/data_dictionaries/
achieving the same goals in ways that are less burdensome to maintain and better serve th
well.
What is a data dictionary?
In this post we’ll use the term “data dictionary” to describe a range of products that include
and “data catalogs”. In general, we’re referring to documents that contain metadata about
warehouse – most commonly, they have one section per table with one row per column inc
column-name, the column-type, any relevant foreign-keys, and a brief description of the co
will be additional metadata about the source of the table (a source system or potentially a p
A screenshot from an unhelpful data dictionary.
Why are data dictionaries created?
Listen. There’s no activity the two of us like better for enjoying a beautiful spring Saturday t
desk, cracking open a cold Kombucha, and writing some damn documentation. While we d
desire to document-all-the-things, we also believe that it is important to find the right docum
the goals that you want to achieve. People tend to embark on the data dictionary journey w
vague goals in mind. They want to 1) enable more data exploration, 2) codify a single sourc
definitions, and 3) document the provenance of particular pieces of business logic.
8/10/21, 3:31 PM
You Probably Don't Need a Data Dictionary - Locally Optimistic
Page 3 of 9
https://locallyoptimistic.com/post/data_dictionaries/
These are all great goals! However, we do not believe that data dictionaries are the best too
these objectives. We will talk a bit about why we do not believe that data dictionaries, as tra
work very well before diving into alternative ways to achieve those goals.
What goes wrong with data dictionaries?
With your traditional data dictionary, you start small and simple. Maybe just documenting th
tables, or the definitions and business logic behind your top three KPIs.
But new use cases are always coming up, new ways of looking at data and measuring succ
to standardize) – and so the list grows.
If your KPIs aren’t just pulling from a single table or a single system, now you need to start
different sources join together in a wild mess of foreign key relationships. Maybe you even c
pictures speak a thousand words after all.
And everything is great…until you run into the issue of different audiences:
Non-technical business users: Just want the data, or at most, a high-level summary
they’re important.
Business analysts: Want the data, but also the business logic behind it, and probably
populate the source and/or the relevant context.
Data scientists / Engineers: Just want the column-level metadata, relationships, and
raw data correctly – “I can write my own SQL/Python, thanks.”
Which means that realistically, to solve everyone’s use case, you’ll need to write three versi
one, very large version) for any given piece of data. Not fun.
But even if you’re willing to go down that path, eventually something will change. Maybe yo
Conversion just shifted a little bit (as we all know, business logic can be extremely slippery)
8/10/21, 3:31 PM
You Probably Don't Need a Data Dictionary - Locally Optimistic
Page 4 of 9
https://locallyoptimistic.com/post/data_dictionaries/
have to update not only your data model and dashboards + reports, but also 3 documents
The maintenance burden very quickly grows into a scalability issue, and a serious bloc
iteration speed of the entire data team. Not only does maintaining documentation slow you
documentation starts to slip (as it inevitably will) the consumers of the data dictionary will b
you will be investing time in maintaining this document that people do not even trust (and th
As a result, the heroes who start down the path of the perfect data dictionary often end up
1. Is hard to maintain
2. Is hard to use, and doesn’t answer everyone’s questions
3. Is generally out-of-date (in one way or another)
4. Ends up getting deprecated
Fortunately, we have better ways of achieving the same goals with less maintenance burde
What are the other options?
We believe the best way to approach the problem is by using the right tools for the right job
solutions for users at each level of the data literacy curve.
For the non-technical business user:
They’re really looking for an actionable, easy-to-use, single source of truth. Data – signed, s
our opinion, the best solution involves showing rather than telling: a well-architected and w
general, we believe that non-technical business users should not be writing SQL and theref
dictionary as traditionally constituted.
We do not want to go into a full-fledged comparison between BI platforms here, but whatev
here are the key features you’ll want to keep an eye out for:
8/10/21, 3:31 PM
You Probably Don't Need a Data Dictionary - Locally Optimistic
Page 5 of 9
https://locallyoptimistic.com/post/data_dictionaries/
1. Single Source of Truth
Data used to drive business decisions comes from a single system/platform
Business logic is centrally defined, curated, and easily reproducible
KPIs are clearly defined, well-organized, and shared across the organization
2. Ad-hoc Exploration
High-level KPIs are rarely actionable – so allowing users to take the next step in s
discovering, and drilling down into the data is critical
3. Right Level of Abstraction
Naively exposing every piece of data will create a difficult, confusing, and intimida
Develop a curated, properly labeled, easy-to-navigate dataset (with complex joins
abstracted away from the end user)
4. Balance between governance and democratization
Do you allow people to save their own reports/dashboards? Define their own dim
transformations?
5. Self Documenting Code
In a good BI tool, it should be easy for business users to find existing reports/dashboards t
modify (or leverage as a starting point for additional exploration). The tool should provide g
avoid making common mistakes, and provide enough metadata (either through tool-tips or
business users both find what they are looking for and avoid common pitfalls. The differenc
and a data dictionary is that the tool tips live with the data as they are actually used by bus
the post-business-logic description of the entity which is generally much more useful to end
If your BI tool has these qualities, then your business users will not need (or wouldn’t get an
dictionary. Since the BI tool should help business users answer business questions, your bu
have to know anything about the underlying data structures in order to achieve their goals.
users asking for a data dictionary, you should be thinking about building them better tools r
documenting the current suboptimal system.
Depending on the size of your team, the structure of your organization, and the complexity
8/10/21, 3:31 PM
You Probably Don't Need a Data Dictionary - Locally Optimistic
Page 6 of 9
https://locallyoptimistic.com/post/data_dictionaries/
may need to augment your BI tool with some additional tooling in the form of a wiki or know
below). For your non-technical users this is best used as a reference for “who is the expert”
to if they want to get started in a new domain area.
For the business / data analyst: For the people in your organization who are rolling up the
the data, there should be two sources of documentation that are most relevant:
A self-documenting and searchable code-base
A knowledge-base or wiki of high-level descriptions
The first place an analyst should look when trying to understand a table or column is in the
and manages that table or column. If, for example, you are using Looker as your BI tool on
analysts should be able to quickly and easily search those code bases to find the correct se
code base is well-architected and you follow best practices for programming style and read
straightforward for the analyst to review and comprehend the relative logic. If your analyst c
comprehend the logic, you should consider refactoring your code base or implementing mo
practices.
For the types of knowledge that are not easy to see in the code itself, it can be helpful to ha
that people can review for high-level information about a given concept. This documentatio
catalog of “what is in the data warehouse” but rather should include information about the
behind broad concepts in the business:
What is (NPS, ARR, Conversion, insert your concept here), how do we define it, how is
business logic), why is it important, and what are the next steps?
Important context and history (“this is the story of why it takes 5 joins to connect our h
coupon code system”).
Any potential “gotchas” (“we know that this logic is incorrect for some number of user
checked it impacted less than 1% of users and so we ignored it”).
Between these two sources of information, analysts should have everything they need to un
8/10/21, 3:31 PM
You Probably Don't Need a Data Dictionary - Locally Optimistic
Page 7 of 9
https://locallyoptimistic.com/post/data_dictionaries/
of the data they have.
For the data scientist / engineer:
For the most technical folks on your team, you can add some additional tooling that can he
that will be most useful to them. In addition to the self-documenting code-base (which is
concept in this blog post), you can think about adding:
Knowledge-repo: This tool is used to keep track of analyses that have happened in the
resource for both analysts and data scientists to see 1) example uses of certain data s
are related to the topic they are working on. Seeing how someone else used a given ta
for someone to understand its intricacies.
Programmatically-generated ERDs: rather than maintaining these types of documenta
programmatically-generated ERDs or data-heritage DAGs (from a tool like dbt or airflo
ensure that your documentation actually matches your code.
Architecture Design Documentation: this is often an artifact of the development proces
depth description of requirements, technical decisions made, architecture, etc.
Other important things to keep in mind:
You need a strong data organization to maintain these different avenues of documenta
You need a data-driven culture of cross-functional collaboration to make all of the piec
What are data dictionaries useful for?
There are two use-cases where having something like a data dictionary can be valuable:
1. If you are a data-as-a-service (DaaS) provider, then your data dictionary is effectively d
will be critical to your clients
2. For compliance / security reasons, you have complex restrictions on who can access
8/10/21, 3:31 PM
You Probably Don't Need a Data Dictionary - Locally Optimistic
Page 8 of 9
https://locallyoptimistic.com/post/data_dictionaries/
!
warehouse
These two use-cases are pretty niche, and we do not believe they apply to most internal-fa
second use-case is a bit tricky because what you want to build is not a “data dictionary” as
“helpful” metadata about tables and columns) but rather an access-spec which can hierarc
different parts of a warehouse (at the schema, table, column, and row level). We have some
build an effective tool for this, so if you’re working on something like that, please reach out!
Conclusion
Overall, we think that data dictionaries tend not to be very useful, and a data team’s time w
maintaining a well-organized code-base that is self-documenting and providing well-design
that do not require much documentation. Where possible, teams should make use of tools
generating documentation directly from code in order to prevent concept drift.
Michael Kaminsky & Alex Jia
Check out the Author pages for Alex and Michael
Related Posts:
" # $
Previous Post:
KPI Principles
%
%
8/10/21, 3:31 PM
You Probably Don't Need a Data Dictionary - Locally Optimistic
Page 9 of 9
https://locallyoptimistic.com/post/data_dictionaries/
The Attribution Dilemma Adding Context to Your Analysis with
Annotations
The Data S

Weitere ähnliche Inhalte

Was ist angesagt?

Data-Ed: Show Me the Money: The Business Value of Data and ROI
Data-Ed: Show Me the Money: The Business Value of Data and ROIData-Ed: Show Me the Money: The Business Value of Data and ROI
Data-Ed: Show Me the Money: The Business Value of Data and ROIData Blueprint
 
Real-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come AllReal-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come AllDATAVERSITY
 
Essential Metadata Strategies
Essential Metadata StrategiesEssential Metadata Strategies
Essential Metadata StrategiesDATAVERSITY
 
DataEd Slides: Expressing Data Improvements as Business Outcomes
DataEd Slides: Expressing Data Improvements as Business OutcomesDataEd Slides: Expressing Data Improvements as Business Outcomes
DataEd Slides: Expressing Data Improvements as Business OutcomesDATAVERSITY
 
DataEd Slides: Getting Data Quality Right – Success Stories
DataEd Slides: Getting Data Quality Right – Success StoriesDataEd Slides: Getting Data Quality Right – Success Stories
DataEd Slides: Getting Data Quality Right – Success StoriesDATAVERSITY
 
Data Governance Strategies - With Great Power Comes Great Accountability
Data Governance Strategies - With Great Power Comes Great AccountabilityData Governance Strategies - With Great Power Comes Great Accountability
Data Governance Strategies - With Great Power Comes Great AccountabilityDATAVERSITY
 
DataEd Slides: Data Management vs. Data Strategy
DataEd Slides: Data Management vs. Data StrategyDataEd Slides: Data Management vs. Data Strategy
DataEd Slides: Data Management vs. Data StrategyDATAVERSITY
 
DataEd Slides: Exorcising the Seven Deadly Data Sins
DataEd Slides: Exorcising the Seven Deadly Data SinsDataEd Slides: Exorcising the Seven Deadly Data Sins
DataEd Slides: Exorcising the Seven Deadly Data SinsDATAVERSITY
 
Data-Ed Slides: Best Practices in Data Stewardship (Technical)
Data-Ed Slides: Best Practices in Data Stewardship (Technical)Data-Ed Slides: Best Practices in Data Stewardship (Technical)
Data-Ed Slides: Best Practices in Data Stewardship (Technical)DATAVERSITY
 
Data-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMData-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMDATAVERSITY
 
DataEd Slides: Data Strategy Best Practices
DataEd Slides:  Data Strategy Best PracticesDataEd Slides:  Data Strategy Best Practices
DataEd Slides: Data Strategy Best PracticesDATAVERSITY
 
Helping HR to Cross the Big Data Chasm
Helping HR to Cross the Big Data ChasmHelping HR to Cross the Big Data Chasm
Helping HR to Cross the Big Data ChasmDATAVERSITY
 
Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management DATAVERSITY
 
Data Systems Integration & Business Value PT. 3: Warehousing
Data Systems Integration & Business Value PT. 3: Warehousing Data Systems Integration & Business Value PT. 3: Warehousing
Data Systems Integration & Business Value PT. 3: Warehousing Data Blueprint
 
Data governance, Information security strategy
Data governance, Information security strategyData governance, Information security strategy
Data governance, Information security strategyvasanthi4ever
 
The Five Pillars of Data Governance 2.0 Success
The Five Pillars of Data Governance 2.0 SuccessThe Five Pillars of Data Governance 2.0 Success
The Five Pillars of Data Governance 2.0 SuccessDATAVERSITY
 
Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...
Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...
Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...DATAVERSITY
 
DataEd Slides: Data Strategy – Plans Are Useless but Planning Is Invaluable
DataEd Slides: Data Strategy – Plans Are Useless but Planning Is InvaluableDataEd Slides: Data Strategy – Plans Are Useless but Planning Is Invaluable
DataEd Slides: Data Strategy – Plans Are Useless but Planning Is InvaluableDATAVERSITY
 
DataEd Slides: Exorcising the Seven Deadly Data Sins
DataEd Slides: Exorcising the Seven Deadly Data SinsDataEd Slides: Exorcising the Seven Deadly Data Sins
DataEd Slides: Exorcising the Seven Deadly Data SinsDATAVERSITY
 
Building a Team for Managing Unstructured Data
Building a Team for Managing Unstructured DataBuilding a Team for Managing Unstructured Data
Building a Team for Managing Unstructured DataDATAVERSITY
 

Was ist angesagt? (20)

Data-Ed: Show Me the Money: The Business Value of Data and ROI
Data-Ed: Show Me the Money: The Business Value of Data and ROIData-Ed: Show Me the Money: The Business Value of Data and ROI
Data-Ed: Show Me the Money: The Business Value of Data and ROI
 
Real-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come AllReal-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come All
 
Essential Metadata Strategies
Essential Metadata StrategiesEssential Metadata Strategies
Essential Metadata Strategies
 
DataEd Slides: Expressing Data Improvements as Business Outcomes
DataEd Slides: Expressing Data Improvements as Business OutcomesDataEd Slides: Expressing Data Improvements as Business Outcomes
DataEd Slides: Expressing Data Improvements as Business Outcomes
 
DataEd Slides: Getting Data Quality Right – Success Stories
DataEd Slides: Getting Data Quality Right – Success StoriesDataEd Slides: Getting Data Quality Right – Success Stories
DataEd Slides: Getting Data Quality Right – Success Stories
 
Data Governance Strategies - With Great Power Comes Great Accountability
Data Governance Strategies - With Great Power Comes Great AccountabilityData Governance Strategies - With Great Power Comes Great Accountability
Data Governance Strategies - With Great Power Comes Great Accountability
 
DataEd Slides: Data Management vs. Data Strategy
DataEd Slides: Data Management vs. Data StrategyDataEd Slides: Data Management vs. Data Strategy
DataEd Slides: Data Management vs. Data Strategy
 
DataEd Slides: Exorcising the Seven Deadly Data Sins
DataEd Slides: Exorcising the Seven Deadly Data SinsDataEd Slides: Exorcising the Seven Deadly Data Sins
DataEd Slides: Exorcising the Seven Deadly Data Sins
 
Data-Ed Slides: Best Practices in Data Stewardship (Technical)
Data-Ed Slides: Best Practices in Data Stewardship (Technical)Data-Ed Slides: Best Practices in Data Stewardship (Technical)
Data-Ed Slides: Best Practices in Data Stewardship (Technical)
 
Data-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMData-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDM
 
DataEd Slides: Data Strategy Best Practices
DataEd Slides:  Data Strategy Best PracticesDataEd Slides:  Data Strategy Best Practices
DataEd Slides: Data Strategy Best Practices
 
Helping HR to Cross the Big Data Chasm
Helping HR to Cross the Big Data ChasmHelping HR to Cross the Big Data Chasm
Helping HR to Cross the Big Data Chasm
 
Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management
 
Data Systems Integration & Business Value PT. 3: Warehousing
Data Systems Integration & Business Value PT. 3: Warehousing Data Systems Integration & Business Value PT. 3: Warehousing
Data Systems Integration & Business Value PT. 3: Warehousing
 
Data governance, Information security strategy
Data governance, Information security strategyData governance, Information security strategy
Data governance, Information security strategy
 
The Five Pillars of Data Governance 2.0 Success
The Five Pillars of Data Governance 2.0 SuccessThe Five Pillars of Data Governance 2.0 Success
The Five Pillars of Data Governance 2.0 Success
 
Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...
Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...
Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...
 
DataEd Slides: Data Strategy – Plans Are Useless but Planning Is Invaluable
DataEd Slides: Data Strategy – Plans Are Useless but Planning Is InvaluableDataEd Slides: Data Strategy – Plans Are Useless but Planning Is Invaluable
DataEd Slides: Data Strategy – Plans Are Useless but Planning Is Invaluable
 
DataEd Slides: Exorcising the Seven Deadly Data Sins
DataEd Slides: Exorcising the Seven Deadly Data SinsDataEd Slides: Exorcising the Seven Deadly Data Sins
DataEd Slides: Exorcising the Seven Deadly Data Sins
 
Building a Team for Managing Unstructured Data
Building a Team for Managing Unstructured DataBuilding a Team for Managing Unstructured Data
Building a Team for Managing Unstructured Data
 

Ähnlich wie Getting (Re)Started with Data Stewardship

DataEd Slides: Data Management + Data Strategy = Interoperability
DataEd Slides: Data Management + Data Strategy = InteroperabilityDataEd Slides: Data Management + Data Strategy = Interoperability
DataEd Slides: Data Management + Data Strategy = InteroperabilityDATAVERSITY
 
Approaching Data Quality
Approaching Data QualityApproaching Data Quality
Approaching Data QualityDATAVERSITY
 
Data Management vs. Data Governance Program
Data Management vs. Data Governance ProgramData Management vs. Data Governance Program
Data Management vs. Data Governance ProgramDATAVERSITY
 
DataEd Slides: Data Management Best Practices
DataEd Slides: Data Management Best PracticesDataEd Slides: Data Management Best Practices
DataEd Slides: Data Management Best PracticesDATAVERSITY
 
Necessary Prerequisites to Data Success
Necessary Prerequisites to Data SuccessNecessary Prerequisites to Data Success
Necessary Prerequisites to Data SuccessDATAVERSITY
 
DataEd Slides: Approaching Data Governance Strategically
DataEd Slides: Approaching Data Governance StrategicallyDataEd Slides: Approaching Data Governance Strategically
DataEd Slides: Approaching Data Governance StrategicallyDATAVERSITY
 
Key Elements of a Successful Data Governance Program
Key Elements of a Successful Data Governance ProgramKey Elements of a Successful Data Governance Program
Key Elements of a Successful Data Governance ProgramDATAVERSITY
 
DataEd Slides: Getting (Re)Started with Data Stewardship
DataEd Slides: Getting (Re)Started with Data StewardshipDataEd Slides: Getting (Re)Started with Data Stewardship
DataEd Slides: Getting (Re)Started with Data StewardshipDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
DataEd Slides: Getting Started with Data Stewardship
DataEd Slides:  Getting Started with Data StewardshipDataEd Slides:  Getting Started with Data Stewardship
DataEd Slides: Getting Started with Data StewardshipDATAVERSITY
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality RightDATAVERSITY
 
Getting Started with Data Stewardship
Getting Started with Data StewardshipGetting Started with Data Stewardship
Getting Started with Data StewardshipDATAVERSITY
 
DataEd Slides: Growing Practical Data Governance Programs
DataEd Slides: Growing Practical Data Governance ProgramsDataEd Slides: Growing Practical Data Governance Programs
DataEd Slides: Growing Practical Data Governance ProgramsDATAVERSITY
 
Where Data Architecture and Data Governance Collide
Where Data Architecture and Data Governance CollideWhere Data Architecture and Data Governance Collide
Where Data Architecture and Data Governance CollideDATAVERSITY
 
Data-Ed Webinar: Monetizing Data Management - Show Me the Money
Data-Ed Webinar: Monetizing Data Management - Show Me the MoneyData-Ed Webinar: Monetizing Data Management - Show Me the Money
Data-Ed Webinar: Monetizing Data Management - Show Me the MoneyDATAVERSITY
 
Data Preparation Fundamentals
Data Preparation FundamentalsData Preparation Fundamentals
Data Preparation FundamentalsDATAVERSITY
 
Data-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesData-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesDATAVERSITY
 
DataEd Webinar: Implementing Successful Data Strategies - Developing Organiza...
DataEd Webinar: Implementing Successful Data Strategies - Developing Organiza...DataEd Webinar: Implementing Successful Data Strategies - Developing Organiza...
DataEd Webinar: Implementing Successful Data Strategies - Developing Organiza...DATAVERSITY
 
Data-Ed Webinar: The Seven Deadly Data Sins - Emerging from Management Purgatory
Data-Ed Webinar: The Seven Deadly Data Sins - Emerging from Management PurgatoryData-Ed Webinar: The Seven Deadly Data Sins - Emerging from Management Purgatory
Data-Ed Webinar: The Seven Deadly Data Sins - Emerging from Management PurgatoryDATAVERSITY
 
Why Data Modeling Is Fundamental
Why Data Modeling Is FundamentalWhy Data Modeling Is Fundamental
Why Data Modeling Is FundamentalDATAVERSITY
 

Ähnlich wie Getting (Re)Started with Data Stewardship (20)

DataEd Slides: Data Management + Data Strategy = Interoperability
DataEd Slides: Data Management + Data Strategy = InteroperabilityDataEd Slides: Data Management + Data Strategy = Interoperability
DataEd Slides: Data Management + Data Strategy = Interoperability
 
Approaching Data Quality
Approaching Data QualityApproaching Data Quality
Approaching Data Quality
 
Data Management vs. Data Governance Program
Data Management vs. Data Governance ProgramData Management vs. Data Governance Program
Data Management vs. Data Governance Program
 
DataEd Slides: Data Management Best Practices
DataEd Slides: Data Management Best PracticesDataEd Slides: Data Management Best Practices
DataEd Slides: Data Management Best Practices
 
Necessary Prerequisites to Data Success
Necessary Prerequisites to Data SuccessNecessary Prerequisites to Data Success
Necessary Prerequisites to Data Success
 
DataEd Slides: Approaching Data Governance Strategically
DataEd Slides: Approaching Data Governance StrategicallyDataEd Slides: Approaching Data Governance Strategically
DataEd Slides: Approaching Data Governance Strategically
 
Key Elements of a Successful Data Governance Program
Key Elements of a Successful Data Governance ProgramKey Elements of a Successful Data Governance Program
Key Elements of a Successful Data Governance Program
 
DataEd Slides: Getting (Re)Started with Data Stewardship
DataEd Slides: Getting (Re)Started with Data StewardshipDataEd Slides: Getting (Re)Started with Data Stewardship
DataEd Slides: Getting (Re)Started with Data Stewardship
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
DataEd Slides: Getting Started with Data Stewardship
DataEd Slides:  Getting Started with Data StewardshipDataEd Slides:  Getting Started with Data Stewardship
DataEd Slides: Getting Started with Data Stewardship
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality Right
 
Getting Started with Data Stewardship
Getting Started with Data StewardshipGetting Started with Data Stewardship
Getting Started with Data Stewardship
 
DataEd Slides: Growing Practical Data Governance Programs
DataEd Slides: Growing Practical Data Governance ProgramsDataEd Slides: Growing Practical Data Governance Programs
DataEd Slides: Growing Practical Data Governance Programs
 
Where Data Architecture and Data Governance Collide
Where Data Architecture and Data Governance CollideWhere Data Architecture and Data Governance Collide
Where Data Architecture and Data Governance Collide
 
Data-Ed Webinar: Monetizing Data Management - Show Me the Money
Data-Ed Webinar: Monetizing Data Management - Show Me the MoneyData-Ed Webinar: Monetizing Data Management - Show Me the Money
Data-Ed Webinar: Monetizing Data Management - Show Me the Money
 
Data Preparation Fundamentals
Data Preparation FundamentalsData Preparation Fundamentals
Data Preparation Fundamentals
 
Data-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesData-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance Strategies
 
DataEd Webinar: Implementing Successful Data Strategies - Developing Organiza...
DataEd Webinar: Implementing Successful Data Strategies - Developing Organiza...DataEd Webinar: Implementing Successful Data Strategies - Developing Organiza...
DataEd Webinar: Implementing Successful Data Strategies - Developing Organiza...
 
Data-Ed Webinar: The Seven Deadly Data Sins - Emerging from Management Purgatory
Data-Ed Webinar: The Seven Deadly Data Sins - Emerging from Management PurgatoryData-Ed Webinar: The Seven Deadly Data Sins - Emerging from Management Purgatory
Data-Ed Webinar: The Seven Deadly Data Sins - Emerging from Management Purgatory
 
Why Data Modeling Is Fundamental
Why Data Modeling Is FundamentalWhy Data Modeling Is Fundamental
Why Data Modeling Is Fundamental
 

Mehr von DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 

Mehr von DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Kürzlich hochgeladen

Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823
 
hybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptxhybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptx9to5mart
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 

Kürzlich hochgeladen (20)

Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
hybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptxhybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptx
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 

Getting (Re)Started with Data Stewardship

  • 1. Getting (Re)Started with Data Stewardship © Copyright 2021 by Peter Aiken Slide # 1 paiken@plusanythingawesome.com+1.804.382.5957 Peter Aiken, PhD Integrating the role of Data Debt fighting Peter Aiken, Ph.D. • I've been doing this a long time • My work is recognized as useful • Associate Professor of IS (vcu.edu) • Institute for Defense Analyses (ida.org) • DAMA International (dama.org) • MIT CDO Society (iscdo.org) • Anything Awesome (plusanythingawesome.com) • Experienced w/ 500+ data management practices worldwide • Multi-year immersions – US DoD (DISA/Army/Marines/DLA) – Nokia – Deutsche Bank – Wells Fargo – Walmart … • 12 books and dozens of articles © Copyright 2021 by Peter Aiken Slide # 2 https://plusanythingawesome.com + • DAMA International President 2009-2013/2018/2020 • DAMA International Achievement Award 2001 (with Dr. E. F. "Ted" Codd • DAMA International Community Award 2005
  • 2. 3 Program © Copyright 2021 by Peter Aiken Slide # • Why do we need data stewards? – Definitions: Data steward/stewardship, data debit – Architectural context (cannot just clean up) – The role of strategy • How are stewards supposed to eliminate data debit while facing compliance/regulation/initiatives? – Resolve prerequisite challenges – Relationship with governance/organization – Stewardship role (in context of data governance) – Fire station model (Reactive & proactive foci) • When (Organizational improvement) – Differing cadence (Need for different structural approach) – Foundational prerequisites – Need for simplicity, agility, practice • Take aways ➜ Q&A Getting (Re)Started with Data Stewardship Integrating the role of Data Debit fighting © Copyright 2021 by Peter Aiken Slide # 4 https://plusanythingawesome.com Question? • How many starting versus how many re-starting?
  • 3. How old is your profession? © Copyright 2021 by Peter Aiken Slide # 5 https://plusanythingawesome.com Augusta Ada King Countess of Lovelace (1815-52) • 8,000+ years • formalize practices • GAAP What do we teach knowledge workers about data? © Copyright 2021 by Peter Aiken Slide # 6 https://plusanythingawesome.com What percentage of the deal with it daily?
  • 4. What do we teach IT professionals about data? © Copyright 2021 by Peter Aiken Slide # 7 https://plusanythingawesome.com • 1 course – How to build a new database • What impressions do IT professionals get from this education? – Data is a technical skill that is needed when developing new databases Confusion • IT thinks data is a business problem – "If they can connect to the server, then my job is done!" • The business thinks IT is managing data adequately – "Who else would be taking care of it?" © Copyright 2021 by Peter Aiken Slide # 8 https://plusanythingawesome.com
  • 5. Bad Data Decisions/Illiteracy Spiral © Copyright 2021 by Peter Aiken Slide # 9 https://plusanythingawesome.com Bad data decisions Technical decision makers are not data literate Business decision makers are not data literate Poor organizational outcomes Poor treatment of organizational data assets Poor quality data © Copyright 2021 by Peter Aiken Slide # 10 https://plusanythingawesome.com ?
  • 6. Definitions Steward • 1. a person who looks after the passengers on a ship, aircraft, or train and brings them meals. – synonyms: flight attendant, cabin attendant, air hostess, purser "an air steward" – a person responsible for supplies of food to a college, club, or other institution. • 2. an official appointed to supervise arrangements or keep order at a large public event, for ex. sporting event. – synonyms: official, marshal, organizer "the race stewards" – short for shop steward. • 3. a person employed to manage another's property, especially a large house or estate. – synonyms: (estate) manager, agent, overseer, custodian, caretaker; historical "the steward of the estate" – a person whose responsibility it is to take care of something."farmers pride themselves on being stewards of the countryside" Stewarding • 1. (of an official) supervise arrangements or keep order at (a large public event). "the event was organized and stewarded properly" • 2. manage or look after (another's property). Data Steward • Manage data assets on behalf of all stake holders and in the best interests of the organization (McGilvray, 2008) • Represent the interests of all stakeholders and take an enterprise perspective • Have dedicated time enough to be accountable and responsible Trust • Firm belief in the reliability, truth, ability, or strength of someone or something (google.com) Fiduciary • Involving trust, especially with regard to the relationship between a trustee and a beneficiary (google.com) © Copyright 2021 by Peter Aiken Slide # 11 https://plusanythingawesome.com one who actively directs the use of organizational data assets in support of specific mission objectives Steward • one who actively directs © Copyright 2021 by Peter Aiken Slide # 12 https://plusanythingawesome.com , Data
  • 7. Data Steward © Copyright 2021 by Peter Aiken Slide # 13 https://plusanythingawesome.com • What do data stewards do in our organization? - Improve the organization's data assets value, and - Advocate/evangelize for increasing the scope/rigor of data-centric practices - Ensure efficient/effective data management practices - Improve data's use achieving organizational objectives Source: Unknown © Copyright 2021 by Peter Aiken Slide # 14 https://plusanythingawesome.com Data Management Body of Knowledge (DM BoK V2) Practice Areas from The DAMA Guide to the Data Management Body of Knowledge 2E © 2017 by DAMA International
  • 8. Data Debt • The time and effort it will take to return your data to a governed state from its likely current state of ungoverned • Getting back to zero – Involves undoing existing stuff – Likely new skills are required • At zero-must start from scratch – Typically requires annual proof of value • Now you need to get good at both – Almost all data challenges involve interoperability – Little guidance at optimizing data management practices – Very little guidance as to how to get back to zero © Copyright 2021 by Peter Aiken Slide # 15 https://plusanythingawesome.com Poor data manifests as multifaceted organizational challenges © Copyright 2021 by Peter Aiken Slide # 16 https://plusanythingawesome.com
  • 9. Root cause analysis is part of data stewardship © Copyright 2021 by Peter Aiken Slide # 17 https://plusanythingawesome.com IT System Business Challenge Business Process Business Challenge IT Process Business Challenge Business System Business Challenge IT Process Business Challenge IT System Business Challenge Business Process Business Challenge Poor results Consistency Encourages Quality Analysis © Copyright 2021 by Peter Aiken Slide # 18 https://plusanythingawesome.com IT System Business Challenge Business Process Business Challenge IT Process Business Challenge Business System Business Challenge IT Process Business Challenge IT System Business Challenge Business Process Business Challenge Eliminating data debt requires a team with specialized skills deployed to create a repeatable process and develop sustained organizational skillsets
  • 10. Governance and Architecture © Copyright 2021 by Peter Aiken Slide # 19 https://plusanythingawesome.com Example from: https://www.slideshare.net/AnthonyDehnashi/architecture-governance https://plusanythingawesome.com Corporate Governance • "Corporate governance - which can be defined narrowly as the relationship of a company to its shareholders or, more broadly, as its relationship to society….", Financial Times, 1997. • "Corporate governance is about promoting corporate fairness, transparency and accountability" James Wolfensohn, World Bank, President Financial Times, June 1999. • “Corporate governance deals with the ways in which suppliers of finance to corporations assure themselves of getting a return on their investment”, The Journal of Finance, Shleifer and Vishny, 1997. © Copyright 2021 by Peter Aiken Slide # 20 https://plusanythingawesome.com • "Corporate governance - which can be defined narrowly as the relationship of a company to its shareholders or, more broadly, as its relationship to society….", Financial Times, 1997. • "Corporate governance is about promoting corporate fairness, transparency and accountability" James Wolfensohn, World Bank, President Financial Times, June 1999. • “Corporate governance deals with the ways in which suppliers of finance to corporations assure themselves of getting a return on their investment”, The Journal of Finance, Shleifer and Vishny, 1997.
  • 11. © Copyright 2021 by Peter Aiken Slide # 21 https://plusanythingawesome.com https://plusanythingawesome.com • "Putting structure around how organizations align IT strategy with business strategy, ensuring that companies stay on track to achieve their strategies and goals, and implementing good ways to measure IT’s performance. • It makes sure that all stakeholders’ interests are taken into account and that processes provide measurable results. • Framework should answer some key questions, such as how the IT department is functioning overall, what key metrics management needs and what return IT is giving back to the business from the investment it’s making." CIO Magazine (May 2007) IT Governance Institute, 5 areas of focus: • Strategic Alignment • Value Delivery • Resource Management • Risk Management • Performance Measures • "Putting structure around how organizations align IT strategy with business strategy, ensuring that companies stay on track to achieve their strategies and goals, and implementing good ways to measure IT’s performance. • It makes sure that all stakeholders’ interests are taken into account and that processes provide measurable results. • Framework should answer some key questions, such as how the IT department is functioning overall, what key metrics management needs and what return IT is giving back to the business from the investment it’s making." CIO Magazine (May 2007) IT Governance Institute, 5 areas of focus: • Strategic Alignment • Value Delivery • Resource Management • Risk Management • Performance Measures IT Governance © Copyright 2021 by Peter Aiken Slide # 22 https://plusanythingawesome.com
  • 12. © Copyright 2021 by Peter Aiken Slide # 23 https://plusanythingawesome.com https://plusanythingawesome.com Elevator Pitch 7 Data Governance Definitions 24 © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com • The formal orchestration of people, process, and technology to enable an organization to leverage data as an enterprise asset – The MDM Institute • A convergence of data quality, data management, business process management, and risk management surrounding the handling of data in an organization – Wikipedia • A system of decision rights and accountabilities for information-related processes, executed according to agreed-upon models which describe who can take what actions with what information, and when, under what circumstances, using what methods – Data Governance Institute • The execution and enforcement of authority over the management of data assets and the performance of data functions – KiK Consulting • A quality control discipline for assessing, managing, using, improving, monitoring, maintaining, and protecting organizational information – IBM Data Governance Council • Data governance is the formulation of policy to optimize, secure, and leverage information as an enterprise asset by aligning multiple functional objectives – Sunil Soares • The exercise of authority and control over the management of data assets – DM BoK
  • 13. What is Data Governance? © Copyright 2021 by Peter Aiken Slide # 25 https://plusanythingawesome.com Managing Data with Guidance Would you want your sole, non- depletable, non- degrading, durable, strategic asset managed without guidance? https://plusanythingawesome.com What is Data Governance? © Copyright 2021 by Peter Aiken Slide # 26 https://plusanythingawesome.com Managing Data Decisions with Guidance Would you want your sole, non- depletable, non- degrading, durable, strategic asset managed without guidance? https://plusanythingawesome.com
  • 14. Architecture • Things – (components) data structures • The functions of the things – (individually) sources and uses of data • How the things interact – (as a system, towards a goal) Efficiencies/effectiveness © Copyright 2021 by Peter Aiken Slide # 27 https://plusanythingawesome.com Organizational Architectures • Amazon – Traditional structure • Google – Team of 3 • Facebook – Do you really have a structure? • Microsoft – Eliminate their own products • Apple – Everything revolves around one individual • Oracle – Buys one company after another © Copyright 2021 by Peter Aiken Slide # 28 https://plusanythingawesome.com
  • 15. Data/Information Architectures – Useful Definition • Common vocabulary expressing integrated requirements ensuring that data assets are stored, arranged, managed, and used in systems in support of organizational strategy [Aiken 2010] © Copyright 2021 by Peter Aiken Slide # 29 https://plusanythingawesome.com • A specific definition – 'Understanding an architecture' – Documented and articulated as a (digital) blueprint illustrating the commonalities and interconnections among the architectural components – Ideally the understanding is shared by systems and humans Understanding = Interoperability © Copyright 2021 by Peter Aiken Slide # 30 https://plusanythingawesome.com
  • 16. Data Architectures: here, whether you like it or not © Copyright 2021 by Peter Aiken Slide # 31 https://plusanythingawesome.com deviantart.com • All organizations have data architectures – Some are better understood and documented (and therefore more useful to the organization) than others Typically Managed Organizational Architectures • Process Architecture – Arrangement of inputs -> transformations = value -> outputs – Typical elements: Functions, activities, workflow, events, cycles, products, procedures • Systems Architecture – Applications, software components, interfaces, projects • Business Architecture – Goals, strategies, roles, organizational structure, location(s) • Security Architecture – Arrangement of security controls in relation to IT Architecture • Technical Architecture/Tarchitecture – Relation of software capabilities/technology stack – Typical elements: Networks, hardware, software platforms, standards/protocols • Data/Information Architecture – Arrangement of data assets supporting organizational strategy – Typical elements: specifications expressed as entities, relationships, attributes, definitions, values, vocabularies © Copyright 2021 by Peter Aiken Slide # 32 https://plusanythingawesome.com
  • 17. The Role of Strategy © Copyright 2021 by Peter Aiken Slide # 33 https://plusanythingawesome.com Example from: https://slideplayer.com/slide/5082003/ https://plusanythingawesome.com What is Strategy? • Current use derived from military - a pattern in a stream of decisions [Henry Mintzberg] © Copyright 2021 by Peter Aiken Slide # 34 https://plusanythingawesome.com A thing
  • 18. Every Day Low Price Former Walmart Business Strategy © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com 35 Wayne Gretzky’s Definition of Strategy © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com He skates to where he thinks the puck will be ... 36
  • 19. Strategy in Action: Napoleon faces a larger enemy • Question? – How do I defeat the competition when their forces are bigger than mine? • Answer: – Divide and conquer! – “a pattern in a stream of decisions” © Copyright 2021 by Peter Aiken Slide # 37 https://plusanythingawesome.com Supply Line Metadata (as part of a divide and conquer strategy) © Copyright 2021 by Peter Aiken Slide # 38 https://plusanythingawesome.com
  • 20. First Divide © Copyright 2021 by Peter Aiken Slide # 39 https://plusanythingawesome.com Then Conquer © Copyright 2021 by Peter Aiken Slide # 40 https://plusanythingawesome.com
  • 21. Complex Strategy • First – Hit both armies hard at just the right spot • Then – Turn right and defeat the Prussians • Then – Turn left and defeat the British © Copyright 2021 by Peter Aiken Slide # Whilesomeoneisshootingatyou! 41 https://plusanythingawesome.com Data Governance Environment Complex Data Governance Environment Complex Data Governance Council • Liaise between agency operations & CDO • Advise CDO on technology, policy, and governance strategies • Administer data governance policies set by the board • implement data sharing & analytics projects • Review open data assets • Report progress & compliance to the Board Advise CDO © Copyright 2021 by Peter Aiken Slide # 42 https://plusanythingawesome.com Data sharing and analytics • Identify goals and objectives • Prioritize initiatives • Study & report • Recommend changes to budget and code DATA COMMISSION LEGISLATIVE Region representatives EXECUTIVE JUDICIAL Advise Governor TRUSTEE execute • Define, approve, and communicate data strategies, policies, standards, rules, guidelines, & best practices • Provide a governance, policy, and technology framework • Define agency data governance responsibilities • Encourage & facilitate data sharing • Facilitate coordination to prevent duplication • Coordinate policy and technology proposals and recommendations • Administer and manage the commonwealth data trust • Track and enforce compliance and conformance • Oversee dissemination of open data Executive data board Oversee council • Translate commonwealth goals to agency performance targets • provide resources • Remove organizational obstacles • appoint data governance council members • Oversee the data governance council • Oversee data sharing & analytics projects
  • 22. Strategy Guides Workgroup Activities © Copyright 2021 by Peter Aiken Slide # A pattern in a stream of decisions 43 https://plusanythingawesome.com Strategic Approach - Master/Reference Data • Reference – Controls accessible data values • Master – Controls access to system capabilities • Transaction – Instances of values © Copyright 2021 by Peter Aiken Slide # 44 https://plusanythingawesome.com Countries where we do business? Types of accounts available? Controlled vocabulary items Are you a member of our premium club? Authorizing uses/users? Common/standard data structures $5 Authorized Like Example from: Dr. Christopher Bradley of DMAdvisors–he has more, ping him at chris.bradley@dmadvisors.co.uk
  • 23. Data Strategy provides focus for stewardship efforts Note: Reducing ROT increases data leverage © Copyright 2021 by Peter Aiken Slide # 45 https://plusanythingawesome.com Organizational Data Data Stewards Technologies Process People Less Data ROT -> Data Stewardship Considerations • Why? – Stewardship terminology is not widely known – We do not have agreed upon definitions – It has become a de-facto widely used, ill defined term – Stewards work effectively with architectural components – Strategy focuses steward leveraging activities © Copyright 2021 by Peter Aiken Slide # 46 https://plusanythingawesome.com http://williamnava.com/philosophy-shaves-barber-21/
  • 24. © Copyright 2021 by Peter Aiken Slide # 47 https://plusanythingawesome.com All inputs are data All outputs are data All inputs are data All inputs are data All inputs are data All outputs are data All outputs are data All outputs are data All inputs are data All outputs are data All inputs are data All outputs are data All inputs are data All outputs are data The Data Machine How to determine what to manage formally? Too much requires expensive and slow bureaucracy ← → Too little misses opportunities Interoperability is the primary value determinant 48 Program © Copyright 2021 by Peter Aiken Slide # • Why do we need data stewards? – Definitions: Data steward/stewardship, data debit – Architectural context (cannot just clean up) – The role of strategy • How are stewards supposed to eliminate data debit while facing compliance/regulation/initiatives? – Resolve prerequisite challenges – Relationship with governance/organization – Stewardship role (in context of data governance) – Fire station model (Reactive & proactive foci) • When (Organizational improvement) – Differing cadence (Need for different structural approach) – Foundational prerequisites – Need for simplicity, agility, practice • Take aways ➜ Q&A Getting (Re)Started with Data Stewardship Integrating the role of Data Debit fighting
  • 25. • Overly dependent upon: – Human-beings – Wetwear – Knowledge workers – Informal communications – Often described as the weakest link Data / Information Gap Information © Copyright 2021 by Peter Aiken Slide # 49 https://plusanythingawesome.com Data © Copyright 2021 by Peter Aiken Slide # • Have little idea what data they have • Do not know where it is (and) • Do not know what their knowledge workers do with it Put simply, organizations: 50 https://plusanythingawesome.com
  • 26. Data Knowledge is too little and too informal • Data stewardship happens 'pretty well' at the workgroup level – Defining characteristic of a workgroup (Workgroups get work done!) – Without guidance, what are the chances that all workgroups are pulling toward the same objectives? – Consider the time spent attempting informal practices • Data chaff becomes sand in the machinery – Preventing smooth interoperation and exchanges – Death by 1,000 cuts that have been difficult to account for • Organizations and individuals lack – Knowledge – Skills – Maturity © Copyright 2021 by Peter Aiken Slide # 51 https://plusanythingawesome.com Separating the Wheat from the Chaff © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com 52 https://plusanythingawesome.com Is well organized data worth more?
  • 27. Pre-Information Age Metadata • Examples of information architecture achievements that happened well before the information age: – Page numbering – Alphabetical order – Table of contents – Indexes – Lexicons – Maps – Diagrams © Copyright 2021 by Peter Aiken Slide # 53 https://plusanythingawesome.com Example from: How to make sense of any mess by Abby Covert (2014) ISBN: 1500615994 "While we can arrange things with the intent to communicate certain information, we can't actually make information. Our users do that for us." https://www.youtube.com/watch?v=60oD1TDzAXQ&feature=emb_logo https://www.youtube.com/watch?v=r10Sod44rME&t=1s https://www.youtube.com/watch?v=XD2OkDPAl6s https://plusanythingawesome.com https://plusanythingawesome.com Remove the structure and things fall apart rapidly • Better organized data increases in value © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com 54 https://plusanythingawesome.com https://plusanythingawesome.com
  • 28. Separating the Wheat from the Chaff • Data that is better organized increases in value • Poor data management practices are costing organizations money/time/effort • 80% of organizational data is ROT – Redundant – Obsolete – Trivial • The question is which data to eliminate? – Most enterprise data is never analyzed © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com 55 https://plusanythingawesome.com Data Assets Win! • Today, data is the most powerful, yet underutilized and poorly managed organizational asset • Data is your – Sole – Non-depletable – Non-degrading – Durable – Strategic • Asset – Data is the new oil! – Data is the new (s)oil! – Data is the new bacon! • As such, data deserves: – It's own strategy – Attention on par with similar organizational assets – Professional ministration to make up for past neglect © Copyright 2021 by Peter Aiken Slide # 2020 American Airlines market value ~ $6b AAdvantage valued between $19.5-$31.5 United market value ~ 9$b MileagePlus ~ $22b https://www.forbes.com/sites/advisor/2020/07/15/how-airlines-make-billions-from-monetizing-frequent-flyer-programs/?sh=66da87a614e9 Data Assets Financial Assets Real Estate Assets Inventory Assets Non- depletable Available for subsequent use Can be used up Can be used up Non- degrading √ √ Can degrade over time Can degrade over time Durable Non-taxed √ √ Strategic Asset √ √ √ √ 56 https://plusanythingawesome.com Asset: A resource controlled by the organization as a result of past events or transactions and from which future economic benefits are expected to flow [Wikipedia] Data Assets Win!
  • 29. Data Governance Frameworks • A system of ideas for guiding analyses • A means of organizing project data • Priorities for data decision making • A means of assessing progress – Don’t put up walls until foundation inspection is passed – Put the roof on ASAP • Make it all dependent upon continued funding © Copyright 2021 by Peter Aiken Slide # 57 https://plusanythingawesome.com A Framework For Stewardship © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com A Framework For Stewardship from https://www.trainingjournal.com/articles/feature/stewardship
  • 30. Organizational Data Challenges Stewardship Engine Regulation and Policy A Framework for Data Stewardship © Copyright 2021 by Peter Aiken Slide # 59 https://plusanythingawesome.com Monetary Proactive Reactive Stewardship Activities Address Some Other Time Strategic Consideration Non-monetary Value © Copyright 2021 by Peter Aiken Slide # 60 https://plusanythingawesome.com https://plusanythingawesome.com
  • 31. Data and Duct Tape © Copyright 2021 by Peter Aiken Slide # 61 https://plusanythingawesome.com https://plusanythingawesome.com © Copyright 2021 by Peter Aiken Slide # 62 https://plusanythingawesome.com https://plusanythingawesome.com
  • 32. Leadership (data decision makers) Domain expertise is less ← | → Domain expertise is greater Roles more formally defined ← | → Roles less formally defined Encounter governed data more directly ← | → Encounter governed data less directly More time is dedicated ← | → Less time is dedicated IT/Systems Development Stewards (data trustees) Guidance Decisions Participants/Experts (data subject matter experts) Other Sources/Uses (data makers & consumers) IT/Systems Development Data/feedback Changes Action R e s o u r c e s Ideas Data/Feedback Components comprising the data community © Copyright 2021 by Peter Aiken Slide # 63 https://plusanythingawesome.com Leadership (data decision makers) Stewards (data trustees) Guidance Decisions Participants/Experts (data subject matter experts) Other Sources/Uses (data makers & consumers) Data/feedback Changes Action R e s o u r c e s Ideas Data/Feedback Basic Version © Copyright 2021 by Peter Aiken Slide # 64 https://plusanythingawesome.com
  • 33. ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ ≈ Data Governance Role: Produce systemic organizational changes that impact data and work practices over time © Copyright 2021 by Peter Aiken Slide # 65 https://plusanythingawesome.com Data Leadership Feedback Data Governance Data Improvement Data Stewards Data Community Participants Data Generators/Data Users Data Things Happen Organizational Things Happen DIPs Data Improves Over Time Data Improves As A Result of Focus Feedback X $ X $ X $ X $ X $ X $ X $ X $ X $ $ Data Strategy in Context © Copyright 2021 by Peter Aiken Slide # 66 https://plusanythingawesome.com Organizational Strategy Data Strategy IT Projects Organizational Operations Data Governance Data asset support for organizational strategy What the data assets need to do to support strategy How well data is supporting strategy Operational feedback How IT supports strategy Other aspects of organizational strategy
  • 34. Data Governance & Data Stewards © Copyright 2021 by Peter Aiken Slide # 67 https://plusanythingawesome.com Data Strategy Data Governance What the data assets do to support strategy How well the data strategy is working (Business Goals) (Metadata) Data Stewards What is the most effective use of steward investments? (Metadata) Progress, plans, problems Getting Started with Data Stewardship How? • Transform tribal knowledge-based processes to data asset leveraging • Understand stewards transform governance into by strategy focused action • Apply a framework to your tasks • Understand and get good at both reactive and proactive activities • Attempt to incorporate leadership outside of traditional channels • Know that you cannot accomplish everything © Copyright 2021 by Peter Aiken Slide # 68 https://plusanythingawesome.com https://hatrabbits.com/en/how-how-diagram/ https://plusanythingawesome.com
  • 35. Keep the proper focus • Wrong question: – How should we mange this data? • Right question: – Should we include this data item within the scope of our current stewardship practices? • Either way document why! © Copyright 2021 by Peter Aiken Slide # 69 https://plusanythingawesome.com https://plusanythingawesome.com 70 Program © Copyright 2021 by Peter Aiken Slide # • Why do we need data stewards? – Definitions: Data steward/stewardship, data debit – Architectural context (cannot just clean up) – The role of strategy • How are stewards supposed to eliminate data debit while facing compliance/regulation/initiatives? – Resolve prerequisite challenges – Relationship with governance/organization – Stewardship role (in context of data governance) – Fire station model (Reactive & proactive foci) • When (Organizational improvement) – Differing cadence (Need for different structural approach) – Foundational prerequisites – Need for simplicity, agility, practice • Take aways ➜ Q&A Getting (Re)Started with Data Stewardship Integrating the role of Data Debit fighting
  • 36. Data Footprints • SQL Server – 47,000,000,000,000 bytes – Largest table 34 billion records 3.5 TBs • Informix – 1,800,000,000 queries/day – 65,000,000 tables / 517,000 databases • Teradata – 117 billion records – 23 TBs for one table • DB2 – 29,838,518,078 daily queries © Copyright 2021 by Peter Aiken Slide # 71 https://plusanythingawesome.com Standard data Data supply Data literacy Making a Better Data Governance Sandwich © Copyright 2021 by Peter Aiken Slide # Data literacy Standard data Data supply 72 https://plusanythingawesome.com
  • 37. Making a Better Data Governance Sandwich © Copyright 2021 by Peter Aiken Slide # Standard data Data supply Data literacy 73 https://plusanythingawesome.com Making a Better Data Sandwich © Copyright 2021 by Peter Aiken Slide # Standard data Data supply Data literacy This cannot happen without data engineering and architecture! Quality data engineering/ architecture work products do not happen accidentally! 74 https://plusanythingawesome.com
  • 38. • Durable asset - An asset that has a usable life more than one year • Reasonable project deliverables - 90 day increments - Data evolution is measured in years • Data - Evolves - it is not created - Significantly more stable • Readymade data architectural components - Prerequisite to agile development • Only alternative is to create additional data siloes! Data is not a Project © Copyright 2021 by Peter Aiken Slide # 75 https://plusanythingawesome.com Differences between Programs and Projects • Programs are Ongoing, Projects End – Managing a program involves long term strategic planning and continuous process improvement is not required of a project • Programs are Tied to the Financial Calendar – Program managers are often responsible for delivering results tied to the organization's financial calendar • Program Management is Governance Intensive – Programs are governed by a senior board that provides direction, oversight, and control while projects tend to be less governance-intensive • Programs Have Greater Scope of Financial Management – Projects typically have a straight-forward budget and project financial management is focused on spending to budget while program planning, management and control is significantly more complex • Program Change Management is an Executive Leadership Capability – Projects employ a formal change management process while at the program level, change management requires executive leadership skills and program change is driven more by an organization's strategy and is subject to market conditions and changing business goals © Copyright 2021 by Peter Aiken Slide # Adapted from http://top.idownloadnew.com/program_vs_project/ and http://management.simplicable.com/management/new/program-management-vs-project-management 76 https://plusanythingawesome.com Your data program must last at least as long as your HR program!
  • 39. © Copyright 2021 by Peter Aiken Slide # 77 https://plusanythingawesome.com “Your Organization is all about Data, until it’s not about just Data” What Business are you in? • A management paradigm that views any manageable system as being limited in achieving more of its goals by a small number of constraints • There is always at least one constraint, and TOC uses a focusing process to identify the constraint and restructure the rest of the organization to address it • TOC adopts the common idiom "a chain is no stronger than its weakest link," processes, organizations, etc., are vulnerable because the weakest component can damage or break them or at least adversely affect the outcome © Copyright 2021 by Peter Aiken Slide # 78 https://plusanythingawesome.com https://en.wikipedia.org/wiki/Theory_of_constraints (TOC) https://plusanythingawesome.com
  • 40. Organizational Data Usage Practices © Copyright 2021 by Peter Aiken Slide # 79 https://plusanythingawesome.com Data Management Practices Duplicated but ETLed Data (quality & transformations applied) "Warehoused" Data Learning/ Feedback Marts Analytics Practices T1 Organizations without a formalized data stewards T3 Data Steward: Use data to create strategic opportunities T4 Data Steward: both Improve Operations Innovation The focus of data stewards should be sequenced © Copyright 2021 by Peter Aiken Slide # 80 https://plusanythingawesome.com Only 1 is 10 organizations has a board approved data strategy! T2 Data Steward: Increase organizational efficiencies/ effectiveness X X
  • 41. Data Strategy in Context – THIS IS WRONG! © Copyright 2021 by Peter Aiken Slide # Organizational Strategy IT Strategy Data Strategy x 81 https://plusanythingawesome.com Organizational Strategy IT Strategy This is correct … © Copyright 2021 by Peter Aiken Slide # Data Strategy https://plusanythingawesome.com https://plusanythingawesome.com
  • 42. KISS (Keep it simple, stupid!) © Copyright 2021 by Peter Aiken Slide # 83 https://plusanythingawesome.com Data – Driven? Centric? Focused? First? … the Data Doctrine® (V2) © Copyright 2021 by Peter Aiken Slide # 84 https://plusanythingawesome.com Source: theagiledoctrine.org We are uncovering better ways of developing IT systems by doing it and helping others do it. Through this work we have come to value: data programs driving IT programs informed information investing over technology acquisition activities stable, shared organizational data over IT component evolution data reuse over the acquisition of new data sources
  • 43. the Data Doctrine® (V2) We are uncovering better ways of developing IT systems by doing it and helping others do it. Through this work we have come to value: data programs driving IT programs informed information investing over technology acquisition activities stable, shared organizational data over IT component evolution data reuse over the acquisition of new data sources © Copyright 2021 by Peter Aiken Slide # 85 https://plusanythingawesome.com That is, while there is value in the items on the right, we value the items on the left more. Source: theagiledoctrine.org Data Decisions Variety © Copyright 2021 by Peter Aiken Slide # 86 https://plusanythingawesome.com https://www.healthcatalyst.com/why-are-data-stewards-so-important-for-healthcare Data Source Steward
  • 44. Data Steward • Business data steward – Manage from the perspective of business elements (i.e. business definitions and data quality) • Technical data steward – Focus on the use of data by systems and models (i.e. code operation) • Project data steward – Gather definitions, quality rules and issues for referral to business/technical stewards • Domain data steward – Manage data/metadata required across multiple business areas (i.e. customer data) • Operational data steward – Directly input data or instruct those who do; aid business stewards identifying root cause and addressing issues • Metadata Data Steward – Manage metadata as an asset • Legacy Data Steward – Manage legacy data as an asset • Data steward auditor – Ensures compliance with data guidance • Data steward manager – Planning, organizing, leading and controlling © Copyright 2021 by Peter Aiken Slide # 87 https://plusanythingawesome.com (list adapted from Plotkin, 2014) Strategy Example 1 © Copyright 2021 by Peter Aiken Slide # Good Guys (Us) Bad Guys (Them) 88 https://plusanythingawesome.com
  • 45. Strategy Example 2 © Copyright 2021 by Peter Aiken Slide # Good Guys (Us) Bad Guys (Them) 89 https://plusanythingawesome.com Strategy Example 3 © Copyright 2021 by Peter Aiken Slide # Good Guys (Us) Bad Guys (Them) 90 https://plusanythingawesome.com
  • 46. • Why do we need data stewards? – Definitions: Data steward/stewardship, data debt – Architectural context (cannot just clean up) – The role of strategy • How are stewards supposed to eliminate data debt while facing compliance/regulation/initiatives? – Resolve prerequisite challenges – Relationship with governance/organization – Stewardship role (in context of data governance) – Fire station model (Reactive & proactive foci) • When (Organizational improvement) – Differing cadence (Need for different structural approach) – Foundational prerequisites – Need for simplicity, agility, practice • Take aways ➜ Q&A © Copyright 2021 by Peter Aiken Slide # 91 https://plusanythingawesome.com Program Getting (Re)Started with Data Stewardship Integrating the role of Data Debt fighting Take Aways © Copyright 2021 by Peter Aiken Slide # 92 https://plusanythingawesome.com • Since stewards own the results, they control the remediation process • Need for professional data stewards is increasing – Increase in data volume – Lack of practice improvement • Data stewardship is a relatively new discipline – Must conform to organizational constraints – No one best way • Data stewards must be driven by a data strategy complimenting organizational strategy • Data stewards direct data management efforts • The language of data stewards is metadata • Process improvement can improve data stewardship and organizational practices
  • 47. • Data Governance: How to Design, Deploy, and Sustain an Effective Data Governance Program • John Ladley • Amazon Best Sellers Rank: #641,937 in Books (See Top 100 in Books) – #242 in Management Information Systems – #209 in Library Management – #380 in Database Storage & Design © Copyright 2021 by Peter Aiken Slide # 93 https://plusanythingawesome.com Upcoming Events Approaching Data Quality–Engineering Success Stories 14 September 2021 Essential Metadata Strategies 12 October 2021 Necessary Prerequisites to Data Success: Exorcising the Seven Deadly Data Sins 9 November 2021 © Copyright 2021 by Peter Aiken Slide # 94 https://plusanythingawesome.com Brought to you by: Time: 19:00 UTC (2:00 PM NYC) | Presented by: Peter Aiken, PhD Note: In this .pdf, clicking any webinar title opens the registration link
  • 48. Event Pricing © Copyright 2021 by Peter Aiken Slide # 95 https://plusanythingawesome.com • 20% off directly from the publisher on select titles • My Book Store @ http://plusanythingawesome.com • Enter the code "anythingawesome" at the Technics bookstore checkout where it says to "Apply Coupon" peter@plusanythingawesome.com +1.804.382.5957 Questions? Thank You! © Copyright 2021 by Peter Aiken Slide # 96 + =
  • 49. 8/10/21, 3:31 PM You Probably Don't Need a Data Dictionary - Locally Optimistic Page 1 of 9 https://locallyoptimistic.com/post/data_dictionaries/ Home About Getting Involved LO Events In Tools Tags Analytics, Business Intelligence, Data Dictionary, Data Engineering June 17, 2019 Michael Kaminsky & Alex J You Probably Don’t Need a Data Dictionary While efforts to build a data dictionary are often undertaken out of a zeal for documentation applaud, in practice data dictionaries and data catalogs end up being a large maintenance value, and tend to very quickly become out of date. Instead of investing in building out traditional data dictionaries, we recommend a few differe
  • 50. 8/10/21, 3:31 PM You Probably Don't Need a Data Dictionary - Locally Optimistic Page 2 of 9 https://locallyoptimistic.com/post/data_dictionaries/ achieving the same goals in ways that are less burdensome to maintain and better serve th well. What is a data dictionary? In this post we’ll use the term “data dictionary” to describe a range of products that include and “data catalogs”. In general, we’re referring to documents that contain metadata about warehouse – most commonly, they have one section per table with one row per column inc column-name, the column-type, any relevant foreign-keys, and a brief description of the co will be additional metadata about the source of the table (a source system or potentially a p A screenshot from an unhelpful data dictionary. Why are data dictionaries created? Listen. There’s no activity the two of us like better for enjoying a beautiful spring Saturday t desk, cracking open a cold Kombucha, and writing some damn documentation. While we d desire to document-all-the-things, we also believe that it is important to find the right docum the goals that you want to achieve. People tend to embark on the data dictionary journey w vague goals in mind. They want to 1) enable more data exploration, 2) codify a single sourc definitions, and 3) document the provenance of particular pieces of business logic.
  • 51. 8/10/21, 3:31 PM You Probably Don't Need a Data Dictionary - Locally Optimistic Page 3 of 9 https://locallyoptimistic.com/post/data_dictionaries/ These are all great goals! However, we do not believe that data dictionaries are the best too these objectives. We will talk a bit about why we do not believe that data dictionaries, as tra work very well before diving into alternative ways to achieve those goals. What goes wrong with data dictionaries? With your traditional data dictionary, you start small and simple. Maybe just documenting th tables, or the definitions and business logic behind your top three KPIs. But new use cases are always coming up, new ways of looking at data and measuring succ to standardize) – and so the list grows. If your KPIs aren’t just pulling from a single table or a single system, now you need to start different sources join together in a wild mess of foreign key relationships. Maybe you even c pictures speak a thousand words after all. And everything is great…until you run into the issue of different audiences: Non-technical business users: Just want the data, or at most, a high-level summary they’re important. Business analysts: Want the data, but also the business logic behind it, and probably populate the source and/or the relevant context. Data scientists / Engineers: Just want the column-level metadata, relationships, and raw data correctly – “I can write my own SQL/Python, thanks.” Which means that realistically, to solve everyone’s use case, you’ll need to write three versi one, very large version) for any given piece of data. Not fun. But even if you’re willing to go down that path, eventually something will change. Maybe yo Conversion just shifted a little bit (as we all know, business logic can be extremely slippery)
  • 52. 8/10/21, 3:31 PM You Probably Don't Need a Data Dictionary - Locally Optimistic Page 4 of 9 https://locallyoptimistic.com/post/data_dictionaries/ have to update not only your data model and dashboards + reports, but also 3 documents The maintenance burden very quickly grows into a scalability issue, and a serious bloc iteration speed of the entire data team. Not only does maintaining documentation slow you documentation starts to slip (as it inevitably will) the consumers of the data dictionary will b you will be investing time in maintaining this document that people do not even trust (and th As a result, the heroes who start down the path of the perfect data dictionary often end up 1. Is hard to maintain 2. Is hard to use, and doesn’t answer everyone’s questions 3. Is generally out-of-date (in one way or another) 4. Ends up getting deprecated Fortunately, we have better ways of achieving the same goals with less maintenance burde What are the other options? We believe the best way to approach the problem is by using the right tools for the right job solutions for users at each level of the data literacy curve. For the non-technical business user: They’re really looking for an actionable, easy-to-use, single source of truth. Data – signed, s our opinion, the best solution involves showing rather than telling: a well-architected and w general, we believe that non-technical business users should not be writing SQL and theref dictionary as traditionally constituted. We do not want to go into a full-fledged comparison between BI platforms here, but whatev here are the key features you’ll want to keep an eye out for:
  • 53. 8/10/21, 3:31 PM You Probably Don't Need a Data Dictionary - Locally Optimistic Page 5 of 9 https://locallyoptimistic.com/post/data_dictionaries/ 1. Single Source of Truth Data used to drive business decisions comes from a single system/platform Business logic is centrally defined, curated, and easily reproducible KPIs are clearly defined, well-organized, and shared across the organization 2. Ad-hoc Exploration High-level KPIs are rarely actionable – so allowing users to take the next step in s discovering, and drilling down into the data is critical 3. Right Level of Abstraction Naively exposing every piece of data will create a difficult, confusing, and intimida Develop a curated, properly labeled, easy-to-navigate dataset (with complex joins abstracted away from the end user) 4. Balance between governance and democratization Do you allow people to save their own reports/dashboards? Define their own dim transformations? 5. Self Documenting Code In a good BI tool, it should be easy for business users to find existing reports/dashboards t modify (or leverage as a starting point for additional exploration). The tool should provide g avoid making common mistakes, and provide enough metadata (either through tool-tips or business users both find what they are looking for and avoid common pitfalls. The differenc and a data dictionary is that the tool tips live with the data as they are actually used by bus the post-business-logic description of the entity which is generally much more useful to end If your BI tool has these qualities, then your business users will not need (or wouldn’t get an dictionary. Since the BI tool should help business users answer business questions, your bu have to know anything about the underlying data structures in order to achieve their goals. users asking for a data dictionary, you should be thinking about building them better tools r documenting the current suboptimal system. Depending on the size of your team, the structure of your organization, and the complexity
  • 54. 8/10/21, 3:31 PM You Probably Don't Need a Data Dictionary - Locally Optimistic Page 6 of 9 https://locallyoptimistic.com/post/data_dictionaries/ may need to augment your BI tool with some additional tooling in the form of a wiki or know below). For your non-technical users this is best used as a reference for “who is the expert” to if they want to get started in a new domain area. For the business / data analyst: For the people in your organization who are rolling up the the data, there should be two sources of documentation that are most relevant: A self-documenting and searchable code-base A knowledge-base or wiki of high-level descriptions The first place an analyst should look when trying to understand a table or column is in the and manages that table or column. If, for example, you are using Looker as your BI tool on analysts should be able to quickly and easily search those code bases to find the correct se code base is well-architected and you follow best practices for programming style and read straightforward for the analyst to review and comprehend the relative logic. If your analyst c comprehend the logic, you should consider refactoring your code base or implementing mo practices. For the types of knowledge that are not easy to see in the code itself, it can be helpful to ha that people can review for high-level information about a given concept. This documentatio catalog of “what is in the data warehouse” but rather should include information about the behind broad concepts in the business: What is (NPS, ARR, Conversion, insert your concept here), how do we define it, how is business logic), why is it important, and what are the next steps? Important context and history (“this is the story of why it takes 5 joins to connect our h coupon code system”). Any potential “gotchas” (“we know that this logic is incorrect for some number of user checked it impacted less than 1% of users and so we ignored it”). Between these two sources of information, analysts should have everything they need to un
  • 55. 8/10/21, 3:31 PM You Probably Don't Need a Data Dictionary - Locally Optimistic Page 7 of 9 https://locallyoptimistic.com/post/data_dictionaries/ of the data they have. For the data scientist / engineer: For the most technical folks on your team, you can add some additional tooling that can he that will be most useful to them. In addition to the self-documenting code-base (which is concept in this blog post), you can think about adding: Knowledge-repo: This tool is used to keep track of analyses that have happened in the resource for both analysts and data scientists to see 1) example uses of certain data s are related to the topic they are working on. Seeing how someone else used a given ta for someone to understand its intricacies. Programmatically-generated ERDs: rather than maintaining these types of documenta programmatically-generated ERDs or data-heritage DAGs (from a tool like dbt or airflo ensure that your documentation actually matches your code. Architecture Design Documentation: this is often an artifact of the development proces depth description of requirements, technical decisions made, architecture, etc. Other important things to keep in mind: You need a strong data organization to maintain these different avenues of documenta You need a data-driven culture of cross-functional collaboration to make all of the piec What are data dictionaries useful for? There are two use-cases where having something like a data dictionary can be valuable: 1. If you are a data-as-a-service (DaaS) provider, then your data dictionary is effectively d will be critical to your clients 2. For compliance / security reasons, you have complex restrictions on who can access
  • 56. 8/10/21, 3:31 PM You Probably Don't Need a Data Dictionary - Locally Optimistic Page 8 of 9 https://locallyoptimistic.com/post/data_dictionaries/ ! warehouse These two use-cases are pretty niche, and we do not believe they apply to most internal-fa second use-case is a bit tricky because what you want to build is not a “data dictionary” as “helpful” metadata about tables and columns) but rather an access-spec which can hierarc different parts of a warehouse (at the schema, table, column, and row level). We have some build an effective tool for this, so if you’re working on something like that, please reach out! Conclusion Overall, we think that data dictionaries tend not to be very useful, and a data team’s time w maintaining a well-organized code-base that is self-documenting and providing well-design that do not require much documentation. Where possible, teams should make use of tools generating documentation directly from code in order to prevent concept drift. Michael Kaminsky & Alex Jia Check out the Author pages for Alex and Michael Related Posts: " # $ Previous Post: KPI Principles % %
  • 57. 8/10/21, 3:31 PM You Probably Don't Need a Data Dictionary - Locally Optimistic Page 9 of 9 https://locallyoptimistic.com/post/data_dictionaries/ The Attribution Dilemma Adding Context to Your Analysis with Annotations The Data S