SlideShare ist ein Scribd-Unternehmen logo
1 von 56
Downloaden Sie, um offline zu lesen
Copyright 2020 by Data Blueprint Slide # 1Peter Aiken, PhD
(Unlocking Business Value)
Peter Aiken, PhD
Leveraging
Data
Management
Technologies
• DAMA International President 2009-2013 / 2018
• DAMA International Achievement Award 2001
(with Dr. E. F. "Ted" Codd
• DAMA International Community Award 2005
• I've been doing this a long time
• My work is recognized as useful
• Associate Professor of IS (vcu.edu)
• Founder, Data Blueprint (datablueprint.com)
• DAMA International (dama.org)
• CDO Society (iscdo.org)
• 11 books and dozens of articles
• Experienced w/ 500+ data
management practices worldwide
• Multi-year immersions
– US DoD (DISA/Army/Marines/DLA)
– Nokia
– Deutsche Bank
– Wells Fargo
– Walmart … PETER AIKEN WITH JUANITA BILLINGS
FOREWORD BY JOHN BOTTEGA
MONETIZING
DATA MANAGEMENT
Unlocking the Value in Your Organization’s
Most Important Asset.
2Copyright 2020 by Data Blueprint Slide #
Peter Aiken, Ph.D.
Trifacta Overview
© 2020 Trifacta | Proprietary and Confidential 1
Contact Info | Trifacta.com
© 2020 Trifacta | Proprietary and Confidential
USE CASES
Data Onboarding
Data Science/ML
Reporting & Analytics
DATA PLATFORMS
Databases
Log Files
Spreadsheets
IoT Sensors
Apps
“It’s impossible to overstress this: 80% of the work in any data project is in cleaning the data.”
— DJ Patil, Former Chief Data Scientist of
the United States
DATA PIPELINE
Discovering
Structuring
CleansingEnriching
Validating
The 80% Problem Is Well Understood
Solving the 80% Problem Requires Aligning IT and Business
2/26/20© 2020 Trifacta | Proprietary and Confidential3
IT
Scale | Security | Governance
BUSINESS
Self-Service | Speed | Cost
Weeks
Months
Years...
Trifacta Enables the Business without Sacrificing IT Requirements
2/26/204
IT
Scale | Security | Governance
BUSINESS
Self-Service | Speed | Cost
Self-Service | Modern Stack | Efficient
© 2020 Trifacta | Proprietary and Confidential
© 2020 Trifacta | Proprietary and Confidential5 2/26/20
https://www.trifacta.com/blog/introducing-data-school/
Thank You
Contact Info | Trifacta.com
© 2020 Trifacta | Proprietary and Confidential 7
Copyright 2020 by Data Blueprint Slide # 1Peter Aiken, PhD
(Unlocking Business Value)
Peter Aiken, PhD
Leveraging
Data
Management
Technologies
• DAMA International President 2009-2013 / 2018
• DAMA International Achievement Award 2001
(with Dr. E. F. "Ted" Codd
• DAMA International Community Award 2005
• I've been doing this a long time
• My work is recognized as useful
• Associate Professor of IS (vcu.edu)
• Founder, Data Blueprint (datablueprint.com)
• DAMA International (dama.org)
• CDO Society (iscdo.org)
• 11 books and dozens of articles
• Experienced w/ 500+ data
management practices worldwide
• Multi-year immersions
– US DoD (DISA/Army/Marines/DLA)
– Nokia
– Deutsche Bank
– Wells Fargo
– Walmart … PETER AIKEN WITH JUANITA BILLINGS
FOREWORD BY JOHN BOTTEGA
MONETIZING
DATA MANAGEMENT
Unlocking the Value in Your Organization’s
Most Important Asset.
2Copyright 2020 by Data Blueprint Slide #
Peter Aiken, Ph.D.
Copyright 2020 by Data Blueprint Slide # X
By the end of this session, you should have a better
understanding of data management technologies in
terms of:
• Technology Considerations
• Data Technology Architecture
• CASE Tools
• Repositories
• Profiling/Discovery Tools
• Data Quality Engineering Tools
• Data Life Cycle
• Other Technologies:
– Servers, EII Technologies, Portals, Conversion Tools
Approaching Data Management Technologies
Blind Persons and the Elephant
4Copyright 2020 by Data Blueprint Slide #
http://www.dailymirror.lk/print/opinion/editorial-we-need-to-become-channels-of-peace/172-27164
It is like a fan!
It is like a snake!
It is like a wall!
It is like a rope!
It is like a tree!
5Copyright 2020 by Data Blueprint Slide #
Unrefined
data management
definition
Sources
Uses
Data Management
6Copyright 2020 by Data Blueprint Slide #
More refined
data management
definition
Sources
ReuseData Management➜ ➜
7Copyright 2020 by Data Blueprint Slide #
Data Governance
Data Assets/Ethical Framework
Sources
➜ Use
➜Reuse
Better still data management definition
➜
Standard data
Data supply
Data literacy
Making a Better Data Sandwich
8Copyright 2020 by Data Blueprint Slide #
Data literacy
Standard data
Data supply
Making a Better Data Sandwich
9Copyright 2020 by Data Blueprint Slide #
Standard data
Data supply
Data literacy
Making a Better Data Sandwich
Quality engineering/
architecture work products
do not happen accidentally!
10Copyright 2020 by Data Blueprint Slide #
Standard data
Data supply
Data literacy
This cannot happen without engineering and architecture!
Technologies by themselves, are a One Legged Stool
11Copyright 2020 by Data Blueprint Slide #
Success Requires a 3-Legged Stool
12Copyright 2020 by Data Blueprint Slide #
People
Process
Technology
13Copyright 2020 by Data Blueprint Slide #
People
Process
Technology
14Copyright 2020 by Data Blueprint Slide #
Supply/demand for data talent
15Copyright 2020 by Data Blueprint Slide #
https://www.logianalytics.com/bi-trends/3-keys-understanding-data/
Growth of Data vs. Growth of Data Analysts
• Stored data accumulating at
28% annual growth rate
• Data analysts in workforce
growing at 5.7% growth rate
16Copyright 2020 by Data Blueprint Slide #
R. Buckminster Fuller
17Copyright 2020 by Data Blueprint Slide #
https://en.wikipedia.org/wiki/Moore%27s_law#/media/File:Moore%27s_Law_Transistor_Count_1971-2016.png
Postpone technology investments
as long as possible
The hardest part of
requirements is not
doing design
Vendor Hype
• CIOs/CDOs feel pressure
• Vendor/project promise auditing
• No understanding of hype curve
18Copyright 2020 by Data Blueprint Slide #
Who wrote this … ?
19Copyright 2020 by Data Blueprint Slide #
• In considering any new subject,
• there is frequently a tendency
first to overrate what we find to
be already interesting or
remarkable, and
• secondly - by a sort of natural
reaction - to undervalue the true
state of the case.
– Lady Augusta Ada King,
(1815 – 1852)
Countess of Lovelace
– (aka) Ada Lovelace,
daughter of Lord Byron
– Publisher of the first
computing program
20Copyright 2020 by Data Blueprint Slide #
http://www.gartner.com/technology/research/methodologies/hype-cycle.jsp
21Copyright 2020 by Data Blueprint Slide #
Technology Trigger: A potential technology breakthrough kicks things off. Early proof-of-concept stories and media interest
trigger significant publicity. Often no usable products exist and commercial viability is unproven.
Trough of Disillusionment: Interest wanes as experiments and implementations fail to deliver. Producers of the
technology shake out or fail. Investments continue only if the surviving providers improve their products to the
satisfaction of early adopters.
Peak of Inflated Expectations: Early publicity produces a number of
success stories—often accompanied by scores of failures. Some
companies take action; many do not.
Slope of Enlightenment: More instances of how the technology can benefit the
enterprise start to crystallize and become more widely understood. Second- and third-
generation products appear from technology providers. More enterprises fund pilots;
conservative companies remain cautious.
Plateau of Productivity: Mainstream adoption starts to
take off. Criteria for assessing provider viability are more
clearly defined. The technology’s broad market
applicability and relevance are clearly paying off.
Gartner Five-phase Hype Cycle
Hype Cycle for Data Management
22Copyright 2020 by Data Blueprint Slide #
Hype Cycle for Information Governance and Master Data Management
23Copyright 2020 by Data Blueprint Slide #
Hype Cycle for Analytics and Business Intelligence
24Copyright 2020 by Data Blueprint Slide #
Copyright 2020 by Data Blueprint Slide # X
By the end of this session, you should have a better
understanding of data management technologies in
terms of:
• Technology Considerations
• Data Technology Architecture
• CASE Tools
• Repositories
• Profiling/Discovery Tools
• Data Quality Engineering Tools
• Data Life Cycle
• Other Technologies:
– Servers, EII Technologies, Portals, Conversion Tools
Approaching Data Management Technologies
Data Management Technologies
• Managing data technology should follow the same
principles and standards for managing any
technology
• Leading reference model for technology
management is the Information Technology
Infrastructure Library (ITIL):
http://www.itil-officialsite.com/home/home.asp
26Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Understanding Data Technology Requirements
Need to understand:
• How the technology works
• How it provides value in the
context of a particular business
• Requirements of a data technology before determining what
technical solution to choose for a particular situation
Suggested questions:
• What problem does this data technology mean to solve?
• What sets this data technology apart from others?
• Are there specific hardware/software/operating systems/storage/
network/connectivity requirements?
• Does this technology include data security functionality?
27Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Defining Data Technology Architecture
• Data technology is part of the overall technology architecture
• It is also often considered part of the enterprise’s data architecture
• Data technology architecture addresses 3 questions:
– What technologies are
standard/required/preferred/acceptable?
– Which technologies apply to which
purposes and circumstances?
– In a distributed environment, which
technologies exist where, and
how does data move from one node to another?
28Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Definitions
• as opposed to
mobile device
management
• MDM is a discipline
or strategy
– "… where the business
and the IT organization
work together to
ensure the uniformity,
accuracy,
semantic persistence,
stewardship and
accountability
of the enterprise's
official, shared master
data."
• Sold as technology-
based solution
29Copyright 2020 by Data Blueprint Slide #
Master Data Architecture
30Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
• Technology-first
approaches often de-
emphasize the
people and process
components
• Successful MDM also
requires
– Governance/quality
– Process architecture
Tools and Methods Are Required!
31Copyright 2020 by Data Blueprint Slide #
Copyright 2020 by Data Blueprint Slide # X
By the end of this session, you should have a better
understanding of data management technologies in
terms of:
• Technology Considerations
• Data Technology Architecture
• CASE Tools
• Repositories
• Profiling/Discovery Tools
• Data Quality Engineering Tools
• Data Life Cycle
• Other Technologies:
– Servers, EII Technologies, Portals, Conversion Tools
Approaching Data Management Technologies
Computer-aided software engineering (CASE)
is the scientific application of a set of tools and
methods to a software system which is meant
to result in high-quality, defect-free, and
maintainable software products. It also refers
to methods for the development of information
systems together with automated tools that
can be used in the software development
process. Computer Aided Software/Systems Engineering Tools
• Scientific application of a set of tools and methods to a software system which is
meant to result in high-quality, defect free, and maintainable software products
• Refers to methods for the development of information systems together with
automated tools that can be used in the software development process
• CASE functions include analysis, design, and programming
33Copyright 2020 by Data Blueprint Slide #
Source: http://en.wikipedia.org/wiki/
CASE Tools
CASE-based Support
34Copyright 2020 by Data Blueprint Slide #
http://www.visible.com
CASE-based Support
35Copyright 2020 by Data Blueprint Slide #
http://www.visible.com
CASE-based Support
36Copyright 2020 by Data Blueprint Slide #
http://www.visible.com
CASE Tool Evolution
• Microsoft
– Excel
– Powerpoint
– Visio
• ERwin ER/Studio
• Rational Rose
• Open source
– It is never free. Even open-sourced
technology requires care and feeding
• List of CASE Tools
– http://www.unl.csi.cuny.edu/faqs/software-enginering/tools.html
37Copyright 2020 by Data Blueprint Slide #
Typical expense concept
[adapted from Huff "Elements of a Realistic
CASE Tool Adoption Budget" © 1992
Communications of the ACM]
38Copyright 2020 by Data Blueprint Slide #
• Sample budget for
implementing a $2500/seat
CASE technology can be $2.5
million over a 5-year period
$187K =
$2500/seat
× 75 seats
$360K = training
$500K = workstations
$150K= assessment costs
$910K = total initial investment
$150K = in-house support
$ 55K = hardware and software maintenance
$ 60K = ongoing training and misc.
$265K = annual additional investment
× 5 years
$1325K investment over 5 years
$187K =
$2500/seat
× 75 seats
$360K = training
$500K = workstations
$150K= assessment costs
$910K = total initial investment
$150K = in-house support
$ 55K = hardware and software maintenance
$ 60K = ongoing training and misc.
$265K = annual additional investment
× 5 years
$1325K investment over 5 years
$187K =
$2500/seat
× 75 seats
$360K = training
$500K = workstations
$150K= assessment costs
$910K = total initial investment
$150K = in-house support
$ 55K = hardware and software maintenance
$ 60K = ongoing training and misc.
$265K = annual additional investment
× 5 years
$1325K investment over 5 years
39Copyright 2020 by Data Blueprint Slide #
This includes:
• Senders
– flows from the CASE effort that
can inform the re-architecting
effort.
• Receivers
– flows from the project that can
inform the CASE effort.
• Senders and receivers
– some elements, such as
restructuring and reengineering,
are both senders and receivers.
CASE Tool: "Taxonomy"
A variety of
CASE-based
methods and
technologies can
access and
update the
metadata
metadata
Integration
Additional metadata uses
accessible via: web; portal;
XML; RDBMS
Everything must "fit" into one
CASE technology
Changing Model of CASE Tool Usage
40Copyright 2020 by Data Blueprint Slide #
Limited access
from outside
the CASE
technology
environment
CASE
tool-specific
methods
and
technologies
Limited additional
metadata use
Copyright 2020 by Data Blueprint Slide # X
By the end of this session, you should have a better
understanding of data management technologies in
terms of:
• Technology Considerations
• Data Technology Architecture
• CASE Tools
• Repositories
• Profiling/Discovery Tools
• Data Quality Engineering Tools
• Data Life Cycle
• Other Technologies:
– Servers, EII Technologies, Portals, Conversion Tools
Approaching Data Management Technologies
The Biggest Challenges to Data Management Practice
42Copyright 2020 by Data Blueprint Slide #
One Eighth of the Data Management Spend
43Copyright 2020 by Data Blueprint Slide #
88%
12%
Metadata
• Metadata management
is still a nascent
discipline that only
represents 12% of the
time spent in data
management
Repositories have been difficult to "sell"
21 September 1999
Michael Blechar, Lisa Wallace
Management Summary
Most executive and IS managers view an IT metadata repository as an
esoteric technology that is not directly related to the business.
However, as will be seen, an IT metadata repository can substantially
help IS organizations support the applications, which in turn support
the business. An IT metadata repository is a pre-built system and
reference database where the IS organizations can track and manage
the information about the applications and databases they build and
maintain; think of it as the inventory and change impact reporting
system for IS. These repositories track metadata such as the
descriptions of jobs, programs, modules, screens, data and
databases, and the interrelationships between them. Metadata differs
from the actual data being described. Metadata is information about
data. For example, the metadata descriptions in the repository tell one
that the field "customer number" appears in Databases A, B and F ...
44Copyright 2020 by Data Blueprint Slide #
[From gartner.com]
What tools do you use?
45%
23%
13%
9%
7%
2%
1% 1% 1% 1%
None HomeGrown Other CA Platinum Rochade Universal
Repository
DesignBank DWGuide InfoManager Interface
Metadata
Tool
• Many build their own
Repository Technologies in Use
45Copyright 2020 by Data Blueprint Slide #
Number Responding=181
• Almost 50% doesn't use
• The "traditional" players are low
numbers
Magic Quadrant for Metadata Management Solutions
46Copyright 2020 by Data Blueprint Slide #
https://www.gartner.com/document/3894971?ref=solrAll&refval=219836558&qid=de595a5685b6f86db0ec6
IBM's AD/Cycle Information Model
47Copyright 2020 by Data Blueprint Slide #
48Copyright 2020 by Data Blueprint Slide #
https://wiscorp.com/kwf_diagram.html
• "The repository" does not have to be an integrated solution
– it must be an easily integrateable solution
• Repository functionality (does not equal a) repository
– metadata must easily evolve to repository solution
• Multiple repositories are not necessarily bad
– as interim solutions, Excel has been working quite well
• Minimal functionality includes
• ability to create, read, update, delete, and evolve metadata items
• Remember the 1st law of data management
– In order to manage metadata, you need metadata repository functions
49Copyright 2020 by Data Blueprint Slide #
Implementing Metadata Repository Functionality
Copyright 2020 by Data Blueprint Slide # X
By the end of this session, you should have a better
understanding of data management technologies in
terms of:
• Technology Considerations
• Data Technology Architecture
• CASE Tools
• Repositories
• Profiling/Discovery Tools
• Data Quality Engineering Tools
• Data Life Cycle
• Other Technologies:
– Servers, EII Technologies, Portals, Conversion Tools
Approaching Data Management Technologies
Time Spent by Data Management Teams Across Disciplines
51Copyright 2020 by Data Blueprint Slide #
https://www.gartner.com/document/3894971?ref=solrAll&refval=219836558&qid=de595a5685b6f86db0ec6
Data Discovery Technologies
• Data analysis software technologies deliver up to 10X
productivity over manual approaches
• Based on a powerful computing technology that allows data
engineers to quickly form candidate hypotheses with respect to
the existing data structures
• Hypotheses are then presented to the SMEs (both business and
technical) who confirm, refine, or deny them
• Allows existing data structures to be inferred at rate that is an
order of magnitude more effective than previous manual
approaches
• Pioneers include Evoke->CSI, Metagenix->Ascential->IBM,
Sypherlink
52Copyright 2020 by Data Blueprint Slide #
Profiling
Discovery
Analysis
How has this been done in the past?
Old
• Manually
• Brute force
• Repository dependent
• Quality indifferent
• Not repeatable
New
• Semi-automated
• Engineered
• Repository
independent
• Integrated quality
• Repeatable
• Currency
• Accuracy
53Copyright 2020 by Data Blueprint Slide #
54Copyright 2020 by Data Blueprint Slide #
Select an Attribute to
get a list of values
Double-click a value to
see rows with that value
Comparing Weekly Progress
Monday
Morning:
Model
preparation
Afternoon:
Model refinement/
validation session
Tuesday
Morning:
Model refinement/
validation session
Afternoon:
Model refinement/
validation session
Wednesday
Morning:
Model
preparation
Afternoon:
Model refinement/
validation session
Thursday
Morning:
Model refinement/
validation session
Afternoon:
Model refinement/
validation session
Friday
Morning:
Model
preparation
Afternoon:
Model refinement/
validation session
Monday
Morning:
Model
preparation
Afternoon:
Model
preparation
Tuesday
Morning:
Model
preparation
Afternoon:
Model refinement/
validation session
Wednesday
Morning:
Model
preparation
Afternoon:
Model
preparation
Thursday
Morning:
Model
preparation
Afternoon:
Model refinement/
validation session
Friday
Morning:
Model
preparation
Afternoon:
Model
preparation
Reactive
Proactive
55Copyright 2020 by Data Blueprint Slide #
Trifacta/Data Wrangling
56Copyright 2020 by Data Blueprint Slide #
Copyright 2020 by Data Blueprint Slide # X
By the end of this session, you should have a better
understanding of data management technologies in
terms of:
• Technology Considerations
• Data Technology Architecture
• CASE Tools
• Repositories
• Profiling/Discovery Tools
• Data Quality Engineering Tools
• Data Life Cycle
• Other Technologies:
– Servers, EII Technologies, Portals, Conversion Tools
Approaching Data Management Technologies
Data Quality Engineering Tools
• 4 categories of activities:
– Analysis
– Cleansing
– Enhancement
– Monitoring
• Principal tools:
– Data Profiling
– Parsing and Standardization
– Data Transformation
– Identity Resolution and Matching
– Enhancement
– Reporting
58Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
DQ Tools
1. Data Profiling
– Need to be able to distinguish between good and bad data before making any
improvements
– Data profiling is a set of algorithms for 2 purposes:
• Statistical analysis and assessment of the data quality values within a data set
• Exploring relationships that exist between value collections within and across data sets
59Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
DQ Tools
2. Parsing & Standardization
– Data parsing tools enable the definition of
patterns that feed into a rules engine used
to distinguish between valid and invalid
data values
– Actions are triggered upon matching a
specific pattern
– When an invalid pattern is recognized, the
application may attempt to transform the
invalid value into one that meets
expectations
60Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
https://www.youtube.com/watch?v=r9UhJxFT5rk
DQ Tool Pattern
61Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
DQ Tools
4. Identify Resolution & Matching
– Basic approaches to matching:
– Deterministic
• Relies on defined patterns and rules for
assigning weights and scores to determine
similarity
– Predictable
• Only as good as anticipations of the rules
developers
– Probabilistic
• Uses statistical techniques to assess
probabilities that pairs of records represent
the same entity
– Not reliant on rules
• Refined based on experience -> matchers
can improve precision as more data is
analyzed
62Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
DQ Tools
5. Enhancement
– A method for adding value to information by accumulating additional information
about a base set of entities and then merging all the sets of information to
provide a focused view
Examples:
– Time/date stamps
– Auditing information
– Contextual information
– Geographic information
– Demographic information
– Psychographic information
63Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
DQ Tools
6. Reporting
• Good reporting supports:
– Inspection and monitoring of conformance to data quality expectations
– Monitoring performance of data stewards conforming to data quality SLAs
– Workflow processing for data quality incidents
– Manual oversight of data cleansing and correction
• Associate report results w/:
– Data quality measurement
– Metrics
– Activity
64Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Copyright 2020 by Data Blueprint Slide # X
By the end of this session, you should have a better
understanding of data management technologies in
terms of:
• Technology Considerations
• Data Technology Architecture
• CASE Tools
• Repositories
• Profiling/Discovery Tools
• Data Quality Engineering Tools
• Data Life Cycle
• Other Technologies:
– Servers, EII Technologies, Portals, Conversion Tools
Approaching Data Management Technologies
Data acquisition activities Data usage activitiesData storage
Traditional Data Life Cycle
66Copyright 2020 by Data Blueprint Slide #
67Copyright 2020 by Data Blueprint Slide #
Metadata
Creation
Data
Assessment
MetadataRefinement
DataRefinement
Data
Manipulation
DataCreation
Data
Utilization
Metadata
Structuring
Data Storage
DataLifeCycleModel
Metadata Data
Dimension Focus/Phase: Refinement Creation Structuring Creation Manipulation Refinement Utilization Assessment
Data
Architecture
Quality
Data architecture quality
is the focus of metadata
creation & refinement
efforts.
↵ ↵ ↵
Data Model
Quality
Data model quality is the
focus of metadata
refinement & structuring
efforts
↵ ↵ ↵
Data Value
Quality
Data value quality is the
focus of the data
creation, manipulation,
and refinements phases.
↵ ↵ ↵ ↵
Data
Representation
Quality
Data representation
quality is the focus of
data utilization phase.
↵ ↵
Dimensions Related to Phases
• Data architecture quality is the focus of metadata creation and refinement efforts.
• Data model quality is the focus of metadata structuring efforts
• Data value quality is the focus of the data creation, manipulation, and refinements phases.
• Data architecture and model quality are the focus of metadata refinement efforts.
• Data representation quality is the focus of data utilization and assessment phase.
68Copyright 2020 by Data Blueprint Slide #
Copyright 2020 by Data Blueprint Slide # X
By the end of this session, you should have a better
understanding of data management technologies in
terms of:
• Technology Considerations
• Data Technology Architecture
• CASE Tools
• Repositories
• Profiling/Discovery Tools
• Data Quality Engineering Tools
• Data Life Cycle
• Other Technologies:
– Servers, EII Technologies, Portals, Conversion Tools
Approaching Data Management Technologies
Other Technologies
Data Integration Definition:
• Pulling together and reconciling dispersed data for analytic
purposes that organizations have maintained in multiple,
heterogeneous systems. Data needs to be accessed and
extracted, moved and loaded, validated and cleaned,
standardized and transformed.
• Other tools include:
– Servers
– EII technologies
– Portals
– Conversion tools
70Copyright 2020 by Data Blueprint Slide #
Source: http://www.information-management.com
Portal Options
71Copyright 2020 by Data Blueprint Slide #
[Adapted from Terry Lanham Designing Innovative Enterprise Portals and Implementing Them Into Your Content Strategies Lockheed
Martin’s Compelling Case Study Web Content II: Leveraging Best-of-Breed Content Strategies - San Francisco, CA 23 January 2001]
Legacy Systems Transformed Into Web-services Accessed Through a Portal
72Copyright 2020 by Data Blueprint Slide #
Organizational Portal
Saturday, April 6, 2019 - All systems operational!
Organizational News
• Organizational Early News • Industry News
• Press Releases • Newsletters
Organizational IT
• Service Desk
• Settings
Email
• 320 new msgs, 14,572 total
• Send quick email
Organizational Essentials
• Knowledge network
• Employee assistance
• IT procurement
• Organizational media design
• Organizational merchandise
Search
Go
Stocks
Full Portfolio
XYZ
YYZ
ZZZ
Market Update
50
29.5
45.25
As of:
Saturday, April 6, 2019
Get Quote
Reporting
Regional
• Northeast
• Northwest
• Southeast
• Southwest
• Midnorth
• Midsouth
State
• Alabama
• Arkansas
• Georgia
• Mississippi
• Vermont
• Virginia
Legacy
Application 1
Legacy
Application 2
Legacy
Application 3
Legacy
Application 4
Legacy
Application 5
Web
Service 1.1
Web
Service 1.2
Web
Service 1.3
Web
Service 2.1
Web
Service 2.2
Web
Service 3.1
Web
Service 3.2
Web
Service 4.1
Web
Service 4.2
Web
Service 5.1
Web
Service 5.2
Web
Service 5.3
73Copyright 2020 by Data Blueprint Slide #
Top Tier Demo
Portals as a Data Quality Tool
74Copyright 2020 by Data Blueprint Slide #
Defining Spaces
• ETL Extract Transform, Load
– delivers aggregated data to a
new database
• EAI Enterprise Application Integration
– connects applications to other applications in a
predictable manner using
pre-established connections
• EII Enterprise Information Integration
– between ETL and EAI - delivers tailored views of
information to users at the time that it is required
75Copyright 2020 by Data Blueprint Slide #
Meta-Matrix Virtual-Integration Example
76Copyright 2020 by Data Blueprint Slide #
Approaching Data Management Technologies
By the end of this session, you should have a better understanding
of data management technologies and their use as part of a people
process & technology 3-legged stool in terms of:
• Technology Considerations
• Data Technology Architecture
• CASE Tools
• Repositories
• Profiling/Discovery Tools
• Data Quality Engineering Tools
• Data Life Cycle
• Other Technologies:
– Servers, EII Technologies, Portals, Conversion Tools
77Copyright 2020 by Data Blueprint Slide #
Gartner Key Findings
• Data assets continue to drive strategic cloud service providers’
offerings
• Machine learning is increasingly popular–key uses:
– Data integration tools,
– Database management systems,
– Data quality tools and
– Metadata management solutions
• Increasing use of cloud for production applications requiring that
database in the cloud
• Organizations applying a combination of data warehouses, data
lakes and data hubs can achieve greater flexibility to support a
range of use cases compared to those applying only one.
78Copyright 2020 by Data Blueprint Slide #
https://www.gartner.com/document/3894971?ref=solrAll&refval=219836558&qid=de595a5685b6f86db0ec6
IT Business
Data
Perceived State of Data
79Copyright 2020 by Data Blueprint Slide #
Data
Desired To Be State of Data
80Copyright 2020 by Data Blueprint Slide #
IT Business
The Real State of Data
81Copyright 2020 by Data Blueprint Slide #
Data
IT Business
It isn't possible to go digital
Digital
82Copyright 2020 by Data Blueprint Slide #
aBy just spelling 'data'
Dat
83Copyright 2020 by Data Blueprint Slide #
It requires more work
Data
84Copyright 2020 by Data Blueprint Slide #
a
Lady Ada Augusta King Rule
85Copyright 2020 by Data Blueprint Slide #
https://people.well.com/user/adatoole/bio.htm
Recent Technology Realization
86Copyright 2020 by Data Blueprint Slide #
GarbageIn➜
GarbageOut!Recent
GI➜GO!
87Copyright 2020 by Data Blueprint Slide #
Perfect
Model
Garbage
Data
Garbage
Results
Data
Warehouse
Machine
Learning
Business
Intelligence
Block Chain
AI
MDM
Analytics
Technology
Data
Governance
GI➜GO!
88Copyright 2020 by Data Blueprint Slide #
Perfect
Model
Quality
Data
Garbage
Results
Data
Warehouse
Machine
Learning
Business
Intelligence
Block Chain
AI
MDM
Analytics
Technology
Data
Governance
Quality In ➜ Quality Out!
89Copyright 2020 by Data Blueprint Slide #
Perfect
Model
Quality
Data
Good
Results
Data
Warehouse
Machine
Learning
Business
Intelligence
Block Chain
AI
MDM
Analytics
Technology
Data
Governance
More Data Management Tools
90Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
More Data Management Tools
91Copyright 2020 by Data Blueprint Slide #
from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Gartner Recommendations
Leaders seeking to advance their strategies and deliver effective solutions
must:
• Validate both product capabilities and data availability
• Leverage automation to free up scarce specialist resources
• Update policies and governance
standards to address cloud and
database platform as a service
(dbPaaS) before purchasing
• Favor providers with a clear
roadmap for ML enabling better
business outcomes or service levels
• Don’t expect a single piece of
infrastructure to meet all of your
needs. Plan the core by
assessing use case type,
processing flexibility and
semantic requirements
92Copyright 2020 by Data Blueprint Slide #
https://www.gartner.com/document/3894971?ref=solrAll&refval=219836558&qid=de595a5685b6f86db0ec6
+ =
Questions?
93Copyright 2020 by Data Blueprint Slide #
It’s your turn!
Use the chat feature or
Twitter (#dataed) to submit
your questions now!
Upcoming Events
May Webinar
Data Management Best Practices
May 12, 2020 @ 2:00 PM ET
June Webinar
Approaching Data Governance
Strategically
June 9, 2020 @ 2:00 PM ET
Sign up for webinars at:
www.datablueprint.com/webinar-schedule
94Copyright 2020 by Data Blueprint Slide #
Brought to you by:
10124 W. Broad Street, Suite C
Glen Allen, Virginia 23060
804.521.4056
Copyright 2020 by Data Blueprint Slide # 95

Weitere ähnliche Inhalte

Was ist angesagt?

Essential Metadata Strategies
Essential Metadata StrategiesEssential Metadata Strategies
Essential Metadata StrategiesDATAVERSITY
 
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityDATAVERSITY
 
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?DATAVERSITY
 
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...DATAVERSITY
 
Slides: Powering a Sustainable Data Governance Program – Learnings & Best Pra...
Slides: Powering a Sustainable Data Governance Program – Learnings & Best Pra...Slides: Powering a Sustainable Data Governance Program – Learnings & Best Pra...
Slides: Powering a Sustainable Data Governance Program – Learnings & Best Pra...DATAVERSITY
 
A Modern Approach to DI & MDM
A Modern Approach to DI & MDMA Modern Approach to DI & MDM
A Modern Approach to DI & MDMDATAVERSITY
 
ADV Slides: The Data Needed to Evolve an Enterprise Artificial Intelligence S...
ADV Slides: The Data Needed to Evolve an Enterprise Artificial Intelligence S...ADV Slides: The Data Needed to Evolve an Enterprise Artificial Intelligence S...
ADV Slides: The Data Needed to Evolve an Enterprise Artificial Intelligence S...DATAVERSITY
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDATAVERSITY
 
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
DataEd Slides: Essential Metadata Strategies
DataEd Slides: Essential Metadata StrategiesDataEd Slides: Essential Metadata Strategies
DataEd Slides: Essential Metadata StrategiesDATAVERSITY
 
Data Architecture Strategies
Data Architecture StrategiesData Architecture Strategies
Data Architecture StrategiesDATAVERSITY
 
Implementing the Data Maturity Model (DMM)
Implementing the Data Maturity Model (DMM)Implementing the Data Maturity Model (DMM)
Implementing the Data Maturity Model (DMM)DATAVERSITY
 
ADV Slides: Strategies for Transitioning to a Cloud-First Enterprise
ADV Slides: Strategies for Transitioning to a Cloud-First EnterpriseADV Slides: Strategies for Transitioning to a Cloud-First Enterprise
ADV Slides: Strategies for Transitioning to a Cloud-First EnterpriseDATAVERSITY
 
RWDG Slides: Governing Your Data Catalog, Business Glossary, and Data Dictionary
RWDG Slides: Governing Your Data Catalog, Business Glossary, and Data DictionaryRWDG Slides: Governing Your Data Catalog, Business Glossary, and Data Dictionary
RWDG Slides: Governing Your Data Catalog, Business Glossary, and Data DictionaryDATAVERSITY
 
Do-It-Yourself (DIY) Data Governance Framework
Do-It-Yourself (DIY) Data Governance FrameworkDo-It-Yourself (DIY) Data Governance Framework
Do-It-Yourself (DIY) Data Governance FrameworkDATAVERSITY
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceDATAVERSITY
 
Data-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsData-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsDATAVERSITY
 
Unlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeUnlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeDATAVERSITY
 

Was ist angesagt? (20)

Essential Metadata Strategies
Essential Metadata StrategiesEssential Metadata Strategies
Essential Metadata Strategies
 
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
 
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
 
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
 
Slides: Powering a Sustainable Data Governance Program – Learnings & Best Pra...
Slides: Powering a Sustainable Data Governance Program – Learnings & Best Pra...Slides: Powering a Sustainable Data Governance Program – Learnings & Best Pra...
Slides: Powering a Sustainable Data Governance Program – Learnings & Best Pra...
 
A Modern Approach to DI & MDM
A Modern Approach to DI & MDMA Modern Approach to DI & MDM
A Modern Approach to DI & MDM
 
ADV Slides: The Data Needed to Evolve an Enterprise Artificial Intelligence S...
ADV Slides: The Data Needed to Evolve an Enterprise Artificial Intelligence S...ADV Slides: The Data Needed to Evolve an Enterprise Artificial Intelligence S...
ADV Slides: The Data Needed to Evolve an Enterprise Artificial Intelligence S...
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data Architecture
 
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
DataEd Slides: Essential Metadata Strategies
DataEd Slides: Essential Metadata StrategiesDataEd Slides: Essential Metadata Strategies
DataEd Slides: Essential Metadata Strategies
 
Data Architecture Strategies
Data Architecture StrategiesData Architecture Strategies
Data Architecture Strategies
 
Implementing the Data Maturity Model (DMM)
Implementing the Data Maturity Model (DMM)Implementing the Data Maturity Model (DMM)
Implementing the Data Maturity Model (DMM)
 
ADV Slides: Strategies for Transitioning to a Cloud-First Enterprise
ADV Slides: Strategies for Transitioning to a Cloud-First EnterpriseADV Slides: Strategies for Transitioning to a Cloud-First Enterprise
ADV Slides: Strategies for Transitioning to a Cloud-First Enterprise
 
RWDG Slides: Governing Your Data Catalog, Business Glossary, and Data Dictionary
RWDG Slides: Governing Your Data Catalog, Business Glossary, and Data DictionaryRWDG Slides: Governing Your Data Catalog, Business Glossary, and Data Dictionary
RWDG Slides: Governing Your Data Catalog, Business Glossary, and Data Dictionary
 
Do-It-Yourself (DIY) Data Governance Framework
Do-It-Yourself (DIY) Data Governance FrameworkDo-It-Yourself (DIY) Data Governance Framework
Do-It-Yourself (DIY) Data Governance Framework
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data Governance
 
Data-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsData-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling Fundamentals
 
Unlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeUnlocking the Value of Your Data Lake
Unlocking the Value of Your Data Lake
 

Ähnlich wie DataEd Slides: Leveraging Data Management Technologies

DataEd Slides: Approaching Data Management Technologies
DataEd Slides:  Approaching Data Management TechnologiesDataEd Slides:  Approaching Data Management Technologies
DataEd Slides: Approaching Data Management TechnologiesDATAVERSITY
 
Data-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsData-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsDATAVERSITY
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements  Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements Data Blueprint
 
DataEd Slides: Data Management Best Practices
DataEd Slides: Data Management Best PracticesDataEd Slides: Data Management Best Practices
DataEd Slides: Data Management Best PracticesDATAVERSITY
 
Data Quality Success Stories
Data Quality Success StoriesData Quality Success Stories
Data Quality Success StoriesDATAVERSITY
 
Necessary Prerequisites to Data Success
Necessary Prerequisites to Data SuccessNecessary Prerequisites to Data Success
Necessary Prerequisites to Data SuccessDATAVERSITY
 
DataEd Slides: Data Management vs. Data Strategy
DataEd Slides: Data Management vs. Data StrategyDataEd Slides: Data Management vs. Data Strategy
DataEd Slides: Data Management vs. Data StrategyDATAVERSITY
 
Where Data Architecture and Data Governance Collide
Where Data Architecture and Data Governance CollideWhere Data Architecture and Data Governance Collide
Where Data Architecture and Data Governance CollideDATAVERSITY
 
Data-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityData-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityDATAVERSITY
 
Data Preparation Fundamentals
Data Preparation FundamentalsData Preparation Fundamentals
Data Preparation FundamentalsDATAVERSITY
 
DataEd Slides: Unlock Business Value Using Reference and Master Data Manageme...
DataEd Slides: Unlock Business Value Using Reference and Master Data Manageme...DataEd Slides: Unlock Business Value Using Reference and Master Data Manageme...
DataEd Slides: Unlock Business Value Using Reference and Master Data Manageme...DATAVERSITY
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality RightDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
DataEd Webinar: Reference & Master Data Management - Unlocking Business Value
DataEd Webinar:  Reference & Master Data Management - Unlocking Business ValueDataEd Webinar:  Reference & Master Data Management - Unlocking Business Value
DataEd Webinar: Reference & Master Data Management - Unlocking Business ValueDATAVERSITY
 
Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data Blueprint
 
Data-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMData-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMDATAVERSITY
 
DataEd Slides: The Seven Deadly Data Sins
DataEd Slides: The Seven Deadly Data SinsDataEd Slides: The Seven Deadly Data Sins
DataEd Slides: The Seven Deadly Data SinsDATAVERSITY
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureDATAVERSITY
 
Data-Ed Online Webinar: Data Architecture Requirements
Data-Ed Online Webinar: Data Architecture RequirementsData-Ed Online Webinar: Data Architecture Requirements
Data-Ed Online Webinar: Data Architecture RequirementsDATAVERSITY
 
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?DATAVERSITY
 

Ähnlich wie DataEd Slides: Leveraging Data Management Technologies (20)

DataEd Slides: Approaching Data Management Technologies
DataEd Slides:  Approaching Data Management TechnologiesDataEd Slides:  Approaching Data Management Technologies
DataEd Slides: Approaching Data Management Technologies
 
Data-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsData-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture Requirements
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements  Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements
 
DataEd Slides: Data Management Best Practices
DataEd Slides: Data Management Best PracticesDataEd Slides: Data Management Best Practices
DataEd Slides: Data Management Best Practices
 
Data Quality Success Stories
Data Quality Success StoriesData Quality Success Stories
Data Quality Success Stories
 
Necessary Prerequisites to Data Success
Necessary Prerequisites to Data SuccessNecessary Prerequisites to Data Success
Necessary Prerequisites to Data Success
 
DataEd Slides: Data Management vs. Data Strategy
DataEd Slides: Data Management vs. Data StrategyDataEd Slides: Data Management vs. Data Strategy
DataEd Slides: Data Management vs. Data Strategy
 
Where Data Architecture and Data Governance Collide
Where Data Architecture and Data Governance CollideWhere Data Architecture and Data Governance Collide
Where Data Architecture and Data Governance Collide
 
Data-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityData-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data Quality
 
Data Preparation Fundamentals
Data Preparation FundamentalsData Preparation Fundamentals
Data Preparation Fundamentals
 
DataEd Slides: Unlock Business Value Using Reference and Master Data Manageme...
DataEd Slides: Unlock Business Value Using Reference and Master Data Manageme...DataEd Slides: Unlock Business Value Using Reference and Master Data Manageme...
DataEd Slides: Unlock Business Value Using Reference and Master Data Manageme...
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality Right
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
DataEd Webinar: Reference & Master Data Management - Unlocking Business Value
DataEd Webinar:  Reference & Master Data Management - Unlocking Business ValueDataEd Webinar:  Reference & Master Data Management - Unlocking Business Value
DataEd Webinar: Reference & Master Data Management - Unlocking Business Value
 
Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM
 
Data-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMData-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDM
 
DataEd Slides: The Seven Deadly Data Sins
DataEd Slides: The Seven Deadly Data SinsDataEd Slides: The Seven Deadly Data Sins
DataEd Slides: The Seven Deadly Data Sins
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
 
Data-Ed Online Webinar: Data Architecture Requirements
Data-Ed Online Webinar: Data Architecture RequirementsData-Ed Online Webinar: Data Architecture Requirements
Data-Ed Online Webinar: Data Architecture Requirements
 
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
 

Mehr von DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...DATAVERSITY
 

Mehr von DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
 

Kürzlich hochgeladen

Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformationAnnie Melnic
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are successPratikSingh115843
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfNicoChristianSunaryo
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etclalithasri22
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfnikeshsingh56
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfPratikPatil591646
 

Kürzlich hochgeladen (17)

Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformation
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are success
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdf
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etc
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdf
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdf
 

DataEd Slides: Leveraging Data Management Technologies

  • 1. Copyright 2020 by Data Blueprint Slide # 1Peter Aiken, PhD (Unlocking Business Value) Peter Aiken, PhD Leveraging Data Management Technologies • DAMA International President 2009-2013 / 2018 • DAMA International Achievement Award 2001 (with Dr. E. F. "Ted" Codd • DAMA International Community Award 2005 • I've been doing this a long time • My work is recognized as useful • Associate Professor of IS (vcu.edu) • Founder, Data Blueprint (datablueprint.com) • DAMA International (dama.org) • CDO Society (iscdo.org) • 11 books and dozens of articles • Experienced w/ 500+ data management practices worldwide • Multi-year immersions – US DoD (DISA/Army/Marines/DLA) – Nokia – Deutsche Bank – Wells Fargo – Walmart … PETER AIKEN WITH JUANITA BILLINGS FOREWORD BY JOHN BOTTEGA MONETIZING DATA MANAGEMENT Unlocking the Value in Your Organization’s Most Important Asset. 2Copyright 2020 by Data Blueprint Slide # Peter Aiken, Ph.D.
  • 2. Trifacta Overview © 2020 Trifacta | Proprietary and Confidential 1 Contact Info | Trifacta.com
  • 3. © 2020 Trifacta | Proprietary and Confidential USE CASES Data Onboarding Data Science/ML Reporting & Analytics DATA PLATFORMS Databases Log Files Spreadsheets IoT Sensors Apps “It’s impossible to overstress this: 80% of the work in any data project is in cleaning the data.” — DJ Patil, Former Chief Data Scientist of the United States DATA PIPELINE Discovering Structuring CleansingEnriching Validating The 80% Problem Is Well Understood
  • 4. Solving the 80% Problem Requires Aligning IT and Business 2/26/20© 2020 Trifacta | Proprietary and Confidential3 IT Scale | Security | Governance BUSINESS Self-Service | Speed | Cost Weeks Months Years...
  • 5. Trifacta Enables the Business without Sacrificing IT Requirements 2/26/204 IT Scale | Security | Governance BUSINESS Self-Service | Speed | Cost Self-Service | Modern Stack | Efficient © 2020 Trifacta | Proprietary and Confidential
  • 6. © 2020 Trifacta | Proprietary and Confidential5 2/26/20
  • 8. Thank You Contact Info | Trifacta.com © 2020 Trifacta | Proprietary and Confidential 7
  • 9. Copyright 2020 by Data Blueprint Slide # 1Peter Aiken, PhD (Unlocking Business Value) Peter Aiken, PhD Leveraging Data Management Technologies • DAMA International President 2009-2013 / 2018 • DAMA International Achievement Award 2001 (with Dr. E. F. "Ted" Codd • DAMA International Community Award 2005 • I've been doing this a long time • My work is recognized as useful • Associate Professor of IS (vcu.edu) • Founder, Data Blueprint (datablueprint.com) • DAMA International (dama.org) • CDO Society (iscdo.org) • 11 books and dozens of articles • Experienced w/ 500+ data management practices worldwide • Multi-year immersions – US DoD (DISA/Army/Marines/DLA) – Nokia – Deutsche Bank – Wells Fargo – Walmart … PETER AIKEN WITH JUANITA BILLINGS FOREWORD BY JOHN BOTTEGA MONETIZING DATA MANAGEMENT Unlocking the Value in Your Organization’s Most Important Asset. 2Copyright 2020 by Data Blueprint Slide # Peter Aiken, Ph.D.
  • 10. Copyright 2020 by Data Blueprint Slide # X By the end of this session, you should have a better understanding of data management technologies in terms of: • Technology Considerations • Data Technology Architecture • CASE Tools • Repositories • Profiling/Discovery Tools • Data Quality Engineering Tools • Data Life Cycle • Other Technologies: – Servers, EII Technologies, Portals, Conversion Tools Approaching Data Management Technologies Blind Persons and the Elephant 4Copyright 2020 by Data Blueprint Slide # http://www.dailymirror.lk/print/opinion/editorial-we-need-to-become-channels-of-peace/172-27164 It is like a fan! It is like a snake! It is like a wall! It is like a rope! It is like a tree!
  • 11. 5Copyright 2020 by Data Blueprint Slide # Unrefined data management definition Sources Uses Data Management 6Copyright 2020 by Data Blueprint Slide # More refined data management definition Sources ReuseData Management➜ ➜
  • 12. 7Copyright 2020 by Data Blueprint Slide # Data Governance Data Assets/Ethical Framework Sources ➜ Use ➜Reuse Better still data management definition ➜ Standard data Data supply Data literacy Making a Better Data Sandwich 8Copyright 2020 by Data Blueprint Slide # Data literacy Standard data Data supply
  • 13. Making a Better Data Sandwich 9Copyright 2020 by Data Blueprint Slide # Standard data Data supply Data literacy Making a Better Data Sandwich Quality engineering/ architecture work products do not happen accidentally! 10Copyright 2020 by Data Blueprint Slide # Standard data Data supply Data literacy This cannot happen without engineering and architecture!
  • 14. Technologies by themselves, are a One Legged Stool 11Copyright 2020 by Data Blueprint Slide # Success Requires a 3-Legged Stool 12Copyright 2020 by Data Blueprint Slide # People Process Technology
  • 15. 13Copyright 2020 by Data Blueprint Slide # People Process Technology 14Copyright 2020 by Data Blueprint Slide #
  • 16. Supply/demand for data talent 15Copyright 2020 by Data Blueprint Slide # https://www.logianalytics.com/bi-trends/3-keys-understanding-data/ Growth of Data vs. Growth of Data Analysts • Stored data accumulating at 28% annual growth rate • Data analysts in workforce growing at 5.7% growth rate 16Copyright 2020 by Data Blueprint Slide # R. Buckminster Fuller
  • 17. 17Copyright 2020 by Data Blueprint Slide # https://en.wikipedia.org/wiki/Moore%27s_law#/media/File:Moore%27s_Law_Transistor_Count_1971-2016.png Postpone technology investments as long as possible The hardest part of requirements is not doing design Vendor Hype • CIOs/CDOs feel pressure • Vendor/project promise auditing • No understanding of hype curve 18Copyright 2020 by Data Blueprint Slide #
  • 18. Who wrote this … ? 19Copyright 2020 by Data Blueprint Slide # • In considering any new subject, • there is frequently a tendency first to overrate what we find to be already interesting or remarkable, and • secondly - by a sort of natural reaction - to undervalue the true state of the case. – Lady Augusta Ada King, (1815 – 1852) Countess of Lovelace – (aka) Ada Lovelace, daughter of Lord Byron – Publisher of the first computing program 20Copyright 2020 by Data Blueprint Slide #
  • 19. http://www.gartner.com/technology/research/methodologies/hype-cycle.jsp 21Copyright 2020 by Data Blueprint Slide # Technology Trigger: A potential technology breakthrough kicks things off. Early proof-of-concept stories and media interest trigger significant publicity. Often no usable products exist and commercial viability is unproven. Trough of Disillusionment: Interest wanes as experiments and implementations fail to deliver. Producers of the technology shake out or fail. Investments continue only if the surviving providers improve their products to the satisfaction of early adopters. Peak of Inflated Expectations: Early publicity produces a number of success stories—often accompanied by scores of failures. Some companies take action; many do not. Slope of Enlightenment: More instances of how the technology can benefit the enterprise start to crystallize and become more widely understood. Second- and third- generation products appear from technology providers. More enterprises fund pilots; conservative companies remain cautious. Plateau of Productivity: Mainstream adoption starts to take off. Criteria for assessing provider viability are more clearly defined. The technology’s broad market applicability and relevance are clearly paying off. Gartner Five-phase Hype Cycle Hype Cycle for Data Management 22Copyright 2020 by Data Blueprint Slide #
  • 20. Hype Cycle for Information Governance and Master Data Management 23Copyright 2020 by Data Blueprint Slide # Hype Cycle for Analytics and Business Intelligence 24Copyright 2020 by Data Blueprint Slide #
  • 21. Copyright 2020 by Data Blueprint Slide # X By the end of this session, you should have a better understanding of data management technologies in terms of: • Technology Considerations • Data Technology Architecture • CASE Tools • Repositories • Profiling/Discovery Tools • Data Quality Engineering Tools • Data Life Cycle • Other Technologies: – Servers, EII Technologies, Portals, Conversion Tools Approaching Data Management Technologies Data Management Technologies • Managing data technology should follow the same principles and standards for managing any technology • Leading reference model for technology management is the Information Technology Infrastructure Library (ITIL): http://www.itil-officialsite.com/home/home.asp 26Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 22. Understanding Data Technology Requirements Need to understand: • How the technology works • How it provides value in the context of a particular business • Requirements of a data technology before determining what technical solution to choose for a particular situation Suggested questions: • What problem does this data technology mean to solve? • What sets this data technology apart from others? • Are there specific hardware/software/operating systems/storage/ network/connectivity requirements? • Does this technology include data security functionality? 27Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International Defining Data Technology Architecture • Data technology is part of the overall technology architecture • It is also often considered part of the enterprise’s data architecture • Data technology architecture addresses 3 questions: – What technologies are standard/required/preferred/acceptable? – Which technologies apply to which purposes and circumstances? – In a distributed environment, which technologies exist where, and how does data move from one node to another? 28Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 23. Definitions • as opposed to mobile device management • MDM is a discipline or strategy – "… where the business and the IT organization work together to ensure the uniformity, accuracy, semantic persistence, stewardship and accountability of the enterprise's official, shared master data." • Sold as technology- based solution 29Copyright 2020 by Data Blueprint Slide # Master Data Architecture 30Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International • Technology-first approaches often de- emphasize the people and process components • Successful MDM also requires – Governance/quality – Process architecture
  • 24. Tools and Methods Are Required! 31Copyright 2020 by Data Blueprint Slide # Copyright 2020 by Data Blueprint Slide # X By the end of this session, you should have a better understanding of data management technologies in terms of: • Technology Considerations • Data Technology Architecture • CASE Tools • Repositories • Profiling/Discovery Tools • Data Quality Engineering Tools • Data Life Cycle • Other Technologies: – Servers, EII Technologies, Portals, Conversion Tools Approaching Data Management Technologies
  • 25. Computer-aided software engineering (CASE) is the scientific application of a set of tools and methods to a software system which is meant to result in high-quality, defect-free, and maintainable software products. It also refers to methods for the development of information systems together with automated tools that can be used in the software development process. Computer Aided Software/Systems Engineering Tools • Scientific application of a set of tools and methods to a software system which is meant to result in high-quality, defect free, and maintainable software products • Refers to methods for the development of information systems together with automated tools that can be used in the software development process • CASE functions include analysis, design, and programming 33Copyright 2020 by Data Blueprint Slide # Source: http://en.wikipedia.org/wiki/ CASE Tools CASE-based Support 34Copyright 2020 by Data Blueprint Slide # http://www.visible.com
  • 26. CASE-based Support 35Copyright 2020 by Data Blueprint Slide # http://www.visible.com CASE-based Support 36Copyright 2020 by Data Blueprint Slide # http://www.visible.com
  • 27. CASE Tool Evolution • Microsoft – Excel – Powerpoint – Visio • ERwin ER/Studio • Rational Rose • Open source – It is never free. Even open-sourced technology requires care and feeding • List of CASE Tools – http://www.unl.csi.cuny.edu/faqs/software-enginering/tools.html 37Copyright 2020 by Data Blueprint Slide # Typical expense concept [adapted from Huff "Elements of a Realistic CASE Tool Adoption Budget" © 1992 Communications of the ACM] 38Copyright 2020 by Data Blueprint Slide # • Sample budget for implementing a $2500/seat CASE technology can be $2.5 million over a 5-year period $187K = $2500/seat × 75 seats $360K = training $500K = workstations $150K= assessment costs $910K = total initial investment $150K = in-house support $ 55K = hardware and software maintenance $ 60K = ongoing training and misc. $265K = annual additional investment × 5 years $1325K investment over 5 years $187K = $2500/seat × 75 seats $360K = training $500K = workstations $150K= assessment costs $910K = total initial investment $150K = in-house support $ 55K = hardware and software maintenance $ 60K = ongoing training and misc. $265K = annual additional investment × 5 years $1325K investment over 5 years $187K = $2500/seat × 75 seats $360K = training $500K = workstations $150K= assessment costs $910K = total initial investment $150K = in-house support $ 55K = hardware and software maintenance $ 60K = ongoing training and misc. $265K = annual additional investment × 5 years $1325K investment over 5 years
  • 28. 39Copyright 2020 by Data Blueprint Slide # This includes: • Senders – flows from the CASE effort that can inform the re-architecting effort. • Receivers – flows from the project that can inform the CASE effort. • Senders and receivers – some elements, such as restructuring and reengineering, are both senders and receivers. CASE Tool: "Taxonomy" A variety of CASE-based methods and technologies can access and update the metadata metadata Integration Additional metadata uses accessible via: web; portal; XML; RDBMS Everything must "fit" into one CASE technology Changing Model of CASE Tool Usage 40Copyright 2020 by Data Blueprint Slide # Limited access from outside the CASE technology environment CASE tool-specific methods and technologies Limited additional metadata use
  • 29. Copyright 2020 by Data Blueprint Slide # X By the end of this session, you should have a better understanding of data management technologies in terms of: • Technology Considerations • Data Technology Architecture • CASE Tools • Repositories • Profiling/Discovery Tools • Data Quality Engineering Tools • Data Life Cycle • Other Technologies: – Servers, EII Technologies, Portals, Conversion Tools Approaching Data Management Technologies The Biggest Challenges to Data Management Practice 42Copyright 2020 by Data Blueprint Slide #
  • 30. One Eighth of the Data Management Spend 43Copyright 2020 by Data Blueprint Slide # 88% 12% Metadata • Metadata management is still a nascent discipline that only represents 12% of the time spent in data management Repositories have been difficult to "sell" 21 September 1999 Michael Blechar, Lisa Wallace Management Summary Most executive and IS managers view an IT metadata repository as an esoteric technology that is not directly related to the business. However, as will be seen, an IT metadata repository can substantially help IS organizations support the applications, which in turn support the business. An IT metadata repository is a pre-built system and reference database where the IS organizations can track and manage the information about the applications and databases they build and maintain; think of it as the inventory and change impact reporting system for IS. These repositories track metadata such as the descriptions of jobs, programs, modules, screens, data and databases, and the interrelationships between them. Metadata differs from the actual data being described. Metadata is information about data. For example, the metadata descriptions in the repository tell one that the field "customer number" appears in Databases A, B and F ... 44Copyright 2020 by Data Blueprint Slide # [From gartner.com]
  • 31. What tools do you use? 45% 23% 13% 9% 7% 2% 1% 1% 1% 1% None HomeGrown Other CA Platinum Rochade Universal Repository DesignBank DWGuide InfoManager Interface Metadata Tool • Many build their own Repository Technologies in Use 45Copyright 2020 by Data Blueprint Slide # Number Responding=181 • Almost 50% doesn't use • The "traditional" players are low numbers Magic Quadrant for Metadata Management Solutions 46Copyright 2020 by Data Blueprint Slide # https://www.gartner.com/document/3894971?ref=solrAll&refval=219836558&qid=de595a5685b6f86db0ec6
  • 32. IBM's AD/Cycle Information Model 47Copyright 2020 by Data Blueprint Slide # 48Copyright 2020 by Data Blueprint Slide # https://wiscorp.com/kwf_diagram.html
  • 33. • "The repository" does not have to be an integrated solution – it must be an easily integrateable solution • Repository functionality (does not equal a) repository – metadata must easily evolve to repository solution • Multiple repositories are not necessarily bad – as interim solutions, Excel has been working quite well • Minimal functionality includes • ability to create, read, update, delete, and evolve metadata items • Remember the 1st law of data management – In order to manage metadata, you need metadata repository functions 49Copyright 2020 by Data Blueprint Slide # Implementing Metadata Repository Functionality Copyright 2020 by Data Blueprint Slide # X By the end of this session, you should have a better understanding of data management technologies in terms of: • Technology Considerations • Data Technology Architecture • CASE Tools • Repositories • Profiling/Discovery Tools • Data Quality Engineering Tools • Data Life Cycle • Other Technologies: – Servers, EII Technologies, Portals, Conversion Tools Approaching Data Management Technologies
  • 34. Time Spent by Data Management Teams Across Disciplines 51Copyright 2020 by Data Blueprint Slide # https://www.gartner.com/document/3894971?ref=solrAll&refval=219836558&qid=de595a5685b6f86db0ec6 Data Discovery Technologies • Data analysis software technologies deliver up to 10X productivity over manual approaches • Based on a powerful computing technology that allows data engineers to quickly form candidate hypotheses with respect to the existing data structures • Hypotheses are then presented to the SMEs (both business and technical) who confirm, refine, or deny them • Allows existing data structures to be inferred at rate that is an order of magnitude more effective than previous manual approaches • Pioneers include Evoke->CSI, Metagenix->Ascential->IBM, Sypherlink 52Copyright 2020 by Data Blueprint Slide # Profiling Discovery Analysis
  • 35. How has this been done in the past? Old • Manually • Brute force • Repository dependent • Quality indifferent • Not repeatable New • Semi-automated • Engineered • Repository independent • Integrated quality • Repeatable • Currency • Accuracy 53Copyright 2020 by Data Blueprint Slide # 54Copyright 2020 by Data Blueprint Slide # Select an Attribute to get a list of values Double-click a value to see rows with that value
  • 36. Comparing Weekly Progress Monday Morning: Model preparation Afternoon: Model refinement/ validation session Tuesday Morning: Model refinement/ validation session Afternoon: Model refinement/ validation session Wednesday Morning: Model preparation Afternoon: Model refinement/ validation session Thursday Morning: Model refinement/ validation session Afternoon: Model refinement/ validation session Friday Morning: Model preparation Afternoon: Model refinement/ validation session Monday Morning: Model preparation Afternoon: Model preparation Tuesday Morning: Model preparation Afternoon: Model refinement/ validation session Wednesday Morning: Model preparation Afternoon: Model preparation Thursday Morning: Model preparation Afternoon: Model refinement/ validation session Friday Morning: Model preparation Afternoon: Model preparation Reactive Proactive 55Copyright 2020 by Data Blueprint Slide # Trifacta/Data Wrangling 56Copyright 2020 by Data Blueprint Slide #
  • 37. Copyright 2020 by Data Blueprint Slide # X By the end of this session, you should have a better understanding of data management technologies in terms of: • Technology Considerations • Data Technology Architecture • CASE Tools • Repositories • Profiling/Discovery Tools • Data Quality Engineering Tools • Data Life Cycle • Other Technologies: – Servers, EII Technologies, Portals, Conversion Tools Approaching Data Management Technologies Data Quality Engineering Tools • 4 categories of activities: – Analysis – Cleansing – Enhancement – Monitoring • Principal tools: – Data Profiling – Parsing and Standardization – Data Transformation – Identity Resolution and Matching – Enhancement – Reporting 58Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 38. DQ Tools 1. Data Profiling – Need to be able to distinguish between good and bad data before making any improvements – Data profiling is a set of algorithms for 2 purposes: • Statistical analysis and assessment of the data quality values within a data set • Exploring relationships that exist between value collections within and across data sets 59Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International DQ Tools 2. Parsing & Standardization – Data parsing tools enable the definition of patterns that feed into a rules engine used to distinguish between valid and invalid data values – Actions are triggered upon matching a specific pattern – When an invalid pattern is recognized, the application may attempt to transform the invalid value into one that meets expectations 60Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International https://www.youtube.com/watch?v=r9UhJxFT5rk
  • 39. DQ Tool Pattern 61Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International DQ Tools 4. Identify Resolution & Matching – Basic approaches to matching: – Deterministic • Relies on defined patterns and rules for assigning weights and scores to determine similarity – Predictable • Only as good as anticipations of the rules developers – Probabilistic • Uses statistical techniques to assess probabilities that pairs of records represent the same entity – Not reliant on rules • Refined based on experience -> matchers can improve precision as more data is analyzed 62Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 40. DQ Tools 5. Enhancement – A method for adding value to information by accumulating additional information about a base set of entities and then merging all the sets of information to provide a focused view Examples: – Time/date stamps – Auditing information – Contextual information – Geographic information – Demographic information – Psychographic information 63Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International DQ Tools 6. Reporting • Good reporting supports: – Inspection and monitoring of conformance to data quality expectations – Monitoring performance of data stewards conforming to data quality SLAs – Workflow processing for data quality incidents – Manual oversight of data cleansing and correction • Associate report results w/: – Data quality measurement – Metrics – Activity 64Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 41. Copyright 2020 by Data Blueprint Slide # X By the end of this session, you should have a better understanding of data management technologies in terms of: • Technology Considerations • Data Technology Architecture • CASE Tools • Repositories • Profiling/Discovery Tools • Data Quality Engineering Tools • Data Life Cycle • Other Technologies: – Servers, EII Technologies, Portals, Conversion Tools Approaching Data Management Technologies Data acquisition activities Data usage activitiesData storage Traditional Data Life Cycle 66Copyright 2020 by Data Blueprint Slide #
  • 42. 67Copyright 2020 by Data Blueprint Slide # Metadata Creation Data Assessment MetadataRefinement DataRefinement Data Manipulation DataCreation Data Utilization Metadata Structuring Data Storage DataLifeCycleModel Metadata Data Dimension Focus/Phase: Refinement Creation Structuring Creation Manipulation Refinement Utilization Assessment Data Architecture Quality Data architecture quality is the focus of metadata creation & refinement efforts. ↵ ↵ ↵ Data Model Quality Data model quality is the focus of metadata refinement & structuring efforts ↵ ↵ ↵ Data Value Quality Data value quality is the focus of the data creation, manipulation, and refinements phases. ↵ ↵ ↵ ↵ Data Representation Quality Data representation quality is the focus of data utilization phase. ↵ ↵ Dimensions Related to Phases • Data architecture quality is the focus of metadata creation and refinement efforts. • Data model quality is the focus of metadata structuring efforts • Data value quality is the focus of the data creation, manipulation, and refinements phases. • Data architecture and model quality are the focus of metadata refinement efforts. • Data representation quality is the focus of data utilization and assessment phase. 68Copyright 2020 by Data Blueprint Slide #
  • 43. Copyright 2020 by Data Blueprint Slide # X By the end of this session, you should have a better understanding of data management technologies in terms of: • Technology Considerations • Data Technology Architecture • CASE Tools • Repositories • Profiling/Discovery Tools • Data Quality Engineering Tools • Data Life Cycle • Other Technologies: – Servers, EII Technologies, Portals, Conversion Tools Approaching Data Management Technologies Other Technologies Data Integration Definition: • Pulling together and reconciling dispersed data for analytic purposes that organizations have maintained in multiple, heterogeneous systems. Data needs to be accessed and extracted, moved and loaded, validated and cleaned, standardized and transformed. • Other tools include: – Servers – EII technologies – Portals – Conversion tools 70Copyright 2020 by Data Blueprint Slide # Source: http://www.information-management.com
  • 44. Portal Options 71Copyright 2020 by Data Blueprint Slide # [Adapted from Terry Lanham Designing Innovative Enterprise Portals and Implementing Them Into Your Content Strategies Lockheed Martin’s Compelling Case Study Web Content II: Leveraging Best-of-Breed Content Strategies - San Francisco, CA 23 January 2001] Legacy Systems Transformed Into Web-services Accessed Through a Portal 72Copyright 2020 by Data Blueprint Slide # Organizational Portal Saturday, April 6, 2019 - All systems operational! Organizational News • Organizational Early News • Industry News • Press Releases • Newsletters Organizational IT • Service Desk • Settings Email • 320 new msgs, 14,572 total • Send quick email Organizational Essentials • Knowledge network • Employee assistance • IT procurement • Organizational media design • Organizational merchandise Search Go Stocks Full Portfolio XYZ YYZ ZZZ Market Update 50 29.5 45.25 As of: Saturday, April 6, 2019 Get Quote Reporting Regional • Northeast • Northwest • Southeast • Southwest • Midnorth • Midsouth State • Alabama • Arkansas • Georgia • Mississippi • Vermont • Virginia Legacy Application 1 Legacy Application 2 Legacy Application 3 Legacy Application 4 Legacy Application 5 Web Service 1.1 Web Service 1.2 Web Service 1.3 Web Service 2.1 Web Service 2.2 Web Service 3.1 Web Service 3.2 Web Service 4.1 Web Service 4.2 Web Service 5.1 Web Service 5.2 Web Service 5.3
  • 45. 73Copyright 2020 by Data Blueprint Slide # Top Tier Demo Portals as a Data Quality Tool 74Copyright 2020 by Data Blueprint Slide #
  • 46. Defining Spaces • ETL Extract Transform, Load – delivers aggregated data to a new database • EAI Enterprise Application Integration – connects applications to other applications in a predictable manner using pre-established connections • EII Enterprise Information Integration – between ETL and EAI - delivers tailored views of information to users at the time that it is required 75Copyright 2020 by Data Blueprint Slide # Meta-Matrix Virtual-Integration Example 76Copyright 2020 by Data Blueprint Slide #
  • 47. Approaching Data Management Technologies By the end of this session, you should have a better understanding of data management technologies and their use as part of a people process & technology 3-legged stool in terms of: • Technology Considerations • Data Technology Architecture • CASE Tools • Repositories • Profiling/Discovery Tools • Data Quality Engineering Tools • Data Life Cycle • Other Technologies: – Servers, EII Technologies, Portals, Conversion Tools 77Copyright 2020 by Data Blueprint Slide # Gartner Key Findings • Data assets continue to drive strategic cloud service providers’ offerings • Machine learning is increasingly popular–key uses: – Data integration tools, – Database management systems, – Data quality tools and – Metadata management solutions • Increasing use of cloud for production applications requiring that database in the cloud • Organizations applying a combination of data warehouses, data lakes and data hubs can achieve greater flexibility to support a range of use cases compared to those applying only one. 78Copyright 2020 by Data Blueprint Slide # https://www.gartner.com/document/3894971?ref=solrAll&refval=219836558&qid=de595a5685b6f86db0ec6
  • 48. IT Business Data Perceived State of Data 79Copyright 2020 by Data Blueprint Slide # Data Desired To Be State of Data 80Copyright 2020 by Data Blueprint Slide # IT Business
  • 49. The Real State of Data 81Copyright 2020 by Data Blueprint Slide # Data IT Business It isn't possible to go digital Digital 82Copyright 2020 by Data Blueprint Slide #
  • 50. aBy just spelling 'data' Dat 83Copyright 2020 by Data Blueprint Slide # It requires more work Data 84Copyright 2020 by Data Blueprint Slide # a
  • 51. Lady Ada Augusta King Rule 85Copyright 2020 by Data Blueprint Slide # https://people.well.com/user/adatoole/bio.htm Recent Technology Realization 86Copyright 2020 by Data Blueprint Slide # GarbageIn➜ GarbageOut!Recent
  • 52. GI➜GO! 87Copyright 2020 by Data Blueprint Slide # Perfect Model Garbage Data Garbage Results Data Warehouse Machine Learning Business Intelligence Block Chain AI MDM Analytics Technology Data Governance GI➜GO! 88Copyright 2020 by Data Blueprint Slide # Perfect Model Quality Data Garbage Results Data Warehouse Machine Learning Business Intelligence Block Chain AI MDM Analytics Technology Data Governance
  • 53. Quality In ➜ Quality Out! 89Copyright 2020 by Data Blueprint Slide # Perfect Model Quality Data Good Results Data Warehouse Machine Learning Business Intelligence Block Chain AI MDM Analytics Technology Data Governance More Data Management Tools 90Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 54. More Data Management Tools 91Copyright 2020 by Data Blueprint Slide # from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International Gartner Recommendations Leaders seeking to advance their strategies and deliver effective solutions must: • Validate both product capabilities and data availability • Leverage automation to free up scarce specialist resources • Update policies and governance standards to address cloud and database platform as a service (dbPaaS) before purchasing • Favor providers with a clear roadmap for ML enabling better business outcomes or service levels • Don’t expect a single piece of infrastructure to meet all of your needs. Plan the core by assessing use case type, processing flexibility and semantic requirements 92Copyright 2020 by Data Blueprint Slide # https://www.gartner.com/document/3894971?ref=solrAll&refval=219836558&qid=de595a5685b6f86db0ec6
  • 55. + = Questions? 93Copyright 2020 by Data Blueprint Slide # It’s your turn! Use the chat feature or Twitter (#dataed) to submit your questions now! Upcoming Events May Webinar Data Management Best Practices May 12, 2020 @ 2:00 PM ET June Webinar Approaching Data Governance Strategically June 9, 2020 @ 2:00 PM ET Sign up for webinars at: www.datablueprint.com/webinar-schedule 94Copyright 2020 by Data Blueprint Slide # Brought to you by:
  • 56. 10124 W. Broad Street, Suite C Glen Allen, Virginia 23060 804.521.4056 Copyright 2020 by Data Blueprint Slide # 95