SlideShare ist ein Scribd-Unternehmen logo
1 von 36
Downloaden Sie, um offline zu lesen
Modern Data Integration
William McKnight
Jake Freivald
Information Builders
McKnight Consulting Group
Expert Sessions
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 2
Unlock Potential
William McKnight
www.mcknightcg.com
214-514-1444
Modern Data Integration - Expert Sessions
@williammcknight
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 3
Data is the Most Important Asset
in the World
• We trade it for services instead of money
• Our information is exploding
• Business is moving to real-time, all the time
• Our information differentiates us from our
competitors
• Information is a key business asset
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 4
Corporate Initiatives
 80% of Initiatives That Matter are about DATA
• Budget
• Energy
 80% of Initiatives should be Business-Focused
• ROI
• Resource-Leveled
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 5Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 5
Data Maturity is Highly Correlated to
Business Success
Data
Maturity
Business
Success
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 6Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 6
The Money Tree Doesn’t
Exist
Hitch your Architecture and Maturity Efforts to an Application Budget
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 7Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 7
AI is disruptive
Data is the Foundation
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 8
Choosing a Platform: 3
Major Decisions
 Decision #1: The Data Store Type
• The largest factor for distinguishing between databases and file-based scale-out system utilization is the data profile. The latter is
best for data that fits the loose label of 'unstructured' (or semi-structured) data, while more traditional data -- and smaller
volumes of all data -- still belong in a relational database.
 Decision #2: Data Store Placement
• You must also decide where to place your data store -- on-premises or in the cloud (and which cloud). In the past, the only clear
choice for most organizations was on-premises data. However, the costs of scale are gnawing away at the notion that this
remains the best approach for a data platform. For more on why databases are moving to the cloud, please read this article.
 Decision #3: The Workload Architecture
• Finally, you must keep in mind the distinction between operational or analytical workloads. Short transactional requests and
more complex (often longer) analytics requests demand different architectures. Analytics databases, though quite diverse, are
the preferred platforms for the analytics workload.
8
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 9Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 9
Data Everywhere
And in Numerous Technical Forms
And in Numerous Clouds
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 10Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 10
,
Low Maturity Data
Integration
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 11Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 11
Leverageable Vehicles
 Data Warehouse
 Master Data Management
 Data Lake
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 12Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 12
Points of Data Integration
• Into the Data Warehouse(s)
• Into the Data Marts/cubes that do not integrate with the data warehouse
• Into the Data Marts/cubes that do integrate with the data warehouse
• Into Big Data platforms from sensor, clickstream, other systems
• Into Big Data platforms from Data Stream Processing
• Into the Master Data Management Hub from publishing/master systems
• From the Master Data Management Hub to every subscribing system (ERPs, NoSQL, Hadoop, data
warehouse, analytical databases, etc.)
• Between analytical stores
• Between operational stores
• Summaries of Big Data for the data warehouse and other analytical stores
• Data migrations for setting up new environments
• Etc.!
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 13Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 13
Modern Realities of Data
Integration
 Desire for consolidated methods for data integration
 New types of data sources
• Logs, sensors, etc.
 We have more than OLTP and OLAP
• Distributed data platforms
 Desire for real-time data
 High-velocity data increasingly needs integration
 Traditional approaches, without Stream Processing, turn
into ETL+custom scripts+middleware+MQ
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 14Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 14
Real-Time Data
 A.k.a. messaging, live feeds, real-time, event-driven
 Comes in continuously and often quickly, so we call
it streaming data.
 Needs special attention and can be of immense
value, but only if we are alerted in time.
 Foundation for Artificial Intelligence
• Stream data forms the core of data for artificial
intelligence
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 15Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 15
Message-Oriented Middleware / Message
Queueing Technology
 An architectural component that deals with messages
 Manage and distribute streaming data
• Any kind of data wrapped in a neat package with a very simple
header
• Sent by “producers”—systems, sensors, or devices that generate
the messages—toward a “broker”.
• Routes them into queues according to the information enclosed in the
message header or its own routing process
• “Consumers” retrieve the messages from the queues to which they
subscribe
• Open the messages and perform some kind of action on them.
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 16Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 16
Streaming Architecture
Apps
Streaming
Platform
Change logs
Streaming data pipelines
Messaging or
Stream processing
Request - Response
DW Hadoop
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 17Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 17
Every Project is Burdened
(with Grander Opportunity)
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 18Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 18
Data Success Measurement
User Satisfaction
Business ROI and
growth instigated
Data Maturity
(Long-term User Sat
and Bus ROI)
Misc.
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 19
“Beyond the Mountain is
another mountain.”
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 20Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 20
Champion Initiatives That
Matter
 Every single item on a company mission statement
relates to data at some level
 It is from the position of data expertise that the
mission will be executed and company leadership
will emerge
 The data professional is absolutely sitting on the
performance of the company in this information
economy and has an obligation to demonstrate the
possibilities and originate the architecture, data
and projects that will deliver.
 It’s not enough to be responsive to urgent requests
and be the data leader that companies need.
Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 21
Unlock Potential
William McKnight
www.mcknightcg.com
214-514-1444
Modern Data Integration - Expert Sessions
@williammcknight
Modern Data Integration
Problems with Normal Data Integration Processes
Data modeling. Too much time spent coping with slight changes
in our business data
Business/IT alignment. Data architects, DBAs, and others can’t
communicate with businesspeople
Processes. Too much detail lost by handing off responsibility for
business data to different people
Problem: Data Modeling
Too much time spent coping with slight changes in our business data
Johann Sebastian Bach
Given Middle Family
Mougi
Problem: Data Modeling
Too much time spent coping with slight changes in our business data
Johann Sebastian Bach
Given Middle FamilyHon.
Dmitri ShostakovichDmitriyevich
Mohamed el
Muhammad Qasabgial
Patronymic Art.
Ludwig van Beethoven
ChenYi
Repeated changes in operational systems’ row-and-column structures
Problem: Data Modeling
Ripple effects of changes in one system lead to changes in others
Mougi
Johann Sebastian Bach
Given Middle FamilyHon
Dmitri ShostakovichDmitriyevich
Mohamed el
Muhammad Qasabgial
Patronymic Art
Ludwig van Beethoven
ChenYi
Operational, designed for transactions
Data warehouse, designed for abstractions
Sebastian
Middle
Dmitriyevich
Patronymic
el
al
Art
Hon
van
Mougi
Bach
Family
Shostakovich
Qasabgi
Beethoven
Chen
Johann
Given
Dmitri
Mohamed
Muhammad
Ludwig
Yi
Data mart, designed for analysis
Mo
ugi
Bac
h
Fam
ily
Sho
stak
ovic
h
Qas
abgi
Bee
thov
en
Che
n
Joh
ann
Giv
en
Dmi
tri
Mo
ha
me
d
Mu
ha
mm
ad
Lud
wig
Yi
Mo
ugi
Bac
h
Sho
stak
ovic
h
Qas
abgi
Bee
thov
en
Che
n
Joh
ann
Dmi
tri
Mo
ha
me
d
Mu
ha
mm
ad
Lud
wig
Yi
Mo
ugi
Bac
h
Sho
stak
ovic
h
Qas
abgi
Bee
thov
en
Che
n
Joh
ann
Dmi
tri
Mo
ha
me
d
Mu
ha
mm
ad
Lud
wig
Yi
Mougi
Johann Sebastian Bach
Given Middle FamilyHn
Dmitri ShostakovichDmitriyevich
Mohamed el
Muhammad Qasabgial
Patronymic Art
Ludwig vn Beethoven
ChenYi
Sebastian
Sebastian
Sebastian
el
el
el
Dmitriyevich
Dmitriyevich
Dmitriyevich
Dmitriyevich
Mougi
Johann Sebastian Bach
Given Middle FamilyHn
Dmitri ShostakovichDmitriyevich
Mohamed el
Muhammad Qasabgial
Patronymic Art
Ludwig vn Beethoven
ChenYi
Mougi
Johann Sebastian Bach
Given Middle Family
Dmitri Shostakovich
Mohamed
Muhammad Qasabgi
Ludwig Beethoven
ChenYi
Sebastian
Mougi
Johann Sebastian Bach
Given Middle Family
Dmitri Shostakovich
Mohamed
Muhammad Qasabgi
Ludwig Beethoven
ChenYi
Sebastian
Sebastian
Sebastian
Sebastian
Sebastian
Problem: Business/IT Alignment
Data people often can’t communicate with businesspeople
Data architect thinks
 Model the data
 Govern the data
 Watch out for “quick fixes”
IT:
Gets it
That modeling stuff
we just talked about
Business:
Hates it
Business thinks
 Modeling, metadata are hindrances
 Analytical tools best without governance
 IT slows them down
Problem: Processes
Too much information lost by distributing responsibility for business data
Cleansing occurs in transformation step: Different rules being fired
Different tools and metadata being used by platform
Loss of timestamps, context, before-and-after: No cross-platform auditability
No comprehensive rollback, alternate history, what-if
Operational
application
Data
warehouse
Cloud
application
F
a
m
i
l
y
Transformation
Cleansing
Standardization
Transformation
Cleansing
Standardization
F
a
m
i
l
y
F
a
m
i
l
y
How much time do we
spend mapping one set
of rows and columns
to another?
Modern Data Integration
A modern solution:
post-relational for data capture, transformation,
subject-oriented storage (perhaps), and exchange,
rich documents instead of relational models
Operational
application
Data
warehouse
Analytics
How much time do we
spend mapping one set
of rows and columns
to another?
Modern Data Integration
A modern solution:
post-relational for data capture, transformation,
subject-oriented storage (perhaps), and exchange,
rich documents instead of relational models
Operational
application
Data
warehouse
Analytics
Operational
application
Data
warehouse
Analytics
Modern Data Integration
A modern solution:
ELT capture/integrate to capture data as it is,
time-stamped apply trustworthy processes to it,
subject-oriented and share it in trusted ways
How much info
do we lose
by distributing
ETL processes?
Operational
application
Analytics
Data Capture/Transformation Hub
Transformation
Cleansing
Standardization
Application
to business
use cases
Modern Data Integration
How much info
do we lose
by distributing
ETL processes?
A modern solution:
ELT capture/integrate to capture data as it is,
time-stamped apply trustworthy processes to it,
subject-oriented and share it in trusted ways
Modern Data Integration: The Omni-Gen Approach
We built software to make ourselves successful
 Immediate capture in automatically generated data hub
 Master data: business-user-oriented, subject-oriented
 Rapid, integrated data quality rules
 Mastered and transactional subjects
 Rapid cycle times to keep the business engaged
 Support and automatically apply best practices
Modern Data Integration: The Omni-Gen Approach
Extending Value
We built “persona models” for customer and supplier
Everything you get in Omni-Gen, plus
 Pre-built models
 Pre-built data quality rules
 Pre-built match/merge rules
 Pre-built data governance
 Immediate 360° core view, unlimited extensions
 Supports different consumers with different, but trusted, data
Omni-Gen: More Value in Far Less Time
12-181-3 4-6
Project timeline, in months
Traditional
Data management tools
Build-it-yourself development environment
Omni-Gen
Software solution with built-in best practices
MDM, DQ, integration software with rules,
automatically generated data vault, remediation portal,
360° viewer, history, data interfaces, APIs, and feeds
Omnifor
Persona
Software solution with built-in best practices and complete master data models
Data vault model, data onramps; MDM, data quality, and integration software; MDM
and data quality rules, remediation portal, 360° viewer; Data interfaces, APIs,
history, & feeds; Analytical foundation for dashboarding, advanced analytics, more.
Modern Data Integration
William McKnight
Jake Freivald
Information Builders
McKnight Consulting Group
Expert Sessions

Weitere ähnliche Inhalte

Was ist angesagt?

Generali connection platform_full
Generali connection platform_fullGenerali connection platform_full
Generali connection platform_fullconfluent
 
Apache Kafka® Use Cases for Financial Services
Apache Kafka® Use Cases for Financial ServicesApache Kafka® Use Cases for Financial Services
Apache Kafka® Use Cases for Financial Servicesconfluent
 
Transform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
Transform Your Mainframe Data for the Cloud with Precisely and Apache KafkaTransform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
Transform Your Mainframe Data for the Cloud with Precisely and Apache KafkaPrecisely
 
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, ConfluentApache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, ConfluentHostedbyConfluent
 
Digital Transformation Mindset - More Than Just Technology
Digital Transformation Mindset - More Than Just TechnologyDigital Transformation Mindset - More Than Just Technology
Digital Transformation Mindset - More Than Just Technologyconfluent
 
Confluent x imply: Build the last mile to value for data streaming applications
Confluent x imply:  Build the last mile to value for data streaming applicationsConfluent x imply:  Build the last mile to value for data streaming applications
Confluent x imply: Build the last mile to value for data streaming applicationsconfluent
 
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...SnapLogic
 
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Precisely
 
Kafka Connect and KSQL: Useful Tools in Migrating from a Legacy System to Kaf...
Kafka Connect and KSQL: Useful Tools in Migrating from a Legacy System to Kaf...Kafka Connect and KSQL: Useful Tools in Migrating from a Legacy System to Kaf...
Kafka Connect and KSQL: Useful Tools in Migrating from a Legacy System to Kaf...confluent
 
What does an event mean? Manage the meaning of your data! | Andreas Wombacher...
What does an event mean? Manage the meaning of your data! | Andreas Wombacher...What does an event mean? Manage the meaning of your data! | Andreas Wombacher...
What does an event mean? Manage the meaning of your data! | Andreas Wombacher...HostedbyConfluent
 
Highly configurable and extensible data processing framework at PubMatic
Highly configurable and extensible data processing framework at PubMaticHighly configurable and extensible data processing framework at PubMatic
Highly configurable and extensible data processing framework at PubMaticDataWorks Summit
 
Data reply sneak peek: real time decision engines
Data reply sneak peek:  real time decision enginesData reply sneak peek:  real time decision engines
Data reply sneak peek: real time decision enginesconfluent
 
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftBest Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftSnapLogic
 
Microservices Patterns with GoldenGate
Microservices Patterns with GoldenGateMicroservices Patterns with GoldenGate
Microservices Patterns with GoldenGateJeffrey T. Pollock
 
2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data IntegrationJeffrey T. Pollock
 
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...Databricks
 
Event-Streaming verstehen in unter 10 Min
Event-Streaming verstehen in unter 10 MinEvent-Streaming verstehen in unter 10 Min
Event-Streaming verstehen in unter 10 Minconfluent
 
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...Deepak Chandramouli
 
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...Databricks
 
"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about...
"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about..."Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about...
"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about...Kai Wähner
 

Was ist angesagt? (20)

Generali connection platform_full
Generali connection platform_fullGenerali connection platform_full
Generali connection platform_full
 
Apache Kafka® Use Cases for Financial Services
Apache Kafka® Use Cases for Financial ServicesApache Kafka® Use Cases for Financial Services
Apache Kafka® Use Cases for Financial Services
 
Transform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
Transform Your Mainframe Data for the Cloud with Precisely and Apache KafkaTransform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
Transform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
 
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, ConfluentApache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
 
Digital Transformation Mindset - More Than Just Technology
Digital Transformation Mindset - More Than Just TechnologyDigital Transformation Mindset - More Than Just Technology
Digital Transformation Mindset - More Than Just Technology
 
Confluent x imply: Build the last mile to value for data streaming applications
Confluent x imply:  Build the last mile to value for data streaming applicationsConfluent x imply:  Build the last mile to value for data streaming applications
Confluent x imply: Build the last mile to value for data streaming applications
 
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
 
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
 
Kafka Connect and KSQL: Useful Tools in Migrating from a Legacy System to Kaf...
Kafka Connect and KSQL: Useful Tools in Migrating from a Legacy System to Kaf...Kafka Connect and KSQL: Useful Tools in Migrating from a Legacy System to Kaf...
Kafka Connect and KSQL: Useful Tools in Migrating from a Legacy System to Kaf...
 
What does an event mean? Manage the meaning of your data! | Andreas Wombacher...
What does an event mean? Manage the meaning of your data! | Andreas Wombacher...What does an event mean? Manage the meaning of your data! | Andreas Wombacher...
What does an event mean? Manage the meaning of your data! | Andreas Wombacher...
 
Highly configurable and extensible data processing framework at PubMatic
Highly configurable and extensible data processing framework at PubMaticHighly configurable and extensible data processing framework at PubMatic
Highly configurable and extensible data processing framework at PubMatic
 
Data reply sneak peek: real time decision engines
Data reply sneak peek:  real time decision enginesData reply sneak peek:  real time decision engines
Data reply sneak peek: real time decision engines
 
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftBest Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
 
Microservices Patterns with GoldenGate
Microservices Patterns with GoldenGateMicroservices Patterns with GoldenGate
Microservices Patterns with GoldenGate
 
2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration
 
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
 
Event-Streaming verstehen in unter 10 Min
Event-Streaming verstehen in unter 10 MinEvent-Streaming verstehen in unter 10 Min
Event-Streaming verstehen in unter 10 Min
 
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
 
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
 
"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about...
"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about..."Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about...
"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about...
 

Ähnlich wie Modern data integration expert sessions

Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?SnapLogic
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
What Is My Enterprise Data Maturity 2021
What Is My Enterprise Data Maturity 2021What Is My Enterprise Data Maturity 2021
What Is My Enterprise Data Maturity 2021DATAVERSITY
 
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughtonReal-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughtonSynerzip
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyArcadia Data
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopInside Analysis
 
The new dominant companies are running on data
The new dominant companies are running on data The new dominant companies are running on data
The new dominant companies are running on data SnapLogic
 
Bimodal IT and EDW Modernization
Bimodal IT and EDW ModernizationBimodal IT and EDW Modernization
Bimodal IT and EDW ModernizationRobert Gleave
 
Die Big Data Fabric als Enabler für Machine Learning & AI
Die Big Data Fabric als Enabler für Machine Learning & AIDie Big Data Fabric als Enabler für Machine Learning & AI
Die Big Data Fabric als Enabler für Machine Learning & AIDenodo
 
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini
 
There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?Aerospike, Inc.
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyDataWorks Summit
 
When SAP alone is not enough
When SAP alone is not enoughWhen SAP alone is not enough
When SAP alone is not enoughCloudera, Inc.
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldHao Tran
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldInside Analysis
 
Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Jeffrey T. Pollock
 
Where the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information AccessWhere the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information AccessInside Analysis
 

Ähnlich wie Modern data integration expert sessions (20)

Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
What Is My Enterprise Data Maturity 2021
What Is My Enterprise Data Maturity 2021What Is My Enterprise Data Maturity 2021
What Is My Enterprise Data Maturity 2021
 
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughtonReal-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics Strategy
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
The new dominant companies are running on data
The new dominant companies are running on data The new dominant companies are running on data
The new dominant companies are running on data
 
Bimodal IT and EDW Modernization
Bimodal IT and EDW ModernizationBimodal IT and EDW Modernization
Bimodal IT and EDW Modernization
 
Die Big Data Fabric als Enabler für Machine Learning & AI
Die Big Data Fabric als Enabler für Machine Learning & AIDie Big Data Fabric als Enabler für Machine Learning & AI
Die Big Data Fabric als Enabler für Machine Learning & AI
 
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with Cloudera
 
There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata Company
 
When SAP alone is not enough
When SAP alone is not enoughWhen SAP alone is not enough
When SAP alone is not enough
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile World
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile World
 
Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!
 
Where the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information AccessWhere the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information Access
 

Kürzlich hochgeladen

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 

Kürzlich hochgeladen (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 

Modern data integration expert sessions

  • 1. Modern Data Integration William McKnight Jake Freivald Information Builders McKnight Consulting Group Expert Sessions
  • 2. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 2 Unlock Potential William McKnight www.mcknightcg.com 214-514-1444 Modern Data Integration - Expert Sessions @williammcknight
  • 3. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 3 Data is the Most Important Asset in the World • We trade it for services instead of money • Our information is exploding • Business is moving to real-time, all the time • Our information differentiates us from our competitors • Information is a key business asset
  • 4. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 4 Corporate Initiatives  80% of Initiatives That Matter are about DATA • Budget • Energy  80% of Initiatives should be Business-Focused • ROI • Resource-Leveled
  • 5. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 5Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 5 Data Maturity is Highly Correlated to Business Success Data Maturity Business Success
  • 6. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 6Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 6 The Money Tree Doesn’t Exist Hitch your Architecture and Maturity Efforts to an Application Budget
  • 7. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 7Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 7 AI is disruptive Data is the Foundation
  • 8. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 8 Choosing a Platform: 3 Major Decisions  Decision #1: The Data Store Type • The largest factor for distinguishing between databases and file-based scale-out system utilization is the data profile. The latter is best for data that fits the loose label of 'unstructured' (or semi-structured) data, while more traditional data -- and smaller volumes of all data -- still belong in a relational database.  Decision #2: Data Store Placement • You must also decide where to place your data store -- on-premises or in the cloud (and which cloud). In the past, the only clear choice for most organizations was on-premises data. However, the costs of scale are gnawing away at the notion that this remains the best approach for a data platform. For more on why databases are moving to the cloud, please read this article.  Decision #3: The Workload Architecture • Finally, you must keep in mind the distinction between operational or analytical workloads. Short transactional requests and more complex (often longer) analytics requests demand different architectures. Analytics databases, though quite diverse, are the preferred platforms for the analytics workload. 8
  • 9. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 9Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 9 Data Everywhere And in Numerous Technical Forms And in Numerous Clouds
  • 10. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 10Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 10 , Low Maturity Data Integration
  • 11. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 11Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 11 Leverageable Vehicles  Data Warehouse  Master Data Management  Data Lake
  • 12. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 12Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 12 Points of Data Integration • Into the Data Warehouse(s) • Into the Data Marts/cubes that do not integrate with the data warehouse • Into the Data Marts/cubes that do integrate with the data warehouse • Into Big Data platforms from sensor, clickstream, other systems • Into Big Data platforms from Data Stream Processing • Into the Master Data Management Hub from publishing/master systems • From the Master Data Management Hub to every subscribing system (ERPs, NoSQL, Hadoop, data warehouse, analytical databases, etc.) • Between analytical stores • Between operational stores • Summaries of Big Data for the data warehouse and other analytical stores • Data migrations for setting up new environments • Etc.!
  • 13. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 13Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 13 Modern Realities of Data Integration  Desire for consolidated methods for data integration  New types of data sources • Logs, sensors, etc.  We have more than OLTP and OLAP • Distributed data platforms  Desire for real-time data  High-velocity data increasingly needs integration  Traditional approaches, without Stream Processing, turn into ETL+custom scripts+middleware+MQ
  • 14. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 14Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 14 Real-Time Data  A.k.a. messaging, live feeds, real-time, event-driven  Comes in continuously and often quickly, so we call it streaming data.  Needs special attention and can be of immense value, but only if we are alerted in time.  Foundation for Artificial Intelligence • Stream data forms the core of data for artificial intelligence
  • 15. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 15Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 15 Message-Oriented Middleware / Message Queueing Technology  An architectural component that deals with messages  Manage and distribute streaming data • Any kind of data wrapped in a neat package with a very simple header • Sent by “producers”—systems, sensors, or devices that generate the messages—toward a “broker”. • Routes them into queues according to the information enclosed in the message header or its own routing process • “Consumers” retrieve the messages from the queues to which they subscribe • Open the messages and perform some kind of action on them.
  • 16. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 16Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 16 Streaming Architecture Apps Streaming Platform Change logs Streaming data pipelines Messaging or Stream processing Request - Response DW Hadoop
  • 17. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 17Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 17 Every Project is Burdened (with Grander Opportunity)
  • 18. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 18Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 18 Data Success Measurement User Satisfaction Business ROI and growth instigated Data Maturity (Long-term User Sat and Bus ROI) Misc.
  • 19. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 19 “Beyond the Mountain is another mountain.”
  • 20. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 20Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 20 Champion Initiatives That Matter  Every single item on a company mission statement relates to data at some level  It is from the position of data expertise that the mission will be executed and company leadership will emerge  The data professional is absolutely sitting on the performance of the company in this information economy and has an obligation to demonstrate the possibilities and originate the architecture, data and projects that will deliver.  It’s not enough to be responsive to urgent requests and be the data leader that companies need.
  • 21. Copyright © 2018 McKnight Consulting Group, LLC All Rights Reserved Slide 21 Unlock Potential William McKnight www.mcknightcg.com 214-514-1444 Modern Data Integration - Expert Sessions @williammcknight
  • 23. Problems with Normal Data Integration Processes Data modeling. Too much time spent coping with slight changes in our business data Business/IT alignment. Data architects, DBAs, and others can’t communicate with businesspeople Processes. Too much detail lost by handing off responsibility for business data to different people
  • 24. Problem: Data Modeling Too much time spent coping with slight changes in our business data Johann Sebastian Bach Given Middle Family
  • 25. Mougi Problem: Data Modeling Too much time spent coping with slight changes in our business data Johann Sebastian Bach Given Middle FamilyHon. Dmitri ShostakovichDmitriyevich Mohamed el Muhammad Qasabgial Patronymic Art. Ludwig van Beethoven ChenYi Repeated changes in operational systems’ row-and-column structures
  • 26. Problem: Data Modeling Ripple effects of changes in one system lead to changes in others Mougi Johann Sebastian Bach Given Middle FamilyHon Dmitri ShostakovichDmitriyevich Mohamed el Muhammad Qasabgial Patronymic Art Ludwig van Beethoven ChenYi Operational, designed for transactions Data warehouse, designed for abstractions Sebastian Middle Dmitriyevich Patronymic el al Art Hon van Mougi Bach Family Shostakovich Qasabgi Beethoven Chen Johann Given Dmitri Mohamed Muhammad Ludwig Yi Data mart, designed for analysis Mo ugi Bac h Fam ily Sho stak ovic h Qas abgi Bee thov en Che n Joh ann Giv en Dmi tri Mo ha me d Mu ha mm ad Lud wig Yi Mo ugi Bac h Sho stak ovic h Qas abgi Bee thov en Che n Joh ann Dmi tri Mo ha me d Mu ha mm ad Lud wig Yi Mo ugi Bac h Sho stak ovic h Qas abgi Bee thov en Che n Joh ann Dmi tri Mo ha me d Mu ha mm ad Lud wig Yi Mougi Johann Sebastian Bach Given Middle FamilyHn Dmitri ShostakovichDmitriyevich Mohamed el Muhammad Qasabgial Patronymic Art Ludwig vn Beethoven ChenYi Sebastian Sebastian Sebastian el el el Dmitriyevich Dmitriyevich Dmitriyevich Dmitriyevich Mougi Johann Sebastian Bach Given Middle FamilyHn Dmitri ShostakovichDmitriyevich Mohamed el Muhammad Qasabgial Patronymic Art Ludwig vn Beethoven ChenYi Mougi Johann Sebastian Bach Given Middle Family Dmitri Shostakovich Mohamed Muhammad Qasabgi Ludwig Beethoven ChenYi Sebastian Mougi Johann Sebastian Bach Given Middle Family Dmitri Shostakovich Mohamed Muhammad Qasabgi Ludwig Beethoven ChenYi Sebastian Sebastian Sebastian Sebastian Sebastian
  • 27. Problem: Business/IT Alignment Data people often can’t communicate with businesspeople Data architect thinks  Model the data  Govern the data  Watch out for “quick fixes” IT: Gets it That modeling stuff we just talked about Business: Hates it Business thinks  Modeling, metadata are hindrances  Analytical tools best without governance  IT slows them down
  • 28. Problem: Processes Too much information lost by distributing responsibility for business data Cleansing occurs in transformation step: Different rules being fired Different tools and metadata being used by platform Loss of timestamps, context, before-and-after: No cross-platform auditability No comprehensive rollback, alternate history, what-if Operational application Data warehouse Cloud application F a m i l y Transformation Cleansing Standardization Transformation Cleansing Standardization
  • 29. F a m i l y F a m i l y How much time do we spend mapping one set of rows and columns to another? Modern Data Integration A modern solution: post-relational for data capture, transformation, subject-oriented storage (perhaps), and exchange, rich documents instead of relational models Operational application Data warehouse Analytics
  • 30. How much time do we spend mapping one set of rows and columns to another? Modern Data Integration A modern solution: post-relational for data capture, transformation, subject-oriented storage (perhaps), and exchange, rich documents instead of relational models Operational application Data warehouse Analytics
  • 31. Operational application Data warehouse Analytics Modern Data Integration A modern solution: ELT capture/integrate to capture data as it is, time-stamped apply trustworthy processes to it, subject-oriented and share it in trusted ways How much info do we lose by distributing ETL processes?
  • 32. Operational application Analytics Data Capture/Transformation Hub Transformation Cleansing Standardization Application to business use cases Modern Data Integration How much info do we lose by distributing ETL processes? A modern solution: ELT capture/integrate to capture data as it is, time-stamped apply trustworthy processes to it, subject-oriented and share it in trusted ways
  • 33. Modern Data Integration: The Omni-Gen Approach We built software to make ourselves successful  Immediate capture in automatically generated data hub  Master data: business-user-oriented, subject-oriented  Rapid, integrated data quality rules  Mastered and transactional subjects  Rapid cycle times to keep the business engaged  Support and automatically apply best practices
  • 34. Modern Data Integration: The Omni-Gen Approach Extending Value We built “persona models” for customer and supplier Everything you get in Omni-Gen, plus  Pre-built models  Pre-built data quality rules  Pre-built match/merge rules  Pre-built data governance  Immediate 360° core view, unlimited extensions  Supports different consumers with different, but trusted, data
  • 35. Omni-Gen: More Value in Far Less Time 12-181-3 4-6 Project timeline, in months Traditional Data management tools Build-it-yourself development environment Omni-Gen Software solution with built-in best practices MDM, DQ, integration software with rules, automatically generated data vault, remediation portal, 360° viewer, history, data interfaces, APIs, and feeds Omnifor Persona Software solution with built-in best practices and complete master data models Data vault model, data onramps; MDM, data quality, and integration software; MDM and data quality rules, remediation portal, 360° viewer; Data interfaces, APIs, history, & feeds; Analytical foundation for dashboarding, advanced analytics, more.
  • 36. Modern Data Integration William McKnight Jake Freivald Information Builders McKnight Consulting Group Expert Sessions