SlideShare ist ein Scribd-Unternehmen logo
1 von 41
Why Data Modelling is
Essential in a Big Data
Environment
Mark Barringer
Introduction
• My name is Mark Barringer
(mark.Barringer@embarcadero.com)
• Product Manager (Data Architecture Tools) Embarcadero
• I live in the beautiful city of Winchester in southern England
2
Agenda and Objectives…
3
Data Challenges
What are the modern day challenges for the Data
Architect
Big Data
Landscape
Adding Value to
the Business
Data Modelling
Techniques
How to understand and view the Big Data
Landscape
Making the Big Data Landscape transparent to the
business and how to add real value
A look at some of the latest Data Modelling Tips and
Tricks applied to the Big Data Environment
Challenges facing Data Architecture
4
Federation
Data Democratisation
Platform Fragmentation
Data Lineage
Latency
Delivery
Obfuscation
Challenges facing Data Architecture
5
Federation: The application of a single view over multiple repositories.
Data Democratisation
Platform Fragmentation
Data Lineage
Latency
Delivery
Obfuscation
Challenges facing Data Architecture
6
Federation: The application of a single view over multiple repositories.
Data Democratisation: The expectation of the business to have more control over the data
assets.
Platform Fragmentation
Data Lineage
Latency
Delivery
Obfuscation
Challenges facing Data Architecture
7
Federation: The application of a single view over multiple repositories.
Data Democratisation: The expectation of the business to have more control over the data
assets.
Platform Fragmentation: The proliferation of non-RDBMS solutions to store data.
Data Lineage
Latency
Delivery
Obfuscation
Challenges facing Data Architecture
8
Federation: The application of a single view over multiple repositories.
Data Democratisation: The expectation of the business to have more control over the data
assets.
Platform Fragmentation: The proliferation of non-RDBMS solutions to store data.
Data Lineage: Expectation to understand actions performed on the data.
Latency
Delivery
Obfuscation
Challenges facing Data Architecture
9
Federation: The application of a single view over multiple repositories.
Data Democratisation: The expectation of the business to have more control over the data
assets.
Platform Fragmentation: The proliferation of non-RDBMS solutions to store data.
Data Lineage: Expectation to understand actions performed on the data.
Latency: The trend towards lower end-to end latency of data (from creation to reporting)
Delivery
Obfuscation
Challenges facing Data Architecture
10
Federation: The application of a single view over multiple repositories.
Data Democratisation: The expectation of the business to have more control over the data
assets.
Platform Fragmentation: The proliferation of non-RDBMS solutions to store data.
Data Lineage: Expectation to understand actions performed on the data.
Latency: The trend towards lower end-to end latency of data (from creation to reporting)
Delivery: Model development in step with development teams.
Obfuscation
Challenges facing Data Architecture
11
Federation: The application of a single view over multiple repositories.
Data Democratisation: The expectation of the business to have more control over the data
assets.
Platform Fragmentation: The proliferation of non-RDBMS solutions to store data.
Data Lineage: Expectation to understand actions performed on the data.
Latency: The trend towards lower end-to end latency of data (from creation to reporting)
Delivery: Model development in step with development teams.
Obfuscation: Expectation to view and understand data models by the business.
Why Data Modelling is Essential…
12
Modelling
the Business
Understand
the
Landscape
Self
Documenting
Business
Heterogeneous
& Big Data
Physical models
Data Modelling
Techniques
Agile
Development
Why Data Modelling is Essential…
13
Modelling
the Business
Understand
the
Landscape
Self
documenting
Business
Heterogeneous
& Big Data
Physical models
Data Modelling
Techniques
Agile
Development
Business Modelling
• Meaningful abstracted view
of the business
• Data-centric perspective
• 'Anchor point' for other
models
• Key to successful
communication
• Develop credibility and
relevance with the business
• Establish Business Glossaries
with consistent definitions
• Build a solid foundation for
Compliance, Data Governance
and Master Data
Management
• Improve visibility and
collaboration with ER/Studio
Connecting Business Information and IT
Data
14
Who are the Data customers and collaborators?
15
DA
Technical Collaborators
(DBA, ETL, SA)
(MetaData Consumers)
Data Analysts
(Finance, Credit, Mktg.)
(Data Consumers)
Business Users
(Information Producers)
Benefit of Relating Metadata to Models
• Expand the depth of information
by accessing the underlying
framework
16
• Models and terms seamlessly integrate to
one another
Why Data Modelling is Essential…
17
Modelling
the business
Understand
the
Landscape
Self
documenting
Business
Heterogeneous
& Big Data
Physical models
Data Modelling
Techniques
Agile
Development
Understand the Landscape
• Create Landscape Inventories
• Reverse Engineer
• Overcome Information
Obscurity
• Eliminate Data Silos
• Contain much of the detailed
meta data
• Useful for impact and gap
analysis
• Platform agnostic
• Map to concepts in the
Conceptual Data Model and
physical objects
Enterprise Complexity – Information Obscurity
Big Data & NoSQL: The Challenge
• Capture new data sources and increase
information management footprint
• Understanding semi- and un-structured data
• “Raw Data is the single source of the truth”
• 7 Vs
• Velocity, Volume, Variety,
• Veracity (conformity of facts – Data in doubt)
• Variable, Virtual, Value
• Reverse & Forward Engineering (JSON, BSON)
• Forward & Reverse Engineer DDL
• “We will hopefully find what we didn’t know about
that we didn’t know that we didn’t know about”
Eliminate Data Silos: Inventory existing databases
• What type of data do you own and where can it be found.
• Map your data landscape using data models as the foundation.
– Each model represents a different database system
– Link like data elements together for traceability
20
Why Data Modelling is Essential…
21
Modelling
the Business
Understand
the
Landscape
Self
documenting
Business
Heterogeneous
& Big Data
Physical models
Data Modelling
Techniques
Agile
Development
Physical Modelling for Big Data
• Accurately model all types of
data at rest within the
organization.
• Not just RDBMS resident data
• Document physical meta-data
(table space, partitions, etc)
• Introduce non-RDBMS data
stores e.g. NOSQL, JSON, HIVE
• Build many physicals based
on business decomposition
• Reverse Engineering
Traditional (RDBMS) Prescriptive Data Modelling
MODEL
(and
DESIGN)
LOAD
EXPLORE/
QUERY
DATA EXPLORE
‘Schema on write’
Good for
Known
Unknowns
(Repetition)
Big Data (NoSQL/Hadoop) Descriptive Data Modelling
LOAD QUERY MODEL
NoSQL STORE
EXPLORE
But Fast and Agile!
‘Schema on read’
Good for
Unknown
Unknowns
(Exploration)
Data modeling in Big Data
Customer
-
-
-
NoSQL DATABASE
Documents
-
-
-
Product
-
-
-
Key values
-
-
-
Conceptual/
business data model
Understanding
Logical/physical
data model
Architecture/Design
RELATIONAL
DATABASE
(i.e., Data
warehouse/data mart)
May transfer into structured database
(using models)
Why Data Modelling is Essential…
25
Modelling
the Business
Understand
the
Landscape
Self
documenting
Business
Heterogeneous
& Big Data
Physical models
Data Modelling
Techniques
Agile
Development
Timely Design
• Ensuring changes to the
physical data models are in
step with and relevant to the
development methodology
used in the organization.
• Where modelling meets
development.
• Create credibility and
relevance with the
development teams
• User Stories, Tasks and
Change Management
Agile Change Management
• Enable “Agile Data Modeler”
– Incremental rather than waterfall
• Need more granularity than named
versions of a model or submodel
• Change numbers at Repository check in
• Can be associated to user stories, tasks
26
Why Data Modelling is Essential…
27
Modelling
the Business
Understand
the
Landscape
Self
documenting
Business
Heterogeneous
Physical models
Data Modelling
Techniques
Agile
Development
Data Modelling Techniques
• Sub-Modelling
• Business Decomposition
• Visual Data Lineage
• Impact Analysis / Where Used
• Naming Standards
• Data Source Mapping
• Universal Mapping
• Augmented metadata
• Glossary Integration
Big Data Notation Enhancement
• Physical Model
– Objects instead of
tables
• Nested Objects
– “is contained in”
relationship type
28
Containment Relationship: Array of Nested Objects
29
db.patron.insert(
{
"_id" :
ObjectId("5367ddc4228cd006ab2bc60c"),
name: "Joe Bookreader",
address: [
{
street: "123 Fake Street",
city: "Faketon",
state: "MA"
},
{
street: "1 Someother Street",
zip: "12345"
} ]
})
db.book.insert(
{
title: "MongoDB: The Definitive Guide",
author: [ "Kristina Chodorow", "Mike Dirolf" ],
published_date: ISODate("2010-09-24"),
pages: 216,
language: "English",
publisher_id: ObjectId("5367dd99228cd006ab2bc60b"),
available: 3,
checkout: [ { by: ObjectId("5367ddc4228cd006ab2bc60c"), date: ISODate("2012-10-15") } ]
},
{
title: "50 Tips and Tricks for MongoDB Developer",
author: [ "Kristina Chodorow" ],
published_date: ISODate("2011-05-06"),
pages: 68,
language: "English",
publisher_id: ObjectId("5367dd99228cd006ab2bc60b")
})
Hive & ER/Studio
30
Understanding the Big Data Schema
Technique: Attachments (Metadata extensions)
Technique: Data Source Mapping
32
Technique: Automated Naming Standards
Real-time update while typing
33
Technique: Glossary Integration
• Associate Data Architect objects to Business glossary terms
– Model, submodel
– Entity, Table
– Attribute, Column
– Domain
– View
• Push terms to glossary
34
Why Data Modelling is essential…
35
Modelling
the Business
Understand
the
Landscape
Self
documenting
Business
Heterogeneous
& Big Data
Physical models
Data Modelling
Techniques
Agile
Development
Business metadata
• Provide the business with the
ability to centrally manage its
own meta data in terms of
definitions, rules and
relationships in a structured
and curated manner.
• Facilitate the binding of the
business elements to
technical elements within the
models and other
documentation.
• Data Dictionary
• Self-Service Discovery and
Reporting
Providing Business Context
A taxonomy of searchable terms mapped to unique concepts
R&D
Entity
Business Term
Patient Recruitment
Data
Attribute
Business Term
Batch Supply Data
DatasourcePhysical Model
Column
Logical diagram
Table
Clinical
Discussion
threads
Conceptual &
Process Diagrams
Glossaries
Supply Chain
Why Data Modelling is essential…
37
Modelling
the Business
Understand
the
Landscape
Self
documenting
Business
Heterogeneous
& Big data
Physical models
Data Modeling
Techniques
Agile
Development
Holistic, integrated
modelling that can
present the same meta
data to different
audiences in the most
appropriate format.
The single most important
challenge to overcome is
that of communication and
collaboration and to Build
Trust in Data.
Without the ability to
communicate effectively to a
wide variety of audiences even
the most diligently documented
organisation will be unable to
benefit from it.
Embarcadero Enterprise Database Tools
Win a
FitBit Charge HR
Leave a
Business Card
at the
Barnsten /
Embarcadero
Stand
Raymond Horsten
(r.horsten@barnsten.com)
Mark Barringer
(mark.barringer@embarcadero.com)
Questions and Answers
Raymond Horsten (r.horsten@barnsten.com)
Mark Barringer (mark.barringer@embarcadero.com)
40
Building Trust in Data : Collaboration
Syndication
Governance and Collaboration
Technical
Metadata
Business
Metadata
Metadata
Repository
Data Modeling Team Server Web
Architecture Business
SDLC &
Information
Management
Integrated
Tooling
Enterprise
Data

Weitere ähnliche Inhalte

Was ist angesagt?

Benefits of the Azure Cloud
Benefits of the Azure CloudBenefits of the Azure Cloud
Benefits of the Azure CloudCaserta
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Caserta
 
Using Machine Learning & Spark to Power Data-Driven Marketing
Using Machine Learning & Spark to Power Data-Driven MarketingUsing Machine Learning & Spark to Power Data-Driven Marketing
Using Machine Learning & Spark to Power Data-Driven MarketingCaserta
 
The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data LakeCaserta
 
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data WrongThe Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data WrongDATAVERSITY
 
Performance Acceleration: Summaries, Recommendation, MPP and more
Performance Acceleration: Summaries, Recommendation, MPP and morePerformance Acceleration: Summaries, Recommendation, MPP and more
Performance Acceleration: Summaries, Recommendation, MPP and moreDenodo
 
Neo4j Solutions - Master Data Management
Neo4j Solutions - Master Data ManagementNeo4j Solutions - Master Data Management
Neo4j Solutions - Master Data ManagementCaserta
 
The Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedThe Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedDunn Solutions Group
 
Intro to Data Science on Hadoop
Intro to Data Science on HadoopIntro to Data Science on Hadoop
Intro to Data Science on HadoopCaserta
 
The Business Value of Big Data
The Business Value of Big DataThe Business Value of Big Data
The Business Value of Big DataClark Boyd
 
GDPR Noncompliance: Avoid the Risk with Data Virtualization
GDPR Noncompliance: Avoid the Risk with Data VirtualizationGDPR Noncompliance: Avoid the Risk with Data Virtualization
GDPR Noncompliance: Avoid the Risk with Data VirtualizationDenodo
 
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017Caserta
 
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Erik Fransen
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationVishal Kumar
 
Data catalog
Data catalogData catalog
Data catalogiamtodor
 
The Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseThe Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseCaserta
 
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...Denodo
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteCaserta
 

Was ist angesagt? (20)

Benefits of the Azure Cloud
Benefits of the Azure CloudBenefits of the Azure Cloud
Benefits of the Azure Cloud
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
 
Data vault what's Next: Part 2
Data vault what's Next: Part 2Data vault what's Next: Part 2
Data vault what's Next: Part 2
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
 
Using Machine Learning & Spark to Power Data-Driven Marketing
Using Machine Learning & Spark to Power Data-Driven MarketingUsing Machine Learning & Spark to Power Data-Driven Marketing
Using Machine Learning & Spark to Power Data-Driven Marketing
 
The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data Lake
 
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data WrongThe Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
 
Performance Acceleration: Summaries, Recommendation, MPP and more
Performance Acceleration: Summaries, Recommendation, MPP and morePerformance Acceleration: Summaries, Recommendation, MPP and more
Performance Acceleration: Summaries, Recommendation, MPP and more
 
Neo4j Solutions - Master Data Management
Neo4j Solutions - Master Data ManagementNeo4j Solutions - Master Data Management
Neo4j Solutions - Master Data Management
 
The Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedThe Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They Need
 
Intro to Data Science on Hadoop
Intro to Data Science on HadoopIntro to Data Science on Hadoop
Intro to Data Science on Hadoop
 
The Business Value of Big Data
The Business Value of Big DataThe Business Value of Big Data
The Business Value of Big Data
 
GDPR Noncompliance: Avoid the Risk with Data Virtualization
GDPR Noncompliance: Avoid the Risk with Data VirtualizationGDPR Noncompliance: Avoid the Risk with Data Virtualization
GDPR Noncompliance: Avoid the Risk with Data Virtualization
 
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
 
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data Presentation
 
Data catalog
Data catalogData catalog
Data catalog
 
The Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseThe Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's Enterprise
 
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
 

Andere mochten auch

Infostrada Sports Online Marketing Presentation ISWI
Infostrada Sports Online Marketing Presentation ISWIInfostrada Sports Online Marketing Presentation ISWI
Infostrada Sports Online Marketing Presentation ISWIAnders Dielessen
 
There's Power in a Pin
There's Power in a PinThere's Power in a Pin
There's Power in a Pincdanielsajc
 
Multiplication of grace
Multiplication of graceMultiplication of grace
Multiplication of graceOKE OLUSEGUN
 
Centennium antoine stelma
Centennium antoine stelmaCentennium antoine stelma
Centennium antoine stelmaBigDataExpo
 
Big Data Expo 2015 - Blauw Focus forward
Big Data Expo 2015 - Blauw Focus forwardBig Data Expo 2015 - Blauw Focus forward
Big Data Expo 2015 - Blauw Focus forwardBigDataExpo
 
New Testament Survey no.26: Paul - Letter to Titus
New Testament Survey no.26: Paul - Letter to TitusNew Testament Survey no.26: Paul - Letter to Titus
New Testament Survey no.26: Paul - Letter to TitusClive Ashby
 
Alfiandhani suci mutiara_(h11115508)
Alfiandhani suci mutiara_(h11115508)Alfiandhani suci mutiara_(h11115508)
Alfiandhani suci mutiara_(h11115508)Alfiandhani Mutiara
 
Quby michiel fokke
Quby michiel fokkeQuby michiel fokke
Quby michiel fokkeBigDataExpo
 
Big Data Expo 2015 - Anchormen Enter the Lambda-architecture
Big Data Expo 2015 - Anchormen Enter the Lambda-architectureBig Data Expo 2015 - Anchormen Enter the Lambda-architecture
Big Data Expo 2015 - Anchormen Enter the Lambda-architectureBigDataExpo
 
Jorge bernardino sap sd resume_ 2016_en
Jorge bernardino sap sd resume_ 2016_enJorge bernardino sap sd resume_ 2016_en
Jorge bernardino sap sd resume_ 2016_enJorge Bernardino
 
Bovee bct12 ppt_ch05
Bovee bct12 ppt_ch05Bovee bct12 ppt_ch05
Bovee bct12 ppt_ch05Samina Haider
 
Bovee bct12 ppt_ch08
Bovee bct12 ppt_ch08Bovee bct12 ppt_ch08
Bovee bct12 ppt_ch08Samina Haider
 
Bovee bct12 ppt_ch02
Bovee bct12 ppt_ch02Bovee bct12 ppt_ch02
Bovee bct12 ppt_ch02Samina Haider
 
Bovee bct12 ppt_ch11
Bovee bct12 ppt_ch11Bovee bct12 ppt_ch11
Bovee bct12 ppt_ch11Samina Haider
 
Wielka Brytania
Wielka BrytaniaWielka Brytania
Wielka Brytania89076
 

Andere mochten auch (19)

Infostrada Sports Online Marketing Presentation ISWI
Infostrada Sports Online Marketing Presentation ISWIInfostrada Sports Online Marketing Presentation ISWI
Infostrada Sports Online Marketing Presentation ISWI
 
TUGAS PRAKTIKUM 2
TUGAS PRAKTIKUM 2TUGAS PRAKTIKUM 2
TUGAS PRAKTIKUM 2
 
There's Power in a Pin
There's Power in a PinThere's Power in a Pin
There's Power in a Pin
 
Multiplication of grace
Multiplication of graceMultiplication of grace
Multiplication of grace
 
Overcoming Worldly Values!
Overcoming Worldly Values!Overcoming Worldly Values!
Overcoming Worldly Values!
 
The Grid - All In
The Grid - All InThe Grid - All In
The Grid - All In
 
Centennium antoine stelma
Centennium antoine stelmaCentennium antoine stelma
Centennium antoine stelma
 
Big Data Expo 2015 - Blauw Focus forward
Big Data Expo 2015 - Blauw Focus forwardBig Data Expo 2015 - Blauw Focus forward
Big Data Expo 2015 - Blauw Focus forward
 
New Testament Survey no.26: Paul - Letter to Titus
New Testament Survey no.26: Paul - Letter to TitusNew Testament Survey no.26: Paul - Letter to Titus
New Testament Survey no.26: Paul - Letter to Titus
 
Alfiandhani suci mutiara_(h11115508)
Alfiandhani suci mutiara_(h11115508)Alfiandhani suci mutiara_(h11115508)
Alfiandhani suci mutiara_(h11115508)
 
god's amazing grace
god's amazing gracegod's amazing grace
god's amazing grace
 
Quby michiel fokke
Quby michiel fokkeQuby michiel fokke
Quby michiel fokke
 
Big Data Expo 2015 - Anchormen Enter the Lambda-architecture
Big Data Expo 2015 - Anchormen Enter the Lambda-architectureBig Data Expo 2015 - Anchormen Enter the Lambda-architecture
Big Data Expo 2015 - Anchormen Enter the Lambda-architecture
 
Jorge bernardino sap sd resume_ 2016_en
Jorge bernardino sap sd resume_ 2016_enJorge bernardino sap sd resume_ 2016_en
Jorge bernardino sap sd resume_ 2016_en
 
Bovee bct12 ppt_ch05
Bovee bct12 ppt_ch05Bovee bct12 ppt_ch05
Bovee bct12 ppt_ch05
 
Bovee bct12 ppt_ch08
Bovee bct12 ppt_ch08Bovee bct12 ppt_ch08
Bovee bct12 ppt_ch08
 
Bovee bct12 ppt_ch02
Bovee bct12 ppt_ch02Bovee bct12 ppt_ch02
Bovee bct12 ppt_ch02
 
Bovee bct12 ppt_ch11
Bovee bct12 ppt_ch11Bovee bct12 ppt_ch11
Bovee bct12 ppt_ch11
 
Wielka Brytania
Wielka BrytaniaWielka Brytania
Wielka Brytania
 

Ähnlich wie Big Data Expo 2015 - Barnsten Why Data Modelling is Essential

Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitectureCreating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitecturePerficient, Inc.
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02email2jl
 
An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018Denodo
 
Hadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data ModelHadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data ModelUwe Printz
 
How to Survive as a Data Architect in a Polyglot Database World
How to Survive as a Data Architect in a Polyglot Database WorldHow to Survive as a Data Architect in a Polyglot Database World
How to Survive as a Data Architect in a Polyglot Database WorldKaren Lopez
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricNathan Bijnens
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)Moacyr Passador
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsCaserta
 
3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your PortfolioDenodo
 
Mastering your data with ca e rwin dm 09082010
Mastering your data with ca e rwin dm 09082010Mastering your data with ca e rwin dm 09082010
Mastering your data with ca e rwin dm 09082010ERwin Modeling
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationDenodo
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
 
Driving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsDriving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsEmbarcadero Technologies
 
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachUsing OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachKent Graziano
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseDatabricks
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneySai Paravastu
 
GraphTalks Rome - Selecting the right Technology
GraphTalks Rome - Selecting the right TechnologyGraphTalks Rome - Selecting the right Technology
GraphTalks Rome - Selecting the right TechnologyNeo4j
 
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Denodo
 

Ähnlich wie Big Data Expo 2015 - Barnsten Why Data Modelling is Essential (20)

Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitectureCreating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data Architecture
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
 
An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018
 
Hadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data ModelHadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data Model
 
How to Survive as a Data Architect in a Polyglot Database World
How to Survive as a Data Architect in a Polyglot Database WorldHow to Survive as a Data Architect in a Polyglot Database World
How to Survive as a Data Architect in a Polyglot Database World
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft Fabric
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
 
3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio
 
Mastering your data with ca e rwin dm 09082010
Mastering your data with ca e rwin dm 09082010Mastering your data with ca e rwin dm 09082010
Mastering your data with ca e rwin dm 09082010
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
Driving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsDriving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data Assets
 
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachUsing OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
GraphTalks Rome - Selecting the right Technology
GraphTalks Rome - Selecting the right TechnologyGraphTalks Rome - Selecting the right Technology
GraphTalks Rome - Selecting the right Technology
 
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
 

Mehr von BigDataExpo

Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...
Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...
Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...BigDataExpo
 
Google Cloud - Google's vision on AI
Google Cloud - Google's vision on AIGoogle Cloud - Google's vision on AI
Google Cloud - Google's vision on AIBigDataExpo
 
Pacmed - Machine Learning in health care: opportunities and challanges in pra...
Pacmed - Machine Learning in health care: opportunities and challanges in pra...Pacmed - Machine Learning in health care: opportunities and challanges in pra...
Pacmed - Machine Learning in health care: opportunities and challanges in pra...BigDataExpo
 
PGGM - The Future Explore
PGGM - The Future ExplorePGGM - The Future Explore
PGGM - The Future ExploreBigDataExpo
 
Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...
Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...
Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...BigDataExpo
 
Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...
Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...
Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...BigDataExpo
 
OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...
OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...
OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...BigDataExpo
 
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AIDynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AIBigDataExpo
 
Teleperformance - Smart personalized service door het gebruik van Data Science
Teleperformance - Smart personalized service door het gebruik van Data Science Teleperformance - Smart personalized service door het gebruik van Data Science
Teleperformance - Smart personalized service door het gebruik van Data Science BigDataExpo
 
FunXtion - Interactive Digital Fitness with Data Analytics
FunXtion - Interactive Digital Fitness with Data AnalyticsFunXtion - Interactive Digital Fitness with Data Analytics
FunXtion - Interactive Digital Fitness with Data AnalyticsBigDataExpo
 
fashionTrade - Vroeger noemde we dat Big Data
fashionTrade - Vroeger noemde we dat Big DatafashionTrade - Vroeger noemde we dat Big Data
fashionTrade - Vroeger noemde we dat Big DataBigDataExpo
 
BigData Republic - Industrializing data science: a view from the trenches
BigData Republic - Industrializing data science: a view from the trenchesBigData Republic - Industrializing data science: a view from the trenches
BigData Republic - Industrializing data science: a view from the trenchesBigDataExpo
 
Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...
Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...
Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...BigDataExpo
 
Endrse - Next level online samenwerkingen tussen personalities en merken met ...
Endrse - Next level online samenwerkingen tussen personalities en merken met ...Endrse - Next level online samenwerkingen tussen personalities en merken met ...
Endrse - Next level online samenwerkingen tussen personalities en merken met ...BigDataExpo
 
Bovag - Refine-IT - Proces optimalisatie in de automotive sector
Bovag - Refine-IT - Proces optimalisatie in de automotive sectorBovag - Refine-IT - Proces optimalisatie in de automotive sector
Bovag - Refine-IT - Proces optimalisatie in de automotive sectorBigDataExpo
 
Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...
Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...
Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...BigDataExpo
 
Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...
Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...
Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...BigDataExpo
 
Rabobank - There is something about Data
Rabobank - There is something about DataRabobank - There is something about Data
Rabobank - There is something about DataBigDataExpo
 
VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...
VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...
VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...BigDataExpo
 
Booking.com - Data science and experimentation at Booking.com: a data-driven ...
Booking.com - Data science and experimentation at Booking.com: a data-driven ...Booking.com - Data science and experimentation at Booking.com: a data-driven ...
Booking.com - Data science and experimentation at Booking.com: a data-driven ...BigDataExpo
 

Mehr von BigDataExpo (20)

Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...
Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...
Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...
 
Google Cloud - Google's vision on AI
Google Cloud - Google's vision on AIGoogle Cloud - Google's vision on AI
Google Cloud - Google's vision on AI
 
Pacmed - Machine Learning in health care: opportunities and challanges in pra...
Pacmed - Machine Learning in health care: opportunities and challanges in pra...Pacmed - Machine Learning in health care: opportunities and challanges in pra...
Pacmed - Machine Learning in health care: opportunities and challanges in pra...
 
PGGM - The Future Explore
PGGM - The Future ExplorePGGM - The Future Explore
PGGM - The Future Explore
 
Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...
Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...
Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...
 
Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...
Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...
Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...
 
OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...
OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...
OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...
 
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AIDynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
 
Teleperformance - Smart personalized service door het gebruik van Data Science
Teleperformance - Smart personalized service door het gebruik van Data Science Teleperformance - Smart personalized service door het gebruik van Data Science
Teleperformance - Smart personalized service door het gebruik van Data Science
 
FunXtion - Interactive Digital Fitness with Data Analytics
FunXtion - Interactive Digital Fitness with Data AnalyticsFunXtion - Interactive Digital Fitness with Data Analytics
FunXtion - Interactive Digital Fitness with Data Analytics
 
fashionTrade - Vroeger noemde we dat Big Data
fashionTrade - Vroeger noemde we dat Big DatafashionTrade - Vroeger noemde we dat Big Data
fashionTrade - Vroeger noemde we dat Big Data
 
BigData Republic - Industrializing data science: a view from the trenches
BigData Republic - Industrializing data science: a view from the trenchesBigData Republic - Industrializing data science: a view from the trenches
BigData Republic - Industrializing data science: a view from the trenches
 
Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...
Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...
Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...
 
Endrse - Next level online samenwerkingen tussen personalities en merken met ...
Endrse - Next level online samenwerkingen tussen personalities en merken met ...Endrse - Next level online samenwerkingen tussen personalities en merken met ...
Endrse - Next level online samenwerkingen tussen personalities en merken met ...
 
Bovag - Refine-IT - Proces optimalisatie in de automotive sector
Bovag - Refine-IT - Proces optimalisatie in de automotive sectorBovag - Refine-IT - Proces optimalisatie in de automotive sector
Bovag - Refine-IT - Proces optimalisatie in de automotive sector
 
Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...
Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...
Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...
 
Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...
Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...
Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...
 
Rabobank - There is something about Data
Rabobank - There is something about DataRabobank - There is something about Data
Rabobank - There is something about Data
 
VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...
VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...
VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...
 
Booking.com - Data science and experimentation at Booking.com: a data-driven ...
Booking.com - Data science and experimentation at Booking.com: a data-driven ...Booking.com - Data science and experimentation at Booking.com: a data-driven ...
Booking.com - Data science and experimentation at Booking.com: a data-driven ...
 

KĂźrzlich hochgeladen

Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 

KĂźrzlich hochgeladen (20)

Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 

Big Data Expo 2015 - Barnsten Why Data Modelling is Essential

  • 1. Why Data Modelling is Essential in a Big Data Environment Mark Barringer
  • 2. Introduction • My name is Mark Barringer (mark.Barringer@embarcadero.com) • Product Manager (Data Architecture Tools) Embarcadero • I live in the beautiful city of Winchester in southern England 2
  • 3. Agenda and Objectives… 3 Data Challenges What are the modern day challenges for the Data Architect Big Data Landscape Adding Value to the Business Data Modelling Techniques How to understand and view the Big Data Landscape Making the Big Data Landscape transparent to the business and how to add real value A look at some of the latest Data Modelling Tips and Tricks applied to the Big Data Environment
  • 4. Challenges facing Data Architecture 4 Federation Data Democratisation Platform Fragmentation Data Lineage Latency Delivery Obfuscation
  • 5. Challenges facing Data Architecture 5 Federation: The application of a single view over multiple repositories. Data Democratisation Platform Fragmentation Data Lineage Latency Delivery Obfuscation
  • 6. Challenges facing Data Architecture 6 Federation: The application of a single view over multiple repositories. Data Democratisation: The expectation of the business to have more control over the data assets. Platform Fragmentation Data Lineage Latency Delivery Obfuscation
  • 7. Challenges facing Data Architecture 7 Federation: The application of a single view over multiple repositories. Data Democratisation: The expectation of the business to have more control over the data assets. Platform Fragmentation: The proliferation of non-RDBMS solutions to store data. Data Lineage Latency Delivery Obfuscation
  • 8. Challenges facing Data Architecture 8 Federation: The application of a single view over multiple repositories. Data Democratisation: The expectation of the business to have more control over the data assets. Platform Fragmentation: The proliferation of non-RDBMS solutions to store data. Data Lineage: Expectation to understand actions performed on the data. Latency Delivery Obfuscation
  • 9. Challenges facing Data Architecture 9 Federation: The application of a single view over multiple repositories. Data Democratisation: The expectation of the business to have more control over the data assets. Platform Fragmentation: The proliferation of non-RDBMS solutions to store data. Data Lineage: Expectation to understand actions performed on the data. Latency: The trend towards lower end-to end latency of data (from creation to reporting) Delivery Obfuscation
  • 10. Challenges facing Data Architecture 10 Federation: The application of a single view over multiple repositories. Data Democratisation: The expectation of the business to have more control over the data assets. Platform Fragmentation: The proliferation of non-RDBMS solutions to store data. Data Lineage: Expectation to understand actions performed on the data. Latency: The trend towards lower end-to end latency of data (from creation to reporting) Delivery: Model development in step with development teams. Obfuscation
  • 11. Challenges facing Data Architecture 11 Federation: The application of a single view over multiple repositories. Data Democratisation: The expectation of the business to have more control over the data assets. Platform Fragmentation: The proliferation of non-RDBMS solutions to store data. Data Lineage: Expectation to understand actions performed on the data. Latency: The trend towards lower end-to end latency of data (from creation to reporting) Delivery: Model development in step with development teams. Obfuscation: Expectation to view and understand data models by the business.
  • 12. Why Data Modelling is Essential… 12 Modelling the Business Understand the Landscape Self Documenting Business Heterogeneous & Big Data Physical models Data Modelling Techniques Agile Development
  • 13. Why Data Modelling is Essential… 13 Modelling the Business Understand the Landscape Self documenting Business Heterogeneous & Big Data Physical models Data Modelling Techniques Agile Development Business Modelling • Meaningful abstracted view of the business • Data-centric perspective • 'Anchor point' for other models • Key to successful communication • Develop credibility and relevance with the business • Establish Business Glossaries with consistent definitions • Build a solid foundation for Compliance, Data Governance and Master Data Management • Improve visibility and collaboration with ER/Studio
  • 15. Who are the Data customers and collaborators? 15 DA Technical Collaborators (DBA, ETL, SA) (MetaData Consumers) Data Analysts (Finance, Credit, Mktg.) (Data Consumers) Business Users (Information Producers)
  • 16. Benefit of Relating Metadata to Models • Expand the depth of information by accessing the underlying framework 16 • Models and terms seamlessly integrate to one another
  • 17. Why Data Modelling is Essential… 17 Modelling the business Understand the Landscape Self documenting Business Heterogeneous & Big Data Physical models Data Modelling Techniques Agile Development Understand the Landscape • Create Landscape Inventories • Reverse Engineer • Overcome Information Obscurity • Eliminate Data Silos • Contain much of the detailed meta data • Useful for impact and gap analysis • Platform agnostic • Map to concepts in the Conceptual Data Model and physical objects
  • 18. Enterprise Complexity – Information Obscurity
  • 19. Big Data & NoSQL: The Challenge • Capture new data sources and increase information management footprint • Understanding semi- and un-structured data • “Raw Data is the single source of the truth” • 7 Vs • Velocity, Volume, Variety, • Veracity (conformity of facts – Data in doubt) • Variable, Virtual, Value • Reverse & Forward Engineering (JSON, BSON) • Forward & Reverse Engineer DDL • “We will hopefully find what we didn’t know about that we didn’t know that we didn’t know about”
  • 20. Eliminate Data Silos: Inventory existing databases • What type of data do you own and where can it be found. • Map your data landscape using data models as the foundation. – Each model represents a different database system – Link like data elements together for traceability 20
  • 21. Why Data Modelling is Essential… 21 Modelling the Business Understand the Landscape Self documenting Business Heterogeneous & Big Data Physical models Data Modelling Techniques Agile Development Physical Modelling for Big Data • Accurately model all types of data at rest within the organization. • Not just RDBMS resident data • Document physical meta-data (table space, partitions, etc) • Introduce non-RDBMS data stores e.g. NOSQL, JSON, HIVE • Build many physicals based on business decomposition • Reverse Engineering
  • 22. Traditional (RDBMS) Prescriptive Data Modelling MODEL (and DESIGN) LOAD EXPLORE/ QUERY DATA EXPLORE ‘Schema on write’ Good for Known Unknowns (Repetition)
  • 23. Big Data (NoSQL/Hadoop) Descriptive Data Modelling LOAD QUERY MODEL NoSQL STORE EXPLORE But Fast and Agile! ‘Schema on read’ Good for Unknown Unknowns (Exploration)
  • 24. Data modeling in Big Data Customer - - - NoSQL DATABASE Documents - - - Product - - - Key values - - - Conceptual/ business data model Understanding Logical/physical data model Architecture/Design RELATIONAL DATABASE (i.e., Data warehouse/data mart) May transfer into structured database (using models)
  • 25. Why Data Modelling is Essential… 25 Modelling the Business Understand the Landscape Self documenting Business Heterogeneous & Big Data Physical models Data Modelling Techniques Agile Development Timely Design • Ensuring changes to the physical data models are in step with and relevant to the development methodology used in the organization. • Where modelling meets development. • Create credibility and relevance with the development teams • User Stories, Tasks and Change Management
  • 26. Agile Change Management • Enable “Agile Data Modeler” – Incremental rather than waterfall • Need more granularity than named versions of a model or submodel • Change numbers at Repository check in • Can be associated to user stories, tasks 26
  • 27. Why Data Modelling is Essential… 27 Modelling the Business Understand the Landscape Self documenting Business Heterogeneous Physical models Data Modelling Techniques Agile Development Data Modelling Techniques • Sub-Modelling • Business Decomposition • Visual Data Lineage • Impact Analysis / Where Used • Naming Standards • Data Source Mapping • Universal Mapping • Augmented metadata • Glossary Integration
  • 28. Big Data Notation Enhancement • Physical Model – Objects instead of tables • Nested Objects – “is contained in” relationship type 28
  • 29. Containment Relationship: Array of Nested Objects 29 db.patron.insert( { "_id" : ObjectId("5367ddc4228cd006ab2bc60c"), name: "Joe Bookreader", address: [ { street: "123 Fake Street", city: "Faketon", state: "MA" }, { street: "1 Someother Street", zip: "12345" } ] }) db.book.insert( { title: "MongoDB: The Definitive Guide", author: [ "Kristina Chodorow", "Mike Dirolf" ], published_date: ISODate("2010-09-24"), pages: 216, language: "English", publisher_id: ObjectId("5367dd99228cd006ab2bc60b"), available: 3, checkout: [ { by: ObjectId("5367ddc4228cd006ab2bc60c"), date: ISODate("2012-10-15") } ] }, { title: "50 Tips and Tricks for MongoDB Developer", author: [ "Kristina Chodorow" ], published_date: ISODate("2011-05-06"), pages: 68, language: "English", publisher_id: ObjectId("5367dd99228cd006ab2bc60b") })
  • 30. Hive & ER/Studio 30 Understanding the Big Data Schema
  • 33. Technique: Automated Naming Standards Real-time update while typing 33
  • 34. Technique: Glossary Integration • Associate Data Architect objects to Business glossary terms – Model, submodel – Entity, Table – Attribute, Column – Domain – View • Push terms to glossary 34
  • 35. Why Data Modelling is essential… 35 Modelling the Business Understand the Landscape Self documenting Business Heterogeneous & Big Data Physical models Data Modelling Techniques Agile Development Business metadata • Provide the business with the ability to centrally manage its own meta data in terms of definitions, rules and relationships in a structured and curated manner. • Facilitate the binding of the business elements to technical elements within the models and other documentation. • Data Dictionary • Self-Service Discovery and Reporting
  • 36. Providing Business Context A taxonomy of searchable terms mapped to unique concepts R&D Entity Business Term Patient Recruitment Data Attribute Business Term Batch Supply Data DatasourcePhysical Model Column Logical diagram Table Clinical Discussion threads Conceptual & Process Diagrams Glossaries Supply Chain
  • 37. Why Data Modelling is essential… 37 Modelling the Business Understand the Landscape Self documenting Business Heterogeneous & Big data Physical models Data Modeling Techniques Agile Development Holistic, integrated modelling that can present the same meta data to different audiences in the most appropriate format. The single most important challenge to overcome is that of communication and collaboration and to Build Trust in Data. Without the ability to communicate effectively to a wide variety of audiences even the most diligently documented organisation will be unable to benefit from it.
  • 39. Win a FitBit Charge HR Leave a Business Card at the Barnsten / Embarcadero Stand Raymond Horsten (r.horsten@barnsten.com) Mark Barringer (mark.barringer@embarcadero.com)
  • 40. Questions and Answers Raymond Horsten (r.horsten@barnsten.com) Mark Barringer (mark.barringer@embarcadero.com) 40
  • 41. Building Trust in Data : Collaboration Syndication Governance and Collaboration Technical Metadata Business Metadata Metadata Repository Data Modeling Team Server Web Architecture Business SDLC & Information Management Integrated Tooling Enterprise Data

Hinweis der Redaktion

  1. Data Modelling as was... Conceptual Data Model: this is a concept that data modellers rave about but few others understand. Many orgs don’t have one and are quite happy. Some orgs have one but don’t use it and consider it a waste of money this leaves a (proportionally) vey small group of organisations that have a CDM and use it. Most organisations have a large monolithic Enterprise Data Warehouse – this acts a focus for most of the modelling activity in an organisation It can be argued that the EDW is a de facto Logical data model of the organisation Whilst the EDW serves a very useful purpose in most organisations it does have its limitations many of which seem to be brought into sharp relief in recent years. It is usually seen as an IT function black hole in terms of resources and requirements
  2. DA = Data Architect DBA = Database Administrator ETL = Extract, Transform, Load developer SA = System Analyst As business analysts and data analysts seek to become stronger bridges between business and IT, they become power users of data management tools and would need access to business definitions in data management tools. ER/Studio Team Server helps to expand the circle of data comprehension. When new versions of DBPS come out, they will be well integrated with ER/Studio Team Server. So DBAs would be interested in ER/Studio Team Server as well
  3. Introduce first benefit to the industry – pose pain points – introduce Rob to demo and then field questions/input
  4. So why can't organizations make more effective use of information?  In short, it's information obscurity.  You see, enterprise data isn't just BIG, it's complex.  Our enterprise customers have hundreds of systems and hundreds of thousands of data elements.  If Sales says a customer is anyone they're calling on, if Finance says it's anyone who's paid us money, and Support says it's anyone with paid-up maintenance who's really a Customer?  If Customer data is in a hundred different systems, if it's escaped the data center on mobile devices or migrated to the cloud, where do I go to find the right data, and how do I interpret it?  It's no surprise that most organizations can't leverage all of their data - in many cases, users can't even find it.   Transition: Many organizations turning to data governance as a way forward.
  5. Big data & NoSQL provide huge potential to explore, understand and gain value from data sets that are either too large, complex or diverse to fit into traditional database management systems. They enable you to capture new types of semi-structured or unstructured data sources in their raw format, with the goal of providing the raw data as the single source of the truth. The other reason why Big Data & NoSQL platforms have become so popular are because of the low price and high performance ratio that they can provide in comparison with traditional databases, especially when having to store huge amounts of data. However this comes with additional challenges of managing and understanding all of this data. You may be aware of the 3 Vs : Velocity, Volume and Variety , but recently 4 more have been added. Especially: Veracity: Regarding the certainty of the data Variable: Can have variable schemas, variable ways of interpreting the same data: E.g. from the customer perspective, or vendor perspective. (This leads to schema on read so that you have a different emphasis when reading.) Virtual: Virtualization of data source Value: The Most important is extracting value of the data Managing this data , this big data effectively, will hopefully lead us to uncover the Unknown Unknowns about the data. Or put differently “We will hopefully find what we didn’t know about that we didn’t know that we didn’t know about” The Unknown Unknowns are potentially big opportunities that fall far outside of the day-to-day of your business. These include things like: Market segments you haven’t discovered yet Features that people love Product innovations The “why?” behind customer behaviours But in order to manage all of this data, you will first need to understand the schema governing or describing the data. And I would like look briefly at some of the schema differences when modeling Big Data & NoSQL platforms vs traditional databases. --------- Hadoop and MongoDB are the enterprise Big Data Leaders Hadoop – cost-efficient staging and ETL of very large data sets MongoDB – Customer/User profile management for large-scale web sites Greenplum – Innovative MPP analytic database technology – now part of EMC Pivotal Labs
  6. In a big data environment, data is usually stored in a noSQL database, meaning a non-relational data store. In this data store, there are no relationships that exist. However, categories are built based upon the unstructured data and data is analyzed using various tools (for example, Hive in the Hadoop environment). NoSQL (‘not only SQL’) environments such as Hadoop is ‘schema on read’ versus ‘schema on write’. For example, Hadoop uses an HDFS for its file structure and it is a ‘schema on read’ meaning that we don’t need to define the data structure before loading the data (which we would need to do if it were a traditional ‘schema on write’ such as a relational data warehouse). The pros of this approach are: It is not necessary to define the structure and therefore there is great flexibility in how the data (structured and especially semi-structured) can be stored, queried, and used. It promotes experimentation The cost of getting things ‘wrong’ has a very low cost (since it was only experimental) It is an agile approach since it speeds up the time to have the data available versus having to first model, develop ETL, perform data quality, etc. The cons of this approach are: Expensive because compute resources need to be high It is not self documenting Have to create jobs that creates the schema on read Data modeling may take place after data is analyzed in order to: Understand the data Provide a design for transfer into a relational data store, if needed and decided upon Investigating computing is an environment where we can experiment. Once we have decided that some of the information is useful and what can be used in production, then we may decide to transfer it into a structured environment (EDW) and this is where data modeling is needed.
  7. Many organizations have struggled making the transition from traditional “waterfall” data modeling to more responsive and iterative agile approaches. An important aspect of this is granular change management, enabling checkout/checkin of only those objects required for a specific modification, rather than a full model or sub-model. Just as important, is knowing “why” the changes were made. Therefore, object checkin/checkout can now be associated with a specific task or “user story”, which is a practice agile developers have been using for years. Knowing the “why” is also extremely important from a data governance perspective.
  8. Because of Big Data, we have had to enhance our notations to accommodate new types of physical models We are now using Objects instead of Table in the Physical Models. The big thing with the Big Data stores, is that we can have nested objects in those structures. We have introduced a new relationship type that only shows on Big Data platforms, and that is the Is Contained In relationship type. An we’ll see shortly in the demo that we can handle nested objects, and nested arrays of objects using these notations. In the Diagram, the “Diamond on one end, with cardinality on the other end” corresponds to the “Is Contained In” relationship type. We have utilized that notation convention from UML. So that those who are familiar with that notation, should have an easy transition to our tool.
  9. This is the JSON code for a couple of the collections in MongoDB. You may be familiar with JSON, it contains objects called collections, which usually contain nested objects. These objects typically contain key value pairs. ++ Show SLIDE : Containment Relationship: Array of Nested Objects * On the LEFT side you see PATRON : If you look down you'll see ADDRESS there. In the Diagram the relationship line from PATRON to ADDRESS had the star. We can see that there are 2 Addresses referenced in there. => You will also notice that After ADDRESS there is a square brackets "[" => And that kicks off the ARRAY of NESTED OBJECTS. When reverse engineering that "[" indicates to the tool that this is an ARRAY of NESTED OBJECTS. * On the RIGHT Side you see BOOK: Notice that CHECKOUT has after the colon (:) the "[" . But although there is only has 1 instance of checkout there, the syntax indicates that this is an ARRAY. => Moving to the Next slide of single nested objects.
  10. In HIVE you can reverse engineer and forward engineer the DDL, this looks like another DB platform. You just click on the DDL tab, and we’ll see what this looks like in the tool in a few minutes. That was a quick overview, but it is much more fun to see it in action, so lets move over to the demo. --- Hive is a SQL like query language in Hadoop. Even though data is not stored in tables. It is normally text, e.g.: commar delimited. -> Hive applies a schema to it. Note that attribute names in Hive would not make a Data Modeler happy (e.g.: n1 int, mars2 int, prep int,etc..). It is almost like a non-materialized view on top of a text file.
  11. Naming Standards Automation:  Currently we invoke our naming standards manually, applying to a submodel at a time.  The auto naming standards will allow us to bind a naming standards template to data model objects such as entities/tables and attributes/columns.  The typical use case would be to have the physical name change in place as we are editing the logical name (that’s how ERwin does it in a far less elegant fashion through their macro formulas).  We will also be able to apply physical to logical mapping (reverse direction) if that is desired.  Standards can be attached to individual objects, or defaulted at the sub-model level. 
  12. We are adding the capability for real-time integration between the Data Architect tool and the glossaries/terms in Team server. If a defined glossary term is used in naming or defining major model objects, those terms are automatically highlighted with capability to hyperlink and/or copy the term definition to the model object. Mouse-over the highlighted term will show the definition from the Team Server glossary. We will be extending this capability further later this year to include glossary integration with process artifacts in the Business Architect tool as well.
  13. We have a broad portfolio of products spanning Data Modeling, Database Management and Application Development. We often refer to the Design , Develop and Deliver areas. We can see on the left hand side of the diagram the Data Architecture products that we will look at today, together with the Database Administrator and Database Developer products at the top and right that we will look at DB PowerStudio refresher next week. And at the Center, we have the Team Server Core, which ties all of these together. On the right of the slide, you can see some of the many database platforms that we support, and this list is constantly expanding.
  14. Metadata Governance and Syndication Applying the principles of Governance to metadata collaborative authoring Unleashing metadata by delivering it into core SDLC and Information Management workflows Value Propositions: Data Architect To: Peer Architects Benefit: Reuse, consistency, coordination Data Architect To: Business Analyst, Developer Benefit: Review and approval, coordination Steward, Data SME, Governance Team To: Metadata Benefit: Collaborative authoring, business review and approval Information Management Professionals To: Metadata Benefit: Discoverability, comprehension, quality, access to policy Data Analyst To: Metadata Benefit: Discoverability, definitions, lineage, security and sensitivity advisories