SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Data Modelling
It’s a lot more than drawing diagrams
George McGeachie
Metadata Matters Limited
My lightning talk at PG day in 2014
(I liked that venue – Horwood House)
My entry in the 2nd Quadrant
blog last week – used example
of automating Data Vault
design and creation
https://blog.2ndquadrant.com/data-modelling-lot-just-diagram/
2
This is my favourite theme
The right tool can
give you a lot more
than just this messy
Diagram
– would you want to
work with this
diagram?
3
This is my favourite theme
A data model is a lot more than just a Diagram
4
This is my favourite theme
 Check against your design standards
 The tedious stuff, like making sure all your
tables have the standard audit columns
 Do you need JSON?
 How much will this DB grow?
 Managing (and comparing) schema, table &
column versions
 Building Data Vaults – see 2nd Quadrant blog
5
Automate tasks – before you
build the database
Available automation
A Contextual menu is one way of
accessing automation – check the
model, export JSON to a file, apply
Naming Standards, adding audit
columns
Add your own model
checks, along with
automatic fixing for
those problems if
possible
(e.g. adding
surrogate key)
7
Check your design meets your
design standards
8
Make sure all your tables have
the standard audit columns
Don’t blink or you’ll miss it
9
The PDM – without audit columns
10
Less than 2 seconds later …
11
Do you need JSON?
{
"Name" : "Departments",
"Code" : "Departments",
"Fully Qualified Name" : "Group0.Departments",
"Fully Qualified Code" : "Group0.Departments",
"Owner" : "Group0",
"Object Type" : "Table",
"id" : "8731F3EE-8E53-46C6-A873-81C522F51717",
"description" : "contains the names and heads of th
"note" : "<NONE>",
"Columns" :
[
{
"Name" : "DepartmentID",
"Code" : "DepartmentID",
"Fully Qualified Name" : "Departments
"Fully Qualified Code" : "Group0.Depa
"Object Type" : "Column",
"id" : "D23F6064-87A8-4D1D-92D0-70F35
"description" : "short one",
"note" : "<NONE>",
"Data Type" : "INT4",
"Length" : "4",
"Precision" : "0",
"Primary?" : "TRUE",
"FK?" : "FALSE",
"Mandatory?" : "TRUE",
 Extract current statistics, define growth rates
12
How much will this database
grow?
Estimate of the size of the Database "PhysicalDataModel_1"...
Number Estimated size Object
------------------------- ----------------------- ----------------------------------------
------------
1,556 312 KB Table "Contacts"
17,370 3,475 KB Table "Customers"
917 KB Index "IX_customer_name"
130 7 KB Table "Departments"
1,945 390 KB Table "Employees"
182 13 KB Table "FinancialCodes"
2,179 39 KB Table "FinancialData"
259 130 KB Table "MarketingInformation"
259 KB Long data types
274 KB Index "MarketingTextIndex"
1,379 9,651 KB Table "Products"
9,650 KB Long data types
34 KB Index "IX_product_name"
60 KB Index "IX_product_description"
41 KB Index "IX_product_size"
41 KB Index "IX_product_color"
28,453 619 KB Table "SalesOrderItems"
382,637,520 11,595,077 KB Table "SalesOrders"
3,268 467 KB Table "SpatialContacts"
467 25 KB Table "SpatialShapes"
------------------------- ----------------------- ----------------------------------------
------------
11,621,481 KB Total estimated space
The data will be distributed on the following tablespaces:
Estimated size Tablespace
----------------------- ----------------------------------------------
1,367 KB system
13
Estimate Database size
 Write your own estimation script
14
If you don’t like the way it’s
done
 Branching
 Comparing
versions
15
Versioning
 Check models into the
repository, but don’t
update the mainline until
they’ve been approved
16
Check in model for peer review
17
Integrate the 2nd Branch back
into the 1st Branch
Models updated
with selected
changes
Still able to access version 1
18
Simon and Hannu
say …
Page 53
• Understand Database
Dependencies
◦ e.g. Table  View  Procedure
I only have the first edition of this
excellent book
 ETL Jobs
 Forms and Reports
 Applications
 XML Message Schemas
 Regulatory Requirements
 Business Processes
 Use Cases
 JIRA tickets
etc.
19
Databases have connections
20
Choose your tools carefully
What Tools are there?
The big 3
ERwin, ER/Studio, PowerDesigner
Others
Dezign
Sparx EA
ModelRight
Silverrun
IBM Infosphere Data Architect
Toad Data Modeller
might not all support PG
George McGeachie
Co-author of “Data Modeling Made Simple with
PowerDesigner”, data modeller and strategist,
SAP PowerDesigner trainer, and data modelling
tool junkie.
@metadatajunkie
Blog – metadatajunkie.wordpress.com
https://www.linkedin.com/in/georgemcgeachie/
George.McGeachie@MetadataMatters.com
Mobile: +44 (0) 794 293 0648

Weitere ähnliche Inhalte

Was ist angesagt?

SlamData - How MongoDB Is Powering a Revolution in Visual Analytics
SlamData - How MongoDB Is Powering a Revolution in Visual AnalyticsSlamData - How MongoDB Is Powering a Revolution in Visual Analytics
SlamData - How MongoDB Is Powering a Revolution in Visual AnalyticsJohn De Goes
 
Effective capture of metadata using ca e rwin data modeler 09232010
Effective capture of metadata using ca e rwin data modeler 09232010Effective capture of metadata using ca e rwin data modeler 09232010
Effective capture of metadata using ca e rwin data modeler 09232010ERwin Modeling
 
MS SQL SERVER: Data mining concepts and dmx
MS SQL SERVER: Data mining concepts and dmxMS SQL SERVER: Data mining concepts and dmx
MS SQL SERVER: Data mining concepts and dmxDataminingTools Inc
 
01 Persistence And Orm
01 Persistence And Orm01 Persistence And Orm
01 Persistence And OrmRanjan Kumar
 
Datalayer Best Practices with Observepoint
Datalayer Best Practices with ObservepointDatalayer Best Practices with Observepoint
Datalayer Best Practices with ObservepointMike Plant
 
Give Me My Damn Report: Making NoSQL Data Accessible to the Business
Give Me My Damn Report: Making NoSQL Data Accessible to the BusinessGive Me My Damn Report: Making NoSQL Data Accessible to the Business
Give Me My Damn Report: Making NoSQL Data Accessible to the BusinessFormant
 
Making Smarter Business Decisions with Power BI
Making Smarter Business Decisions with Power BIMaking Smarter Business Decisions with Power BI
Making Smarter Business Decisions with Power BIAvtex
 
Prague data management meetup 2017-03-28
Prague data management meetup 2017-03-28Prague data management meetup 2017-03-28
Prague data management meetup 2017-03-28Martin Bém
 
Cust experience a practical guide 09152010
Cust experience a practical guide 09152010Cust experience a practical guide 09152010
Cust experience a practical guide 09152010ERwin Modeling
 
Tableau PPT Intro, Features, Advantages, Disadvantages
Tableau PPT Intro, Features, Advantages, DisadvantagesTableau PPT Intro, Features, Advantages, Disadvantages
Tableau PPT Intro, Features, Advantages, DisadvantagesBurn & Born
 
Guidelines data cite_denmark_ver3
Guidelines data cite_denmark_ver3Guidelines data cite_denmark_ver3
Guidelines data cite_denmark_ver3DTU Library
 

Was ist angesagt? (16)

Star schema PPT
Star schema PPTStar schema PPT
Star schema PPT
 
SlamData - How MongoDB Is Powering a Revolution in Visual Analytics
SlamData - How MongoDB Is Powering a Revolution in Visual AnalyticsSlamData - How MongoDB Is Powering a Revolution in Visual Analytics
SlamData - How MongoDB Is Powering a Revolution in Visual Analytics
 
Xml
XmlXml
Xml
 
Effective capture of metadata using ca e rwin data modeler 09232010
Effective capture of metadata using ca e rwin data modeler 09232010Effective capture of metadata using ca e rwin data modeler 09232010
Effective capture of metadata using ca e rwin data modeler 09232010
 
MS SQL SERVER: Data mining concepts and dmx
MS SQL SERVER: Data mining concepts and dmxMS SQL SERVER: Data mining concepts and dmx
MS SQL SERVER: Data mining concepts and dmx
 
01 Persistence And Orm
01 Persistence And Orm01 Persistence And Orm
01 Persistence And Orm
 
Datalayer Best Practices with Observepoint
Datalayer Best Practices with ObservepointDatalayer Best Practices with Observepoint
Datalayer Best Practices with Observepoint
 
Lançamento ERwin 08/02
Lançamento ERwin 08/02Lançamento ERwin 08/02
Lançamento ERwin 08/02
 
Give Me My Damn Report: Making NoSQL Data Accessible to the Business
Give Me My Damn Report: Making NoSQL Data Accessible to the BusinessGive Me My Damn Report: Making NoSQL Data Accessible to the Business
Give Me My Damn Report: Making NoSQL Data Accessible to the Business
 
Making Smarter Business Decisions with Power BI
Making Smarter Business Decisions with Power BIMaking Smarter Business Decisions with Power BI
Making Smarter Business Decisions with Power BI
 
Rendering The Fat
Rendering The FatRendering The Fat
Rendering The Fat
 
Prague data management meetup 2017-03-28
Prague data management meetup 2017-03-28Prague data management meetup 2017-03-28
Prague data management meetup 2017-03-28
 
Hibernate II
Hibernate IIHibernate II
Hibernate II
 
Cust experience a practical guide 09152010
Cust experience a practical guide 09152010Cust experience a practical guide 09152010
Cust experience a practical guide 09152010
 
Tableau PPT Intro, Features, Advantages, Disadvantages
Tableau PPT Intro, Features, Advantages, DisadvantagesTableau PPT Intro, Features, Advantages, Disadvantages
Tableau PPT Intro, Features, Advantages, Disadvantages
 
Guidelines data cite_denmark_ver3
Guidelines data cite_denmark_ver3Guidelines data cite_denmark_ver3
Guidelines data cite_denmark_ver3
 

Ähnlich wie Lightning talk at PG Conf UK 2018

Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data SolutionJames Serra
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big DataJames Serra
 
MongoDb Schema Pattern - Kalpit Pandit.pptx
MongoDb Schema Pattern - Kalpit Pandit.pptxMongoDb Schema Pattern - Kalpit Pandit.pptx
MongoDb Schema Pattern - Kalpit Pandit.pptxKalpitPandit1
 
Emerging database landscape july 2011
Emerging database landscape july 2011Emerging database landscape july 2011
Emerging database landscape july 2011navaidkhan
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.pptBsMath3rdsem
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingPrithwis Mukerjee
 
Best Practices for Building and Deploying Data Pipelines in Apache Spark
Best Practices for Building and Deploying Data Pipelines in Apache SparkBest Practices for Building and Deploying Data Pipelines in Apache Spark
Best Practices for Building and Deploying Data Pipelines in Apache SparkDatabricks
 
Snowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessSnowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessyalisassoon
 
A guide to preparing your data for tableau
A guide to preparing your data for tableauA guide to preparing your data for tableau
A guide to preparing your data for tableauPhillip Reinhart
 
IBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNIBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNabclearnn
 
L’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazioneL’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazioneMongoDB
 
Data Science Demystified
Data Science DemystifiedData Science Demystified
Data Science DemystifiedEmily Robinson
 
L’architettura di Classe Enterprise di Nuova Generazione
L’architettura di Classe Enterprise di Nuova GenerazioneL’architettura di Classe Enterprise di Nuova Generazione
L’architettura di Classe Enterprise di Nuova GenerazioneMongoDB
 
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Webinar: Introducing the MongoDB Connector for BI 2.0 with TableauWebinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Webinar: Introducing the MongoDB Connector for BI 2.0 with TableauMongoDB
 
Cloud as a Data Platform
Cloud as a Data PlatformCloud as a Data Platform
Cloud as a Data PlatformAndrei Savu
 
Snowplow - Evolve your analytics stack with your business
Snowplow - Evolve your analytics stack with your businessSnowplow - Evolve your analytics stack with your business
Snowplow - Evolve your analytics stack with your businessGiuseppe Gaviani
 

Ähnlich wie Lightning talk at PG Conf UK 2018 (20)

Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big Data
 
MongoDb Schema Pattern - Kalpit Pandit.pptx
MongoDb Schema Pattern - Kalpit Pandit.pptxMongoDb Schema Pattern - Kalpit Pandit.pptx
MongoDb Schema Pattern - Kalpit Pandit.pptx
 
Date Analysis .pdf
Date Analysis .pdfDate Analysis .pdf
Date Analysis .pdf
 
Emerging database landscape july 2011
Emerging database landscape july 2011Emerging database landscape july 2011
Emerging database landscape july 2011
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in Datawarehousing
 
Best Practices for Building and Deploying Data Pipelines in Apache Spark
Best Practices for Building and Deploying Data Pipelines in Apache SparkBest Practices for Building and Deploying Data Pipelines in Apache Spark
Best Practices for Building and Deploying Data Pipelines in Apache Spark
 
Snowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessSnowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your business
 
A guide to preparing your data for tableau
A guide to preparing your data for tableauA guide to preparing your data for tableau
A guide to preparing your data for tableau
 
IBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNIBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARN
 
L’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazioneL’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazione
 
Data Science Demystified
Data Science DemystifiedData Science Demystified
Data Science Demystified
 
No SQL and MongoDB - Hyderabad Scalability Meetup
No SQL and MongoDB - Hyderabad Scalability MeetupNo SQL and MongoDB - Hyderabad Scalability Meetup
No SQL and MongoDB - Hyderabad Scalability Meetup
 
Data engineering design patterns
Data engineering design patternsData engineering design patterns
Data engineering design patterns
 
PowerBI Training
PowerBI Training PowerBI Training
PowerBI Training
 
L’architettura di Classe Enterprise di Nuova Generazione
L’architettura di Classe Enterprise di Nuova GenerazioneL’architettura di Classe Enterprise di Nuova Generazione
L’architettura di Classe Enterprise di Nuova Generazione
 
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Webinar: Introducing the MongoDB Connector for BI 2.0 with TableauWebinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
 
Cloud as a Data Platform
Cloud as a Data PlatformCloud as a Data Platform
Cloud as a Data Platform
 
Snowplow - Evolve your analytics stack with your business
Snowplow - Evolve your analytics stack with your businessSnowplow - Evolve your analytics stack with your business
Snowplow - Evolve your analytics stack with your business
 

Kürzlich hochgeladen

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 

Kürzlich hochgeladen (20)

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

Lightning talk at PG Conf UK 2018

  • 1. Data Modelling It’s a lot more than drawing diagrams George McGeachie Metadata Matters Limited
  • 2. My lightning talk at PG day in 2014 (I liked that venue – Horwood House) My entry in the 2nd Quadrant blog last week – used example of automating Data Vault design and creation https://blog.2ndquadrant.com/data-modelling-lot-just-diagram/ 2 This is my favourite theme
  • 3. The right tool can give you a lot more than just this messy Diagram – would you want to work with this diagram? 3 This is my favourite theme
  • 4. A data model is a lot more than just a Diagram 4 This is my favourite theme
  • 5.  Check against your design standards  The tedious stuff, like making sure all your tables have the standard audit columns  Do you need JSON?  How much will this DB grow?  Managing (and comparing) schema, table & column versions  Building Data Vaults – see 2nd Quadrant blog 5 Automate tasks – before you build the database
  • 6. Available automation A Contextual menu is one way of accessing automation – check the model, export JSON to a file, apply Naming Standards, adding audit columns
  • 7. Add your own model checks, along with automatic fixing for those problems if possible (e.g. adding surrogate key) 7 Check your design meets your design standards
  • 8. 8 Make sure all your tables have the standard audit columns Don’t blink or you’ll miss it
  • 9. 9 The PDM – without audit columns
  • 10. 10 Less than 2 seconds later …
  • 11. 11 Do you need JSON? { "Name" : "Departments", "Code" : "Departments", "Fully Qualified Name" : "Group0.Departments", "Fully Qualified Code" : "Group0.Departments", "Owner" : "Group0", "Object Type" : "Table", "id" : "8731F3EE-8E53-46C6-A873-81C522F51717", "description" : "contains the names and heads of th "note" : "<NONE>", "Columns" : [ { "Name" : "DepartmentID", "Code" : "DepartmentID", "Fully Qualified Name" : "Departments "Fully Qualified Code" : "Group0.Depa "Object Type" : "Column", "id" : "D23F6064-87A8-4D1D-92D0-70F35 "description" : "short one", "note" : "<NONE>", "Data Type" : "INT4", "Length" : "4", "Precision" : "0", "Primary?" : "TRUE", "FK?" : "FALSE", "Mandatory?" : "TRUE",
  • 12.  Extract current statistics, define growth rates 12 How much will this database grow?
  • 13. Estimate of the size of the Database "PhysicalDataModel_1"... Number Estimated size Object ------------------------- ----------------------- ---------------------------------------- ------------ 1,556 312 KB Table "Contacts" 17,370 3,475 KB Table "Customers" 917 KB Index "IX_customer_name" 130 7 KB Table "Departments" 1,945 390 KB Table "Employees" 182 13 KB Table "FinancialCodes" 2,179 39 KB Table "FinancialData" 259 130 KB Table "MarketingInformation" 259 KB Long data types 274 KB Index "MarketingTextIndex" 1,379 9,651 KB Table "Products" 9,650 KB Long data types 34 KB Index "IX_product_name" 60 KB Index "IX_product_description" 41 KB Index "IX_product_size" 41 KB Index "IX_product_color" 28,453 619 KB Table "SalesOrderItems" 382,637,520 11,595,077 KB Table "SalesOrders" 3,268 467 KB Table "SpatialContacts" 467 25 KB Table "SpatialShapes" ------------------------- ----------------------- ---------------------------------------- ------------ 11,621,481 KB Total estimated space The data will be distributed on the following tablespaces: Estimated size Tablespace ----------------------- ---------------------------------------------- 1,367 KB system 13 Estimate Database size
  • 14.  Write your own estimation script 14 If you don’t like the way it’s done
  • 16.  Check models into the repository, but don’t update the mainline until they’ve been approved 16 Check in model for peer review
  • 17. 17 Integrate the 2nd Branch back into the 1st Branch Models updated with selected changes Still able to access version 1
  • 18. 18 Simon and Hannu say … Page 53 • Understand Database Dependencies ◦ e.g. Table  View  Procedure I only have the first edition of this excellent book
  • 19.  ETL Jobs  Forms and Reports  Applications  XML Message Schemas  Regulatory Requirements  Business Processes  Use Cases  JIRA tickets etc. 19 Databases have connections
  • 20. 20 Choose your tools carefully What Tools are there? The big 3 ERwin, ER/Studio, PowerDesigner Others Dezign Sparx EA ModelRight Silverrun IBM Infosphere Data Architect Toad Data Modeller might not all support PG
  • 21. George McGeachie Co-author of “Data Modeling Made Simple with PowerDesigner”, data modeller and strategist, SAP PowerDesigner trainer, and data modelling tool junkie. @metadatajunkie Blog – metadatajunkie.wordpress.com https://www.linkedin.com/in/georgemcgeachie/ George.McGeachie@MetadataMatters.com Mobile: +44 (0) 794 293 0648