SlideShare a Scribd company logo
1 of 22
The Dark Art of Data Migration Presented by Graham Morgan MD Spatial Consultants Ltd
Tuning in …
3 Expert independent advice
Is it that Difficult? Drawings with layout Features split Implied data (color) Multiple sources Choreography 4 Missing data Unstructured data Spatial formats Variability within Different meanings Inaccuracies
Target - SpatialNet All data is stored in an Oracle database 5 Data is: Feature based Highly organized Classified / Categorized Consistent Data integrity rules Standardized
Target - SpatialNet All data stored in Oracle database 6
7 SpatialInfo data model
8 Where do we start?
Features or Drawings? 9
Data Challenges Where is the feature record? Different data sources / formats No single business identifier for features Which attribute values are correct? Multiple copies, alternatives Do we have all the attributes? Do we have connectivity information? Infrastructure Level Fiber level 10
System Challenges It isn’t just a data migration… System footprints  Business functions Data (CRUD) Access security 11 Data migration is a component of a business improvement programme
Data Migration Process Requirements Estimate / Quote Design Implement Test Deploy Celebrate … 12
Data Migration Process 13
Data Migration Process - Prepare Map existing business functions to systems Existing / Planned Inventory data Data to migrate to SpatialNet  / other systems? Prepare data documentation Develop baseline tests Feature counts Network trace tests Make local corrections 14
Data Migration Process - Develop Exchange data in situ (lossless) Load directly ‘As Is’ into Oracle Pre-process for load option Re-project Inventory / study / profile / explore / specify Migrate to intermediate model Validate Migrate to SpatialNet Validate Correct data at source … Repeat … 15
16 Migration Route
Profile / Explore Defend against duplicates – ensure unique keys Missing features Attribute value patterns / consistency Mandatory, Unique, Domain values Logical consistency Geometry types & consistency Connectivity: inferred vs. recorded 17
Tools Source to Oracle Staging FME FDO/.NET Manifold Oracle Staging to Neutral PL/SQL FME NHibernate/.NET 18 ,[object Object]
ESRI ArcGIS
MapInfo
Talend / Open Source
Oracle Integration

More Related Content

What's hot

2013 Enterprise Track, Using Spatial ETL in a Multi-vendor Enterprise GIS Env...
2013 Enterprise Track, Using Spatial ETL in a Multi-vendor Enterprise GIS Env...2013 Enterprise Track, Using Spatial ETL in a Multi-vendor Enterprise GIS Env...
2013 Enterprise Track, Using Spatial ETL in a Multi-vendor Enterprise GIS Env...GIS in the Rockies
 
Resume quaish abuzer
Resume quaish abuzerResume quaish abuzer
Resume quaish abuzerquaish abuzer
 
Oracle Forms Modernization Roadmap
Oracle Forms Modernization RoadmapOracle Forms Modernization Roadmap
Oracle Forms Modernization RoadmapKai-Uwe Möller
 
Software Driven ETL Conversion Platform
Software Driven ETL Conversion PlatformSoftware Driven ETL Conversion Platform
Software Driven ETL Conversion PlatformAnalytiX DS
 
Integration strategy for large erp deployments
Integration strategy for large erp deploymentsIntegration strategy for large erp deployments
Integration strategy for large erp deploymentsArup Dutta
 
Etl overview training
Etl overview trainingEtl overview training
Etl overview trainingMondy Holten
 
Copy of Alok_Singh_CV
Copy of Alok_Singh_CVCopy of Alok_Singh_CV
Copy of Alok_Singh_CVAlok Singh
 
Krishna_IBM_Infosphere_Certified_Datastage_Consultant
Krishna_IBM_Infosphere_Certified_Datastage_Consultant Krishna_IBM_Infosphere_Certified_Datastage_Consultant
Krishna_IBM_Infosphere_Certified_Datastage_Consultant Krishna Kishore
 
Leveraging OTM's Fusion Transportation Intelligence
Leveraging OTM's Fusion Transportation IntelligenceLeveraging OTM's Fusion Transportation Intelligence
Leveraging OTM's Fusion Transportation IntelligenceMavenWire
 
Ajith_kumar_4.3 Years_Informatica_ETL
Ajith_kumar_4.3 Years_Informatica_ETLAjith_kumar_4.3 Years_Informatica_ETL
Ajith_kumar_4.3 Years_Informatica_ETLAjith Kumar Pampatti
 

What's hot (20)

Mallikarjun_Konduri
Mallikarjun_KonduriMallikarjun_Konduri
Mallikarjun_Konduri
 
ETL
ETLETL
ETL
 
2013 Enterprise Track, Using Spatial ETL in a Multi-vendor Enterprise GIS Env...
2013 Enterprise Track, Using Spatial ETL in a Multi-vendor Enterprise GIS Env...2013 Enterprise Track, Using Spatial ETL in a Multi-vendor Enterprise GIS Env...
2013 Enterprise Track, Using Spatial ETL in a Multi-vendor Enterprise GIS Env...
 
Resume quaish abuzer
Resume quaish abuzerResume quaish abuzer
Resume quaish abuzer
 
Arun Kondra
Arun KondraArun Kondra
Arun Kondra
 
Oracle Forms Modernization Roadmap
Oracle Forms Modernization RoadmapOracle Forms Modernization Roadmap
Oracle Forms Modernization Roadmap
 
Software Driven ETL Conversion Platform
Software Driven ETL Conversion PlatformSoftware Driven ETL Conversion Platform
Software Driven ETL Conversion Platform
 
informatica_developer
informatica_developerinformatica_developer
informatica_developer
 
Integration strategy for large erp deployments
Integration strategy for large erp deploymentsIntegration strategy for large erp deployments
Integration strategy for large erp deployments
 
Ramachandran_ETL Developer
Ramachandran_ETL DeveloperRamachandran_ETL Developer
Ramachandran_ETL Developer
 
Siva - Resume
Siva - ResumeSiva - Resume
Siva - Resume
 
Etl overview training
Etl overview trainingEtl overview training
Etl overview training
 
Copy of Alok_Singh_CV
Copy of Alok_Singh_CVCopy of Alok_Singh_CV
Copy of Alok_Singh_CV
 
Mahesh_ETL
Mahesh_ETLMahesh_ETL
Mahesh_ETL
 
CloverETL Basic Training Excerpt
CloverETL Basic Training ExcerptCloverETL Basic Training Excerpt
CloverETL Basic Training Excerpt
 
Krishna_IBM_Infosphere_Certified_Datastage_Consultant
Krishna_IBM_Infosphere_Certified_Datastage_Consultant Krishna_IBM_Infosphere_Certified_Datastage_Consultant
Krishna_IBM_Infosphere_Certified_Datastage_Consultant
 
Leveraging OTM's Fusion Transportation Intelligence
Leveraging OTM's Fusion Transportation IntelligenceLeveraging OTM's Fusion Transportation Intelligence
Leveraging OTM's Fusion Transportation Intelligence
 
Resume_ETL__Testing
Resume_ETL__TestingResume_ETL__Testing
Resume_ETL__Testing
 
Jithender_3+Years_Exp_ETL Testing
Jithender_3+Years_Exp_ETL TestingJithender_3+Years_Exp_ETL Testing
Jithender_3+Years_Exp_ETL Testing
 
Ajith_kumar_4.3 Years_Informatica_ETL
Ajith_kumar_4.3 Years_Informatica_ETLAjith_kumar_4.3 Years_Informatica_ETL
Ajith_kumar_4.3 Years_Informatica_ETL
 

Viewers also liked

The ideal online course
The ideal online course The ideal online course
The ideal online course Telma Martins
 
179_elgoibar2a.doc
179_elgoibar2a.doc179_elgoibar2a.doc
179_elgoibar2a.docbinovo
 
GIS Update - Travels of a GIS Gypsy
GIS Update - Travels of a GIS GypsyGIS Update - Travels of a GIS Gypsy
GIS Update - Travels of a GIS GypsyGraham Morgan
 
Geo Spatial Technology And How It Is Changing The Way We Use The Web
Geo Spatial Technology And How It Is Changing The Way We Use The WebGeo Spatial Technology And How It Is Changing The Way We Use The Web
Geo Spatial Technology And How It Is Changing The Way We Use The WebGraham Morgan
 
Video Games and Diabetes: Promoting Diabetes Awareness and Management in Appa...
Video Games and Diabetes: Promoting Diabetes Awareness and Management in Appa...Video Games and Diabetes: Promoting Diabetes Awareness and Management in Appa...
Video Games and Diabetes: Promoting Diabetes Awareness and Management in Appa...Shane Tilton
 
The Buyer's Journey - by Chris Lema
The Buyer's Journey - by Chris LemaThe Buyer's Journey - by Chris Lema
The Buyer's Journey - by Chris LemaChris Lema
 

Viewers also liked (8)

The ideal online course
The ideal online course The ideal online course
The ideal online course
 
179_elgoibar2a.doc
179_elgoibar2a.doc179_elgoibar2a.doc
179_elgoibar2a.doc
 
Where Next?
Where Next?Where Next?
Where Next?
 
Pablo nerudah
Pablo nerudahPablo nerudah
Pablo nerudah
 
GIS Update - Travels of a GIS Gypsy
GIS Update - Travels of a GIS GypsyGIS Update - Travels of a GIS Gypsy
GIS Update - Travels of a GIS Gypsy
 
Geo Spatial Technology And How It Is Changing The Way We Use The Web
Geo Spatial Technology And How It Is Changing The Way We Use The WebGeo Spatial Technology And How It Is Changing The Way We Use The Web
Geo Spatial Technology And How It Is Changing The Way We Use The Web
 
Video Games and Diabetes: Promoting Diabetes Awareness and Management in Appa...
Video Games and Diabetes: Promoting Diabetes Awareness and Management in Appa...Video Games and Diabetes: Promoting Diabetes Awareness and Management in Appa...
Video Games and Diabetes: Promoting Diabetes Awareness and Management in Appa...
 
The Buyer's Journey - by Chris Lema
The Buyer's Journey - by Chris LemaThe Buyer's Journey - by Chris Lema
The Buyer's Journey - by Chris Lema
 

Similar to The Dark Art of Data Migration

Python + MPP Database = Large Scale AI/ML Projects in Production Faster
Python + MPP Database = Large Scale AI/ML Projects in Production FasterPython + MPP Database = Large Scale AI/ML Projects in Production Faster
Python + MPP Database = Large Scale AI/ML Projects in Production FasterPaige_Roberts
 
Arun Mathew Thomas_resume
Arun Mathew Thomas_resumeArun Mathew Thomas_resume
Arun Mathew Thomas_resumeARUN THOMAS
 
Using FME to support open data initiatives and INSPIRE
Using FME to support open data initiatives and INSPIREUsing FME to support open data initiatives and INSPIRE
Using FME to support open data initiatives and INSPIREIMGS
 
Webinar: Successful Data Migration to Microsoft Dynamics 365 CRM | InSync
Webinar: Successful Data Migration to Microsoft Dynamics 365 CRM | InSyncWebinar: Successful Data Migration to Microsoft Dynamics 365 CRM | InSync
Webinar: Successful Data Migration to Microsoft Dynamics 365 CRM | InSyncAPPSeCONNECT
 
Data Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesData Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesIvo Andreev
 
AGIT 2015 - Hans Viehmann: "Big Data and Smart Cities"
AGIT 2015  - Hans Viehmann: "Big Data and Smart Cities"AGIT 2015  - Hans Viehmann: "Big Data and Smart Cities"
AGIT 2015 - Hans Viehmann: "Big Data and Smart Cities"jstrobl
 
Building the enterprise data architecture
Building the enterprise data architectureBuilding the enterprise data architecture
Building the enterprise data architectureCosta Pissaris
 
CV | Sham Sunder | Data | Database | Business Intelligence | .Net
CV | Sham Sunder | Data | Database | Business Intelligence | .NetCV | Sham Sunder | Data | Database | Business Intelligence | .Net
CV | Sham Sunder | Data | Database | Business Intelligence | .NetSham Sunder
 
The Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-SystemThe Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-Systeminside-BigData.com
 
KScope14 FDMEE Multiproduct
KScope14 FDMEE MultiproductKScope14 FDMEE Multiproduct
KScope14 FDMEE MultiproductAlithya
 
Thinking About Guideline for Data Interoperability - Design concept and workf...
Thinking About Guideline for Data Interoperability - Design concept and workf...Thinking About Guideline for Data Interoperability - Design concept and workf...
Thinking About Guideline for Data Interoperability - Design concept and workf...Open Cyber University of Korea
 
Managing data interoperability with FME
Managing data interoperability with FMEManaging data interoperability with FME
Managing data interoperability with FMEIMGS
 
FDMEE Can Do That?
FDMEE Can Do That?FDMEE Can Do That?
FDMEE Can Do That?Alithya
 
Resume_RaghavMahajan_ETL_Developer
Resume_RaghavMahajan_ETL_DeveloperResume_RaghavMahajan_ETL_Developer
Resume_RaghavMahajan_ETL_DeveloperRaghav Mahajan
 
CA ERwin Data Modeler End User Presentation
CA ERwin Data Modeler End User PresentationCA ERwin Data Modeler End User Presentation
CA ERwin Data Modeler End User PresentationCA RMDM Latam
 

Similar to The Dark Art of Data Migration (20)

Python + MPP Database = Large Scale AI/ML Projects in Production Faster
Python + MPP Database = Large Scale AI/ML Projects in Production FasterPython + MPP Database = Large Scale AI/ML Projects in Production Faster
Python + MPP Database = Large Scale AI/ML Projects in Production Faster
 
L08 Data Source Layer
L08 Data Source LayerL08 Data Source Layer
L08 Data Source Layer
 
Arun Mathew Thomas_resume
Arun Mathew Thomas_resumeArun Mathew Thomas_resume
Arun Mathew Thomas_resume
 
Using FME to support open data initiatives and INSPIRE
Using FME to support open data initiatives and INSPIREUsing FME to support open data initiatives and INSPIRE
Using FME to support open data initiatives and INSPIRE
 
Webinar: Successful Data Migration to Microsoft Dynamics 365 CRM | InSync
Webinar: Successful Data Migration to Microsoft Dynamics 365 CRM | InSyncWebinar: Successful Data Migration to Microsoft Dynamics 365 CRM | InSync
Webinar: Successful Data Migration to Microsoft Dynamics 365 CRM | InSync
 
L17 Data Source Layer
L17 Data Source LayerL17 Data Source Layer
L17 Data Source Layer
 
Data Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesData Warehouse Design and Best Practices
Data Warehouse Design and Best Practices
 
AGIT 2015 - Hans Viehmann: "Big Data and Smart Cities"
AGIT 2015  - Hans Viehmann: "Big Data and Smart Cities"AGIT 2015  - Hans Viehmann: "Big Data and Smart Cities"
AGIT 2015 - Hans Viehmann: "Big Data and Smart Cities"
 
Sr odi consultant
Sr  odi consultantSr  odi consultant
Sr odi consultant
 
Pradeep_resume_ETL Testing
Pradeep_resume_ETL TestingPradeep_resume_ETL Testing
Pradeep_resume_ETL Testing
 
L15 Data Source Layer
L15 Data Source LayerL15 Data Source Layer
L15 Data Source Layer
 
Building the enterprise data architecture
Building the enterprise data architectureBuilding the enterprise data architecture
Building the enterprise data architecture
 
CV | Sham Sunder | Data | Database | Business Intelligence | .Net
CV | Sham Sunder | Data | Database | Business Intelligence | .NetCV | Sham Sunder | Data | Database | Business Intelligence | .Net
CV | Sham Sunder | Data | Database | Business Intelligence | .Net
 
The Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-SystemThe Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-System
 
KScope14 FDMEE Multiproduct
KScope14 FDMEE MultiproductKScope14 FDMEE Multiproduct
KScope14 FDMEE Multiproduct
 
Thinking About Guideline for Data Interoperability - Design concept and workf...
Thinking About Guideline for Data Interoperability - Design concept and workf...Thinking About Guideline for Data Interoperability - Design concept and workf...
Thinking About Guideline for Data Interoperability - Design concept and workf...
 
Managing data interoperability with FME
Managing data interoperability with FMEManaging data interoperability with FME
Managing data interoperability with FME
 
FDMEE Can Do That?
FDMEE Can Do That?FDMEE Can Do That?
FDMEE Can Do That?
 
Resume_RaghavMahajan_ETL_Developer
Resume_RaghavMahajan_ETL_DeveloperResume_RaghavMahajan_ETL_Developer
Resume_RaghavMahajan_ETL_Developer
 
CA ERwin Data Modeler End User Presentation
CA ERwin Data Modeler End User PresentationCA ERwin Data Modeler End User Presentation
CA ERwin Data Modeler End User Presentation
 

Recently uploaded

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 

The Dark Art of Data Migration

  • 1. The Dark Art of Data Migration Presented by Graham Morgan MD Spatial Consultants Ltd
  • 4. Is it that Difficult? Drawings with layout Features split Implied data (color) Multiple sources Choreography 4 Missing data Unstructured data Spatial formats Variability within Different meanings Inaccuracies
  • 5. Target - SpatialNet All data is stored in an Oracle database 5 Data is: Feature based Highly organized Classified / Categorized Consistent Data integrity rules Standardized
  • 6. Target - SpatialNet All data stored in Oracle database 6
  • 8. 8 Where do we start?
  • 10. Data Challenges Where is the feature record? Different data sources / formats No single business identifier for features Which attribute values are correct? Multiple copies, alternatives Do we have all the attributes? Do we have connectivity information? Infrastructure Level Fiber level 10
  • 11. System Challenges It isn’t just a data migration… System footprints Business functions Data (CRUD) Access security 11 Data migration is a component of a business improvement programme
  • 12. Data Migration Process Requirements Estimate / Quote Design Implement Test Deploy Celebrate … 12
  • 14. Data Migration Process - Prepare Map existing business functions to systems Existing / Planned Inventory data Data to migrate to SpatialNet / other systems? Prepare data documentation Develop baseline tests Feature counts Network trace tests Make local corrections 14
  • 15. Data Migration Process - Develop Exchange data in situ (lossless) Load directly ‘As Is’ into Oracle Pre-process for load option Re-project Inventory / study / profile / explore / specify Migrate to intermediate model Validate Migrate to SpatialNet Validate Correct data at source … Repeat … 15
  • 17. Profile / Explore Defend against duplicates – ensure unique keys Missing features Attribute value patterns / consistency Mandatory, Unique, Domain values Logical consistency Geometry types & consistency Connectivity: inferred vs. recorded 17
  • 18.
  • 21. Talend / Open Source
  • 23.
  • 24. Promote to SpatialNet Standard load procedure Dictionary mapping Apply symbol rotation, generate annotation Validation reports 20
  • 25. Shortcuts Pitfalls Data analysis in Oracle- rich toolset- neutral model Correct data at source- with familiar tools Train users on SpatialNet during migration- with familiar data Script everything Automate some clean up Square Peg / Round Hole- start migration before defining scope & context Migrate in isolation Let the consultants figure it out Plan for single migration 21
  • 26. Thank You Graham MorganSpatial Consultants Ltd gmorgan@spatialconsultants.comwww.spatialconsultants.com

Editor's Notes

  1. SPATIALinfo conference, Denver, CO, 9th June 2011Planning for the migration of digital spatial data into SpatialNet
  2. NW England: Lake District, Britain’s Energy Coast
  3. GIS consulting for clients worldwideSpatial systems & data integrationData migrationInformation strategyApplication development
  4. Why should it be difficult – the source data is in a system that has probably successfully supported the business for a number of years.
  5. Relational tablesAttributes with data typesGeometry also stored in native OracleIntegrity rules to enforce consistencyStandards based – other programs can access it.Structure means that the data can be queried with confidence to support business decision making.
  6. All data, including geometries, can be accessed by Oracle tools – and many 3rd party tools.Geometry can be queried and manipulated through SQL.
  7. The SpatialNet data model is complex as it supports many purposes.
  8. Data held across different systems in different formatsHopefully with some form of documentationProbably some duplication with differentiationAs successful business applications often get more complex over time (as they take on additional responsibilities) then it becomes important to fully understand the match (and gaps) between the source and target data.Drawings are only fully intelligible to the trained human eye.
  9. Data associated by proximity on a drawingExtended entity data, blocks, etc.GIS systems can also be configured to store data as annotation (i.e. unstructured) ….
  10. Fiber connectivity is not represented spatially – it must be provided by attribution.
  11. Does the new system provide all the business functions of the existing system?Does all of the existing data map neatly into the model of the new system?Can the new system be accessed by all of the people that currently access the existing system?Existing system is likely to have evolved over time and may have assumed some additional responisbilities.An architect is required to have overall visibility of the entire system replacement project.The data migration is only one project within a larger programme.
  12. Dream on….Activities are generally correct, they are all required – but must expect several iterations to refine.
  13. Iterations generally get fasterFirst is the longest as there is more to put in place; later iterations are just refinements; final is very fast
  14. Magicians are really illusionists – there is always an explanation – Preparation is key.Master data may be in other systems!Must understand the CRUD matrix for the data being migrated – in which applications will each attribute be maintained? And what integration points are required (inbound and outbound) to maintain the data in the new system.Correct data errors in the existing system – greater levels of expertise and efficiency with the familiar system.Prepare for acceptance testing – how will we know when the data has been migrated correctly?What goes in is what came out.How to exchange data between source system/s and migration environment?
  15. Investigate Oracle loading options for spatial data.Standardize DictionariesSymbol Orientation, Annotation positioning, Feature placement (e.g. dist between cables)Pre-processing – minimum changes to prepare the data for loading into Oracle using the selected tools. E.g. drop constraints, change reserved words, protect Identity columns, etc.
  16. Load source into Oracle environment with minimal processing – use Oracle-based tools to process data with consistency.Bulk of the work happens in migrating from the staging environment to the neutral model.
  17. Geometry values – recorded Vs calculatedOracle may well provide a better environment for data profiling than source applications
  18. Every migration is unique.Need to be flexible with use of the most appropriate toolsYet try to be consistent in tool use.Script the migration as far as possible to enable efficient iterations.
  19. Neutral model is relatively simple, easy to read and review.
  20. Tom Ward presentation on afternoon of June 9th.
  21. Support for Just in Time training.Script as much of the migration as possible – there will be several iterations; there may be some time between them.Consider option of automated data cleanup (e.g. data formats for dates, phone numbers, etc.) if time allows.