SlideShare ist ein Scribd-Unternehmen logo
1 von 29
Downloaden Sie, um offline zu lesen
FME Process Optimisation at the Ordnance
Survey Great Britain
An exercise in best practice
david.eagle@1spatial.com
Principal Consultant
FME Certified Professional & Trainer
1Spatial
Agenda
•  The Ordnance Survey
•  The Object Editor Adapter
•  Project Goals
•  FME as a Test Harness
•  Deploying ‘Best Practice’ in FME
•  Tips and tricks
•  Project Benefits
2	
  
Ordnance Survey - Then
3	
  
•  The original ‘Board of Ordnance’, MOD of the day
•  Started survey work in the late 1700s
•  Comprehensively mapped the south coast to be able to effectively
repel invasion
•  http://goo.gl/02tVFZ
Lands End,
Mount's Bay,
Cornwall
*
Ordnance Survey - Now
4	
  
•  Self financing part of the UK government
•  £120-million-a-year civilian organisation
•  At the forefront of the digital economy
•  Remit covers Great Britain (England, Scotland
and Wales)
•  243,241 sq km / 93,956 sq miles
•  Integrated data capture for field surveyors,
HQ-based editors, and external suppliers
*
Geospatial Data Management
System (GDMS)
•  Seamless data maintenance solution
•  Intelligent and integrated maintenance data model
•  Allows separation of product data
•  Includes automated data validation
•  Platform for new product development
•  650+ data editors
•  500+ million features
•  4,000 data maintenance jobs per day
*
GDMS – Production
*
GDMS – Production Architecture
•  Includes best of breed COTS components from
•  1Spatial
•  BAE Systems
•  ESRI
•  Intergraph
•  Microsoft
•  Oracle
•  Safe Software
•  Snowflake Software
•  Scalable infrastructure
•  Disconnected editing thanks to the ‘Object Editor Adapter’
*
Object Editor Adapter
8	
  
•  Transforms data into a structure usable in the Object Editor map
application
•  FME underpins the transformation
•  Schema is manipulated into ‘understandable’ structure from the
database
GML
OEA
(FME)
PGDB
*
Project Goals
9	
  
1.  Document process and make maintenance practical
•  Schema changes pending
•  Component needed to be better understood
2.  Upgrade process to FME 2013 (look beyond 2013)
•  Built with FME 2009
•  Utilise recent FME enhancements
3.  Improve efficiency & performance
•  Remove redundancy
“If it ain’t broke…”
10	
  
•  Huge amount of change in FME since 2009
•  Attribute management particular interest
•  OS needed to unpick the workspace to be
able to update it anyway
•  Workspace opening time was slow,
impacting batch run time and FME Server
–  3 minutes
–  c.60mb (most *.fmw don’t exceed a few 100kb)
•  Quite a few transformers…
Transformers, transformers
everywhere
11	
  
•  943
–  Custom Transformer(s) = 255
–  AttributeRenamer = 211
–  Tester = 150
–  AttributeRemover = 87
–  AttributeCreator = 51
–  ListExploder = 72
–  CaseChanger = 21
–  ListElementCounter = 18
–  AttributeSplitter = 12
–  AttributeSetter = 10
–  ListIndexer = 9
–  AttributeCopier = 8
What is a ‘Test Harness’?
12	
  
In software testing, a test harness or automated test framework
is a collection of software and test data
configured to test a program unit
by running it under varying conditions
and monitoring its behaviour and outputs.
Wikipedia
Using FME as a ‘Test Harness’
13	
  
•  The process had to change but the data must not!
•  FME to check the old process against the new
•  Controlled via a spreadsheet, a WorkspaceRunner and Published
Parameters
GML	
  
PGDB	
  
OLD
PGDB	
  
NEW
Detect
Change
Failures:
Spreadsheets and
Geometry
*
14	
  
Dynamic Schema
15	
  
•  Introduced around FME 2009
–  Existing process didn’t use it
•  Enable ‘Dynamic’ schema mode
–  Whatever schema is read, gets written
–  Decouple schema from format
•  http://blog.safe.com/2010/02/fmeevangelist54/
Dynamic Schema
16	
  
•  96 input layers can be reduced to 1!
–  Magic Transformer = FeatureTypeExtractor
–  Workspace infinitely less daunting
Reduce Transformer
Duplication
17	
  
•  Helped by Dynamic Schema Mode
•  Try not to duplicate operations
Reduce Transformer
Duplication
18	
  
•  ‘Like’ operations carried out on all features
–  Filter features off to unique processes when needed
•  20% of data had a ‘Level’
–  21 CaseChangers used to apply UPPERCASE
–  Replaced with 1 StringCaseChanger
•  69 Custom Transformers with embedded Python used to
format Date attributes
–  All features passed through a StringReplacer to remove hyphens
•  2013-01-01 to 20130101
Reduce Transformer
Duplication
19	
  
Custom Transformers
20	
  
•  What?
•  A container for many regular transformers
•  Allows a common set of tasks to be reused in 1 or many
workspaces
•  When exported becomes an *.FMX file for sharing
•  The FME Store was introduced in 2012
–  2014 FME Store mature with 130+ transformers (and other resources)
•  http://fmestore.safe.com
•  http://fmepedia.safe.com/articles/Samples_and_Demos/Custom-Transformer-
KMLDiagrammer
•  It’s increasingly likely your workspaces will contain Store resources
•  FME now auto connects & downloads any resources required
Custom Transformers
21	
  
•  Support for versioning
•  Ports are not disconnected when renamed
–  http://blog.safe.com/2013/12/fmeevangelist121/
•  Embed/link all instances of a given transformer
•  Manage input and output schema on Custom Transformers
Custom Transformers 2014
22	
  
•  14 unique custom transformers used 255 times
•  PGDB had constraints, many tables did not allow null values
–  Custom transformers tested and removed null attributes
–  Thanks to Generic mode expose constrained attributes and pass through a
PythonCaller to remove the null attributes
•  FME 2014
Use Custom Transformers
Effectively
23	
  
Schema Mapping
24	
  
•  Attribute names are hardcoded into 211 AttributeRenamers
in the workspace
–  Changes to target schema means attribute hunting!
–  Biggest challenge for ongoing maintenance
•  1x SchemaMapper replaces all AttributeRenamers
–  Non-FME user can alter the process by editing
AttributeMapping.csv
Documentation
25	
  
•  Always document your Workspace
•  The OEA process was documented
–  But didn’t include detailed explanations ‘why’
•  You should be able to read a Workspace and understand it
–  Highlight dependencies and use XML notation for naming
•  In 2014 canvas annotations taken to the next level
Documentation – Self contained
26	
  
Before and After
27	
  
Benefits
28	
  
Before After Improvement
Time to open (from local disk) 2mins 12secs 31secs 1min 41secs
File size 56mb 13.8mb 75% reduction
Lines of machine written code 450,332 111,334 75% reduction
Number of transformers 943 336 64% fewer
Sample run time 1hr 7mins 48mins 19mins
•  Current, documented and ‘owned’ by the Ordnance Survey
•  Robust production extraction routine
•  Simpler to maintain and change
–  Some schema changes possible without opening FME!
29	
  
	

david.eagle@1spatial.com
Principal Consultant
FME Certified Professional & Trainer
1Spatial

Weitere ähnliche Inhalte

Ähnlich wie Process Optimisation at the Ordnance Survey Great Britain

3450 - Writing and optimising applications for performance in a hybrid messag...
3450 - Writing and optimising applications for performance in a hybrid messag...3450 - Writing and optimising applications for performance in a hybrid messag...
3450 - Writing and optimising applications for performance in a hybrid messag...Timothy McCormick
 
Wastewater treatment dcs upgrade adds remote capabilities, improves performance
Wastewater treatment dcs upgrade adds remote capabilities, improves performanceWastewater treatment dcs upgrade adds remote capabilities, improves performance
Wastewater treatment dcs upgrade adds remote capabilities, improves performanceIntelligentManufacturingInstitute
 
The Top 5 Practices of a Highly Successful ChangeMan ZMF Administrator
The Top 5 Practices of a Highly Successful ChangeMan ZMF AdministratorThe Top 5 Practices of a Highly Successful ChangeMan ZMF Administrator
The Top 5 Practices of a Highly Successful ChangeMan ZMF AdministratorSerena Software
 
Pharo 10 and beyond
 Pharo 10 and beyond Pharo 10 and beyond
Pharo 10 and beyondESUG
 
Managing your AWS infrastructure at scale
Managing your AWS infrastructure at scaleManaging your AWS infrastructure at scale
Managing your AWS infrastructure at scaleDavid Mat
 
Scaling machinelearning as a service at uber li Erran li - 2016
Scaling machinelearning as a service at uber li Erran li - 2016Scaling machinelearning as a service at uber li Erran li - 2016
Scaling machinelearning as a service at uber li Erran li - 2016Karthik Murugesan
 
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016PAPIs.io
 
IMS09 ims v14 higlights
IMS09   ims v14 higlightsIMS09   ims v14 higlights
IMS09 ims v14 higlightsRobert Hain
 
Fossil Power Simulator Efficiencies When Using Automated Testing
Fossil Power Simulator Efficiencies When Using Automated TestingFossil Power Simulator Efficiencies When Using Automated Testing
Fossil Power Simulator Efficiencies When Using Automated TestingGSE Systems, Inc.
 
Index conf sparkml-feb20-n-pentreath
Index conf sparkml-feb20-n-pentreathIndex conf sparkml-feb20-n-pentreath
Index conf sparkml-feb20-n-pentreathChester Chen
 
T sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersT sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersShehap Elnagar
 
T sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersT sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersShehap Elnagar
 
T sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersT sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersShehap Elnagar
 
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii VozniukCloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii VozniukAndrii Vozniuk
 
Reduced instruction set computers
Reduced instruction set computersReduced instruction set computers
Reduced instruction set computersSyed Zaid Irshad
 

Ähnlich wie Process Optimisation at the Ordnance Survey Great Britain (20)

3450 - Writing and optimising applications for performance in a hybrid messag...
3450 - Writing and optimising applications for performance in a hybrid messag...3450 - Writing and optimising applications for performance in a hybrid messag...
3450 - Writing and optimising applications for performance in a hybrid messag...
 
Cleansing land ownership data, an FME use case - David Eagle
Cleansing land ownership data, an FME use case - David EagleCleansing land ownership data, an FME use case - David Eagle
Cleansing land ownership data, an FME use case - David Eagle
 
Wastewater treatment dcs upgrade adds remote capabilities, improves performance
Wastewater treatment dcs upgrade adds remote capabilities, improves performanceWastewater treatment dcs upgrade adds remote capabilities, improves performance
Wastewater treatment dcs upgrade adds remote capabilities, improves performance
 
Java Performance Tuning
Java Performance TuningJava Performance Tuning
Java Performance Tuning
 
The Top 5 Practices of a Highly Successful ChangeMan ZMF Administrator
The Top 5 Practices of a Highly Successful ChangeMan ZMF AdministratorThe Top 5 Practices of a Highly Successful ChangeMan ZMF Administrator
The Top 5 Practices of a Highly Successful ChangeMan ZMF Administrator
 
Code Optimization
Code OptimizationCode Optimization
Code Optimization
 
Pharo 10 and beyond
 Pharo 10 and beyond Pharo 10 and beyond
Pharo 10 and beyond
 
Managing your AWS infrastructure at scale
Managing your AWS infrastructure at scaleManaging your AWS infrastructure at scale
Managing your AWS infrastructure at scale
 
Scaling machinelearning as a service at uber li Erran li - 2016
Scaling machinelearning as a service at uber li Erran li - 2016Scaling machinelearning as a service at uber li Erran li - 2016
Scaling machinelearning as a service at uber li Erran li - 2016
 
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
 
IMS09 ims v14 higlights
IMS09   ims v14 higlightsIMS09   ims v14 higlights
IMS09 ims v14 higlights
 
Fossil Power Simulator Efficiencies When Using Automated Testing
Fossil Power Simulator Efficiencies When Using Automated TestingFossil Power Simulator Efficiencies When Using Automated Testing
Fossil Power Simulator Efficiencies When Using Automated Testing
 
Index conf sparkml-feb20-n-pentreath
Index conf sparkml-feb20-n-pentreathIndex conf sparkml-feb20-n-pentreath
Index conf sparkml-feb20-n-pentreath
 
T sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersT sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powers
 
T sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersT sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powers
 
T sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersT sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powers
 
What's new in LR IP4.4
What's new in LR IP4.4What's new in LR IP4.4
What's new in LR IP4.4
 
What's new in IP 4.4
What's new in IP 4.4What's new in IP 4.4
What's new in IP 4.4
 
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii VozniukCloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
 
Reduced instruction set computers
Reduced instruction set computersReduced instruction set computers
Reduced instruction set computers
 

Mehr von Safe Software

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action:  Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action:  Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
The Critical Role of Spatial Data in Today's Data Ecosystem
The Critical Role of Spatial Data in Today's Data EcosystemThe Critical Role of Spatial Data in Today's Data Ecosystem
The Critical Role of Spatial Data in Today's Data EcosystemSafe Software
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataSafe Software
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsIgniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsSafe Software
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightSafe Software
 
Mastering MicroStation DGN: How to Integrate CAD and GIS
Mastering MicroStation DGN: How to Integrate CAD and GISMastering MicroStation DGN: How to Integrate CAD and GIS
Mastering MicroStation DGN: How to Integrate CAD and GISSafe Software
 
Geospatial Synergy: Amplifying Efficiency with FME & Esri
Geospatial Synergy: Amplifying Efficiency with FME & EsriGeospatial Synergy: Amplifying Efficiency with FME & Esri
Geospatial Synergy: Amplifying Efficiency with FME & EsriSafe Software
 
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdfIntroducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdfSafe Software
 
Breaking Barriers & Leveraging the Latest Developments in AI Technology
Breaking Barriers & Leveraging the Latest Developments in AI TechnologyBreaking Barriers & Leveraging the Latest Developments in AI Technology
Breaking Barriers & Leveraging the Latest Developments in AI TechnologySafe Software
 
Best Practices to Navigating Data and Application Integration for the Enterpr...
Best Practices to Navigating Data and Application Integration for the Enterpr...Best Practices to Navigating Data and Application Integration for the Enterpr...
Best Practices to Navigating Data and Application Integration for the Enterpr...Safe Software
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataSafe Software
 
New Year's Fireside Chat with Safe Software’s Founders
New Year's Fireside Chat with Safe Software’s FoundersNew Year's Fireside Chat with Safe Software’s Founders
New Year's Fireside Chat with Safe Software’s FoundersSafe Software
 
Taking Off with FME: Elevating Airport Operations to New Heights
Taking Off with FME: Elevating Airport Operations to New HeightsTaking Off with FME: Elevating Airport Operations to New Heights
Taking Off with FME: Elevating Airport Operations to New HeightsSafe Software
 
Initiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance StrategySafe Software
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Safe Software
 

Mehr von Safe Software (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action:  Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action:  Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
The Critical Role of Spatial Data in Today's Data Ecosystem
The Critical Role of Spatial Data in Today's Data EcosystemThe Critical Role of Spatial Data in Today's Data Ecosystem
The Critical Role of Spatial Data in Today's Data Ecosystem
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsIgniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
Mastering MicroStation DGN: How to Integrate CAD and GIS
Mastering MicroStation DGN: How to Integrate CAD and GISMastering MicroStation DGN: How to Integrate CAD and GIS
Mastering MicroStation DGN: How to Integrate CAD and GIS
 
Geospatial Synergy: Amplifying Efficiency with FME & Esri
Geospatial Synergy: Amplifying Efficiency with FME & EsriGeospatial Synergy: Amplifying Efficiency with FME & Esri
Geospatial Synergy: Amplifying Efficiency with FME & Esri
 
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdfIntroducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
 
Breaking Barriers & Leveraging the Latest Developments in AI Technology
Breaking Barriers & Leveraging the Latest Developments in AI TechnologyBreaking Barriers & Leveraging the Latest Developments in AI Technology
Breaking Barriers & Leveraging the Latest Developments in AI Technology
 
Best Practices to Navigating Data and Application Integration for the Enterpr...
Best Practices to Navigating Data and Application Integration for the Enterpr...Best Practices to Navigating Data and Application Integration for the Enterpr...
Best Practices to Navigating Data and Application Integration for the Enterpr...
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
 
New Year's Fireside Chat with Safe Software’s Founders
New Year's Fireside Chat with Safe Software’s FoundersNew Year's Fireside Chat with Safe Software’s Founders
New Year's Fireside Chat with Safe Software’s Founders
 
Taking Off with FME: Elevating Airport Operations to New Heights
Taking Off with FME: Elevating Airport Operations to New HeightsTaking Off with FME: Elevating Airport Operations to New Heights
Taking Off with FME: Elevating Airport Operations to New Heights
 
Initiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance Strategy
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
 

Kürzlich hochgeladen

Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 

Kürzlich hochgeladen (20)

Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

Process Optimisation at the Ordnance Survey Great Britain

  • 1. FME Process Optimisation at the Ordnance Survey Great Britain An exercise in best practice david.eagle@1spatial.com Principal Consultant FME Certified Professional & Trainer 1Spatial
  • 2. Agenda •  The Ordnance Survey •  The Object Editor Adapter •  Project Goals •  FME as a Test Harness •  Deploying ‘Best Practice’ in FME •  Tips and tricks •  Project Benefits 2  
  • 3. Ordnance Survey - Then 3   •  The original ‘Board of Ordnance’, MOD of the day •  Started survey work in the late 1700s •  Comprehensively mapped the south coast to be able to effectively repel invasion •  http://goo.gl/02tVFZ Lands End, Mount's Bay, Cornwall *
  • 4. Ordnance Survey - Now 4   •  Self financing part of the UK government •  £120-million-a-year civilian organisation •  At the forefront of the digital economy •  Remit covers Great Britain (England, Scotland and Wales) •  243,241 sq km / 93,956 sq miles •  Integrated data capture for field surveyors, HQ-based editors, and external suppliers *
  • 5. Geospatial Data Management System (GDMS) •  Seamless data maintenance solution •  Intelligent and integrated maintenance data model •  Allows separation of product data •  Includes automated data validation •  Platform for new product development •  650+ data editors •  500+ million features •  4,000 data maintenance jobs per day *
  • 7. GDMS – Production Architecture •  Includes best of breed COTS components from •  1Spatial •  BAE Systems •  ESRI •  Intergraph •  Microsoft •  Oracle •  Safe Software •  Snowflake Software •  Scalable infrastructure •  Disconnected editing thanks to the ‘Object Editor Adapter’ *
  • 8. Object Editor Adapter 8   •  Transforms data into a structure usable in the Object Editor map application •  FME underpins the transformation •  Schema is manipulated into ‘understandable’ structure from the database GML OEA (FME) PGDB *
  • 9. Project Goals 9   1.  Document process and make maintenance practical •  Schema changes pending •  Component needed to be better understood 2.  Upgrade process to FME 2013 (look beyond 2013) •  Built with FME 2009 •  Utilise recent FME enhancements 3.  Improve efficiency & performance •  Remove redundancy
  • 10. “If it ain’t broke…” 10   •  Huge amount of change in FME since 2009 •  Attribute management particular interest •  OS needed to unpick the workspace to be able to update it anyway •  Workspace opening time was slow, impacting batch run time and FME Server –  3 minutes –  c.60mb (most *.fmw don’t exceed a few 100kb) •  Quite a few transformers…
  • 11. Transformers, transformers everywhere 11   •  943 –  Custom Transformer(s) = 255 –  AttributeRenamer = 211 –  Tester = 150 –  AttributeRemover = 87 –  AttributeCreator = 51 –  ListExploder = 72 –  CaseChanger = 21 –  ListElementCounter = 18 –  AttributeSplitter = 12 –  AttributeSetter = 10 –  ListIndexer = 9 –  AttributeCopier = 8
  • 12. What is a ‘Test Harness’? 12   In software testing, a test harness or automated test framework is a collection of software and test data configured to test a program unit by running it under varying conditions and monitoring its behaviour and outputs. Wikipedia
  • 13. Using FME as a ‘Test Harness’ 13   •  The process had to change but the data must not! •  FME to check the old process against the new •  Controlled via a spreadsheet, a WorkspaceRunner and Published Parameters GML   PGDB   OLD PGDB   NEW Detect Change Failures: Spreadsheets and Geometry *
  • 14. 14  
  • 15. Dynamic Schema 15   •  Introduced around FME 2009 –  Existing process didn’t use it •  Enable ‘Dynamic’ schema mode –  Whatever schema is read, gets written –  Decouple schema from format •  http://blog.safe.com/2010/02/fmeevangelist54/
  • 16. Dynamic Schema 16   •  96 input layers can be reduced to 1! –  Magic Transformer = FeatureTypeExtractor –  Workspace infinitely less daunting
  • 17. Reduce Transformer Duplication 17   •  Helped by Dynamic Schema Mode •  Try not to duplicate operations
  • 18. Reduce Transformer Duplication 18   •  ‘Like’ operations carried out on all features –  Filter features off to unique processes when needed •  20% of data had a ‘Level’ –  21 CaseChangers used to apply UPPERCASE –  Replaced with 1 StringCaseChanger •  69 Custom Transformers with embedded Python used to format Date attributes –  All features passed through a StringReplacer to remove hyphens •  2013-01-01 to 20130101
  • 20. Custom Transformers 20   •  What? •  A container for many regular transformers •  Allows a common set of tasks to be reused in 1 or many workspaces •  When exported becomes an *.FMX file for sharing
  • 21. •  The FME Store was introduced in 2012 –  2014 FME Store mature with 130+ transformers (and other resources) •  http://fmestore.safe.com •  http://fmepedia.safe.com/articles/Samples_and_Demos/Custom-Transformer- KMLDiagrammer •  It’s increasingly likely your workspaces will contain Store resources •  FME now auto connects & downloads any resources required Custom Transformers 21  
  • 22. •  Support for versioning •  Ports are not disconnected when renamed –  http://blog.safe.com/2013/12/fmeevangelist121/ •  Embed/link all instances of a given transformer •  Manage input and output schema on Custom Transformers Custom Transformers 2014 22  
  • 23. •  14 unique custom transformers used 255 times •  PGDB had constraints, many tables did not allow null values –  Custom transformers tested and removed null attributes –  Thanks to Generic mode expose constrained attributes and pass through a PythonCaller to remove the null attributes •  FME 2014 Use Custom Transformers Effectively 23  
  • 24. Schema Mapping 24   •  Attribute names are hardcoded into 211 AttributeRenamers in the workspace –  Changes to target schema means attribute hunting! –  Biggest challenge for ongoing maintenance •  1x SchemaMapper replaces all AttributeRenamers –  Non-FME user can alter the process by editing AttributeMapping.csv
  • 25. Documentation 25   •  Always document your Workspace •  The OEA process was documented –  But didn’t include detailed explanations ‘why’ •  You should be able to read a Workspace and understand it –  Highlight dependencies and use XML notation for naming
  • 26. •  In 2014 canvas annotations taken to the next level Documentation – Self contained 26  
  • 28. Benefits 28   Before After Improvement Time to open (from local disk) 2mins 12secs 31secs 1min 41secs File size 56mb 13.8mb 75% reduction Lines of machine written code 450,332 111,334 75% reduction Number of transformers 943 336 64% fewer Sample run time 1hr 7mins 48mins 19mins •  Current, documented and ‘owned’ by the Ordnance Survey •  Robust production extraction routine •  Simpler to maintain and change –  Some schema changes possible without opening FME!
  • 29. 29   david.eagle@1spatial.com Principal Consultant FME Certified Professional & Trainer 1Spatial