SlideShare ist ein Scribd-Unternehmen logo
1 von 17
Open Spatial Data
Progress towards a reusable gazetteer
                                               th
        Open Data Group – 16 April 2012
                   @ianibbo



This work is licensed under a Creative Commons Attribution 3.0 Unported License.
Overview
Original Problem
  How to transition a central gov't funded aggregation of
   childcare and positive activities with a budget of
   >2m / year to an open data* model running on £60/
   month hardware
     Retaining security (Of a certain level)
     Retaining functionality
(See http://www.madwdata.org.uk/blog/id/394)
2 Major Costs To Mitigate
Large cluster of proprietary OS hosts, ~12 front
 end web servers, hot backup sql server
  Migrated to 1*Pound Host server ~£60/month, server
   has 2 hard drives, hot backup, off site rsync
Data costs – BPH Address-Point data – Used for
 geocoding incoming records and lookups on
 search terms. OS Boundary Line
  ???
Some Noise
Open Spatial Data Consultation......
Open Spatial Data
Ordnance Survey Open Data
  http://www.ordnancesurvey.co.uk/oswebsite/products/os-lo
Code Point Open
  Postcodes to Northing/Easting
OS Locator
  Gazetteer of road names (And other features)
Obtained by registering on website, requesting,
 getting email, following link, …..
The reality of CodePoint Open
The core data is “Open”
Missing the one vital link between CodePoint
 Open and OS Locator – PostCode → Road
 Names / Identifiers.
If you're happy to display Postcodes without road
   names, it's ideal.
Last Mile Problem.
Finding an automated way to link the 2 is hard!
Licensed data is now open, but out of date
Address Point
Still Licensed
Expensive
Probably not that useful anyway for most projects
Problem with focus on “Open Data”
Everyone ends up implementing their own
 gazetteer
Large scale providers have rate limits and
 introduce external dependencies / Speed
 issues
People want local geo-coding (for lots of different
 reasons).
Having rolled your own gazetteer, you discover
 you need to handle updates (Full replacements)
It's not an end in itself
Vision
A stand-alone gazetteer web app designed for
  local network use with features for importing
  updates from OS, reconciling multiple data
  sources and performing geo-coding lookups.
Available Tools
Apache SOLR
  Long-Standing stalwart of the open data and search
    community
  Schemas slightly clunky
  Several spatial options, all with different strengths /
   weaknesses. Multiple points a problem in some.
ElasticSearch
  Schema Free, Apparently Solid Spatial, Multi Points
  Good integration with Mongo via Rivers
Problems / Issues
ES Spatial search hard to do directly via a COOL
 URL
  Spatial query syntax is expressive, but complex and
   needs JSON sub-documents
Need service wrappers
But thats easily done
Updates!
Missed Level of Abstraction
(Common to many open data sets?)

                   Local
                   Copy




                    C
                     o
  Sourc             m      Processin
    e               pa         g
                    re


      NOSQL Like            ES Ideal for
      Mongo is ideal for    this
      this
Progress
Starting to extract code from existing services
 into a generic spatial app
https://github.com/ianibo/AnOpenGazetteerFramewo
Work progressing under aegis of GIST Mobile
 group / Open Data group
Workable Gaz now, but command line interface
 for importing.
Questions / Comments?
Some supporting info
Original Project – FOI request to DfE
                         Total costs - First 3 years

          7000000
                                           Local Authority         Consultation sem-
          6000000                          Revenue                 inars
                                           Local Authority Capi-   Methods Consulting
          5000000                          tal
                                           Central Office of In-   Engine Group
          4000000                          formation
                                           Qi Consulting           Digital Public
          3000000                          Redhouse                Tribal Education
                                           DfE Staff Costs
          2000000

          1000000

                0
               2008-09   2009-10            2010-11
First 3 years - Non LA costs

2500000

                                              Central Office of In-
2000000                                       formation
                                              Qi Consulting
                                              Redhouse
1500000
                                              DfE Staff Costs
                                              Consultation sem-
                                              inars
1000000
                                              Methods Consulting
                                              Engine Group
 500000                                       Digital Public
                                              Tribal Education

      0
     2008-09    2009-10           2010-11

Weitere ähnliche Inhalte

Ähnlich wie Open spatial processing

Irw seminar slides for am Sharepoint event on 25.4.12
Irw seminar slides for am Sharepoint event on 25.4.12 Irw seminar slides for am Sharepoint event on 25.4.12
Irw seminar slides for am Sharepoint event on 25.4.12 NatashaIRWS
 
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...BIWUG
 
Blockchain by Silver Touch Tech Lab
Blockchain by Silver Touch Tech LabBlockchain by Silver Touch Tech Lab
Blockchain by Silver Touch Tech LabSilverTouchTechLab
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
 
Case Study - Appirio as the Serverless Enterprise
Case Study - Appirio as the Serverless EnterpriseCase Study - Appirio as the Serverless Enterprise
Case Study - Appirio as the Serverless EnterpriseAppirio
 
Broadband World Forum 2012 Highlights
Broadband World Forum 2012 HighlightsBroadband World Forum 2012 Highlights
Broadband World Forum 2012 HighlightsAlan Quayle
 
Lesser Known Opportunities in Technology
Lesser Known Opportunities in TechnologyLesser Known Opportunities in Technology
Lesser Known Opportunities in TechnologyCalen Legaspi
 
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisatiesData Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisatiesMultiscope
 
1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...IBM
 
How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02Data Science London
 
EDW_Recommendation_Smartmonk_26-12-14
EDW_Recommendation_Smartmonk_26-12-14EDW_Recommendation_Smartmonk_26-12-14
EDW_Recommendation_Smartmonk_26-12-14Nagi Reddy B
 
Using Linked Data and the Internet of Things for Energy Management
Using Linked Data and the Internet of Things for Energy ManagementUsing Linked Data and the Internet of Things for Energy Management
Using Linked Data and the Internet of Things for Energy ManagementEdward Curry
 
Top Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama SoftwareTop Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama SoftwarePanorama Software
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBigDataExpo
 
Big Data and Analytics by Silver Touch Tech Lab
Big Data and Analytics by Silver Touch Tech LabBig Data and Analytics by Silver Touch Tech Lab
Big Data and Analytics by Silver Touch Tech LabSilverTouchTechLab
 
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...Jade Global
 
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12Eric Athas
 

Ähnlich wie Open spatial processing (20)

Irw seminar slides for am Sharepoint event on 25.4.12
Irw seminar slides for am Sharepoint event on 25.4.12 Irw seminar slides for am Sharepoint event on 25.4.12
Irw seminar slides for am Sharepoint event on 25.4.12
 
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
 
Blockchain by Silver Touch Tech Lab
Blockchain by Silver Touch Tech LabBlockchain by Silver Touch Tech Lab
Blockchain by Silver Touch Tech Lab
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Case Study - Appirio as the Serverless Enterprise
Case Study - Appirio as the Serverless EnterpriseCase Study - Appirio as the Serverless Enterprise
Case Study - Appirio as the Serverless Enterprise
 
Broadband World Forum 2012 Highlights
Broadband World Forum 2012 HighlightsBroadband World Forum 2012 Highlights
Broadband World Forum 2012 Highlights
 
Lesser Known Opportunities in Technology
Lesser Known Opportunities in TechnologyLesser Known Opportunities in Technology
Lesser Known Opportunities in Technology
 
Lesser Known Opportunities in Technology
Lesser Known Opportunities in TechnologyLesser Known Opportunities in Technology
Lesser Known Opportunities in Technology
 
5 2-nessi-stuart campbell
5 2-nessi-stuart campbell5 2-nessi-stuart campbell
5 2-nessi-stuart campbell
 
Resume_ASA
Resume_ASAResume_ASA
Resume_ASA
 
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisatiesData Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
 
1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...
 
How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02
 
EDW_Recommendation_Smartmonk_26-12-14
EDW_Recommendation_Smartmonk_26-12-14EDW_Recommendation_Smartmonk_26-12-14
EDW_Recommendation_Smartmonk_26-12-14
 
Using Linked Data and the Internet of Things for Energy Management
Using Linked Data and the Internet of Things for Energy ManagementUsing Linked Data and the Internet of Things for Energy Management
Using Linked Data and the Internet of Things for Energy Management
 
Top Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama SoftwareTop Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama Software
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of Analytics
 
Big Data and Analytics by Silver Touch Tech Lab
Big Data and Analytics by Silver Touch Tech LabBig Data and Analytics by Silver Touch Tech Lab
Big Data and Analytics by Silver Touch Tech Lab
 
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
 
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
 

Mehr von ianibbo

Slides from Gangplank discussion
Slides from Gangplank discussionSlides from Gangplank discussion
Slides from Gangplank discussionianibbo
 
Ki presv2
Ki presv2Ki presv2
Ki presv2ianibbo
 
Xcri aggr dev8d
Xcri aggr dev8dXcri aggr dev8d
Xcri aggr dev8dianibbo
 
Xcri aggr dev8d
Xcri aggr dev8dXcri aggr dev8d
Xcri aggr dev8dianibbo
 
Aggregator appoverview
Aggregator appoverviewAggregator appoverview
Aggregator appoverviewianibbo
 
Gist od2-feb-2011
Gist od2-feb-2011Gist od2-feb-2011
Gist od2-feb-2011ianibbo
 
Gist od2-feb-2011
Gist od2-feb-2011Gist od2-feb-2011
Gist od2-feb-2011ianibbo
 
Setting up and open fidy dev environment
Setting up and open fidy dev environmentSetting up and open fidy dev environment
Setting up and open fidy dev environmentianibbo
 
OpenFIDY Module & Architecture Overview
OpenFIDY Module & Architecture OverviewOpenFIDY Module & Architecture Overview
OpenFIDY Module & Architecture Overviewianibbo
 
GIST Mobile Group - Welcome jan2011
GIST Mobile Group - Welcome jan2011GIST Mobile Group - Welcome jan2011
GIST Mobile Group - Welcome jan2011ianibbo
 
GIST Open Conversation - Sheffield App
GIST Open Conversation - Sheffield AppGIST Open Conversation - Sheffield App
GIST Open Conversation - Sheffield Appianibbo
 
Making Agile Pay
Making Agile PayMaking Agile Pay
Making Agile Payianibbo
 
Geekup Sheffield Semantic Web Primer
Geekup Sheffield Semantic Web PrimerGeekup Sheffield Semantic Web Primer
Geekup Sheffield Semantic Web Primerianibbo
 

Mehr von ianibbo (13)

Slides from Gangplank discussion
Slides from Gangplank discussionSlides from Gangplank discussion
Slides from Gangplank discussion
 
Ki presv2
Ki presv2Ki presv2
Ki presv2
 
Xcri aggr dev8d
Xcri aggr dev8dXcri aggr dev8d
Xcri aggr dev8d
 
Xcri aggr dev8d
Xcri aggr dev8dXcri aggr dev8d
Xcri aggr dev8d
 
Aggregator appoverview
Aggregator appoverviewAggregator appoverview
Aggregator appoverview
 
Gist od2-feb-2011
Gist od2-feb-2011Gist od2-feb-2011
Gist od2-feb-2011
 
Gist od2-feb-2011
Gist od2-feb-2011Gist od2-feb-2011
Gist od2-feb-2011
 
Setting up and open fidy dev environment
Setting up and open fidy dev environmentSetting up and open fidy dev environment
Setting up and open fidy dev environment
 
OpenFIDY Module & Architecture Overview
OpenFIDY Module & Architecture OverviewOpenFIDY Module & Architecture Overview
OpenFIDY Module & Architecture Overview
 
GIST Mobile Group - Welcome jan2011
GIST Mobile Group - Welcome jan2011GIST Mobile Group - Welcome jan2011
GIST Mobile Group - Welcome jan2011
 
GIST Open Conversation - Sheffield App
GIST Open Conversation - Sheffield AppGIST Open Conversation - Sheffield App
GIST Open Conversation - Sheffield App
 
Making Agile Pay
Making Agile PayMaking Agile Pay
Making Agile Pay
 
Geekup Sheffield Semantic Web Primer
Geekup Sheffield Semantic Web PrimerGeekup Sheffield Semantic Web Primer
Geekup Sheffield Semantic Web Primer
 

Kürzlich hochgeladen

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 

Kürzlich hochgeladen (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 

Open spatial processing

  • 1. Open Spatial Data Progress towards a reusable gazetteer th Open Data Group – 16 April 2012 @ianibbo This work is licensed under a Creative Commons Attribution 3.0 Unported License.
  • 2.
  • 3. Overview Original Problem How to transition a central gov't funded aggregation of childcare and positive activities with a budget of >2m / year to an open data* model running on £60/ month hardware Retaining security (Of a certain level) Retaining functionality (See http://www.madwdata.org.uk/blog/id/394)
  • 4. 2 Major Costs To Mitigate Large cluster of proprietary OS hosts, ~12 front end web servers, hot backup sql server Migrated to 1*Pound Host server ~£60/month, server has 2 hard drives, hot backup, off site rsync Data costs – BPH Address-Point data – Used for geocoding incoming records and lookups on search terms. OS Boundary Line ???
  • 5. Some Noise Open Spatial Data Consultation......
  • 6. Open Spatial Data Ordnance Survey Open Data http://www.ordnancesurvey.co.uk/oswebsite/products/os-lo Code Point Open Postcodes to Northing/Easting OS Locator Gazetteer of road names (And other features) Obtained by registering on website, requesting, getting email, following link, …..
  • 7. The reality of CodePoint Open The core data is “Open” Missing the one vital link between CodePoint Open and OS Locator – PostCode → Road Names / Identifiers. If you're happy to display Postcodes without road names, it's ideal. Last Mile Problem. Finding an automated way to link the 2 is hard! Licensed data is now open, but out of date
  • 8. Address Point Still Licensed Expensive Probably not that useful anyway for most projects
  • 9. Problem with focus on “Open Data” Everyone ends up implementing their own gazetteer Large scale providers have rate limits and introduce external dependencies / Speed issues People want local geo-coding (for lots of different reasons). Having rolled your own gazetteer, you discover you need to handle updates (Full replacements) It's not an end in itself
  • 10. Vision A stand-alone gazetteer web app designed for local network use with features for importing updates from OS, reconciling multiple data sources and performing geo-coding lookups.
  • 11. Available Tools Apache SOLR Long-Standing stalwart of the open data and search community Schemas slightly clunky Several spatial options, all with different strengths / weaknesses. Multiple points a problem in some. ElasticSearch Schema Free, Apparently Solid Spatial, Multi Points Good integration with Mongo via Rivers
  • 12. Problems / Issues ES Spatial search hard to do directly via a COOL URL Spatial query syntax is expressive, but complex and needs JSON sub-documents Need service wrappers But thats easily done Updates!
  • 13. Missed Level of Abstraction (Common to many open data sets?) Local Copy C o Sourc m Processin e pa g re NOSQL Like ES Ideal for Mongo is ideal for this this
  • 14. Progress Starting to extract code from existing services into a generic spatial app https://github.com/ianibo/AnOpenGazetteerFramewo Work progressing under aegis of GIST Mobile group / Open Data group Workable Gaz now, but command line interface for importing.
  • 16. Some supporting info Original Project – FOI request to DfE Total costs - First 3 years 7000000 Local Authority Consultation sem- 6000000 Revenue inars Local Authority Capi- Methods Consulting 5000000 tal Central Office of In- Engine Group 4000000 formation Qi Consulting Digital Public 3000000 Redhouse Tribal Education DfE Staff Costs 2000000 1000000 0 2008-09 2009-10 2010-11
  • 17. First 3 years - Non LA costs 2500000 Central Office of In- 2000000 formation Qi Consulting Redhouse 1500000 DfE Staff Costs Consultation sem- inars 1000000 Methods Consulting Engine Group 500000 Digital Public Tribal Education 0 2008-09 2009-10 2010-11