SlideShare a Scribd company logo
1 of 27
OpenStreetMap history
    challenges – methods – reasons
Relevance of Historical Data

    What did the map look
     like two years ago?

      Who was involved in
     editing object X / region Y?

        What exactly happened
            in changeset Z?
Relevance of Historical Data


   Historical visualization


       Retracing vandalism


             Data analysis
OpenStreetMap consists of   • nodes
                            • ways
                            • relations
• nodes
• ways
• relations
              each of those have a version and a
              timestamp, among other things
each of those have a version and a
• nodes       timestamp, among other things
• ways
• relations
              So in theory it’s possible to recreate
              the full history of all objects in
              OpenStreetMap
in practice
this is more
complicated
in practice
this is more
complicated    ways and
               relations
               only have
               references
               to node IDs
in practice
this is more
complicated
ways and
relations      this poses two
only have      problems.
references
to node IDs
1. Moving a node does not
 cause a new way version
1. Moving a node does not
 cause a new way version




     2. References in ways and
    relations do not point to a
          specific version
way version 1


1

v1                               3
                2
                                 v1
                v1




     1. Moving a node does not
      cause a new way version
way version 1

                   2

                   v2
1

v1                               3

                                 v1




     1. Moving a node does not
      cause a new way version
2. References in ways and
relations do not point to a
      specific version
2. References in ways and
relations do not point to a
      specific version
2. References in ways and
relations do not point to a
      specific version
2. References in ways and
relations do not point to a
      specific version
Also, the data is not readily accessible
Also, the data is not readily accessible
        Full history dump is 451GB of XML,
      containing 1.7 billion node versions, 170
               million way versions, and
             4.5 million relation versions.
What is needed?
What is needed?


        • A suitable DB schema
        • PBF format support
        • importer (support)
What is needed?


         • A suitable DB schema
         • PBF format support
         • importer (support)


   .....and ideally, a history API
• A suitable DB schema
• PBF format support
• importer support (osmosis, imposm)
• A suitable DB schema
• PBF format support
• importer support (osmosis, imposm)


       History Hack Weekend
       http://bit.ly/osmhistoryhack
History Hack Weekend
http://bit.ly/osmhistoryhack


        June 10-12
        LinuxHotel
      Essen, Germany
thanks

  OSM > mvexel
twitter > @mvexel
Martijn van Exel


all photos from flickr commons

More Related Content

More from Martijn van Exel (6)

MapRoulette Cooperative Challenges
MapRoulette Cooperative ChallengesMapRoulette Cooperative Challenges
MapRoulette Cooperative Challenges
 
The Golden Age of GIS - Keynote for GIS day 2019
The Golden Age of GIS - Keynote for GIS day 2019The Golden Age of GIS - Keynote for GIS day 2019
The Golden Age of GIS - Keynote for GIS day 2019
 
OpenStreetMap
OpenStreetMapOpenStreetMap
OpenStreetMap
 
Open Places Platform
Open Places PlatformOpen Places Platform
Open Places Platform
 
OpenStreetMap voor IMG100k
OpenStreetMap voor IMG100kOpenStreetMap voor IMG100k
OpenStreetMap voor IMG100k
 
OpenStreetMap Netherlands Since AND import
OpenStreetMap Netherlands Since AND importOpenStreetMap Netherlands Since AND import
OpenStreetMap Netherlands Since AND import
 

Recently uploaded

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 

Dealing with historical data in OpenStreetMap

  • 1. OpenStreetMap history challenges – methods – reasons
  • 2. Relevance of Historical Data What did the map look like two years ago? Who was involved in editing object X / region Y? What exactly happened in changeset Z?
  • 3. Relevance of Historical Data Historical visualization Retracing vandalism Data analysis
  • 4. OpenStreetMap consists of • nodes • ways • relations
  • 5. • nodes • ways • relations each of those have a version and a timestamp, among other things
  • 6. each of those have a version and a • nodes timestamp, among other things • ways • relations So in theory it’s possible to recreate the full history of all objects in OpenStreetMap
  • 7. in practice this is more complicated
  • 8. in practice this is more complicated ways and relations only have references to node IDs
  • 9. in practice this is more complicated ways and relations this poses two only have problems. references to node IDs
  • 10. 1. Moving a node does not cause a new way version
  • 11. 1. Moving a node does not cause a new way version 2. References in ways and relations do not point to a specific version
  • 12. way version 1 1 v1 3 2 v1 v1 1. Moving a node does not cause a new way version
  • 13. way version 1 2 v2 1 v1 3 v1 1. Moving a node does not cause a new way version
  • 14. 2. References in ways and relations do not point to a specific version
  • 15. 2. References in ways and relations do not point to a specific version
  • 16. 2. References in ways and relations do not point to a specific version
  • 17. 2. References in ways and relations do not point to a specific version
  • 18. Also, the data is not readily accessible
  • 19. Also, the data is not readily accessible Full history dump is 451GB of XML, containing 1.7 billion node versions, 170 million way versions, and 4.5 million relation versions.
  • 21. What is needed? • A suitable DB schema • PBF format support • importer (support)
  • 22. What is needed? • A suitable DB schema • PBF format support • importer (support) .....and ideally, a history API
  • 23. • A suitable DB schema • PBF format support • importer support (osmosis, imposm)
  • 24. • A suitable DB schema • PBF format support • importer support (osmosis, imposm) History Hack Weekend http://bit.ly/osmhistoryhack
  • 25.
  • 26. History Hack Weekend http://bit.ly/osmhistoryhack June 10-12 LinuxHotel Essen, Germany
  • 27. thanks OSM > mvexel twitter > @mvexel Martijn van Exel all photos from flickr commons

Editor's Notes

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n
  22. \n
  23. \n
  24. \n