SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Machine Learning for Better Maps
Zhuangfang Yi- Development Seed - @geonanayi
Workshop @Clark University, 01/07/2019
OpenStreetMap label quality and geodiversity for
machine learning applications
Deep Learning
Machine Learning + OpenStreetMap + Satellite Imagery
Urban = Yes
Image Classification Training Dataset
Object Detection training dataset
Segmentation training dataset
Label Maker
OpenStreetMap is an attractive label/tags database for machine learning
applications that holds repid updated mapped object daily by thousands of users.
Training Data Completeness Matters
- OSM tag/label info and
popularity
Tag info in France.
Landuse is one of tags that has
been frequently used by the
users.
Label Maker
OpenStreetMap is an attractive label/tags database for machine learning
applications that host rapidly update mapped object daily by thousands of users.
Training data
Completeness Matters
OpenStreetMap Label Quality for Machine Learning
Applications
ISO standard for geographic information data: positional accuracy,
completeness, and logical consistency.
Other data quality issues in OSM:
- Vandalism
- Missing details
- Completeness and accuracy
Training data Completeness Matters
Training Data Completeness Matters
Available tools for data quality
assessment:
- OSM analytics (OSM v.s.
Human Settlement Layer)
- OSM-lint (e.g. OSM v.s.
US census TIGER in USA)
Training Data Completeness Matters
Building classification in Vietnam with LeNet
on AWS SageMaker.
Individual building detection with Tensorflow Object
detection in Mexico
60% -> 84% from Vietnam to Mexico
- OSM label data + satellite images match
- OSM label data is not well-aligned with the paired satellite image
Training Data Completeness Matters
Training Data Completeness Matters
HOT Task Manager
Training Data Completeness Matters
Urchn for urban change detection with ML
Training Data
Geodiversity Matters
Applying Machine Learning Applications for Geospatial
Analysis
Training Data Geodiversity Matters
High-voltage grid detection
with deep learning in
Pakistan, Nigeria, and
Zambia
Training Data Geodiversity Matters
Training Data Geodiversity Matters
Training Data Geodiversity Matters
Urban settlement change detection in Ethiopia between 2000 - 2017 with random
Conclusions
When it comes to applying machine learning applications:
- Training data quality matters, and to use OSM label data for ML
applications, I recommend:
1. Do a proper label completeness assessment with currently
available tools;
2. Check OSM tag/label info and frequency for your area of interest;
3. For segmentation ML application, make sure the image tiles
align-well with your label dataset;
4. Prepare training dataset using: Label Maker, or RoboSat or other
data prep tools.
- Training data geodiversity matters, and recommend to do:
a. data/image feature similarity analysis;
Contacts
Twitter @geonanayi
GitHub @geoyi
Email nana@developmentseed.org
Data Completeness Matters
HOT Analytics for Health
With support of the Bill and Melinda Gates
Foundation and the Clinton Health Access
Initiative, we have designed an analysis tool
to evaluate the accuracy and precision of
OpenStreetMap field data.
Other data quality issues in OSM:
- Vandalism
- Missing details
- Completeness and accuracy
The results of this analysis found the
positional accuracy of OpenStreetMap data
to be very good in comparison to OS
MasterMap, with over 80% overlap
between most the road objects tested
between the two datasets. The results also
found there to be a positive correlation
between road name attribute
completeness and number of users per
area.
Training data Completeness Matters

Weitere ähnliche Inhalte

Was ist angesagt?

Big Data, Data and Information Mining for Earth Observation
Big Data, Data and Information Mining for Earth ObservationBig Data, Data and Information Mining for Earth Observation
Big Data, Data and Information Mining for Earth ObservationPier Giorgio Marchetti
 
A GIS Based Satellite Data Management Application
A GIS Based Satellite Data Management ApplicationA GIS Based Satellite Data Management Application
A GIS Based Satellite Data Management ApplicationCarlos Gabriel Asato
 
Better Hackathon 2020 ETHZ - Comparing Static And Dynamic Effects Of Earthquakes
Better Hackathon 2020 ETHZ - Comparing Static And Dynamic Effects Of EarthquakesBetter Hackathon 2020 ETHZ - Comparing Static And Dynamic Effects Of Earthquakes
Better Hackathon 2020 ETHZ - Comparing Static And Dynamic Effects Of EarthquakesPRBETTER
 
Computer application in geography
Computer application in geographyComputer application in geography
Computer application in geographyShoukat Ali
 
Tianjin Case Study
Tianjin Case StudyTianjin Case Study
Tianjin Case StudyVisionMap
 
INTRODUCTION TO GIS
INTRODUCTION TO GISINTRODUCTION TO GIS
INTRODUCTION TO GISHamzaAhmad91
 
Gis Across Curriculum nh
Gis Across Curriculum nhGis Across Curriculum nh
Gis Across Curriculum nhVince123
 
Optimized Neural Networks Using Principal Component Analysis for Automatic Ro...
Optimized Neural Networks Using Principal Component Analysis for Automatic Ro...Optimized Neural Networks Using Principal Component Analysis for Automatic Ro...
Optimized Neural Networks Using Principal Component Analysis for Automatic Ro...Conferenceproceedings
 
Group 3 presentation
Group 3 presentationGroup 3 presentation
Group 3 presentationPhilwood
 
Thrive2055_UTCGIS_2015ii
Thrive2055_UTCGIS_2015iiThrive2055_UTCGIS_2015ii
Thrive2055_UTCGIS_2015iiAndy Carroll
 
Challenges to Large Scale Mapping: Can Data Geometry Help?
Challenges to Large Scale Mapping: Can Data Geometry Help?Challenges to Large Scale Mapping: Can Data Geometry Help?
Challenges to Large Scale Mapping: Can Data Geometry Help?Louisa Diggs
 

Was ist angesagt? (20)

Big Data, Data and Information Mining for Earth Observation
Big Data, Data and Information Mining for Earth ObservationBig Data, Data and Information Mining for Earth Observation
Big Data, Data and Information Mining for Earth Observation
 
A GIS Based Satellite Data Management Application
A GIS Based Satellite Data Management ApplicationA GIS Based Satellite Data Management Application
A GIS Based Satellite Data Management Application
 
Better Hackathon 2020 ETHZ - Comparing Static And Dynamic Effects Of Earthquakes
Better Hackathon 2020 ETHZ - Comparing Static And Dynamic Effects Of EarthquakesBetter Hackathon 2020 ETHZ - Comparing Static And Dynamic Effects Of Earthquakes
Better Hackathon 2020 ETHZ - Comparing Static And Dynamic Effects Of Earthquakes
 
ICT in Geography
ICT in GeographyICT in Geography
ICT in Geography
 
Computer application in geography
Computer application in geographyComputer application in geography
Computer application in geography
 
Tianjin Case Study
Tianjin Case StudyTianjin Case Study
Tianjin Case Study
 
INTRODUCTION TO GIS
INTRODUCTION TO GISINTRODUCTION TO GIS
INTRODUCTION TO GIS
 
7 satida mistelbauer
7 satida mistelbauer7 satida mistelbauer
7 satida mistelbauer
 
Aerial Photography in Archaeology
Aerial Photography in ArchaeologyAerial Photography in Archaeology
Aerial Photography in Archaeology
 
Gis Across Curriculum nh
Gis Across Curriculum nhGis Across Curriculum nh
Gis Across Curriculum nh
 
Jeffrey Villaveces
Jeffrey VillavecesJeffrey Villaveces
Jeffrey Villaveces
 
FDL 2018 Virtual Briefing 1
FDL 2018 Virtual Briefing 1FDL 2018 Virtual Briefing 1
FDL 2018 Virtual Briefing 1
 
Geospatial_Center_Brochure_2016
Geospatial_Center_Brochure_2016Geospatial_Center_Brochure_2016
Geospatial_Center_Brochure_2016
 
Gis
GisGis
Gis
 
Optimized Neural Networks Using Principal Component Analysis for Automatic Ro...
Optimized Neural Networks Using Principal Component Analysis for Automatic Ro...Optimized Neural Networks Using Principal Component Analysis for Automatic Ro...
Optimized Neural Networks Using Principal Component Analysis for Automatic Ro...
 
Group 3 presentation
Group 3 presentationGroup 3 presentation
Group 3 presentation
 
Thrive2055_UTCGIS_2015ii
Thrive2055_UTCGIS_2015iiThrive2055_UTCGIS_2015ii
Thrive2055_UTCGIS_2015ii
 
Irsposter
IrsposterIrsposter
Irsposter
 
Dip application 1
Dip application 1Dip application 1
Dip application 1
 
Challenges to Large Scale Mapping: Can Data Geometry Help?
Challenges to Large Scale Mapping: Can Data Geometry Help?Challenges to Large Scale Mapping: Can Data Geometry Help?
Challenges to Large Scale Mapping: Can Data Geometry Help?
 

Ähnlich wie Machine Learning for Better Maps

Brandon Barnett_Resume2015_Web
Brandon Barnett_Resume2015_WebBrandon Barnett_Resume2015_Web
Brandon Barnett_Resume2015_WebBrandon Barnett
 
Data science and visualization lab presentation
Data science and visualization lab presentationData science and visualization lab presentation
Data science and visualization lab presentationiHub Research
 
Using R for Classification of Large Social Network Data
Using R for Classification of Large Social Network DataUsing R for Classification of Large Social Network Data
Using R for Classification of Large Social Network DataIJCSIS Research Publications
 
2018 GIS in Recreation: The Latest Trail Technology Crowdsourcing Maps and Apps
2018 GIS in Recreation: The Latest Trail Technology Crowdsourcing Maps and Apps2018 GIS in Recreation: The Latest Trail Technology Crowdsourcing Maps and Apps
2018 GIS in Recreation: The Latest Trail Technology Crowdsourcing Maps and AppsGIS in the Rockies
 
IEEE SIGHT Bombay section webinar talk on GIS & Remote Sensing-Introduction t...
IEEE SIGHT Bombay section webinar talk on GIS & Remote Sensing-Introduction t...IEEE SIGHT Bombay section webinar talk on GIS & Remote Sensing-Introduction t...
IEEE SIGHT Bombay section webinar talk on GIS & Remote Sensing-Introduction t...AdityaAllamraju1
 
Use of Technology for field data capture and compilation, and the implications
Use of Technology for field data capture and compilation, and the implications Use of Technology for field data capture and compilation, and the implications
Use of Technology for field data capture and compilation, and the implications FAO
 
IRJET- University Campus Event Navigation System
IRJET-  	  University Campus Event Navigation System   IRJET-  	  University Campus Event Navigation System
IRJET- University Campus Event Navigation System IRJET Journal
 
IRJET- Comparative Analysis of Various Tools for Data Mining and Big Data...
IRJET-  	  Comparative Analysis of Various Tools for Data Mining and Big Data...IRJET-  	  Comparative Analysis of Various Tools for Data Mining and Big Data...
IRJET- Comparative Analysis of Various Tools for Data Mining and Big Data...IRJET Journal
 
IRJET- Popularity based Recommender Sytsem for Google Maps
IRJET-  	  Popularity based Recommender Sytsem for Google MapsIRJET-  	  Popularity based Recommender Sytsem for Google Maps
IRJET- Popularity based Recommender Sytsem for Google MapsIRJET Journal
 
The State of GIS in Washington & Oregon The 2014 GMI Metric Survey
The State of GIS in Washington & Oregon  The 2014 GMI Metric SurveyThe State of GIS in Washington & Oregon  The 2014 GMI Metric Survey
The State of GIS in Washington & Oregon The 2014 GMI Metric SurveyGreg Babinski
 
Open trip planner status update may 2011
Open trip planner status update may 2011Open trip planner status update may 2011
Open trip planner status update may 2011bibianamchugh
 
Usability Engineering For OSM - SOTM 2007
Usability Engineering For OSM - SOTM 2007Usability Engineering For OSM - SOTM 2007
Usability Engineering For OSM - SOTM 2007Muki Haklay
 
Netspeed-GIS (1).ppt
Netspeed-GIS (1).pptNetspeed-GIS (1).ppt
Netspeed-GIS (1).pptRajashekhar L
 
Real World Application of Big Data In Data Mining Tools
Real World Application of Big Data In Data Mining ToolsReal World Application of Big Data In Data Mining Tools
Real World Application of Big Data In Data Mining Toolsijsrd.com
 

Ähnlich wie Machine Learning for Better Maps (20)

Brandon Barnett_Resume2015_Web
Brandon Barnett_Resume2015_WebBrandon Barnett_Resume2015_Web
Brandon Barnett_Resume2015_Web
 
Data science and visualization lab presentation
Data science and visualization lab presentationData science and visualization lab presentation
Data science and visualization lab presentation
 
Using R for Classification of Large Social Network Data
Using R for Classification of Large Social Network DataUsing R for Classification of Large Social Network Data
Using R for Classification of Large Social Network Data
 
Routeguru
RouteguruRouteguru
Routeguru
 
Chapter3 application requirements
Chapter3 application requirementsChapter3 application requirements
Chapter3 application requirements
 
2018 GIS in Recreation: The Latest Trail Technology Crowdsourcing Maps and Apps
2018 GIS in Recreation: The Latest Trail Technology Crowdsourcing Maps and Apps2018 GIS in Recreation: The Latest Trail Technology Crowdsourcing Maps and Apps
2018 GIS in Recreation: The Latest Trail Technology Crowdsourcing Maps and Apps
 
Resume
ResumeResume
Resume
 
IEEE SIGHT Bombay section webinar talk on GIS & Remote Sensing-Introduction t...
IEEE SIGHT Bombay section webinar talk on GIS & Remote Sensing-Introduction t...IEEE SIGHT Bombay section webinar talk on GIS & Remote Sensing-Introduction t...
IEEE SIGHT Bombay section webinar talk on GIS & Remote Sensing-Introduction t...
 
Use of Technology for field data capture and compilation, and the implications
Use of Technology for field data capture and compilation, and the implications Use of Technology for field data capture and compilation, and the implications
Use of Technology for field data capture and compilation, and the implications
 
IRJET- University Campus Event Navigation System
IRJET-  	  University Campus Event Navigation System   IRJET-  	  University Campus Event Navigation System
IRJET- University Campus Event Navigation System
 
IRJET- Comparative Analysis of Various Tools for Data Mining and Big Data...
IRJET-  	  Comparative Analysis of Various Tools for Data Mining and Big Data...IRJET-  	  Comparative Analysis of Various Tools for Data Mining and Big Data...
IRJET- Comparative Analysis of Various Tools for Data Mining and Big Data...
 
Resume Diego Marinho de Oliveira
Resume Diego Marinho de OliveiraResume Diego Marinho de Oliveira
Resume Diego Marinho de Oliveira
 
IRJET- Popularity based Recommender Sytsem for Google Maps
IRJET-  	  Popularity based Recommender Sytsem for Google MapsIRJET-  	  Popularity based Recommender Sytsem for Google Maps
IRJET- Popularity based Recommender Sytsem for Google Maps
 
Symposium 2008
Symposium 2008Symposium 2008
Symposium 2008
 
SMART Seminar Series: "From Big Data to Smart data"
SMART Seminar Series: "From Big Data to Smart data"SMART Seminar Series: "From Big Data to Smart data"
SMART Seminar Series: "From Big Data to Smart data"
 
The State of GIS in Washington & Oregon The 2014 GMI Metric Survey
The State of GIS in Washington & Oregon  The 2014 GMI Metric SurveyThe State of GIS in Washington & Oregon  The 2014 GMI Metric Survey
The State of GIS in Washington & Oregon The 2014 GMI Metric Survey
 
Open trip planner status update may 2011
Open trip planner status update may 2011Open trip planner status update may 2011
Open trip planner status update may 2011
 
Usability Engineering For OSM - SOTM 2007
Usability Engineering For OSM - SOTM 2007Usability Engineering For OSM - SOTM 2007
Usability Engineering For OSM - SOTM 2007
 
Netspeed-GIS (1).ppt
Netspeed-GIS (1).pptNetspeed-GIS (1).ppt
Netspeed-GIS (1).ppt
 
Real World Application of Big Data In Data Mining Tools
Real World Application of Big Data In Data Mining ToolsReal World Application of Big Data In Data Mining Tools
Real World Application of Big Data In Data Mining Tools
 

Mehr von Louisa Diggs

Using Active Learning to Quantify how Training Data Errors Impact Classificat...
Using Active Learning to Quantify how Training Data Errors Impact Classificat...Using Active Learning to Quantify how Training Data Errors Impact Classificat...
Using Active Learning to Quantify how Training Data Errors Impact Classificat...Louisa Diggs
 
Generating Training Data from Noisy Measrements
Generating Training Data from Noisy MeasrementsGenerating Training Data from Noisy Measrements
Generating Training Data from Noisy MeasrementsLouisa Diggs
 
Cropped Field Boundaries, Food Systems, & Fire
Cropped Field Boundaries, Food Systems, & FireCropped Field Boundaries, Food Systems, & Fire
Cropped Field Boundaries, Food Systems, & FireLouisa Diggs
 
A Random Walk of Issues Related to Training Data and Land Cover Mapping
A Random Walk of Issues Related to Training Data and Land Cover MappingA Random Walk of Issues Related to Training Data and Land Cover Mapping
A Random Walk of Issues Related to Training Data and Land Cover MappingLouisa Diggs
 
Assessing Land Cover Change using Uncertain Data
Assessing Land Cover Change using Uncertain DataAssessing Land Cover Change using Uncertain Data
Assessing Land Cover Change using Uncertain DataLouisa Diggs
 
Informal Settlements and Cadastral Mapping
Informal Settlements and Cadastral MappingInformal Settlements and Cadastral Mapping
Informal Settlements and Cadastral MappingLouisa Diggs
 
Sources of Map Error in Public Health Activities and Operations Research
Sources of Map Error in Public Health Activities and Operations ResearchSources of Map Error in Public Health Activities and Operations Research
Sources of Map Error in Public Health Activities and Operations ResearchLouisa Diggs
 
Measuring the impact of label noise on semantic segmentation using rastervision
Measuring the impact of label noise on semantic segmentation using rastervisionMeasuring the impact of label noise on semantic segmentation using rastervision
Measuring the impact of label noise on semantic segmentation using rastervisionLouisa Diggs
 
Mapping Smallholder Yields Using Micro-Satellite Data
Mapping Smallholder Yields Using Micro-Satellite DataMapping Smallholder Yields Using Micro-Satellite Data
Mapping Smallholder Yields Using Micro-Satellite DataLouisa Diggs
 
Crowdsourcing Land Cover and Land Use Data: Experiences from IIASA
Crowdsourcing Land Cover and Land Use Data: Experiences from IIASACrowdsourcing Land Cover and Land Use Data: Experiences from IIASA
Crowdsourcing Land Cover and Land Use Data: Experiences from IIASALouisa Diggs
 
IMED 2018: The use of remote sensing, geostatistical and machine learning met...
IMED 2018: The use of remote sensing, geostatistical and machine learning met...IMED 2018: The use of remote sensing, geostatistical and machine learning met...
IMED 2018: The use of remote sensing, geostatistical and machine learning met...Louisa Diggs
 
IMED 2018: Predicting the environmental suitability of podoconiosis in Ethiopia
IMED 2018: Predicting the environmental suitability of podoconiosis in EthiopiaIMED 2018: Predicting the environmental suitability of podoconiosis in Ethiopia
IMED 2018: Predicting the environmental suitability of podoconiosis in EthiopiaLouisa Diggs
 
IMED 2018: Landcover/habitat
IMED 2018: Landcover/habitatIMED 2018: Landcover/habitat
IMED 2018: Landcover/habitatLouisa Diggs
 
IMED 2018: Modeled Population Estimates from Satellite Imagery and Microcensu...
IMED 2018: Modeled Population Estimates from Satellite Imagery and Microcensu...IMED 2018: Modeled Population Estimates from Satellite Imagery and Microcensu...
IMED 2018: Modeled Population Estimates from Satellite Imagery and Microcensu...Louisa Diggs
 
IMED 2018: Mapping Monkeypox risk in the Congo Basin using Remote Sensing and...
IMED 2018: Mapping Monkeypox risk in the Congo Basin using Remote Sensing and...IMED 2018: Mapping Monkeypox risk in the Congo Basin using Remote Sensing and...
IMED 2018: Mapping Monkeypox risk in the Congo Basin using Remote Sensing and...Louisa Diggs
 
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine lea...
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine lea...IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine lea...
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine lea...Louisa Diggs
 

Mehr von Louisa Diggs (16)

Using Active Learning to Quantify how Training Data Errors Impact Classificat...
Using Active Learning to Quantify how Training Data Errors Impact Classificat...Using Active Learning to Quantify how Training Data Errors Impact Classificat...
Using Active Learning to Quantify how Training Data Errors Impact Classificat...
 
Generating Training Data from Noisy Measrements
Generating Training Data from Noisy MeasrementsGenerating Training Data from Noisy Measrements
Generating Training Data from Noisy Measrements
 
Cropped Field Boundaries, Food Systems, & Fire
Cropped Field Boundaries, Food Systems, & FireCropped Field Boundaries, Food Systems, & Fire
Cropped Field Boundaries, Food Systems, & Fire
 
A Random Walk of Issues Related to Training Data and Land Cover Mapping
A Random Walk of Issues Related to Training Data and Land Cover MappingA Random Walk of Issues Related to Training Data and Land Cover Mapping
A Random Walk of Issues Related to Training Data and Land Cover Mapping
 
Assessing Land Cover Change using Uncertain Data
Assessing Land Cover Change using Uncertain DataAssessing Land Cover Change using Uncertain Data
Assessing Land Cover Change using Uncertain Data
 
Informal Settlements and Cadastral Mapping
Informal Settlements and Cadastral MappingInformal Settlements and Cadastral Mapping
Informal Settlements and Cadastral Mapping
 
Sources of Map Error in Public Health Activities and Operations Research
Sources of Map Error in Public Health Activities and Operations ResearchSources of Map Error in Public Health Activities and Operations Research
Sources of Map Error in Public Health Activities and Operations Research
 
Measuring the impact of label noise on semantic segmentation using rastervision
Measuring the impact of label noise on semantic segmentation using rastervisionMeasuring the impact of label noise on semantic segmentation using rastervision
Measuring the impact of label noise on semantic segmentation using rastervision
 
Mapping Smallholder Yields Using Micro-Satellite Data
Mapping Smallholder Yields Using Micro-Satellite DataMapping Smallholder Yields Using Micro-Satellite Data
Mapping Smallholder Yields Using Micro-Satellite Data
 
Crowdsourcing Land Cover and Land Use Data: Experiences from IIASA
Crowdsourcing Land Cover and Land Use Data: Experiences from IIASACrowdsourcing Land Cover and Land Use Data: Experiences from IIASA
Crowdsourcing Land Cover and Land Use Data: Experiences from IIASA
 
IMED 2018: The use of remote sensing, geostatistical and machine learning met...
IMED 2018: The use of remote sensing, geostatistical and machine learning met...IMED 2018: The use of remote sensing, geostatistical and machine learning met...
IMED 2018: The use of remote sensing, geostatistical and machine learning met...
 
IMED 2018: Predicting the environmental suitability of podoconiosis in Ethiopia
IMED 2018: Predicting the environmental suitability of podoconiosis in EthiopiaIMED 2018: Predicting the environmental suitability of podoconiosis in Ethiopia
IMED 2018: Predicting the environmental suitability of podoconiosis in Ethiopia
 
IMED 2018: Landcover/habitat
IMED 2018: Landcover/habitatIMED 2018: Landcover/habitat
IMED 2018: Landcover/habitat
 
IMED 2018: Modeled Population Estimates from Satellite Imagery and Microcensu...
IMED 2018: Modeled Population Estimates from Satellite Imagery and Microcensu...IMED 2018: Modeled Population Estimates from Satellite Imagery and Microcensu...
IMED 2018: Modeled Population Estimates from Satellite Imagery and Microcensu...
 
IMED 2018: Mapping Monkeypox risk in the Congo Basin using Remote Sensing and...
IMED 2018: Mapping Monkeypox risk in the Congo Basin using Remote Sensing and...IMED 2018: Mapping Monkeypox risk in the Congo Basin using Remote Sensing and...
IMED 2018: Mapping Monkeypox risk in the Congo Basin using Remote Sensing and...
 
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine lea...
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine lea...IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine lea...
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine lea...
 

Kürzlich hochgeladen

Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 

Kürzlich hochgeladen (20)

Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

Machine Learning for Better Maps

  • 1. Machine Learning for Better Maps Zhuangfang Yi- Development Seed - @geonanayi Workshop @Clark University, 01/07/2019 OpenStreetMap label quality and geodiversity for machine learning applications
  • 2. Deep Learning Machine Learning + OpenStreetMap + Satellite Imagery
  • 3. Urban = Yes Image Classification Training Dataset
  • 6. Label Maker OpenStreetMap is an attractive label/tags database for machine learning applications that holds repid updated mapped object daily by thousands of users.
  • 7.
  • 8. Training Data Completeness Matters - OSM tag/label info and popularity Tag info in France. Landuse is one of tags that has been frequently used by the users.
  • 9. Label Maker OpenStreetMap is an attractive label/tags database for machine learning applications that host rapidly update mapped object daily by thousands of users.
  • 10. Training data Completeness Matters OpenStreetMap Label Quality for Machine Learning Applications
  • 11. ISO standard for geographic information data: positional accuracy, completeness, and logical consistency. Other data quality issues in OSM: - Vandalism - Missing details - Completeness and accuracy Training data Completeness Matters
  • 12. Training Data Completeness Matters Available tools for data quality assessment: - OSM analytics (OSM v.s. Human Settlement Layer) - OSM-lint (e.g. OSM v.s. US census TIGER in USA)
  • 13. Training Data Completeness Matters Building classification in Vietnam with LeNet on AWS SageMaker. Individual building detection with Tensorflow Object detection in Mexico 60% -> 84% from Vietnam to Mexico
  • 14. - OSM label data + satellite images match - OSM label data is not well-aligned with the paired satellite image Training Data Completeness Matters
  • 15. Training Data Completeness Matters HOT Task Manager
  • 16. Training Data Completeness Matters Urchn for urban change detection with ML
  • 17. Training Data Geodiversity Matters Applying Machine Learning Applications for Geospatial Analysis
  • 18. Training Data Geodiversity Matters High-voltage grid detection with deep learning in Pakistan, Nigeria, and Zambia
  • 21. Training Data Geodiversity Matters Urban settlement change detection in Ethiopia between 2000 - 2017 with random
  • 22. Conclusions When it comes to applying machine learning applications: - Training data quality matters, and to use OSM label data for ML applications, I recommend: 1. Do a proper label completeness assessment with currently available tools; 2. Check OSM tag/label info and frequency for your area of interest; 3. For segmentation ML application, make sure the image tiles align-well with your label dataset; 4. Prepare training dataset using: Label Maker, or RoboSat or other data prep tools. - Training data geodiversity matters, and recommend to do: a. data/image feature similarity analysis;
  • 24. Data Completeness Matters HOT Analytics for Health With support of the Bill and Melinda Gates Foundation and the Clinton Health Access Initiative, we have designed an analysis tool to evaluate the accuracy and precision of OpenStreetMap field data.
  • 25. Other data quality issues in OSM: - Vandalism - Missing details - Completeness and accuracy The results of this analysis found the positional accuracy of OpenStreetMap data to be very good in comparison to OS MasterMap, with over 80% overlap between most the road objects tested between the two datasets. The results also found there to be a positive correlation between road name attribute completeness and number of users per area. Training data Completeness Matters