SlideShare ist ein Scribd-Unternehmen logo
1 von 12
The Open Data 'Bazaar'
Crowdsourcing Solutions to Improve Data
     Accuracy and Re-use in Kenya

             Qiyang Xu
         qxu1@worldbank.org
Kenya Open Data
   Initiative
Government                Citizens
 Data collector           Launches              Companies
                          Open Data           Civil Society, etc


                        Inaccuracy for the       Accuracy?
                         efficacy of open
                         data initiatives?


                  Leverage crowdsourcing
                  to improve the validity?
                                             ‘Crowds’
Three Questions about
      Open Data
Location Datasets
 Kenya Primary School 2007
 Health Facilities Kenya
      Both available on Kenya Open Data site (opendata.go.ke, 2012)


Geospatial Information Datasets
 Global Administrative Unit Layers (GAUL)
 DigitalGlobe Global Basemap
 Google Earth



Datasets
 Health Facilities
                                Total: 8232
                                   For validation: 4867
                                   Actual valid: 4644
                                       4,203 in Digital Globe
                                       441 in Google Earth




 Primary Schools
      Total: 31229
      For validation: 110
       (random sampling)
      Actual valid: 108


   Results
Basemaps     Track
selection   changes




                         School
                      Information




                       School
                      locations
Connect available data to ‘crowds’ for more powerful feedbacks




Color of Locater   Description
      Green        Original location given in the dataset
     Purple        New location added to the dataset              Track changes
       Red         Proposed correction to the original location
     Yellow        Current location selected to be moved
Let’s play……
 Honor Board with highlighted
  contributors:
   Last registered
   Most active
   Best contributor……
 Measure global progress over time
 Clear goals, easy to achieve
 Editing locations and user interactions
Future……
 Multi-language interface
 Community built on Points of
Knowledge
 Users communication
 Allow picture-uploading and GPS
position collection using mobile device
 ……
 All datasets are fully open
 Open source solution preferred
 Collaborative development
 process
Unleashing the ‘wisdom of crowds’
  Innovation in Governance, World Bank Institute

Weitere ähnliche Inhalte

Ähnlich wie Crowdsourcing Solutions to Improve Kenya Open Data Accuracy

Bonazzi data commons nhgri council feb 2017
Bonazzi data commons nhgri council feb 2017Bonazzi data commons nhgri council feb 2017
Bonazzi data commons nhgri council feb 2017Vivien Bonazzi
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
VGI Overview - Crowdsourcing Participatory Mapping
VGI Overview - Crowdsourcing Participatory MappingVGI Overview - Crowdsourcing Participatory Mapping
VGI Overview - Crowdsourcing Participatory MappingDany Laksono
 
Beyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeBeyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeEric Kansa
 
Online Communities in Citizen Science & BirdCams
Online Communities in Citizen Science & BirdCamsOnline Communities in Citizen Science & BirdCams
Online Communities in Citizen Science & BirdCamsAndrea Wiggins
 
AAG GeoWeb/SDI - The GIS Caste System
AAG GeoWeb/SDI - The GIS Caste SystemAAG GeoWeb/SDI - The GIS Caste System
AAG GeoWeb/SDI - The GIS Caste Systemseagor
 
Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...Alex Coley
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AlonePhilip Bourne
 
EarthCube Townhall: ESIP Summer Meeting 2014
EarthCube Townhall: ESIP Summer Meeting 2014 EarthCube Townhall: ESIP Summer Meeting 2014
EarthCube Townhall: ESIP Summer Meeting 2014 EarthCube
 
Open Data and Open Software Geospatial Applications
Open Data and Open Software Geospatial ApplicationsOpen Data and Open Software Geospatial Applications
Open Data and Open Software Geospatial ApplicationsJody Garnett
 
Reusable Software and Open Data To Optimize Agriculture
Reusable Software and Open Data To Optimize AgricultureReusable Software and Open Data To Optimize Agriculture
Reusable Software and Open Data To Optimize AgricultureDavid LeBauer
 
Big Data, Open Data & eHealth Innovation
Big Data, Open Data & eHealth InnovationBig Data, Open Data & eHealth Innovation
Big Data, Open Data & eHealth InnovationOpen Knowledge Canada
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterprisePhilip Bourne
 
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017Sandra Gesing
 
Introduction to CKAN
Introduction to CKANIntroduction to CKAN
Introduction to CKANOKCon2013
 
Bratsas: Greek open data current status and the okfn
Bratsas: Greek open data current status and the okfnBratsas: Greek open data current status and the okfn
Bratsas: Greek open data current status and the okfnOKFN-GR
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...Edward Curry
 

Ähnlich wie Crowdsourcing Solutions to Improve Kenya Open Data Accuracy (20)

Bonazzi data commons nhgri council feb 2017
Bonazzi data commons nhgri council feb 2017Bonazzi data commons nhgri council feb 2017
Bonazzi data commons nhgri council feb 2017
 
A Server-Assigned Crowdsourcing Framework
A Server-Assigned Crowdsourcing FrameworkA Server-Assigned Crowdsourcing Framework
A Server-Assigned Crowdsourcing Framework
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
VGI Overview - Crowdsourcing Participatory Mapping
VGI Overview - Crowdsourcing Participatory MappingVGI Overview - Crowdsourcing Participatory Mapping
VGI Overview - Crowdsourcing Participatory Mapping
 
Beyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeBeyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional Practice
 
Online Communities in Citizen Science & BirdCams
Online Communities in Citizen Science & BirdCamsOnline Communities in Citizen Science & BirdCams
Online Communities in Citizen Science & BirdCams
 
AAG GeoWeb/SDI - The GIS Caste System
AAG GeoWeb/SDI - The GIS Caste SystemAAG GeoWeb/SDI - The GIS Caste System
AAG GeoWeb/SDI - The GIS Caste System
 
OKF intro and CKAN 2.0
OKF intro and CKAN 2.0OKF intro and CKAN 2.0
OKF intro and CKAN 2.0
 
Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
EarthCube Townhall: ESIP Summer Meeting 2014
EarthCube Townhall: ESIP Summer Meeting 2014 EarthCube Townhall: ESIP Summer Meeting 2014
EarthCube Townhall: ESIP Summer Meeting 2014
 
Open Data and Open Software Geospatial Applications
Open Data and Open Software Geospatial ApplicationsOpen Data and Open Software Geospatial Applications
Open Data and Open Software Geospatial Applications
 
Reusable Software and Open Data To Optimize Agriculture
Reusable Software and Open Data To Optimize AgricultureReusable Software and Open Data To Optimize Agriculture
Reusable Software and Open Data To Optimize Agriculture
 
Big Data, Open Data & eHealth Innovation
Big Data, Open Data & eHealth InnovationBig Data, Open Data & eHealth Innovation
Big Data, Open Data & eHealth Innovation
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital Enterprise
 
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
 
Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...
Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...
Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...
 
Introduction to CKAN
Introduction to CKANIntroduction to CKAN
Introduction to CKAN
 
Bratsas: Greek open data current status and the okfn
Bratsas: Greek open data current status and the okfnBratsas: Greek open data current status and the okfn
Bratsas: Greek open data current status and the okfn
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
 

Crowdsourcing Solutions to Improve Kenya Open Data Accuracy

  • 1. The Open Data 'Bazaar' Crowdsourcing Solutions to Improve Data Accuracy and Re-use in Kenya Qiyang Xu qxu1@worldbank.org
  • 2. Kenya Open Data Initiative
  • 3. Government Citizens Data collector Launches Companies Open Data Civil Society, etc Inaccuracy for the Accuracy? efficacy of open data initiatives? Leverage crowdsourcing to improve the validity? ‘Crowds’ Three Questions about Open Data
  • 4. Location Datasets  Kenya Primary School 2007  Health Facilities Kenya  Both available on Kenya Open Data site (opendata.go.ke, 2012) Geospatial Information Datasets  Global Administrative Unit Layers (GAUL)  DigitalGlobe Global Basemap  Google Earth Datasets
  • 5.  Health Facilities  Total: 8232  For validation: 4867  Actual valid: 4644  4,203 in Digital Globe  441 in Google Earth  Primary Schools  Total: 31229  For validation: 110 (random sampling)  Actual valid: 108 Results
  • 6. Basemaps Track selection changes School Information School locations
  • 7. Connect available data to ‘crowds’ for more powerful feedbacks Color of Locater Description Green Original location given in the dataset Purple New location added to the dataset Track changes Red Proposed correction to the original location Yellow Current location selected to be moved
  • 8.
  • 9. Let’s play……  Honor Board with highlighted contributors:  Last registered  Most active  Best contributor……  Measure global progress over time  Clear goals, easy to achieve  Editing locations and user interactions
  • 10. Future……  Multi-language interface  Community built on Points of Knowledge  Users communication  Allow picture-uploading and GPS position collection using mobile device  ……
  • 11.  All datasets are fully open  Open source solution preferred  Collaborative development process
  • 12. Unleashing the ‘wisdom of crowds’ Innovation in Governance, World Bank Institute