3. (Some of) What’s Broken
• Development Data
– Broken data formats, access, coverage, standards
– Ignored data sources
– Human vs Data disconnect
• Crisis Data
– Remote vs Ground disconnect
– Crisis vs Development disconnect
– Deployment lead overload
• Communities
– Stovepipes, fiefdoms, imperialism, finding…
5. Typical Workflow
W ebsites:
H TM L , X L S,
CSV ,A P Is etc
Scrapers Tem plate V olunteer
A nalysts
& A P Is Creator R esearcher s
CSV stores P ar tially
Com pleted
(online and filled DNA
spr eadsheet
offline) spreadsheet
6. Data Access
Online, under an open license
Structured (e.g. Excel, not PDF)
Non-proprietary (e.g. CSV, not Arcgis)
URI / API (so people can point at it)
Linked to other data (to give context)
8. Standardise
DR Congo in Data.UN.Org:
•“Congo, Democratic Republic of the”, “Congo Democratic”,
“Democratic Republic of the Congo”, “Congo (Democratic Republic of
the)”, “Congo, Dem. Rep.”, “Congo Dem. Rep.”, “Congo, Democratic
Republic of”, “Dem. Rep. of Congo”, “Dem. Rep. of the Congo”
DR Congo in common standards:
•“Democratic Republic of the Congo” (UN Stats), “Congo, The
Democratic Republic of the” (ISO3166), “Congo, Democratic Republic
of the” (FIPS10, Stanag), “180” (UN Stats), “COD” (ISO3166, Stanag),
“CG” (FIPS10)