Report sampling the 20000open data datasets, 103 open data portals, most of the managers of the portals and the 400 data-driven services. Results shown the misalignment between data publication features and data needs to create data-driven businesses. Authors Alberto ABella, Martoa Ortiz de urbina and CArmen De Pablos, With funds and support of COTEC foundation
Lucia Ferretti, Lead Business Designer; Matteo Meschini, Business Designer @T...
La reutilización de datos abiertos: Una oportunidad
1. La reutilización de datos abiertos: una oportunidad para España
PRESENTACIÓN
The reuse of open data:
an opportunity for Spain
Project selected within the 2016 call
of the Open Innovation Program (PIA)
of the Cotec Foundation for Innovation.
This project has received technical support
of the Department of Economy of Cotec.
Dr. Alberto Abella
Dra. Marta Ortiz de Urbina Criado
Dra. Carmen de Pablos Heredero
WWW.COTEC.ES
4. Open
Data
20.026* Datasets
103* → 10%
153 Open data portals
103*
84 Portal managers
27 answers
470 services
59 services → 12%
* March 2017
Universe
Sampling
WORK DONE (MARCH-APRIL 2017)
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
5. 5% of published datasets in the
open data portal in Spain are not
reusable because of the technical
publication format used.
RESULTS: DATA REUSABILITY
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
6. More than half of the datasets do
not contain any kind of
geographic information of
published data and only 16% has
coordinates
RESULTS: DATA REUSABILITY
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
7. Less than 1% of the
published datasets in open
data portals in spain are
published are in real time
(updated more than once a
minute)
RESULTS: DATA REUSABILITY
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
8. More than two-thirds (68%)
of all published data sets only
allow basic reuse according
to MELODA metric (
http://meloda.org)
On the other hand only 8% are
considered inadequate for
data reuse
RESULTS: DATA REUSABILITY
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
9. RESULTS: DATA TOPICS
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
Topic distribution
according NTI-
RISP spanish
regulation
(1st
employment,
2nd
: economic
and taxes)
10. Weight → Concept
20% → > 30 Datasets in portal catalog
10% → Catalog updates available
15% → Use a Data Management Solution
(CKAN, Socrata, OpenDataSoft, ESRI open
data)
25% → Availability of an API
30% → Applications section (based on
open data) in the Portal
RESULTS: DATA PORTAL MATURITY
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
11. RESULTS: DATA PORTALS MATURITY
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
74>% of all spanish datasets
are published in portals with
high maturity
40% Use a DMS tool
12. Almost 1 in 5 portal owners do not know
any reuser of their data.
Public administrations themselves are the main
consumers of the data.
Those managers of data portals are mostly
unaware about what the innovations of
product or process that are generated with the
data they publish
40% of the portals managers confirm that they DO NOT have logs of the
accesses to the data published
No portal manager replied that he would undertake a systematic activity to
promote the use of open data.
RESULTS: DATA PORTAL’S MANAGERS
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
13. Public administrations generates
43% of the sampled services
On the other hand 30% are
created by for profit.
organizations
72% provide geolocalized
services
35% provide services in real time
Most popular topics: Other,
Tourism and Geography (maps
and guides)
RESULTS: OPENDATA-DRIVEN SERVICES
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
14. For profit Services have
business model 66% in real
time and 87% geolocalized.
Most of the services
generated, 47%, 27% are
dedicated to transport and
meteorology respectively *
* The% of published data ins Spanish
open data portals is 4% for transport
and 7% for meteorology(within the
environment category)
RESULTS: OPENDATA-DRIVEN SERVICES
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
15. Business models identified:
●
Exploiting User’s data
●
Advertisements
●
Freemium
●
Contextual recommendations
RESULTS: DATA-DRIVEN BUSINESS-MODELS
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
16. Limited knowledge of
reusers, their value
creation and the
innovations generated
Non-systematic
promotion of data
reuse
Misalignment in :
Published data
Topics
Update frequency
(<1 % vs 66%)
Geolocalization
(16% vs 87%)
Low level of DMS use % of sustainable
services (50%) Is higher
than expected.
Those with for-profit
business model also
high (25%)
CONCLUSIONS: DATA ECONOMY
Open
Data
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
17. City of Vitoria:
Organico, Funcional, Economico, Denominación Orgánica Nivel1, Denominacion Organica
General, Denominacion Aplicacion 2017
City of Madrid:
Centro, Descripcion Centro, Seccion, Descripcion Seccion, Programa, Descripcion Programa,
Capitulo, Descripcion Capitulo, Economico, Descripcion Economico, CRED_INICIAL,
MOD_CREDITO, C_DEFINITIVO, C_AUTORIZADO, C_DISPUESTO
OBL_RECONOC
City of Cáceres:
uri, om_ejecucionPresupuestoFormadoPor, rdfs_label
om_presupuestoEjecutado, om_trimestreEjecucionPresupuesto,
om_ejecucionPresupuestoFormadoPor_reintegroDeGastos,
om_ejecucionPresupuestoFormadoPor_pagosRealizados,
om_ejecucionPresupuestoFormadoPor_numeroCapitulo,
om_ejecucionPresupuestoFormadoPor_modificacionesCapitulo,
om_ejecucionPresupuestoFormadoPor_estadoDeLaEjecucion,
om_ejecucionPresupuestoFormadoPor_obligacionesReconocidas
CONCLUSIONS: OTHER CONCLUSIONS
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
List of fields for the budget of three different cities. No coincidence in any field.
18. Poorly documented
and inhomogeneous
data models increase
the cost of reuse
Scattered sources
DMS performance and
features are improving
Sustainability of
business models?
Progressive improvment
but slower than required
CONCLUSIONS: OTHER CONCLUSIONS
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
Open
Data
19. Can current standardization mechanisms
standardise data publishing?
Spain 2009: 0 datasets in open data portals
Spain 2017: 20,000 datasets
minimum 400 different types of data sets (own estimate higher than 2000)
50 normalisations year (250)
4 normalisations month (only Spain) (20)
Exceeds current capacities of standarisation bodies
PROPOSAL: EUROPEAN ASSOCIATION OF DATA PUBLISHERS
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
20. How many accesses have the current data portals ?
E.g. the most downloaded data set in open data format for the
city of XXXXXX is 2 downloads/ day and is one of the most
successful portals in Spain
What value and what innovation is actually generating for
society since this publication of data?
Can you publish differently to increase the number of
created business and support data-driven innovation?
PROPOSAL: EUROPEAN ASSOCIATION OF DATA PUBLISHERS
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES
21. Association of people (working for data publishers)
Non-profit
European scope
Public and private publishers
DISCUSSION AND DEBATE
Web: https://www.opendatapublishers.eu
Support: Open Data Initiative (secretariat)
http://iniciativabarcelonaopendata.cat/
PROPOSAL: EUROPEAN ASSOCIATION OF DATA PUBLISHERS
La reutilización de datos abiertos: una oportunidad para España WWW.COTEC.ES