Introduction to Prompt Engineering (Focusing on ChatGPT)
20141030 LinDA Workshop echallenges2014 - State of the art in open data infrastructure
1. State of the Art in Open Data Infrastructures for Public Sector
Information
ENGAGE & LinDA Project
Dr. Spiros Mouzakitis
ENGAGE Project Manager
National Technical University
of Athens
Decision Support Systems
Laboratory
eChallenges 2014
2. An overview of Open Data Sources and Data
Management software that we can use today
4. Open Data Landscape
• National Statistical Offices / Eurostat
• Data Gov Initiatives
• European
• Country
• City
• Ministries / Public bodies / Organizations
• World Data Banks / Indicators
• Research data repositories
• Data Marketplaces
• Data aggregators / Catalogs
• Linked Open Data Cloud
EM-DAT
5. Open Data Types
• Catalog - File based
Non-processable format (PDF,images,flash)
Processable format (Excel)
Processable and open format (CSV)
Standard / Tool specific format (SPSS,SDMX,GML)
• Relational Database
• Data sources are already processed - streamlined allowing
advanced services on them (visualizations, aggregations,etc)
• Linked Data (RDF, turtle)
• APIs
• Static / Archived information / Snapshots
• Real-time information
Data provision structure
Data provision structure
Data creation
• Harvest from others
• Backoffice / Service operations
• Survey
• Internal Research / Experiments
6. Standards
• W3C - DCAT
• EC - DCAT Application Profile
• Linked Open Data Vocabularies
• Schema.org
• SDMX
• DDI
• INSPIRE
• ……..
Domain-specific examples
Data catalogue
Vocabularies
Varietions of CKAN metadata, DC, UK
eGovernment Metadata Schema,
7. Obstacles of utilizing data in
everyday business
• Number of datasets vs Quality
• Re-usability
• A directory of offices / phones has no re-usability / research interest
• Aggregated vs Microdata
• Vital context around published data
o Methodology
o Time span
o Internal / External conditions
o Sample
• Linked Data has huge potential but needs
• Commercial focus
• Maintainance
• Trust
8. Open Data Platforms
• CKAN
• DKAN
• Junar
• Socrata
• NESSTAR
• DataPublic
• OpenColibri / ENGAGE
Under development
9. • Catalog system
• API
• Data Store API
• Python-based
framework (pylons)
• Faceted Search
(SOLR)
• Visualizations
Features
Data.gov.uk, publicdata.eu and many other data.gov initiatives including data.gov (US)
10. • Build upon CKAN
features
• Native Drupal
Features
11. • Focus on easing the publication workflow
• Visualizations tables, charts, and maps. Ability for real-time data
• Analytics
Features
13. • Find, browse,
visualize and
analyse data online
• Social sciences
• Surveys
• DDI
Features
• Publisher – freeware with no support, Server - Commercial
Norwegian Social Science Data Services
14. • Drupal Profile
• Open Data
catalogue
• Charts
• Microsoft OGDI
Features