This document outlines an agenda and presentation on CKAN, an open-source data management system. The presentation covers an introduction to CKAN, a tour of its features for publishing, finding, and managing data, how it supports open data principles, examples of CKAN instances, issues, and installation and harvesting topics.
1. CKAN 2 Introduction
Presenter: 李承錱 Cheng-Jen Lee (Sol)
Email: cjlee AT iis.sinica.edu.tw
This work is licensed under a
Creative Commons Attribution-ShareAlike 3.0 Taiwan License.
2. Agenda
● About CKAN
● Feature Tour
– Publish & Find Datasets
– Store & Manage Data
– Engage with Users & Others
– Customise & Extend
● CKAN and 5 ★ Open Data
● Showcase
● Issues
● Extended Topic 1: Installation
● Extended Topic 2: Harvesters
2014/5/22 2
3. Agenda
● About CKAN
● Feature Tour
– Publish & Find Datasets
– Store & Manage Data
– Engage with Users & Others
– Customise & Extend
● CKAN and 5 ★ Open Data
● Showcase
● Issues
● Extended Topic 1: Installation
● Extended Topic 2: Harvesters
2014/5/22 3
4. About CKAN
● The Comprehensive Knowledge Archive
Network
● A powerful data management system
● Publishing
● Sharing
● Finding
● Using Data
2014/5/22 4
5. About CKAN
83 instances
around the world
in May 2014
2014/5/22 5
17. Feature Tour (3)
Engage with Users & Others
Organization
2014/5/22 17
18. Feature Tour (3)
Engage with Users & Others
Manage Users of
an Organization
2014/5/22 18
19. Feature Tour (3)
Engage with Users & Others
Manage Role of Members
Admin: edit datasets & members
Editor: edit datasets
Viewer: view (private) datasets
Note: Public datasets are visible to
everyone
2014/5/22 19
20. Feature Tour (3)
Engage with Users & Others
Harvest and Federation
2014/5/22 20
21. Feature Tour (3)
Engage with Users & Others
History
2014/5/22 21
22. Feature Tour (4)
Customise & Extend
● RESTful JSON APIs
– The Action API
– The DataStore API
– The FileStore API...
● Extensions (over 60)
– ckanext-harvest
– ckanext-spatial
Open source is good!
● Themable
● Integrates with other CMS (ex. Drupal)
2014/5/22 22
24. CKAN and 5 ★ Open Data
● ★ Make your stuff available on the Web (whatever
format) under an open license
2014/5/22 24
25. CKAN and 5 ★ Open Data
● ★★ Make it available as structured data (e.g.,
Excel instead of image scan of a table)
★★★ Use non-proprietary formats (e.g., CSV
instead of Excel)
– Accept any data format
– Beautiful data demonstration
– Datastore : Indexing for structure data
2014/5/22 25
26. CKAN and 5 ★ Open Data
● ★★★★ Use URIs to denote things, so that people
can point at your stuff
– Permanent link for each dataset
– Get Dataset URI through API
● ★★★★★ Link your data to other data to provide
context
– Linked data and RDF for metadata
2014/5/22 26
27. CKAN and 5 ★ Open Data
● RDF for metadata
– DCAT and Dublin Core
– curl -L -H "Accept:application/rdf+xml"
http://thedatahub.org/dataset/gold-prices
2014/5/22 27
28. Agenda
● About CKAN
● Feature Tour
– Publish & Find Datasets
– Store & Manage Data
– Engage with Users & Others
– Customise & Extend
● CKAN and 5 ★ Open Data
● Showcase
● Issues
● Extended Topic 1: Installation
● Extended Topic 2: Harvesters
2014/5/22 28
54. Harvesters
Custom harvester
● Implement the harvester interface
to perform harvesting operations
● Three stages
● gather: get the identification
● fetch: fetch the contents
● import: create ckan package(dataset)
2014/5/22 54
55. Harvesters
The harvesting interface
from base import HarvesterBase
class SRDAHarvester(HarvesterBase):
def _set_config(self,config_str):
def info(self):
...
def gather_stage(self, harvest_job):
def fetch_stage(self, harvest_object):
def import_stage(self, harvest_object):
See the extension site for details
An example (SRDA): http://goo.gl/ZMnND7
2014/5/22 55
Hinweis der Redaktion
由 Open Knowledge Foundation 支持
由 Open Knowledge Foundation 支持
公布網址
Data.gov.uk 即是 drupal+CKAN
Store the raw data and metadata. Visualise structured data with interactive tables, graphs and maps.