SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
“Open Data Web” –
A Linked Open Data Repository Built with CKAN
Cheng-Jen Lee
Andrea Wei-Ching Huang
Tyng-Ruey Chuang
Institute of Information Science, Academia Sinica, Taiwan
CKANCon 2016@Madrid
2016/10/04
Slide and Transcript
• (Slide URL&QR code)
2
Transcript
https://hackmd.io/s/rJIcV6Op
Outline
• Data Source
• Linked Data
• From Archive Catalog to Linked Data
• Linked Open Data Repository: Open Data Web
• System Architecture
• Implementation
• Limitations
• Future Work
3
Data Source
• Union Catalog of Digital Archives Taiwan
• http://catalog.digitalarchives.tw
• Web catalog for digitized archives in 14 domains
from many institutions.
• Part of the catalog is released under CC licenses
• About 840,000 catalog records.
• Free to copy and redistribute.
• Represent resources in a linked data format
• Provide semantic query for time, place, object, etc.
• Enrich resources by linking them to third-party datasets.
4
Linked Data
• Linked Data (from Wikipedia)
• A method of publishing structured data.
• It can be interlinked and become more useful
through semantic queries.
• Linked Open Data is linked data that is open content.
• Mostly in the form of RDF.
• RDF (from W3C RDF 1.1 Primer)
• Resource Description Framework
• A framework for expressing information about resources.
• RDF can enrich a dataset by linking it to third-party datasets.
• Ex. Enrich a dataset about paintings by linking them to the
corresponding artists in Wikidata.
5
RDF Data Model
• A Triple: <subject> <predicate> <object>
• <Bob> <is a> <person>.
• <Bob> <is interested in> <the Mona Lisa>.
• <the Mona Lisa> <was created by> <Leonardo da Vinci>.
Source: https://www.w3.org/TR/2014/NOTE-rdf11-primer-20140624/#section-triple
6
From Archive Catalog to Linked
Data
• We converted archive catalog to two versions of linked data.
• Version D: triples with just Dublin Core descriptions from
the catalog
• D means Dublin Core
• Version R: mapping column values in the catalog to external
datasets (with domain vocabularies) to give enriched
semantics
• R means Refined
• Extract place names from "Coverage" column (dc:coverage) in the
catalog and map them to place IDs on geonames.org.
• Normalize values in "Date" column (dc:date) to ISO8601 format, or
map them to Wikidata IDs.
• Map titles of biology archives to entries on Encyclopedia of Life.
7
Archive
Catalog
XML&CSV
txn:hasEOLPage
<http://eol.org/pages/1134120> ;
--------------------------------------------
skos:editorialNote "採集日期" ;
dwc:eventDate "1993-04-25" ;
RDF-like
CSV
Step 1:
Mapping
column
values to
vocabularies
• "採集日期” means date collected in English.
Step 2:
Converting
CSV data to
linked data
Original Data
Results
After Vocabulary Mapping
Linked Data (RDF)
Title 台灣一葉蘭
Date::field 採集日期
Date 1993-04-25
txn:hasEOLPage eol:1134120
rdf:type schema:CreateAction
skos:editorialNote 採集日期
dwc:eventDate 1993-04-25
Vocabulary Mapping and Data
Conversion Python Scripts: https://gitlab.com/iislod/dat2ld
8
Linked Open Data Repository:
Open Data Web (beta)
http://data.odw.tw
Ontology* for Open Data Web (Draft)
http://voc.odw.tw
* Definitions of the vocabularies used to describe objects in RDF.
9
Feature (1): Linked Data Browsing
Main Menu
Records: D version
Refined: R version (still uploading)
http://data.odw.tw/record/
10
Feature (1): Linked Data Browsing
http://data.odw.tw/record/
List of Resources
Filters
11
Feature (1): Linked Data Browsing
http://data.odw.tw/record/
Get D or R version of
the same resource
12
Example: “Girl Lost in Thought”
linked data
(triples)
http://data.odw.tw/record/d4502674
13
Example: “Girl Lost in Thought”
Export single
resource in linked
data format
http://data.odw.tw/record/d4502674
14
• Spatial indexing based on geo:lat and geo:long values.
Resources about
Tainan City
Feature (2): Spatial Query
15
• Temporal indexing based on dct:W3CDTF, xsd:date, and xsd:gYear values.
Resources in 19th century
Feature (3): Temporal Query
16
Feature (4): SPARQL Endpoint
http://data.odw.tw/sparql/ (For testing)
http://sparql.odw.tw/ (For machine access)
17
Feature (5):
Spatial
Representation
• Only for R version (still uploading).
• Only shows geonames
information in the gn:locatedIn
property.
http://data.odw.tw/r1/r1-r4502674
18
System
Architecture
SPARQL
Query Page
HTML
for individual
record
RDF
for individual
record
ckanext-scheming&
ckanext-repeating
template
ckanext-dcat
output profile
User
Access
individual
resource
SPARQL
(testing)
Computer
SPARQL
Linked Data
(Turtle format)
ImportHarvest
Icon made by SimpleIcon
(http://www.flaticon.com/aut
hors/simpleicon) and Freepik
(http://www.flaticon.com/aut
hors/freepik)
19
Implementation (1/3)
• Custom fields
• ckanext-scheming and ckanext-repeating extension
• Define CKAN custom fields for a data type in a JSON file
• Each data type has its own directory.
• Ex. record.json is for D ver. (http://data.odw.tw/record/)
• A field is defined by a JSON object, for example:
{
"field_name": "dc:format",
"label": "dc:format",
"display_property": "dc:format",
"preset": "repeating_text_modified”
},
20
Implementation (2/3)
• Import linked data
• ckanext-dcat extension for linked data import/export
• CKAN harvesting mechanism by ckanext-harvest extension
• Extend DCATRDFHarvester in ckanext.dcat.harvesters.rdf
• Extend RDFProfile in ckanext.dcat.profiles
• def parse_dataset(self, dataset_dict, dataset_ref):
• (Import) Parse dataset_ref from loaded linked data to CKAN’s
dataset_dict
• def graph_from_dataset(self, dataset_dict, dataset_ref):
• (Export) Generate a graph (linked data) dataset_ref from CKAN’s
dataset_dict
• Modify ckanext-dcat itself
• To support more namespace (ckanext-dcat is originally designed
for DCAT vocabularies.)
21
22
Implementation (3/3)
• Virtuoso SPARQL endpoint integration
• ckanext-sparql extension
• Spatial indexing and searching
• ckanext-spatial extension
• Time indexing and searching
• We developed the ckanext-tempsearch extension.
• Source code available on GitLab.
• https://gitlab.com/iislod/
23
Limitations
• Maintaining two triple stores (CKAN & Virtuoso).
• They may be inconsistent since we do not sync them for
now.
• Slow harvesting speed on CKAN.
• 4 hrs+ for harvesting 20,000 records on a Core i7-2600
3.4 GHz machine (still uploading now).
24
Future Work
• Provide native SPARQL queries in CKAN.
• Then we do not need Virtuoso anymore.
• Harvest multiple resources as a CKAN dataset
• To improve import speed.
• Time and place names mappings to third-party
datasets
• Still remains further verification.
25
Open Data Web (http://data.odw.tw)
E-mail: ask AT odw.tw
We welcome your valuable
comments & suggestions!
26
Acknowledgement: Hsin-Ping Chen (k26021409 AT gmail.com)
for processing geonames data.
Find me at @u10313335, http://about.me/SolLee, cjlee AT iis.sinica.edu.tw

Weitere ähnliche Inhalte

Was ist angesagt?

Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataEUCLID project
 
LiveLinkedData - TransWebData - Nantes 2013
LiveLinkedData - TransWebData - Nantes 2013LiveLinkedData - TransWebData - Nantes 2013
LiveLinkedData - TransWebData - Nantes 2013Luis Daniel Ibáñez
 
Microtask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked DataMicrotask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked DataEUCLID project
 
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata Matters
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata MattersAlphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata Matters
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata MattersNew York University
 
Named Entity Recognition from Online News
Named Entity Recognition from Online NewsNamed Entity Recognition from Online News
Named Entity Recognition from Online NewsBernardo Najlis
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark GreavesMediabistro
 
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...DuraSpace
 
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordForce11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordMark Wilkinson
 
Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Dag Endresen
 
Introduction | Categories for Description of Works of Art | CDWA-LITE
Introduction | Categories for Description of Works of Art | CDWA-LITE Introduction | Categories for Description of Works of Art | CDWA-LITE
Introduction | Categories for Description of Works of Art | CDWA-LITE Kymberly Keeton
 
An Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF GraphsAn Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF GraphsNikolaos Konstantinou
 
Interaction with Linked Data
Interaction with Linked DataInteraction with Linked Data
Interaction with Linked DataEUCLID project
 
FAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologiesFAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologiesResearch Data Alliance
 
Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021Fabrizio Orlandi
 

Was ist angesagt? (20)

Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
LiveLinkedData - TransWebData - Nantes 2013
LiveLinkedData - TransWebData - Nantes 2013LiveLinkedData - TransWebData - Nantes 2013
LiveLinkedData - TransWebData - Nantes 2013
 
Microtask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked DataMicrotask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked Data
 
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata Matters
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata MattersAlphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata Matters
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata Matters
 
Named Entity Recognition from Online News
Named Entity Recognition from Online NewsNamed Entity Recognition from Online News
Named Entity Recognition from Online News
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
 
Tese phd
Tese phdTese phd
Tese phd
 
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
 
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordForce11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, Oxford
 
Timbuctoo 2 EASY
Timbuctoo 2 EASYTimbuctoo 2 EASY
Timbuctoo 2 EASY
 
Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)
 
Introduction | Categories for Description of Works of Art | CDWA-LITE
Introduction | Categories for Description of Works of Art | CDWA-LITE Introduction | Categories for Description of Works of Art | CDWA-LITE
Introduction | Categories for Description of Works of Art | CDWA-LITE
 
Technical Background
Technical BackgroundTechnical Background
Technical Background
 
An Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF GraphsAn Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF Graphs
 
Querying Linked Data
Querying Linked DataQuerying Linked Data
Querying Linked Data
 
Interaction with Linked Data
Interaction with Linked DataInteraction with Linked Data
Interaction with Linked Data
 
FAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologiesFAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologies
 
Providing Linked Data
Providing Linked DataProviding Linked Data
Providing Linked Data
 
Linked data life cycles
Linked data life cyclesLinked data life cycles
Linked data life cycles
 
Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021
 

Andere mochten auch

20160602 典藏目錄的語意與連結
20160602 典藏目錄的語意與連結20160602 典藏目錄的語意與連結
20160602 典藏目錄的語意與連結andrea huang
 
A preliminary study on Wikipedia Dbpdeia and Wikidata
A preliminary study on Wikipedia Dbpdeia and WikidataA preliminary study on Wikipedia Dbpdeia and Wikidata
A preliminary study on Wikipedia Dbpdeia and Wikidataandrea huang
 
defilarea perfecta ...
defilarea perfecta ...defilarea perfecta ...
defilarea perfecta ...sokoban
 
Let us build a story
Let us build a storyLet us build a story
Let us build a storygouree
 
1er Tumblr Art Meetup Paris by ARTchipel
1er Tumblr Art Meetup Paris by ARTchipel1er Tumblr Art Meetup Paris by ARTchipel
1er Tumblr Art Meetup Paris by ARTchipelDRUBAY Diane
 
What gets into Software code after some time
What gets into Software code  after some timeWhat gets into Software code  after some time
What gets into Software code after some timePrasad Narasimhan
 
Sam's_Story
Sam's_StorySam's_Story
Sam's_Storysokoban
 
Lovemewithallyourheart
LovemewithallyourheartLovemewithallyourheart
Lovemewithallyourheartsokoban
 
Muito Bem! ¡Muy Bien!
Muito Bem! ¡Muy Bien!Muito Bem! ¡Muy Bien!
Muito Bem! ¡Muy Bien!Aster
 
Information + insight = action
Information + insight = actionInformation + insight = action
Information + insight = actionPrasad Narasimhan
 
Kelly McEttrick, VideoHub and Andrew Feigenson, Nielsen | Integrating GRPs in...
Kelly McEttrick, VideoHub and Andrew Feigenson, Nielsen | Integrating GRPs in...Kelly McEttrick, VideoHub and Andrew Feigenson, Nielsen | Integrating GRPs in...
Kelly McEttrick, VideoHub and Andrew Feigenson, Nielsen | Integrating GRPs in...Will Richmond
 
Eduard weston
Eduard westonEduard weston
Eduard westonjuan paez
 
051207 Commonsense Geography Meets Web Technology
051207 Commonsense Geography Meets Web Technology 051207 Commonsense Geography Meets Web Technology
051207 Commonsense Geography Meets Web Technology andrea huang
 
Curiozitati
CuriozitatiCuriozitati
Curiozitatisokoban
 
Media And Money
Media And MoneyMedia And Money
Media And MoneyJohn Healy
 

Andere mochten auch (20)

20160602 典藏目錄的語意與連結
20160602 典藏目錄的語意與連結20160602 典藏目錄的語意與連結
20160602 典藏目錄的語意與連結
 
A preliminary study on Wikipedia Dbpdeia and Wikidata
A preliminary study on Wikipedia Dbpdeia and WikidataA preliminary study on Wikipedia Dbpdeia and Wikidata
A preliminary study on Wikipedia Dbpdeia and Wikidata
 
defilarea perfecta ...
defilarea perfecta ...defilarea perfecta ...
defilarea perfecta ...
 
Let us build a story
Let us build a storyLet us build a story
Let us build a story
 
Planttheseedandletitgrow...
Planttheseedandletitgrow...Planttheseedandletitgrow...
Planttheseedandletitgrow...
 
1er Tumblr Art Meetup Paris by ARTchipel
1er Tumblr Art Meetup Paris by ARTchipel1er Tumblr Art Meetup Paris by ARTchipel
1er Tumblr Art Meetup Paris by ARTchipel
 
romania
romaniaromania
romania
 
What gets into Software code after some time
What gets into Software code  after some timeWhat gets into Software code  after some time
What gets into Software code after some time
 
Sam's_Story
Sam's_StorySam's_Story
Sam's_Story
 
Lovemewithallyourheart
LovemewithallyourheartLovemewithallyourheart
Lovemewithallyourheart
 
Muito Bem! ¡Muy Bien!
Muito Bem! ¡Muy Bien!Muito Bem! ¡Muy Bien!
Muito Bem! ¡Muy Bien!
 
Information + insight = action
Information + insight = actionInformation + insight = action
Information + insight = action
 
Kelly McEttrick, VideoHub and Andrew Feigenson, Nielsen | Integrating GRPs in...
Kelly McEttrick, VideoHub and Andrew Feigenson, Nielsen | Integrating GRPs in...Kelly McEttrick, VideoHub and Andrew Feigenson, Nielsen | Integrating GRPs in...
Kelly McEttrick, VideoHub and Andrew Feigenson, Nielsen | Integrating GRPs in...
 
Eduard weston
Eduard westonEduard weston
Eduard weston
 
051207 Commonsense Geography Meets Web Technology
051207 Commonsense Geography Meets Web Technology 051207 Commonsense Geography Meets Web Technology
051207 Commonsense Geography Meets Web Technology
 
Curiozitati
CuriozitatiCuriozitati
Curiozitati
 
Young Wave
Young WaveYoung Wave
Young Wave
 
Antarctica
AntarcticaAntarctica
Antarctica
 
Media And Money
Media And MoneyMedia And Money
Media And Money
 
Mt Everest Pp
Mt Everest PpMt Everest Pp
Mt Everest Pp
 

Ähnlich wie 20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN

“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKAN“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKANChengjen Lee
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And VisualizationIvan Ermilov
 
Putting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAMPutting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAM4Science
 
20160922 Materials Data Facility TMS Webinar
20160922 Materials Data Facility TMS Webinar20160922 Materials Data Facility TMS Webinar
20160922 Materials Data Facility TMS WebinarBen Blaiszik
 
Data standardization process for social sciences and humanities
Data standardization process for social sciences and humanitiesData standardization process for social sciences and humanities
Data standardization process for social sciences and humanitiesvty
 
Ontology-based multi-domain metadata for research data management using tripl...
Ontology-based multi-domain metadata for research data management using tripl...Ontology-based multi-domain metadata for research data management using tripl...
Ontology-based multi-domain metadata for research data management using tripl...João Rocha da Silva
 
Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...
Moving Library Metadata Toward Linked Data:  Opportunities Provided by the eX...Moving Library Metadata Toward Linked Data:  Opportunities Provided by the eX...
Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...Jennifer Bowen
 
Azure DocumentDB for Healthcare Integration
Azure DocumentDB for Healthcare IntegrationAzure DocumentDB for Healthcare Integration
Azure DocumentDB for Healthcare IntegrationBizTalk360
 
Bingham, De Wild & Aasman Presentation
Bingham, De Wild & Aasman PresentationBingham, De Wild & Aasman Presentation
Bingham, De Wild & Aasman PresentationWARCnet
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Ricard de la Vega
 
Research data catalogues and data interoperability in life sciences
Research data catalogues and data interoperability in life sciencesResearch data catalogues and data interoperability in life sciences
Research data catalogues and data interoperability in life sciencesBlue BRIDGE
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebPascal-Nicolas Becker
 
Open Data and CKAN Data Catalogues
Open Data and CKAN Data CataloguesOpen Data and CKAN Data Catalogues
Open Data and CKAN Data Cataloguesdavid-read
 
Using Dublin Core for DISCOVER: a New Zealand visual art and music resource f...
Using Dublin Core for DISCOVER: a New Zealand visual art and music resource f...Using Dublin Core for DISCOVER: a New Zealand visual art and music resource f...
Using Dublin Core for DISCOVER: a New Zealand visual art and music resource f...Karen R
 
Best Practices for Descriptive Metadata
Best Practices for Descriptive MetadataBest Practices for Descriptive Metadata
Best Practices for Descriptive MetadataOCLC
 

Ähnlich wie 20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN (20)

“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKAN“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKAN
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
 
Putting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAMPutting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAM
 
20160922 Materials Data Facility TMS Webinar
20160922 Materials Data Facility TMS Webinar20160922 Materials Data Facility TMS Webinar
20160922 Materials Data Facility TMS Webinar
 
Data standardization process for social sciences and humanities
Data standardization process for social sciences and humanitiesData standardization process for social sciences and humanities
Data standardization process for social sciences and humanities
 
Ontology-based multi-domain metadata for research data management using tripl...
Ontology-based multi-domain metadata for research data management using tripl...Ontology-based multi-domain metadata for research data management using tripl...
Ontology-based multi-domain metadata for research data management using tripl...
 
Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...
Moving Library Metadata Toward Linked Data:  Opportunities Provided by the eX...Moving Library Metadata Toward Linked Data:  Opportunities Provided by the eX...
Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...
 
Azure DocumentDB for Healthcare Integration
Azure DocumentDB for Healthcare IntegrationAzure DocumentDB for Healthcare Integration
Azure DocumentDB for Healthcare Integration
 
Azure DocumentDB
Azure DocumentDBAzure DocumentDB
Azure DocumentDB
 
Bingham, De Wild & Aasman Presentation
Bingham, De Wild & Aasman PresentationBingham, De Wild & Aasman Presentation
Bingham, De Wild & Aasman Presentation
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
 
Research data catalogues and data interoperability in life sciences
Research data catalogues and data interoperability in life sciencesResearch data catalogues and data interoperability in life sciences
Research data catalogues and data interoperability in life sciences
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
 
Open Data and CKAN Data Catalogues
Open Data and CKAN Data CataloguesOpen Data and CKAN Data Catalogues
Open Data and CKAN Data Catalogues
 
Using Dublin Core for DISCOVER: a New Zealand visual art and music resource f...
Using Dublin Core for DISCOVER: a New Zealand visual art and music resource f...Using Dublin Core for DISCOVER: a New Zealand visual art and music resource f...
Using Dublin Core for DISCOVER: a New Zealand visual art and music resource f...
 
Best Practices for Descriptive Metadata
Best Practices for Descriptive MetadataBest Practices for Descriptive Metadata
Best Practices for Descriptive Metadata
 

Mehr von andrea huang

Reuse of Structured Data: Semantics, Linkage, and Realization
Reuse of Structured Data: Semantics, Linkage, and RealizationReuse of Structured Data: Semantics, Linkage, and Realization
Reuse of Structured Data: Semantics, Linkage, and Realizationandrea huang
 
結構資料的再次使用:語意、連結與實作
結構資料的再次使用:語意、連結與實作結構資料的再次使用:語意、連結與實作
結構資料的再次使用:語意、連結與實作andrea huang
 
Metadata as Linked Data for Research Data Repositories
Metadata as Linked Data for Research Data RepositoriesMetadata as Linked Data for Research Data Repositories
Metadata as Linked Data for Research Data Repositoriesandrea huang
 
20130805 Activating Linked Open Data in Libraries Archives and Museums
20130805 Activating Linked Open Data in Libraries Archives and Museums20130805 Activating Linked Open Data in Libraries Archives and Museums
20130805 Activating Linked Open Data in Libraries Archives and Museumsandrea huang
 
101203 An event ontology for crisis-disaster information
101203 An event ontology for crisis-disaster information101203 An event ontology for crisis-disaster information
101203 An event ontology for crisis-disaster informationandrea huang
 
081016 Social Tagging, Online Communication, and Peircean Semiotics
081016 Social Tagging, Online Communication, and Peircean Semiotics081016 Social Tagging, Online Communication, and Peircean Semiotics
081016 Social Tagging, Online Communication, and Peircean Semioticsandrea huang
 
060817 Participation Collaboration Mapping
060817 Participation Collaboration Mapping060817 Participation Collaboration Mapping
060817 Participation Collaboration Mappingandrea huang
 
070928 Collaborative Geospatial Mapping And Data Authorization
070928 Collaborative Geospatial Mapping And Data Authorization070928 Collaborative Geospatial Mapping And Data Authorization
070928 Collaborative Geospatial Mapping And Data Authorizationandrea huang
 
041018 Community Gis
041018 Community Gis041018 Community Gis
041018 Community Gisandrea huang
 
051102 Online Community Mapping
051102 Online Community Mapping051102 Online Community Mapping
051102 Online Community Mappingandrea huang
 

Mehr von andrea huang (10)

Reuse of Structured Data: Semantics, Linkage, and Realization
Reuse of Structured Data: Semantics, Linkage, and RealizationReuse of Structured Data: Semantics, Linkage, and Realization
Reuse of Structured Data: Semantics, Linkage, and Realization
 
結構資料的再次使用:語意、連結與實作
結構資料的再次使用:語意、連結與實作結構資料的再次使用:語意、連結與實作
結構資料的再次使用:語意、連結與實作
 
Metadata as Linked Data for Research Data Repositories
Metadata as Linked Data for Research Data RepositoriesMetadata as Linked Data for Research Data Repositories
Metadata as Linked Data for Research Data Repositories
 
20130805 Activating Linked Open Data in Libraries Archives and Museums
20130805 Activating Linked Open Data in Libraries Archives and Museums20130805 Activating Linked Open Data in Libraries Archives and Museums
20130805 Activating Linked Open Data in Libraries Archives and Museums
 
101203 An event ontology for crisis-disaster information
101203 An event ontology for crisis-disaster information101203 An event ontology for crisis-disaster information
101203 An event ontology for crisis-disaster information
 
081016 Social Tagging, Online Communication, and Peircean Semiotics
081016 Social Tagging, Online Communication, and Peircean Semiotics081016 Social Tagging, Online Communication, and Peircean Semiotics
081016 Social Tagging, Online Communication, and Peircean Semiotics
 
060817 Participation Collaboration Mapping
060817 Participation Collaboration Mapping060817 Participation Collaboration Mapping
060817 Participation Collaboration Mapping
 
070928 Collaborative Geospatial Mapping And Data Authorization
070928 Collaborative Geospatial Mapping And Data Authorization070928 Collaborative Geospatial Mapping And Data Authorization
070928 Collaborative Geospatial Mapping And Data Authorization
 
041018 Community Gis
041018 Community Gis041018 Community Gis
041018 Community Gis
 
051102 Online Community Mapping
051102 Online Community Mapping051102 Online Community Mapping
051102 Online Community Mapping
 

Kürzlich hochgeladen

UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdfUiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdfDianaGray10
 
OpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureOpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureEric D. Schabell
 
Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioComparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioChristian Posta
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Will Schroeder
 
Videogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfVideogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfinfogdgmi
 
Things you didn't know you can use in your Salesforce
Things you didn't know you can use in your SalesforceThings you didn't know you can use in your Salesforce
Things you didn't know you can use in your SalesforceMartin Humpolec
 
Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™Adtran
 
Spring24-Release Overview - Wellingtion User Group-1.pdf
Spring24-Release Overview - Wellingtion User Group-1.pdfSpring24-Release Overview - Wellingtion User Group-1.pdf
Spring24-Release Overview - Wellingtion User Group-1.pdfAnna Loughnan Colquhoun
 
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...Aggregage
 
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationUsing IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationIES VE
 
Computer 10: Lesson 10 - Online Crimes and Hazards
Computer 10: Lesson 10 - Online Crimes and HazardsComputer 10: Lesson 10 - Online Crimes and Hazards
Computer 10: Lesson 10 - Online Crimes and HazardsSeth Reyes
 
Designing A Time bound resource download URL
Designing A Time bound resource download URLDesigning A Time bound resource download URL
Designing A Time bound resource download URLRuncy Oommen
 
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAshyamraj55
 
PicPay - GenAI Finance Assistant - ChatGPT for Customer Service
PicPay - GenAI Finance Assistant - ChatGPT for Customer ServicePicPay - GenAI Finance Assistant - ChatGPT for Customer Service
PicPay - GenAI Finance Assistant - ChatGPT for Customer ServiceRenan Moreira de Oliveira
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7DianaGray10
 
Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.YounusS2
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataSafe Software
 
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...DianaGray10
 
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1DianaGray10
 
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxGDSC PJATK
 

Kürzlich hochgeladen (20)

UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdfUiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
 
OpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureOpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability Adventure
 
Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioComparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and Istio
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
 
Videogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfVideogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdf
 
Things you didn't know you can use in your Salesforce
Things you didn't know you can use in your SalesforceThings you didn't know you can use in your Salesforce
Things you didn't know you can use in your Salesforce
 
Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™
 
Spring24-Release Overview - Wellingtion User Group-1.pdf
Spring24-Release Overview - Wellingtion User Group-1.pdfSpring24-Release Overview - Wellingtion User Group-1.pdf
Spring24-Release Overview - Wellingtion User Group-1.pdf
 
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
 
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationUsing IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
 
Computer 10: Lesson 10 - Online Crimes and Hazards
Computer 10: Lesson 10 - Online Crimes and HazardsComputer 10: Lesson 10 - Online Crimes and Hazards
Computer 10: Lesson 10 - Online Crimes and Hazards
 
Designing A Time bound resource download URL
Designing A Time bound resource download URLDesigning A Time bound resource download URL
Designing A Time bound resource download URL
 
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
 
PicPay - GenAI Finance Assistant - ChatGPT for Customer Service
PicPay - GenAI Finance Assistant - ChatGPT for Customer ServicePicPay - GenAI Finance Assistant - ChatGPT for Customer Service
PicPay - GenAI Finance Assistant - ChatGPT for Customer Service
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7
 
Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
 
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
 
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
 
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptx
 

20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN

  • 1. “Open Data Web” – A Linked Open Data Repository Built with CKAN Cheng-Jen Lee Andrea Wei-Ching Huang Tyng-Ruey Chuang Institute of Information Science, Academia Sinica, Taiwan CKANCon 2016@Madrid 2016/10/04
  • 2. Slide and Transcript • (Slide URL&QR code) 2 Transcript https://hackmd.io/s/rJIcV6Op
  • 3. Outline • Data Source • Linked Data • From Archive Catalog to Linked Data • Linked Open Data Repository: Open Data Web • System Architecture • Implementation • Limitations • Future Work 3
  • 4. Data Source • Union Catalog of Digital Archives Taiwan • http://catalog.digitalarchives.tw • Web catalog for digitized archives in 14 domains from many institutions. • Part of the catalog is released under CC licenses • About 840,000 catalog records. • Free to copy and redistribute. • Represent resources in a linked data format • Provide semantic query for time, place, object, etc. • Enrich resources by linking them to third-party datasets. 4
  • 5. Linked Data • Linked Data (from Wikipedia) • A method of publishing structured data. • It can be interlinked and become more useful through semantic queries. • Linked Open Data is linked data that is open content. • Mostly in the form of RDF. • RDF (from W3C RDF 1.1 Primer) • Resource Description Framework • A framework for expressing information about resources. • RDF can enrich a dataset by linking it to third-party datasets. • Ex. Enrich a dataset about paintings by linking them to the corresponding artists in Wikidata. 5
  • 6. RDF Data Model • A Triple: <subject> <predicate> <object> • <Bob> <is a> <person>. • <Bob> <is interested in> <the Mona Lisa>. • <the Mona Lisa> <was created by> <Leonardo da Vinci>. Source: https://www.w3.org/TR/2014/NOTE-rdf11-primer-20140624/#section-triple 6
  • 7. From Archive Catalog to Linked Data • We converted archive catalog to two versions of linked data. • Version D: triples with just Dublin Core descriptions from the catalog • D means Dublin Core • Version R: mapping column values in the catalog to external datasets (with domain vocabularies) to give enriched semantics • R means Refined • Extract place names from "Coverage" column (dc:coverage) in the catalog and map them to place IDs on geonames.org. • Normalize values in "Date" column (dc:date) to ISO8601 format, or map them to Wikidata IDs. • Map titles of biology archives to entries on Encyclopedia of Life. 7
  • 8. Archive Catalog XML&CSV txn:hasEOLPage <http://eol.org/pages/1134120> ; -------------------------------------------- skos:editorialNote "採集日期" ; dwc:eventDate "1993-04-25" ; RDF-like CSV Step 1: Mapping column values to vocabularies • "採集日期” means date collected in English. Step 2: Converting CSV data to linked data Original Data Results After Vocabulary Mapping Linked Data (RDF) Title 台灣一葉蘭 Date::field 採集日期 Date 1993-04-25 txn:hasEOLPage eol:1134120 rdf:type schema:CreateAction skos:editorialNote 採集日期 dwc:eventDate 1993-04-25 Vocabulary Mapping and Data Conversion Python Scripts: https://gitlab.com/iislod/dat2ld 8
  • 9. Linked Open Data Repository: Open Data Web (beta) http://data.odw.tw Ontology* for Open Data Web (Draft) http://voc.odw.tw * Definitions of the vocabularies used to describe objects in RDF. 9
  • 10. Feature (1): Linked Data Browsing Main Menu Records: D version Refined: R version (still uploading) http://data.odw.tw/record/ 10
  • 11. Feature (1): Linked Data Browsing http://data.odw.tw/record/ List of Resources Filters 11
  • 12. Feature (1): Linked Data Browsing http://data.odw.tw/record/ Get D or R version of the same resource 12
  • 13. Example: “Girl Lost in Thought” linked data (triples) http://data.odw.tw/record/d4502674 13
  • 14. Example: “Girl Lost in Thought” Export single resource in linked data format http://data.odw.tw/record/d4502674 14
  • 15. • Spatial indexing based on geo:lat and geo:long values. Resources about Tainan City Feature (2): Spatial Query 15
  • 16. • Temporal indexing based on dct:W3CDTF, xsd:date, and xsd:gYear values. Resources in 19th century Feature (3): Temporal Query 16
  • 17. Feature (4): SPARQL Endpoint http://data.odw.tw/sparql/ (For testing) http://sparql.odw.tw/ (For machine access) 17
  • 18. Feature (5): Spatial Representation • Only for R version (still uploading). • Only shows geonames information in the gn:locatedIn property. http://data.odw.tw/r1/r1-r4502674 18
  • 19. System Architecture SPARQL Query Page HTML for individual record RDF for individual record ckanext-scheming& ckanext-repeating template ckanext-dcat output profile User Access individual resource SPARQL (testing) Computer SPARQL Linked Data (Turtle format) ImportHarvest Icon made by SimpleIcon (http://www.flaticon.com/aut hors/simpleicon) and Freepik (http://www.flaticon.com/aut hors/freepik) 19
  • 20. Implementation (1/3) • Custom fields • ckanext-scheming and ckanext-repeating extension • Define CKAN custom fields for a data type in a JSON file • Each data type has its own directory. • Ex. record.json is for D ver. (http://data.odw.tw/record/) • A field is defined by a JSON object, for example: { "field_name": "dc:format", "label": "dc:format", "display_property": "dc:format", "preset": "repeating_text_modified” }, 20
  • 21. Implementation (2/3) • Import linked data • ckanext-dcat extension for linked data import/export • CKAN harvesting mechanism by ckanext-harvest extension • Extend DCATRDFHarvester in ckanext.dcat.harvesters.rdf • Extend RDFProfile in ckanext.dcat.profiles • def parse_dataset(self, dataset_dict, dataset_ref): • (Import) Parse dataset_ref from loaded linked data to CKAN’s dataset_dict • def graph_from_dataset(self, dataset_dict, dataset_ref): • (Export) Generate a graph (linked data) dataset_ref from CKAN’s dataset_dict • Modify ckanext-dcat itself • To support more namespace (ckanext-dcat is originally designed for DCAT vocabularies.) 21
  • 22. 22
  • 23. Implementation (3/3) • Virtuoso SPARQL endpoint integration • ckanext-sparql extension • Spatial indexing and searching • ckanext-spatial extension • Time indexing and searching • We developed the ckanext-tempsearch extension. • Source code available on GitLab. • https://gitlab.com/iislod/ 23
  • 24. Limitations • Maintaining two triple stores (CKAN & Virtuoso). • They may be inconsistent since we do not sync them for now. • Slow harvesting speed on CKAN. • 4 hrs+ for harvesting 20,000 records on a Core i7-2600 3.4 GHz machine (still uploading now). 24
  • 25. Future Work • Provide native SPARQL queries in CKAN. • Then we do not need Virtuoso anymore. • Harvest multiple resources as a CKAN dataset • To improve import speed. • Time and place names mappings to third-party datasets • Still remains further verification. 25
  • 26. Open Data Web (http://data.odw.tw) E-mail: ask AT odw.tw We welcome your valuable comments & suggestions! 26 Acknowledgement: Hsin-Ping Chen (k26021409 AT gmail.com) for processing geonames data. Find me at @u10313335, http://about.me/SolLee, cjlee AT iis.sinica.edu.tw