This document summarizes Corey Harper's presentation on Linked Open Data at the Penn Humanities Forum in 2014. The presentation introduced key concepts of the semantic web such as using URIs to identify resources and linking data through relationships. It provided examples of large linked open data projects including DBpedia and the Google Knowledge Graph. The presentation also discussed using linked data to provide additional context and narratives about cultural heritage collections through users' stories and scholars' interactions with archival materials. Harper envisioned linked open data interfaces that aggregate data from multiple sources to provide richer discovery experiences for users.
1. Linked Open Data
Presented for the Penn Humanities Forum
by Corey A Harper
2013-04-11
Tools and techniques for putting
Metadata, Context, & Narrative
On the Web
@chrpr–[Slideshare]
2. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 2
The Web Becomes Semantic
●
Originally:
●
Metadata about “Web” things (documents)
●
Eventually:
●
Metadata about all sorts of things
●
And about relationships between things
●
TBL’s original vision (Weaving the Web – 1999)
●
Then: Focus on Machine Reasoning
●
Scientific American Article
●
Now: Focus on things & links
●
Reasoning & Inferencing less central
3. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 3
Semantic Web Terminology
●
Resource: Any “thing”
●
Class: Abstraction of a type of thing
●
Individual: An instance of a class
●
Property: An attribute of an individual
●Statement/Triple:
●
A Resource (subject)
●
A Property (predicate / verb)
●
A Value (object) - Nodes
●
Graph: Visual Representation of statements
●
Ontology/Vocabulary: A domain specific collection
of classes and properties
5. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 5
Linked Open Data
●
Use URIs as names for things
●
Use HTTP URIs so that people can look up those
names.
●
When someone looks up a URI, provide useful
information.
●
Include links to other URIs. so that they can
discover more things.
http://www.w3.org/DesignIssues/LinkedData.html
6. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 6
Linked Data
●
Metadata as a Graph
●
Typed “things”, named by URIs
●
The relationships between those things,
also built on URIs
●
Ease of integration *across* data
sources – “merging graphs”
7. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 7
Publish Publish Publish!
http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html
9. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 9
DBpedia
Structured Wikipedia Data
●
Partial basis in data entry conventions
●
InfoBox’s, and InfoBox Templates
●
Metadata Entry Format
●
Partial source of Ontology
●
Class Structure
●
Vocabulary Design
10. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 10
DBpedia
●
3.4 Million “things” described
●
Ontology based on “infoboxes”
●
1.5 million things classified
●
http://wiki.dbpedia.org/Ontology
●
Approx. 50,000 “Properties”
●
Approx. 1,200 defined in ontology
14. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 14
http://thinkbase.cs.auckland.ac.nz/start.jsp
15. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 15
Google Knowledge Graph
16. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 16
RelFinder
http://www.visualdataweb.org/relfinder.php
17. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 17
RelFinder
http://www.visualdataweb.org/relfinder.php
18. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 18
Linked Jazz
http://linkedjazz.org/network/
19. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 19
Social Networks of Archival Context
Image From: http://inkdroid.org/journal/2010/08/12/archival-context-on-the-web/
20. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 20
Linking Lives – Screenshots from P. Johnston
21. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 21
Linking Lives – Screenshots from P. Johnston
24. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 24
Pundit
Imagefrom:http://summit2013.lodlam.net/2013/04/03/pundit/
25. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 25
Annotations
●
The Pundit (thepund.it)
●
Hypothesis: Peer Review for the Web (hypthos.is)
●
Open Annotation Collaboration
●
Distributed bibliographic control environment
●
Focus on identification over description
“In short, by treating values as non-literal resources and
assigning URIs to them we give ourselves (and others) the
hooks on which to hang further descriptions.” - Andy Powell
26. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 26
Context
Narrative
Story telling
The Library's story,
and the Archives story,
but also…
27. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 27
Users’ stories
Scholars' stories
Adding context through recombinant metadata
28. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 28
Scholars & Users Stories – Tim Sherratt
(@wragge)
Also: http://discontents.com.au/a-map-and-some-pins-open-data-and-unlimited-horizons/
29. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 29
Linked Data Based UI Design
For Boutique Collections
33. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 33
FuzzyWuzzy & SeatGeek!
FuzzyWuzzy–AwesomeLibraryfromSeatGeek
https://github.com/seatgeek/fuzzywuzzy
http://seatgeek.com/blog/dev/fuzzywuzzy-fuzzy-string-matching-in-python
34. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 34
Slide courtesy of Doug Oard
University of Maryland
35. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 35
Reconciliation & NER
http://freeyourmetadata.org/
Watch for their book
http://www.amazon.com/Linked-Data-Libraries-Archives-Museums/dp/1856049647/
39. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 40
And onward...
• Rethinking UI Design
Aggregate data from more sources
Provide more context
• Building proofs of concept
• Improving Search Engine Optimization
(JSON-LD, Schema.org, RDFa, &c.)
• Use cases!
• Experimentation!
40. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 41
Thanks!
corey.harper@nyu.edu
212.998.2479
@chrpr