SlideShare a Scribd company logo
1 of 20
Download to read offline
Data Culture 
with 
Culture Data 
Dr. Barry Norton, 
Development Manager, ResearchSpace* 
* Funded by the Andrew W. Mellon Foundation 
* Hosted by the Curatorial Directorate, British Museum
Culture Data 
collection.britishmuseum.org 
>2M objects described 
RDF & public SPARQL 
researchspace.org 
musicbrainz.org 
>1M albums described 
RDF & public SPARQL 
linkedbrainz.org
Not the Starting Point 
1. Use URIs as names for 
things 
2. Use HTTP URIs so that 
people can look up those 
names. 
3. When someone looks up 
a URI, provide useful 
information, using the 
standards (RDF*, SPARQL) 
4. Include links to other URIs. 
so that they can discover 
more things. 
Tim Berners-Lee 
Date: 2006-07-27
Not the Starting Point 
1. Use URIs as names for 
things 
2. Use HTTP URIs so that 
people can look up those 
names. 
3. When someone looks up 
a URI, provide useful 
information, using the 
standards (RDF*, SPARQL) 
4. Include links to other URIs. 
so that they can discover 
more things. 
Tim Berners-Lee 
Why? Date: 2006-07-27
Why not CSVs? 
• There are many of them 
• They’re easy to import into a 
spreadsheet (if they ever left) 
• They’re line-by-line 
processable 
– resurgence of GNU tools 
– powerful scripting languages 
– chunkable for Hadoop 
• W3C CSV on the Web
CSV Simple Music 
Artist Album 
The Beatles Rubber Soul 
The Beatles Revolver 
The Beatles Sgt. Pepper’s Lonely Hearts Club Band 
The Rolling Stones Their Satanic Majesties Request 
The Rolling Stones Beggars Banquet 
The Rolling Stones Let It Bleed
CSV Simple 
Denormalisation 
Artist Album Track 
The Beatles Rubber Soul Drive My Car 
The Beatles Rubber Soul Norwegian Wood 
The Beatles Rubber Soul … 
The Beatles Revolver … 
The Beatles Sgt. Pepper’s … … 
The Rolling 
Their Satanic Majesties … … 
Stones 
The Rolling 
Stones 
Beggars Banquet … 
• Redundant but feasible because this is a 
hierarchy
CSV Denormalisation 
Artist Member Album Track 
The Beatles John Rubber Soul ? 
The Beatles Paul Rubber Soul ? 
The Beatles George Rubber Soul ? 
The Beatles Ringo Rubber Soul ? 
The Beatles Sgt. Pepper’s … ? 
The Rolling 
Their Satanic 
… 
Stones 
Majesties … 
The Rolling 
Stones 
Beggars Banquet … 
• Orthogonal (arguably) hierarchies – either 
deeply redundant or very sparse…
Spreadsheets 
• But this is just a relational database 
without the rigour…
Why not RDBMS? 
• Not as unlikely as you might 
think
RDBMS on Web? 
• Tricky to install and update, but feasible 
– MusicBrainz, for instance, distributes the 
postgresql index, but also a VM and 
postgresql replication 
• Actually the basis of most CSV 
– project and denormalise 
– equally feasible with SPARQL (see later) 
• Difficult to extend and integrate new 
data…
MusicBrainz 
‘Advanced Relationships’ 
• link_types 
(rows) are 
extensible 
binary 
relationships 
• Each pair of 
entities (e.g. 
artist_artist) 
are paired and 
then links typed
Why not XML/SOAP? 
• XML is the ‘data equivalent’ 
to HTML Web documents 
• XML is communicable via the 
Web protocol, HTTP, in 
SOAP 
• So didn’t we solve all this in 
the 90s?
HTML versus XML 
HTML: XML (example): 
<html> 
<body> 
<h1>title</h1> 
<p>Some 
text 
referring 
to:</p> 
<img 
src="..."> 
<ul> 
<li>one</li> 
<li>two</li> 
<li>three></li> 
</ul> 
</body> 
</html> 
<artists> 
<artist 
name="The 
Beatles"> 
<member>John</member> 
<member>Paul</member> 
... 
<album 
name="Rubber 
Soul"> 
<track>Drive 
My 
Car</track> 
<track>Norwegian 
Wood 
... 
</album> 
<album 
name="Revolver"> 
... 
</artist> 
<artist 
name="The 
Rolling 
Stones"> 
...
HTTP versus Services 
HTTP: SOAP: 
GET 
/artist/b10bbbfc-­‐ 
cf9e-­‐42e0-­‐be17-­‐ 
e2c3e1d2600d 
HTTP/1.1 
Host: 
musicbrainz.org 
POST 
/ws 
Host: 
www.example.com
Why not JSON/REST? 
• JSON removes a lot of the 
ambiguous structure of XML 
• JSON is a lot more flexible in 
terms of schema 
(expectations) 
• REST argues that resources 
have URIs and all 
representations be resolved 
there
JSON

More Related Content

What's hot

Aplicații Web Semantice - Descriere Proiect
Aplicații Web Semantice - Descriere ProiectAplicații Web Semantice - Descriere Proiect
Aplicații Web Semantice - Descriere Proiect
Vlad Posea
 

What's hot (7)

Linked Data Challenge and Opportunity
Linked Data Challenge and OpportunityLinked Data Challenge and Opportunity
Linked Data Challenge and Opportunity
 
when the link makes sense
when the link makes sensewhen the link makes sense
when the link makes sense
 
Building Blocks for the Future: Making Controlled Vocabularies Available for ...
Building Blocks for the Future: Making Controlled Vocabularies Available for ...Building Blocks for the Future: Making Controlled Vocabularies Available for ...
Building Blocks for the Future: Making Controlled Vocabularies Available for ...
 
Best Genealogy Websites of 2012 - Part 1
Best Genealogy Websites of 2012 - Part 1Best Genealogy Websites of 2012 - Part 1
Best Genealogy Websites of 2012 - Part 1
 
Aplicații Web Semantice - Descriere Proiect
Aplicații Web Semantice - Descriere ProiectAplicații Web Semantice - Descriere Proiect
Aplicații Web Semantice - Descriere Proiect
 
DHWI Linked Open Data - Show and Tell
DHWI Linked Open Data - Show and TellDHWI Linked Open Data - Show and Tell
DHWI Linked Open Data - Show and Tell
 
Why schema.org
Why schema.orgWhy schema.org
Why schema.org
 

Viewers also liked (8)

ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
 
Wib 12 13 newsletter
Wib 12 13 newsletterWib 12 13 newsletter
Wib 12 13 newsletter
 
Webinar 2013 12-10-presentazione_jm
Webinar 2013 12-10-presentazione_jmWebinar 2013 12-10-presentazione_jm
Webinar 2013 12-10-presentazione_jm
 
Mon domingue key_introduction to semantic
Mon domingue key_introduction to semanticMon domingue key_introduction to semantic
Mon domingue key_introduction to semantic
 
Thu bernstein key_warp_speed
Thu bernstein key_warp_speedThu bernstein key_warp_speed
Thu bernstein key_warp_speed
 
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student projectFIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
 
TEORIA DE LA CIENCIA
TEORIA DE LA CIENCIATEORIA DE LA CIENCIA
TEORIA DE LA CIENCIA
 
ESWC SS 2013 - Tuesday Keynote Steffen Staab: Programming the Semantic Web
ESWC SS 2013 - Tuesday Keynote Steffen Staab: Programming the Semantic WebESWC SS 2013 - Tuesday Keynote Steffen Staab: Programming the Semantic Web
ESWC SS 2013 - Tuesday Keynote Steffen Staab: Programming the Semantic Web
 

Similar to Mon norton tut_publishing01

It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011
Ross Singer
 
Linked data radical change
Linked data   radical changeLinked data   radical change
Linked data radical change
Richard Wallis
 
WTF is the Semantic Web
WTF is the Semantic WebWTF is the Semantic Web
WTF is the Semantic Web
Juan Sequeda
 

Similar to Mon norton tut_publishing01 (20)

It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011
 
Building the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeBuilding the new open linked library: Theory and Practice
Building the new open linked library: Theory and Practice
 
Schema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your LibrarySchema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your Library
 
Linked data for Ebook discovery
Linked data for Ebook discoveryLinked data for Ebook discovery
Linked data for Ebook discovery
 
Lita national forum 2012
Lita national forum 2012Lita national forum 2012
Lita national forum 2012
 
WTF is Semantic Web?
WTF is Semantic Web?WTF is Semantic Web?
WTF is Semantic Web?
 
Metadata - Linked Data
Metadata - Linked DataMetadata - Linked Data
Metadata - Linked Data
 
Linked Data - Exposing what we have
Linked Data - Exposing what we haveLinked Data - Exposing what we have
Linked Data - Exposing what we have
 
Why libraries should embrace Linked Data
Why libraries should embrace Linked DataWhy libraries should embrace Linked Data
Why libraries should embrace Linked Data
 
Linked data radical change
Linked data   radical changeLinked data   radical change
Linked data radical change
 
Building the New Open Linked Library
Building the New Open Linked LibraryBuilding the New Open Linked Library
Building the New Open Linked Library
 
Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup: AACR to R...
Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup:  AACR to R...Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup:  AACR to R...
Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup: AACR to R...
 
WTF is the Semantic Web
WTF is the Semantic WebWTF is the Semantic Web
WTF is the Semantic Web
 
Why link?
Why link?Why link?
Why link?
 
Why Link?
Why Link?Why Link?
Why Link?
 
Thinking of Linking
Thinking of LinkingThinking of Linking
Thinking of Linking
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.
 
Linked data - A radical change?
Linked data - A radical change?Linked data - A radical change?
Linked data - A radical change?
 
Webofdata
WebofdataWebofdata
Webofdata
 
Metadata / Linked Data
Metadata / Linked DataMetadata / Linked Data
Metadata / Linked Data
 

More from eswcsummerschool

Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014 Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
eswcsummerschool
 
Mon domingue introduction to the school
Mon domingue introduction to the schoolMon domingue introduction to the school
Mon domingue introduction to the school
eswcsummerschool
 
Mon norton tut_querying cultural heritage data
Mon norton tut_querying cultural heritage dataMon norton tut_querying cultural heritage data
Mon norton tut_querying cultural heritage data
eswcsummerschool
 
Tue acosta hands_on_providinglinkeddata
Tue acosta hands_on_providinglinkeddataTue acosta hands_on_providinglinkeddata
Tue acosta hands_on_providinglinkeddata
eswcsummerschool
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineering
eswcsummerschool
 
Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02
eswcsummerschool
 
Mon fundulaki tut_querying linked data
Mon fundulaki tut_querying linked dataMon fundulaki tut_querying linked data
Mon fundulaki tut_querying linked data
eswcsummerschool
 
Tue acosta tut_providing_linkeddata
Tue acosta tut_providing_linkeddataTue acosta tut_providing_linkeddata
Tue acosta tut_providing_linkeddata
eswcsummerschool
 
Wed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationsWed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservations
eswcsummerschool
 
Wed garcia hands_on_d_bpedia preservation
Wed garcia hands_on_d_bpedia preservationWed garcia hands_on_d_bpedia preservation
Wed garcia hands_on_d_bpedia preservation
eswcsummerschool
 

More from eswcsummerschool (20)

Semantic Aquarium - ESWC SSchool 14 - Student project
Semantic Aquarium - ESWC SSchool 14 - Student projectSemantic Aquarium - ESWC SSchool 14 - Student project
Semantic Aquarium - ESWC SSchool 14 - Student project
 
Syrtaki - ESWC SSchool 14 - Student project
Syrtaki  - ESWC SSchool 14 - Student projectSyrtaki  - ESWC SSchool 14 - Student project
Syrtaki - ESWC SSchool 14 - Student project
 
Keep fit (a bit) - ESWC SSchool 14 - Student project
Keep fit (a bit)  - ESWC SSchool 14 - Student projectKeep fit (a bit)  - ESWC SSchool 14 - Student project
Keep fit (a bit) - ESWC SSchool 14 - Student project
 
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student project
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student projectArabic Sentiment Lexicon - ESWC SSchool 14 - Student project
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student project
 
Personal Tours at the British Museum - ESWC SSchool 14 - Student project
Personal Tours at the British Museum  - ESWC SSchool 14 - Student projectPersonal Tours at the British Museum  - ESWC SSchool 14 - Student project
Personal Tours at the British Museum - ESWC SSchool 14 - Student project
 
Exhibition recommendation using British Museum data and Event Registry - ESWC...
Exhibition recommendation using British Museum data and Event Registry - ESWC...Exhibition recommendation using British Museum data and Event Registry - ESWC...
Exhibition recommendation using British Museum data and Event Registry - ESWC...
 
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
 
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014 Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
 
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
 
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014 Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
 
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
 
Mon domingue introduction to the school
Mon domingue introduction to the schoolMon domingue introduction to the school
Mon domingue introduction to the school
 
Mon norton tut_querying cultural heritage data
Mon norton tut_querying cultural heritage dataMon norton tut_querying cultural heritage data
Mon norton tut_querying cultural heritage data
 
Tue acosta hands_on_providinglinkeddata
Tue acosta hands_on_providinglinkeddataTue acosta hands_on_providinglinkeddata
Tue acosta hands_on_providinglinkeddata
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineering
 
Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02
 
Mon fundulaki tut_querying linked data
Mon fundulaki tut_querying linked dataMon fundulaki tut_querying linked data
Mon fundulaki tut_querying linked data
 
Tue acosta tut_providing_linkeddata
Tue acosta tut_providing_linkeddataTue acosta tut_providing_linkeddata
Tue acosta tut_providing_linkeddata
 
Wed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationsWed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservations
 
Wed garcia hands_on_d_bpedia preservation
Wed garcia hands_on_d_bpedia preservationWed garcia hands_on_d_bpedia preservation
Wed garcia hands_on_d_bpedia preservation
 

Mon norton tut_publishing01

  • 1. Data Culture with Culture Data Dr. Barry Norton, Development Manager, ResearchSpace* * Funded by the Andrew W. Mellon Foundation * Hosted by the Curatorial Directorate, British Museum
  • 2. Culture Data collection.britishmuseum.org >2M objects described RDF & public SPARQL researchspace.org musicbrainz.org >1M albums described RDF & public SPARQL linkedbrainz.org
  • 3. Not the Starting Point 1. Use URIs as names for things 2. Use HTTP URIs so that people can look up those names. 3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4. Include links to other URIs. so that they can discover more things. Tim Berners-Lee Date: 2006-07-27
  • 4. Not the Starting Point 1. Use URIs as names for things 2. Use HTTP URIs so that people can look up those names. 3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4. Include links to other URIs. so that they can discover more things. Tim Berners-Lee Why? Date: 2006-07-27
  • 5. Why not CSVs? • There are many of them • They’re easy to import into a spreadsheet (if they ever left) • They’re line-by-line processable – resurgence of GNU tools – powerful scripting languages – chunkable for Hadoop • W3C CSV on the Web
  • 6.
  • 7. CSV Simple Music Artist Album The Beatles Rubber Soul The Beatles Revolver The Beatles Sgt. Pepper’s Lonely Hearts Club Band The Rolling Stones Their Satanic Majesties Request The Rolling Stones Beggars Banquet The Rolling Stones Let It Bleed
  • 8. CSV Simple Denormalisation Artist Album Track The Beatles Rubber Soul Drive My Car The Beatles Rubber Soul Norwegian Wood The Beatles Rubber Soul … The Beatles Revolver … The Beatles Sgt. Pepper’s … … The Rolling Their Satanic Majesties … … Stones The Rolling Stones Beggars Banquet … • Redundant but feasible because this is a hierarchy
  • 9. CSV Denormalisation Artist Member Album Track The Beatles John Rubber Soul ? The Beatles Paul Rubber Soul ? The Beatles George Rubber Soul ? The Beatles Ringo Rubber Soul ? The Beatles Sgt. Pepper’s … ? The Rolling Their Satanic … Stones Majesties … The Rolling Stones Beggars Banquet … • Orthogonal (arguably) hierarchies – either deeply redundant or very sparse…
  • 10. Spreadsheets • But this is just a relational database without the rigour…
  • 11. Why not RDBMS? • Not as unlikely as you might think
  • 12. RDBMS on Web? • Tricky to install and update, but feasible – MusicBrainz, for instance, distributes the postgresql index, but also a VM and postgresql replication • Actually the basis of most CSV – project and denormalise – equally feasible with SPARQL (see later) • Difficult to extend and integrate new data…
  • 13. MusicBrainz ‘Advanced Relationships’ • link_types (rows) are extensible binary relationships • Each pair of entities (e.g. artist_artist) are paired and then links typed
  • 14. Why not XML/SOAP? • XML is the ‘data equivalent’ to HTML Web documents • XML is communicable via the Web protocol, HTTP, in SOAP • So didn’t we solve all this in the 90s?
  • 15.
  • 16. HTML versus XML HTML: XML (example): <html> <body> <h1>title</h1> <p>Some text referring to:</p> <img src="..."> <ul> <li>one</li> <li>two</li> <li>three></li> </ul> </body> </html> <artists> <artist name="The Beatles"> <member>John</member> <member>Paul</member> ... <album name="Rubber Soul"> <track>Drive My Car</track> <track>Norwegian Wood ... </album> <album name="Revolver"> ... </artist> <artist name="The Rolling Stones"> ...
  • 17. HTTP versus Services HTTP: SOAP: GET /artist/b10bbbfc-­‐ cf9e-­‐42e0-­‐be17-­‐ e2c3e1d2600d HTTP/1.1 Host: musicbrainz.org POST /ws Host: www.example.com
  • 18. Why not JSON/REST? • JSON removes a lot of the ambiguous structure of XML • JSON is a lot more flexible in terms of schema (expectations) • REST argues that resources have URIs and all representations be resolved there
  • 19.
  • 20. JSON