SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Supporting Linked Data Production
  for Cultural Heritage institutes:
   The Amsterdam Museum Case Study

   Victor de Boer, Jan Wielemaker, Judith van
    Gent, Michiel Hildebrand, Antoine Isaac,
    Jacco van Ossenbruggen, Guus Schreiber




                     EuropeanaConnect
Aggregator




             2
Europeana

 “Europeana enables people to explore the digital
 resources of Europe's museums, libraries, archives and
 audio-visual collections.’’
                                      www.europeana.eu




From portal…                              …to data aggregator.



                                                                 3
data.europeana.eu

2.4 Million objects exposed as
Linked Data.

8 aggregators, 200
institutions, 15 countries

Europeana Semantic Elements
converted to RDF Europeana
Data Model (EDM)


                                     4
Linked data-ify




Aggregate and convert




                        5
Linked data-ify




                        Convert to
                        Linked data
                        Mapped to EDM
Aggregate and convert




                                  6
Methodology and tool stack
• Focus on transparency and interactivity
  – Reproducability
  – Both in conversion and alignment


• Maintain detail and complexity of original data

• Interoperability through schema mapping


                                                7
Methods                                                   Tools
                                                             ClioPatria
1. XML ingestion (OAI)

2. Direct transformation to ‘crude’ RDF
                                                             XMLRDF
3. Interactive RDF restructuring

4. Create a metadata mapping schema

5. Align vocabularies with external sources                  Amalgame

6. Publish as Linked Data

                                                                          8
                      cliopatria.swi-prolog.org powered by
Case study:
             Amsterdam Museum
• Formerly Amsterdam Historic
  Museum
  – “The rich collection of works of art,
    objects and archaeological finds brings
    to life the fortunes of Amsterdammers
    of days gone by and today.”

• In March 2010 published their whole
  collection online
  – 73.000 objects
  – CC license

                                              9
Methods                                    Tools
                                              ClioPatria
1. XML ingestion

2. Direct transformation to ‘crude’ RDF
                                              XMLRDF

3. Interactive RDF restructuring

4. Create a metadata mapping schema
                                              Amalgame
5. Align vocabularies with external sources

6. Publish as Linked Data

                                                           10
Ingested AM metadata
                                                    <record priref="10541“ >
• Adlib database XML API                               <acquisition.date>1997</acquisition.date>
                                                       <dimension>
                                                         <dimension.type>hoogte</dimension.type>
                                                         <dimension.unit>cm</dimension.unit>
• Object metadata                                        <dimension.value>6</dimension.value>
                                                       </dimension>
       • 73.000 objects, 256MB                      …
       • Nested XML                                 </record>



• Concept Thesaurus                           <record priref="28024“ >
                                                 <term>Kalverstraat 124</term>
       • 27.000, 9MB                             <broader_term>Kalverstraat</broader_term>
                                                 <term.type>GEOKEYW </term.type>
       • Different types (geo,motif, event)   </record>


• Person Authority File                                  <record priref="6" >
       • 67.000 persons, 10MB                               <biography>boekverkoper en uitgever van
       • Consolidated from object metadata fields        cartografie</biography>
                                                            <birth.date.start>1659</birth.date.start>
       • Creators, annotators, reproduction                 <death.date.start>1733</death.date.start>
         creators, institutions,                            <name>Aa, Pieter van der</name>
                                                            <nationality>Nederlands</nationality>
                                                            <use>Aa, Pieter van der (I)</use>
                                                          </record>
                                                                                                  11
XMLRDF (1)
                      Syntactic RDF conversion
<record priref="19319 “ >
   <date>1651</date>
   <maker>Rembrandt (1606-1669)</maker>
   <object.type>etsplaat</object.type>              priref          “19319 ”
…                                                            date
</record>
                                                                    “1651”
                                                  am:Record
                                                    _:bn1              “Rembrandt (1606-1669)”

                                                      object.type      “etsplaat”



    XML-Element is attributes + content
         Map to RDF blank-node + attributes
               Attributes → Literals (+xml:lang)
               Content
                     If plain → Literal (+xml:lang)
                     Otherwise → RDF blank node (recursive)                          12
ClioPatria:
Intermediate Statistics




                          13
Methods                                    Tools
                                              ClioPatria
1. XML ingestion

2. Direct transformation to ‘crude’ RDF
                                              XMLRDF

3. Interactive RDF restructuring

4. Create a metadata mapping schema
                                              Amalgame
5. Align vocabularies with external sources

6. Publish as Linked Data

                                                           14
XMLRDF Graph rewrite rule
                language

   Declarative committed-choice language
    based on CHR (Constraint Handling
    Rules)
       Triples <=> Guard, NewTriples
       Keep  Triples <=> Guard, NewTriples




                                               15
Example
AM rewriting rules examples




                              17
RDF rewriting conversion
<record priref="19319 “ >
   <date>1651</date>
   <maker>Rembrandt (1606-1669)</maker>
   <object.type>etsplaat</object.type>                          priref                “19319 ”
…                                                                        date
</record>
                                                                                      “1651”
                                                             am:Record
                                                               _:bn1                         “Rembrandt (1606-1669)”

                                                                  object.type                “etsplaat”

                            “19319 ”
                  am:date          “1651”
                                                                                     “1234”
                                                                 am:priref
        am:Record                                                        am:birthdate
                             am:maker                    am:Person
      am:proxy-19319                                                            “1606”
                                                         am:p-1234
                                                                    rda:name             “Rembrandt”


                                          skos:Concept
                                           am:etsplaat
                                                                                “etsplaat”
                                                               skos:prefLabel                               18
Some statistics
               Amsterdam Museum
                   Rules Resources       Predicaes Triples
                                         used
Object metadata        58        73,447          100 5,700,371
                               (Proxies)
Thesaurus              23        28,000           13   601,819
                            (Concepts)
Person Auth List        2        66,966           21   301,143
                              (Persons)

                   558,161 Proxy-Concept relations
                    80,432 Proxy-Person relations
                    243,532 Proxy-Proxy relations           19
Methods                                    Tools
                                              ClioPatria
1. XML ingestion

2. Direct transformation to ‘crude’ RDF
                                              XMLRDF

3. Interactive RDF restructuring

4. Create a metadata mapping schema
                                              Amalgame
5. Align vocabularies with external sources

6. Publish as Linked Data

                                                           20
Mapping to EDM

                   dcterms:subject




                 rdfs:subPropertyOf




                 am:contentPersonName
am:proxy_22093                          “Job Cohen”



                                                      21
Europeana Data Model (EDM)

• Dublin Core for metadata representation
   – creator, date, title etc.

• SKOS for vocabularies
   – preferredLabel, hasBroader, etc.

• RDA Group 2 elements for persons
   – dateOfBirth, name etc.

• OAI-ORE to allow for aggregations etc.

• Some EDM-specific properties
   – edm:wasPresentAt, …

                                            22
Methods                                    Tools
                                              ClioPatria
1. XML ingestion

2. Direct transformation to ‘crude’ RDF
                                              XMLRDF

3. Interactive RDF restructuring

4. Create a metadata mapping schema
                                              Amalgame
5. Align vocabularies with external sources

6. Publish as Linked Data

                                                           23
Amalgame Alignment Platform




                                      24
      semanticweb.cs.vu.nl/amalgame
AM Alignments
• 3500+ links put in RDF
  – 143 places linked to
    GeoNames
  – 1076 persons linked to
    ULAN (VIAF)
  – 34 persons linked to
    DBPedia
  – 2498 concepts
    AATNed.


                                25
Methods                                    Tools
                                              ClioPatria
1. XML ingestion

2. Direct transformation to ‘crude’ RDF
                                              XMLRDF

3. Interactive RDF restructuring

4. Create a metadata mapping schema
                                              Amalgame
5. Align vocabularies with external sources

6. Publish as Linked Data

                                                           26
Architecture
 SPARQL-app                        Browser


                                                Purl.org
                                                redirect


   SPARQL                       Web interface

                 HTTP server


RDF(s) storage                      Logic


                   Prolog


                                                               27
                               http://semanticweb.cs.vu.nl/europeana/
Content negotiation
         @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
         @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
         @prefix ore: <http://www.openarchives.org/ore/terms/> .
         @prefix ens: <http://www.europeana.eu/schemas/edm/> .
         @prefix ahm: <http://purl.org/collections/nl/am/>


         ahm:proxy-66970
                      a ore:Proxy ;
                      ahm:title "Zegelstempel Felix Meritis"@nl ;
                      ahm:material ahm:t-12463 ,
                                        ahm:t-5447 ;
                      ahm:objectCategory ahm:t-5504 ;
                      ahm:objectName ahm:t-13817 ,
                                           ahm:t-8489 ;
                      ahm:objectNumber "KA 7653.1" ;
                      ahm:priref "66970" .

         ahm:proxy-66972
                      a ore:Proxy ;
                      ahm:acquisitionDate "0000" ;
                      ahm:title "Zegelstempel mogelijk van
         familiewapen"@nl .




                                                               28
“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
Wrapping up
                          Methodology
•   Stay as close as possible to original (XML) metadata
•   Separate syntactic transformation, semantic interpretations
•   Interactive workflow, simple steps
•   Use rdf schema to map to interoperability layer
•   Keep provenance, reproducability

                                Tools
•   XMLRDF Realised clean workflow for RDF production.
•   Amalgame: Interactive and transparent vocablary alignment
•   ClioPatria Semantic server: statistics at any moment + Full
    expressivity of Some Prolog


                                                                  31
Issues
• Validate with real collection managers
  – Making good rules is sometimes hard
  – Graphical tools can help

• Integrate in normal collection workflow (tools)
  – LD as another view on the data
  – Live updates

• RDFS reasoning needed to have interoperability
                                                    32
http://semanticweb.cs.vu.nl/lod/am/
              v.de.boer@vu.nl

           amsterdammuseum.nl




                  ?
   ClioPatria: the SWI-Prolog RDF toolkit
(includes XMLRDF and Amalgame packages)
       http://cliopatria.swi-prolog.org

Weitere ähnliche Inhalte

Was ist angesagt?

Linking Roman Coins: CAA2012
Linking Roman Coins: CAA2012Linking Roman Coins: CAA2012
Linking Roman Coins: CAA2012
ewg118
 

Was ist angesagt? (7)

HyperGraphQL
HyperGraphQLHyperGraphQL
HyperGraphQL
 
History and Background of the USEWOD Data Challenge
History and Background of the  USEWOD Data ChallengeHistory and Background of the  USEWOD Data Challenge
History and Background of the USEWOD Data Challenge
 
Linking Roman Coins: CAA2012
Linking Roman Coins: CAA2012Linking Roman Coins: CAA2012
Linking Roman Coins: CAA2012
 
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic RepresentationGetty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
 
Interactive exploration of complex relational data sets in a web - SemWeb.Pro...
Interactive exploration of complex relational data sets in a web - SemWeb.Pro...Interactive exploration of complex relational data sets in a web - SemWeb.Pro...
Interactive exploration of complex relational data sets in a web - SemWeb.Pro...
 
Linking UK Government Data, John Sheridan
Linking UK Government Data, John SheridanLinking UK Government Data, John Sheridan
Linking UK Government Data, John Sheridan
 
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
 

Ähnlich wie Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritage institutes: The Amsterdam Museum Case Study

OpenAIRE schirrwagen
OpenAIRE schirrwagenOpenAIRE schirrwagen
OpenAIRE schirrwagen
Talis Consulting
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Cory Lampert
 
Semantic Technologies and Triplestores for Business Intelligence
Semantic Technologies and Triplestores for Business IntelligenceSemantic Technologies and Triplestores for Business Intelligence
Semantic Technologies and Triplestores for Business Intelligence
Marin Dimitrov
 

Ähnlich wie Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritage institutes: The Amsterdam Museum Case Study (20)

Linked data: Four rules and five stars for the Amsterdam Museum
Linked data: Four rules and five stars for the Amsterdam MuseumLinked data: Four rules and five stars for the Amsterdam Museum
Linked data: Four rules and five stars for the Amsterdam Museum
 
Amsterdam museum as five star linked data
Amsterdam museum as five star linked dataAmsterdam museum as five star linked data
Amsterdam museum as five star linked data
 
Amsterdam Museum as EDM Linked Open Data
Amsterdam Museum as EDM Linked Open DataAmsterdam Museum as EDM Linked Open Data
Amsterdam Museum as EDM Linked Open Data
 
SemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in PracticeSemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in Practice
 
Radically Open Cultural Heritage Data on the Web
Radically Open Cultural Heritage Data on the WebRadically Open Cultural Heritage Data on the Web
Radically Open Cultural Heritage Data on the Web
 
KM Lecture 7 LOD
KM Lecture 7 LODKM Lecture 7 LOD
KM Lecture 7 LOD
 
Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)
 
Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2
 
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
 
A Framework for Improved Access to Museum Databases in the Semantic Web
A Framework for Improved Access to Museum Databases in the Semantic WebA Framework for Improved Access to Museum Databases in the Semantic Web
A Framework for Improved Access to Museum Databases in the Semantic Web
 
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
 
ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2
 
GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2
 
OpenAIRE schirrwagen
OpenAIRE schirrwagenOpenAIRE schirrwagen
OpenAIRE schirrwagen
 
Integration of collection data - A case study from the Oxford Museums and Lib...
Integration of collection data - A case study from the Oxford Museums and Lib...Integration of collection data - A case study from the Oxford Museums and Lib...
Integration of collection data - A case study from the Oxford Museums and Lib...
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
 
Semantic Technologies and Triplestores for Business Intelligence
Semantic Technologies and Triplestores for Business IntelligenceSemantic Technologies and Triplestores for Business Intelligence
Semantic Technologies and Triplestores for Business Intelligence
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival data
 
Integrating Heterogeneous Data Sources in the Web of Data
Integrating Heterogeneous Data Sources in the Web of DataIntegrating Heterogeneous Data Sources in the Web of Data
Integrating Heterogeneous Data Sources in the Web of Data
 
LibreCat::Catmandu
LibreCat::CatmanduLibreCat::Catmandu
LibreCat::Catmandu
 

Mehr von Victor de Boer

Mehr von Victor de Boer (20)

One day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic WebOne day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic Web
 
Linked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media ArchivesLinked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media Archives
 
The Benefits of Linking Metadata for Internal and External users of an Audiov...
The Benefits of Linking Metadata for Internal and External users of an Audiov...The Benefits of Linking Metadata for Internal and External users of an Audiov...
The Benefits of Linking Metadata for Internal and External users of an Audiov...
 
UX Challenges of Information Organisation: Assessment of Language Impairment ...
UX Challenges of Information Organisation: Assessment of Language Impairment ...UX Challenges of Information Organisation: Assessment of Language Impairment ...
UX Challenges of Information Organisation: Assessment of Language Impairment ...
 
Interactive Dance Choreography Assistance presentation for ACE entertainment ...
Interactive Dance Choreography Assistance presentation for ACE entertainment ...Interactive Dance Choreography Assistance presentation for ACE entertainment ...
Interactive Dance Choreography Assistance presentation for ACE entertainment ...
 
Fahad Ali's slides for Machine to-machine communication in rural conditions ...
Fahad Ali's slides for Machine to-machine communication in rural conditions  ...Fahad Ali's slides for Machine to-machine communication in rural conditions  ...
Fahad Ali's slides for Machine to-machine communication in rural conditions ...
 
Linking African Traditional Medicine Knowledge - by Gossa Lo
Linking African Traditional Medicine Knowledge - by Gossa LoLinking African Traditional Medicine Knowledge - by Gossa Lo
Linking African Traditional Medicine Knowledge - by Gossa Lo
 
Enriching Media Collections for Event-based Exploration
Enriching Media Collections for Event-based ExplorationEnriching Media Collections for Event-based Exploration
Enriching Media Collections for Event-based Exploration
 
New Life for Old Media (NEM presentation)
New Life for Old Media  (NEM presentation)New Life for Old Media  (NEM presentation)
New Life for Old Media (NEM presentation)
 
User-centered Data Science for Digital Humanities
User-centered Data Science for Digital HumanitiesUser-centered Data Science for Digital Humanities
User-centered Data Science for Digital Humanities
 
Linked Data for Audiovisual Archives (Guest lecture at NISV)
Linked Data for Audiovisual Archives (Guest lecture at NISV)Linked Data for Audiovisual Archives (Guest lecture at NISV)
Linked Data for Audiovisual Archives (Guest lecture at NISV)
 
Semantic Technology for Development: Semantic Web without the Web?
Semantic Technology for Development: Semantic Web without the Web?Semantic Technology for Development: Semantic Web without the Web?
Semantic Technology for Development: Semantic Web without the Web?
 
DIVE+ and Events at EVENTS2017
DIVE+ and Events at EVENTS2017DIVE+ and Events at EVENTS2017
DIVE+ and Events at EVENTS2017
 
About Cultuurlink
About CultuurlinkAbout Cultuurlink
About Cultuurlink
 
Intro to Linked, Dutch Ships and Sailors and SPARQL handson
Intro to Linked, Dutch Ships and Sailors and SPARQL handson Intro to Linked, Dutch Ships and Sailors and SPARQL handson
Intro to Linked, Dutch Ships and Sailors and SPARQL handson
 
Kasadaka and ICT4D at VU
Kasadaka and ICT4D at VUKasadaka and ICT4D at VU
Kasadaka and ICT4D at VU
 
VU ICT4D symposium 2017 Francis Dittoh Mr. Meteo
VU ICT4D symposium 2017 Francis Dittoh  Mr. MeteoVU ICT4D symposium 2017 Francis Dittoh  Mr. Meteo
VU ICT4D symposium 2017 Francis Dittoh Mr. Meteo
 
VU ICT4D symposium 2017 Chris van Aart
VU ICT4D symposium 2017 Chris van AartVU ICT4D symposium 2017 Chris van Aart
VU ICT4D symposium 2017 Chris van Aart
 
VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...
VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...
VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...
 
VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture
VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture
VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture
 

KĂźrzlich hochgeladen

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 

KĂźrzlich hochgeladen (20)

Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 

Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritage institutes: The Amsterdam Museum Case Study

  • 1. Supporting Linked Data Production for Cultural Heritage institutes: The Amsterdam Museum Case Study Victor de Boer, Jan Wielemaker, Judith van Gent, Michiel Hildebrand, Antoine Isaac, Jacco van Ossenbruggen, Guus Schreiber EuropeanaConnect
  • 3. Europeana “Europeana enables people to explore the digital resources of Europe's museums, libraries, archives and audio-visual collections.’’ www.europeana.eu From portal… …to data aggregator. 3
  • 4. data.europeana.eu 2.4 Million objects exposed as Linked Data. 8 aggregators, 200 institutions, 15 countries Europeana Semantic Elements converted to RDF Europeana Data Model (EDM) 4
  • 6. Linked data-ify Convert to Linked data Mapped to EDM Aggregate and convert 6
  • 7. Methodology and tool stack • Focus on transparency and interactivity – Reproducability – Both in conversion and alignment • Maintain detail and complexity of original data • Interoperability through schema mapping 7
  • 8. Methods Tools ClioPatria 1. XML ingestion (OAI) 2. Direct transformation to ‘crude’ RDF XMLRDF 3. Interactive RDF restructuring 4. Create a metadata mapping schema 5. Align vocabularies with external sources Amalgame 6. Publish as Linked Data 8 cliopatria.swi-prolog.org powered by
  • 9. Case study: Amsterdam Museum • Formerly Amsterdam Historic Museum – “The rich collection of works of art, objects and archaeological finds brings to life the fortunes of Amsterdammers of days gone by and today.” • In March 2010 published their whole collection online – 73.000 objects – CC license 9
  • 10. Methods Tools ClioPatria 1. XML ingestion 2. Direct transformation to ‘crude’ RDF XMLRDF 3. Interactive RDF restructuring 4. Create a metadata mapping schema Amalgame 5. Align vocabularies with external sources 6. Publish as Linked Data 10
  • 11. Ingested AM metadata <record priref="10541“ > • Adlib database XML API <acquisition.date>1997</acquisition.date> <dimension> <dimension.type>hoogte</dimension.type> <dimension.unit>cm</dimension.unit> • Object metadata <dimension.value>6</dimension.value> </dimension> • 73.000 objects, 256MB … • Nested XML </record> • Concept Thesaurus <record priref="28024“ > <term>Kalverstraat 124</term> • 27.000, 9MB <broader_term>Kalverstraat</broader_term> <term.type>GEOKEYW </term.type> • Different types (geo,motif, event) </record> • Person Authority File <record priref="6" > • 67.000 persons, 10MB <biography>boekverkoper en uitgever van • Consolidated from object metadata fields cartografie</biography> <birth.date.start>1659</birth.date.start> • Creators, annotators, reproduction <death.date.start>1733</death.date.start> creators, institutions, <name>Aa, Pieter van der</name> <nationality>Nederlands</nationality> <use>Aa, Pieter van der (I)</use> </record> 11
  • 12. XMLRDF (1) Syntactic RDF conversion <record priref="19319 “ > <date>1651</date> <maker>Rembrandt (1606-1669)</maker> <object.type>etsplaat</object.type> priref “19319 ” … date </record> “1651” am:Record _:bn1 “Rembrandt (1606-1669)” object.type “etsplaat”  XML-Element is attributes + content  Map to RDF blank-node + attributes Attributes → Literals (+xml:lang) Content  If plain → Literal (+xml:lang)  Otherwise → RDF blank node (recursive) 12
  • 14. Methods Tools ClioPatria 1. XML ingestion 2. Direct transformation to ‘crude’ RDF XMLRDF 3. Interactive RDF restructuring 4. Create a metadata mapping schema Amalgame 5. Align vocabularies with external sources 6. Publish as Linked Data 14
  • 15. XMLRDF Graph rewrite rule language  Declarative committed-choice language based on CHR (Constraint Handling Rules)  Triples <=> Guard, NewTriples  Keep Triples <=> Guard, NewTriples 15
  • 17. AM rewriting rules examples 17
  • 18. RDF rewriting conversion <record priref="19319 “ > <date>1651</date> <maker>Rembrandt (1606-1669)</maker> <object.type>etsplaat</object.type> priref “19319 ” … date </record> “1651” am:Record _:bn1 “Rembrandt (1606-1669)” object.type “etsplaat” “19319 ” am:date “1651” “1234” am:priref am:Record am:birthdate am:maker am:Person am:proxy-19319 “1606” am:p-1234 rda:name “Rembrandt” skos:Concept am:etsplaat “etsplaat” skos:prefLabel 18
  • 19. Some statistics Amsterdam Museum Rules Resources Predicaes Triples used Object metadata 58 73,447 100 5,700,371 (Proxies) Thesaurus 23 28,000 13 601,819 (Concepts) Person Auth List 2 66,966 21 301,143 (Persons) 558,161 Proxy-Concept relations 80,432 Proxy-Person relations 243,532 Proxy-Proxy relations 19
  • 20. Methods Tools ClioPatria 1. XML ingestion 2. Direct transformation to ‘crude’ RDF XMLRDF 3. Interactive RDF restructuring 4. Create a metadata mapping schema Amalgame 5. Align vocabularies with external sources 6. Publish as Linked Data 20
  • 21. Mapping to EDM dcterms:subject rdfs:subPropertyOf am:contentPersonName am:proxy_22093 “Job Cohen” 21
  • 22. Europeana Data Model (EDM) • Dublin Core for metadata representation – creator, date, title etc. • SKOS for vocabularies – preferredLabel, hasBroader, etc. • RDA Group 2 elements for persons – dateOfBirth, name etc. • OAI-ORE to allow for aggregations etc. • Some EDM-specific properties – edm:wasPresentAt, … 22
  • 23. Methods Tools ClioPatria 1. XML ingestion 2. Direct transformation to ‘crude’ RDF XMLRDF 3. Interactive RDF restructuring 4. Create a metadata mapping schema Amalgame 5. Align vocabularies with external sources 6. Publish as Linked Data 23
  • 24. Amalgame Alignment Platform 24 semanticweb.cs.vu.nl/amalgame
  • 25. AM Alignments • 3500+ links put in RDF – 143 places linked to GeoNames – 1076 persons linked to ULAN (VIAF) – 34 persons linked to DBPedia – 2498 concepts AATNed. 25
  • 26. Methods Tools ClioPatria 1. XML ingestion 2. Direct transformation to ‘crude’ RDF XMLRDF 3. Interactive RDF restructuring 4. Create a metadata mapping schema Amalgame 5. Align vocabularies with external sources 6. Publish as Linked Data 26
  • 27. Architecture SPARQL-app Browser Purl.org redirect SPARQL Web interface HTTP server RDF(s) storage Logic Prolog 27 http://semanticweb.cs.vu.nl/europeana/
  • 28. Content negotiation @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix ore: <http://www.openarchives.org/ore/terms/> . @prefix ens: <http://www.europeana.eu/schemas/edm/> . @prefix ahm: <http://purl.org/collections/nl/am/> ahm:proxy-66970 a ore:Proxy ; ahm:title "Zegelstempel Felix Meritis"@nl ; ahm:material ahm:t-12463 , ahm:t-5447 ; ahm:objectCategory ahm:t-5504 ; ahm:objectName ahm:t-13817 , ahm:t-8489 ; ahm:objectNumber "KA 7653.1" ; ahm:priref "66970" . ahm:proxy-66972 a ore:Proxy ; ahm:acquisitionDate "0000" ; ahm:title "Zegelstempel mogelijk van familiewapen"@nl . 28
  • 29. “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
  • 30.
  • 31. Wrapping up Methodology • Stay as close as possible to original (XML) metadata • Separate syntactic transformation, semantic interpretations • Interactive workflow, simple steps • Use rdf schema to map to interoperability layer • Keep provenance, reproducability Tools • XMLRDF Realised clean workflow for RDF production. • Amalgame: Interactive and transparent vocablary alignment • ClioPatria Semantic server: statistics at any moment + Full expressivity of Some Prolog 31
  • 32. Issues • Validate with real collection managers – Making good rules is sometimes hard – Graphical tools can help • Integrate in normal collection workflow (tools) – LD as another view on the data – Live updates • RDFS reasoning needed to have interoperability 32
  • 33. http://semanticweb.cs.vu.nl/lod/am/ v.de.boer@vu.nl amsterdammuseum.nl ? ClioPatria: the SWI-Prolog RDF toolkit (includes XMLRDF and Amalgame packages) http://cliopatria.swi-prolog.org

Hinweis der Redaktion

  1. Rather than having Linked Data ingestion being done automatically by large aggregators,we present a methodology that is both transparent and interactive. The methodologycovers data ingestion, conversion, alignment and Linked Data publication. It ishighly modular with clearly recognizable data transformation steps, which can be evaluatedand adapted based on these evaluations. This design allows the institute’s collectionmanagers, who are most knowledgeable about their own data, to perform or overseethe process themselves. We describe a stack of tools that allow collection managers toproduce a Linked Data version of their metadata that maintains the richness of the originaldata including the institute-specific metadata classes and properties. By providinga mapping to a common schema interoperability is achieved.Flickr:  givingnot@rocketmail.com, aoppelaar,  hhesterr,  Grufnik,  moria, Banjaxx, Paradasos
  2. 2.4 million texts, images, videos and sounds gathered by Europeana. These objects come from data providers who have reacted early and positively to Europeana&apos;s initiative of promoting more open data and new data exchange agreements. These collections come from 8 direct Europeana providers encompassing over 200 cultural institutions from 15 countries. 
  3. Rather than having Linked Data ingestion being done automatically by large aggregators,we present a methodology that is both transparent and interactive. The methodologycovers data ingestion, conversion, alignment and Linked Data publication. It ishighly modular with clearly recognizable data transformation steps, which can be evaluatedand adapted based on these evaluations. This design allows the institute’s collectionmanagers, who are most knowledgeable about their own data, to perform or overseethe process themselves. We describe a stack of tools that allow collection managers toproduce a Linked Data version of their metadata that maintains the richness of the originaldata including the institute-specific metadata classes and properties. By providinga mapping to a common schema interoperability is achieved.Flickr:  givingnot@rocketmail.com, aoppelaar,  hhesterr,  Grufnik,  moria, Banjaxx, Paradasos
  4. Rather than having Linked Data ingestion being done automatically by large aggregators,we present a methodology that is both transparent and interactive. The methodologycovers data ingestion, conversion, alignment and Linked Data publication. It ishighly modular with clearly recognizable data transformation steps, which can be evaluatedand adapted based on these evaluations. This design allows the institute’s collectionmanagers, who are most knowledgeable about their own data, to perform or overseethe process themselves. We describe a stack of tools that allow collection managers toproduce a Linked Data version of their metadata that maintains the richness of the originaldata including the institute-specific metadata classes and properties. By providinga mapping to a common schema interoperability is achieved.Flickr:  givingnot@rocketmail.com, aoppelaar,  hhesterr,  Grufnik,  moria, Banjaxx, Paradasos
  5. - Not completely straightforward xml (nestedness)
  6. XMLRDF tool: clean up, link to resources etc.
  7. XMLRDF tool: clean up, link to resources etc.58 XMLRDF rewrite rules23 rewriting rules2 rules
  8. Apps for AmsterdamPlaatsen van Betekenis