SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Downloaden Sie, um offline zu lesen
Services for Digital Cultural Heritage
         Hennie Brugman
         Technical coordinator CATCHPlus
         Max-Planck-Institute for Psycholinguistics
         Netherlands Institute for Sound and Vision




BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
Overview
     • CATCH and CATCHPlus
     • CATCHPlus and infrastructure for
       Digital Cultural Heritage
     • Case: Vocabulary and Alignment
       Service
     • Concluding remarks
BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
CATCH & CATCHPlus
     •   CATCH research program by NWO (14 projects)
     •   CATCHPlus valorisation project
          – 8 subprojects at large CH institutions
              • Deliver (re)usable tools and services
          – Connected by common services concerning
              • terminology
              • annotations
              • metadata (collection catalogs)
              • Content
     •   CATCHPlus project bureau hosted by Netherlands Institute for
         Sound and Vision
     •   www.catchplus.nl
BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
CATCHPlus and infrastructure for digital cultural
   heritage




BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
CATCHPlus service landscape

                                 REST services

 Annotations

Vocabularies                            OAI-PMH data providers



                        Content
                         Content Catalog
                                    Catalog
                          Content (metadata)
                                      Catalog
                                    (metadata)
                                            (metadata)
 BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
CATCHPlus service landscape



 Annotations

Vocabularies                               Index
                                                         “resolve”
                                               harvesting
                                                                     Persistent Identifier
                                                                     services
                        Content          Catalog             “create, manage, search”
                         Content           Catalog
                                             Catalog
                                         (metadata)
                                           (metadata)
                                            (metadata)
 BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
text services

                                      Workspace                        recomm. srvs

 Annotations                          services                         handwriting srvs

                                                                       speech services
Vocabularies                               Index                       music services


                                                                    Persistent Identifier
                                                                    services
                        Content          Catalog
                         Content           Catalog
                                             Catalog
                                         (metadata)
                                           (metadata)
                                            (metadata)
 BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
User Profile
                                                         Repository


             Identity services            user id
                                                                       text services

                                      Workspace                        recomm. srvs

 Annotations                          services                         handwriting srvs

                                                                       speech services
Vocabularies                               Index                       music services


                                                                    Persistent Identifier
                                                                    services
                        Content          Catalog
                         Content           Catalog
                                             Catalog
                                         (metadata)
                                           (metadata)
                                            (metadata)
 BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
User Profile
                                                         Repository


             Identity services            user id
                                                                       text services

                                      Workspace                        recomm. srvs

 Annotations                          services                         handwriting srvs

                                                                       speech services
Vocabularies                               Index                       music services


                                                                    Persistent Identifier
                                                                    services

Status                  Content          Catalog
                         Content           Catalog
                                             Catalog
                                         (metadata)
                                           (metadata)
                                            (metadata)
 BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
User Profile
                                                          Repository


             Identity services              user id                            CLARIN
                                                                       text services

                                      Workspace                        recomm. srvs

 Annotations                          services                         handwriting srvs

                                                                       speech services
Vocabularies                                Index                      music services
CLARIN                               NED!

                                                                    Persistent Identifier
                                                                    services
Potentially of                                                                  EPIC
wider interest Content
                Content                  Catalog
                                           Catalog
                                             Catalog
                                         (metadata)
                                           (metadata)
                                             (metadata)
 BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
Case: Vocabulary and Alignment Service




BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
VAS aims
     • Standard format and access methods
          – SKOS, SKOS based REST API
     • Web publication of vocabularies
          – As searchable and browsable dataset  REST API
          – As Linked Data
          – Usable for sustainable references to concepts  PIDs
     • Improve semantic interoperability by supporting
       alignments
     • Centralised arrangements for licensing

BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
Use cases
     • Use cases from CATCHPlus and Cultural Heritage
          – Publish your thesaurus: import SKOS vocabulary, then
            get REST access, tool support and Linked Data for free.
          – Use for resource description: concept selection
          – Use for browse and search (both terminology and
            collections)
              • VAS Repository as topic map for CH collections
          – Use for thesaurus maintenance by online communities
          – Query translation, expansion, refinement
          – Etc.

BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
What is it?
     • Repository for SKOS data (including alignment
       data)
          – RDF store (Virtuoso)
     • REST API on top (search, autocomplete, upload,
       download), based on SKOS data model
     • Linked Data interface
     • Both persistent identifiers and stable URIs
     • Future functionality:
          – Distributed operation
          – “live connections” with thesaurus databases  automatic
            updates
BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
CATCHPlus        Browse/Search     Commercial                         Linked Data tools

            Tools and Services



                     REST API    LoD


upload/harvest
                       RDF Store


         REST API    LoD         REST API    LoD        REST API


                                                          Alternative
           RDF Store               RDF Store
                                                             Store
   BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
Client tools and services
     • CATCHPlus cases (semantic annotation,
       ranking, art recommender, …)
     • Commercial collection management
       software builder uses API to include
       thesaurus information
     • Generic browse and search web
       application (using the REST API)
BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
Status
     • Currently contains 12 thesauri (most are not yet licensed)
     • Browse/search tool (version 1) is ready
     • Attracting interest from
          – Thesaurus providers
              • VU, Wageningen SemWeb group, RKD, CLARIN-NL
          – Tool builders
              • collection management software builders
          – Opportunity for API and/or technology harmonisation
     • Used for collaboration of Beeld en Geluid and National
       Archive on their GTAA thesaurus
     • Candidate for Open Source development?


BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
Concluding remarks
     • Many services that CATCHPlus builds or needs are quite
       generic
          – We have services to offer and services to ask
     • Cultural Heritage ICT departments are interested in
       infrastructural services
     • Harmonisation of APIs
     • We started with REST (+mashups). Additional need for
       SOAP (+service bus)?
          – Current CATCHPlus answer: no.
     • Most CATCHPlus services need to be reliable and
       performant. Storage capacity is less of an issue.


BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
Thank you. Questions?




BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010

Weitere ähnliche Inhalte

Ähnlich wie Big Grid Clarin Infrastructure Landscape Workshop Catch Plus

[DCTPE2010] Biodiversity & Drupal
[DCTPE2010] Biodiversity & Drupal[DCTPE2010] Biodiversity & Drupal
[DCTPE2010] Biodiversity & DrupalDrupal Taiwan
 
Slash n: Technical Session 3 - Storage @ Scale: Quest for the mythical silver...
Slash n: Technical Session 3 - Storage @ Scale: Quest for the mythical silver...Slash n: Technical Session 3 - Storage @ Scale: Quest for the mythical silver...
Slash n: Technical Session 3 - Storage @ Scale: Quest for the mythical silver...slashn
 
Eudat user forum-london-11march2013-biovel-v3
Eudat user forum-london-11march2013-biovel-v3Eudat user forum-london-11march2013-biovel-v3
Eudat user forum-london-11march2013-biovel-v3Alex Hardisty
 
Knowledge Base+: a Cloud-Based Community Knowledge Base
Knowledge Base+: a Cloud-Based Community Knowledge BaseKnowledge Base+: a Cloud-Based Community Knowledge Base
Knowledge Base+: a Cloud-Based Community Knowledge Basesherif user group
 
Innovation and the STM publisher of the future (SSP IN Conference 2011)
Innovation and the STM publisher of the future (SSP IN Conference 2011)Innovation and the STM publisher of the future (SSP IN Conference 2011)
Innovation and the STM publisher of the future (SSP IN Conference 2011)Bradley Allen
 
Icws10 lecue-gorronogoitia-gonzalez-radzimski-villa-presentation
Icws10 lecue-gorronogoitia-gonzalez-radzimski-villa-presentationIcws10 lecue-gorronogoitia-gonzalez-radzimski-villa-presentation
Icws10 lecue-gorronogoitia-gonzalez-radzimski-villa-presentationFreddy Lecue
 
How we understand research practices: The example of the semantic spider
How we understand research practices: The example of the semantic spiderHow we understand research practices: The example of the semantic spider
How we understand research practices: The example of the semantic spiderKaty Jordan
 
Service Oriented Application Development Sterpka
Service Oriented Application Development   SterpkaService Oriented Application Development   Sterpka
Service Oriented Application Development Sterpkabsterpka
 
A Technical Overview of DuraCloud
A Technical Overview of DuraCloudA Technical Overview of DuraCloud
A Technical Overview of DuraCloudDuraSpace
 
Scalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross KingScalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross KingDigitalPreservationEurope
 
6.Live Framework 和Mesh Services
6.Live Framework 和Mesh Services6.Live Framework 和Mesh Services
6.Live Framework 和Mesh ServicesGaryYoung
 
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and ActionAlbert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and ActionInstitute for Knowledge Mobilization
 
02 Ms Online Identity Session 1
02 Ms Online Identity   Session 102 Ms Online Identity   Session 1
02 Ms Online Identity Session 1Sivadon Chaisiri
 
Gilbane SF - Content Convergence Strategies
Gilbane SF - Content Convergence StrategiesGilbane SF - Content Convergence Strategies
Gilbane SF - Content Convergence StrategiesEric Barroca
 
Windows Azure Platform
Windows Azure PlatformWindows Azure Platform
Windows Azure PlatformSoumow Dollon
 

Ähnlich wie Big Grid Clarin Infrastructure Landscape Workshop Catch Plus (20)

[DCTPE2010] Biodiversity & Drupal
[DCTPE2010] Biodiversity & Drupal[DCTPE2010] Biodiversity & Drupal
[DCTPE2010] Biodiversity & Drupal
 
Slash n: Technical Session 3 - Storage @ Scale: Quest for the mythical silver...
Slash n: Technical Session 3 - Storage @ Scale: Quest for the mythical silver...Slash n: Technical Session 3 - Storage @ Scale: Quest for the mythical silver...
Slash n: Technical Session 3 - Storage @ Scale: Quest for the mythical silver...
 
Eudat user forum-london-11march2013-biovel-v3
Eudat user forum-london-11march2013-biovel-v3Eudat user forum-london-11march2013-biovel-v3
Eudat user forum-london-11march2013-biovel-v3
 
Knowledge Base+: a Cloud-Based Community Knowledge Base
Knowledge Base+: a Cloud-Based Community Knowledge BaseKnowledge Base+: a Cloud-Based Community Knowledge Base
Knowledge Base+: a Cloud-Based Community Knowledge Base
 
Saadallah vtls
Saadallah vtlsSaadallah vtls
Saadallah vtls
 
376 sspin2011 bradleyallen
376 sspin2011 bradleyallen376 sspin2011 bradleyallen
376 sspin2011 bradleyallen
 
Innovation and the STM publisher of the future (SSP IN Conference 2011)
Innovation and the STM publisher of the future (SSP IN Conference 2011)Innovation and the STM publisher of the future (SSP IN Conference 2011)
Innovation and the STM publisher of the future (SSP IN Conference 2011)
 
Icws10 lecue-gorronogoitia-gonzalez-radzimski-villa-presentation
Icws10 lecue-gorronogoitia-gonzalez-radzimski-villa-presentationIcws10 lecue-gorronogoitia-gonzalez-radzimski-villa-presentation
Icws10 lecue-gorronogoitia-gonzalez-radzimski-villa-presentation
 
How we understand research practices: The example of the semantic spider
How we understand research practices: The example of the semantic spiderHow we understand research practices: The example of the semantic spider
How we understand research practices: The example of the semantic spider
 
Service Oriented Application Development Sterpka
Service Oriented Application Development   SterpkaService Oriented Application Development   Sterpka
Service Oriented Application Development Sterpka
 
A Technical Overview of DuraCloud
A Technical Overview of DuraCloudA Technical Overview of DuraCloud
A Technical Overview of DuraCloud
 
SEALS @ WWW2012
SEALS @ WWW2012SEALS @ WWW2012
SEALS @ WWW2012
 
Scalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross KingScalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross King
 
6.Live Framework 和Mesh Services
6.Live Framework 和Mesh Services6.Live Framework 和Mesh Services
6.Live Framework 和Mesh Services
 
Knowledge mobilization
Knowledge mobilization Knowledge mobilization
Knowledge mobilization
 
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and ActionAlbert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
 
02 Ms Online Identity Session 1
02 Ms Online Identity   Session 102 Ms Online Identity   Session 1
02 Ms Online Identity Session 1
 
Gilbane SF - Content Convergence Strategies
Gilbane SF - Content Convergence StrategiesGilbane SF - Content Convergence Strategies
Gilbane SF - Content Convergence Strategies
 
20100407 how i made plates
20100407 how i made plates20100407 how i made plates
20100407 how i made plates
 
Windows Azure Platform
Windows Azure PlatformWindows Azure Platform
Windows Azure Platform
 

Kürzlich hochgeladen

Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Developmentchesterberbo7
 
How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17Celine George
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research DiscourseAnita GoswamiGiri
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW
 
CHEST Proprioceptive neuromuscular facilitation.pptx
CHEST Proprioceptive neuromuscular facilitation.pptxCHEST Proprioceptive neuromuscular facilitation.pptx
CHEST Proprioceptive neuromuscular facilitation.pptxAneriPatwari
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseCeline George
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 

Kürzlich hochgeladen (20)

Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Development
 
How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research Discourse
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
 
CHEST Proprioceptive neuromuscular facilitation.pptx
CHEST Proprioceptive neuromuscular facilitation.pptxCHEST Proprioceptive neuromuscular facilitation.pptx
CHEST Proprioceptive neuromuscular facilitation.pptx
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 Database
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 

Big Grid Clarin Infrastructure Landscape Workshop Catch Plus

  • 1. Services for Digital Cultural Heritage Hennie Brugman Technical coordinator CATCHPlus Max-Planck-Institute for Psycholinguistics Netherlands Institute for Sound and Vision BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 2. Overview • CATCH and CATCHPlus • CATCHPlus and infrastructure for Digital Cultural Heritage • Case: Vocabulary and Alignment Service • Concluding remarks BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 3. CATCH & CATCHPlus • CATCH research program by NWO (14 projects) • CATCHPlus valorisation project – 8 subprojects at large CH institutions • Deliver (re)usable tools and services – Connected by common services concerning • terminology • annotations • metadata (collection catalogs) • Content • CATCHPlus project bureau hosted by Netherlands Institute for Sound and Vision • www.catchplus.nl BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 4. CATCHPlus and infrastructure for digital cultural heritage BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 5. CATCHPlus service landscape REST services Annotations Vocabularies OAI-PMH data providers Content Content Catalog Catalog Content (metadata) Catalog (metadata) (metadata) BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 6. CATCHPlus service landscape Annotations Vocabularies Index “resolve” harvesting Persistent Identifier services Content Catalog “create, manage, search” Content Catalog Catalog (metadata) (metadata) (metadata) BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 7. text services Workspace recomm. srvs Annotations services handwriting srvs speech services Vocabularies Index music services Persistent Identifier services Content Catalog Content Catalog Catalog (metadata) (metadata) (metadata) BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 8. User Profile Repository Identity services user id text services Workspace recomm. srvs Annotations services handwriting srvs speech services Vocabularies Index music services Persistent Identifier services Content Catalog Content Catalog Catalog (metadata) (metadata) (metadata) BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 9. User Profile Repository Identity services user id text services Workspace recomm. srvs Annotations services handwriting srvs speech services Vocabularies Index music services Persistent Identifier services Status Content Catalog Content Catalog Catalog (metadata) (metadata) (metadata) BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 10. User Profile Repository Identity services user id CLARIN text services Workspace recomm. srvs Annotations services handwriting srvs speech services Vocabularies Index music services CLARIN NED! Persistent Identifier services Potentially of EPIC wider interest Content Content Catalog Catalog Catalog (metadata) (metadata) (metadata) BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 11. Case: Vocabulary and Alignment Service BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 12. VAS aims • Standard format and access methods – SKOS, SKOS based REST API • Web publication of vocabularies – As searchable and browsable dataset  REST API – As Linked Data – Usable for sustainable references to concepts  PIDs • Improve semantic interoperability by supporting alignments • Centralised arrangements for licensing BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 13. Use cases • Use cases from CATCHPlus and Cultural Heritage – Publish your thesaurus: import SKOS vocabulary, then get REST access, tool support and Linked Data for free. – Use for resource description: concept selection – Use for browse and search (both terminology and collections) • VAS Repository as topic map for CH collections – Use for thesaurus maintenance by online communities – Query translation, expansion, refinement – Etc. BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 14. What is it? • Repository for SKOS data (including alignment data) – RDF store (Virtuoso) • REST API on top (search, autocomplete, upload, download), based on SKOS data model • Linked Data interface • Both persistent identifiers and stable URIs • Future functionality: – Distributed operation – “live connections” with thesaurus databases  automatic updates BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 15. CATCHPlus Browse/Search Commercial Linked Data tools Tools and Services REST API LoD upload/harvest RDF Store REST API LoD REST API LoD REST API Alternative RDF Store RDF Store Store BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 16. Client tools and services • CATCHPlus cases (semantic annotation, ranking, art recommender, …) • Commercial collection management software builder uses API to include thesaurus information • Generic browse and search web application (using the REST API) BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 17. BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 18. Status • Currently contains 12 thesauri (most are not yet licensed) • Browse/search tool (version 1) is ready • Attracting interest from – Thesaurus providers • VU, Wageningen SemWeb group, RKD, CLARIN-NL – Tool builders • collection management software builders – Opportunity for API and/or technology harmonisation • Used for collaboration of Beeld en Geluid and National Archive on their GTAA thesaurus • Candidate for Open Source development? BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 19. Concluding remarks • Many services that CATCHPlus builds or needs are quite generic – We have services to offer and services to ask • Cultural Heritage ICT departments are interested in infrastructural services • Harmonisation of APIs • We started with REST (+mashups). Additional need for SOAP (+service bus)? – Current CATCHPlus answer: no. • Most CATCHPlus services need to be reliable and performant. Storage capacity is less of an issue. BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010
  • 20. Thank you. Questions? BigGrid/CLARIN Infrastructure Landscape Workshop - March 8, 2010