SlideShare ist ein Scribd-Unternehmen logo
1 von 16
1
Daan Broeder
Menzo Windhouwer
Meertens Instituut
2
• Create an interoperable domain of Language
Resources (LR)
– Interoperable formats for LR content
– Persistent identification (and citation) of LRs
– Use of SAML based AAI for access to LRs
– Use of the Component Metadata Infrastructure (CMDI) for describing
LRs
3
• Created as a response to a fragmented situation of LR metadata
• Flexible
– Not a single schema, but supports different metadata schema
– Different schema for different situations
– Semantic Interoperability via linking to semantic registries
• Community driven
– communities can model their own metadata schema
– know their data and can create the right schema
– know the right terminology
• Sharing
– Concepts, Terminology, Vocabularies
• CLARIN Concept Registry for linguistic concepts,
• ISO 368 and other relevant vocabularies
• CLAVAS for organisation names
– Components & profiles via the CLARIN metadata component registry
4
• A Component groups together metadata
Elements, which naturally belong together
to describe a property of the resources
– The Location where a SpeechRecording took place
– The Location of an Actor
– A Location is described by an address a/o region a/o
country a/o continent
• Components can be nested
– The Language a specific Actor speaks
– An Actor who takes part in a SpeechRecording for a
specific Project
• A Profile is a specific collection of
Components for a specific type of
resources, e.g., speech recordings
SpeechRecordingP
ActorC
LocationC
- addressE
- regionE
- countryE
- continentE
LocationC
ProjectC
LanguageC
LanguageC
Technical
MetadataC
5
OAI-PMH
Provider
OAI-PMH
Harvester
Local
metadata
repository
Joint
metadata
repository
metadata
modeler
metadata
user
metadata
creator
component
registry &
editor
metadata
editor
metadata
curator
metadata
curator
metadata
catalogue
Relation
Registry
search &
semantic
mapping
Resources
Concept
Registry
6
• Started in 2010, version 1.2 released in 2016 supporting
remote vocabularies
• Actively supported by CLARIN ERIC and several national CLARIN
consortia
• Many supporting tools:
– VLO, COMEDI, ARBIL, CMDI maker, Virtual Collection Registry …
• Link to the Linked (open) Data world: CMDI2RDF
CMDI LODCMDI2RDF
7
• Started as a 2014 CLARIN NL project by TLA/MPI and DANS
• Now a service supported by CLARIAH WP2 (X11.400)
• Linking also to other ‘linguistic’ LoD information sources:
– WALS for linguistic typology information
– CLAVAS organization names
– DBpedia (currently only used as glue)
• Automatic synchronization CMDI metadata
• Simplification of the RDFs CMDI model
8
• CMD is classic W3C schema constrained XML
• To map a CMD record to RDF we need
– A mapping for the basic component model to RDFS
• Basic classes and properties to represent profiles, components,
elements, attributes and their relationships and values
– A mapping for a specific profile or component to RDFS
• A specific subclass or subproperty of the basic component model
– A mapping for specific metadata records to RDF instances of RDFS
• Instances of profile or component
– Additionaly there is a generic CMD envelop that is mapped using
common LOD vocabularies
9
 Basic CMD model is described by ISO/DIS 24622-1
 1st part of ISO TC 37 SC 4 3 CMD standards family
 Natural mapping to RDF would be:
 Profiles/components to RDF Classes
 Elements to RDF Properties
 Complication
 CLARIN’s CMDI allows attributes on both Components and Elements
 So elements have to be RDF Classes as well
10
• Nevertheless introduces extra hierarchy
• CMDI is already a hierarchical metadata schema
• Human readability decreases
• Other solutions welcome!
R 14
Age
<Description URI= …. >
<Age>14</Age>
…
</Person
<Description…. >
<Age status=‘U’>14</Age>
…
</Description> R
Age
14
U
Simplified example
status
11
OAI
harvester
CLARIN
joint
metadata
domain
CMD2RDF
• conversion
• enrichment
Virtuoso
caching
CMD-RDF
• SPARQL
• REST
• browse
(L)L(O)D cloud
Component
Registry
CLAVAS
WALS
Technology:
• Virtuoso RDF store
• Elda as browser
• Tomcat as application server
• Conversion pipeline in Java
• Core transforms in XSLT
• All source code on GitHub,
• Docker build file & images available
12
13
• Offers LoD for different LR
metadata infrastructures
– LRE Map (LREC)
– META-SHARE
– CLARIN
– DataHub (linguistic part)
• However
– Wrt. CLARIN only data with DC
profiles
• Just a small part of CLARIN
– Seems partly based on static old
data dumps
14
• Goals:
– Find metadata type of information about LRs in LD format
– Translate that into a ‘suitable’ CMDI profile based metadata record
• Is there such LD that is not already available direct in another
format: OLAC, CLARIN, DC, META-SHARE
– If so, useful to have this metadata in the CLARIN VLO metadata catalogue
– Humanities data archives will have mostly DC, (inventory available from
different projects: e.g. DASISH) and frequently offer LD
– Easier ways exist to translate DC into CMDI (e.g. the CMDI DC profile)
– But LD can be a pivot set for many such translations
• Still in exploratory phase
– Would like to use a general strategy,
– Its very labor intensive to craft specific transformations for every LD set.
15
• Useful for CLARIN?
– Enriching existing CMDI metadata and
recycling them
– Relations to sources already known as:
• WALS, DBpedia, CLAVAS, GlotoLog, …
• Relations to CLARIAH LD sources ?
– Enable the VLO (or an alternative browser)
for visualizing this information
– Increasing metadata quality:
• Use CLAVAS to repair errors
• Include preferred labels
– Some CMDI adaptations required
• Foreign namespace support in CMDI
payload
A
VLO
B
C
RDF2CMD
CLARIN CENTRES
CLARIAH?
Enriched
CMDI
CMDI
DPpedia Glotolog
RDFstore
16
http://cmdi2rdf.meertens.knaw.nl/cmd2rdf/

Weitere ähnliche Inhalte

Was ist angesagt?

DC-2008 Architecture Forum Open session
DC-2008 Architecture Forum Open sessionDC-2008 Architecture Forum Open session
DC-2008 Architecture Forum Open sessionMikael Nilsson
 
Deploying RDF Linked Data via Virtuoso Universal Server
Deploying RDF Linked Data via Virtuoso Universal ServerDeploying RDF Linked Data via Virtuoso Universal Server
Deploying RDF Linked Data via Virtuoso Universal Serverrumito
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchChimezie Ogbuji
 
Semantic Technologies and Triplestores for Business Intelligence
Semantic Technologies and Triplestores for Business IntelligenceSemantic Technologies and Triplestores for Business Intelligence
Semantic Technologies and Triplestores for Business IntelligenceMarin Dimitrov
 
CLARIAH CMDI use case and flexible metadata schemes
CLARIAH CMDI use case and flexible metadata schemesCLARIAH CMDI use case and flexible metadata schemes
CLARIAH CMDI use case and flexible metadata schemesVyacheslav Tykhonov
 
MLA crosswalk
MLA crosswalkMLA crosswalk
MLA crosswalksol613
 
Solving Real Problems Using Linked Data
Solving Real Problems Using Linked DataSolving Real Problems Using Linked Data
Solving Real Problems Using Linked Datarumito
 
RDF_API_Java_Stefan_Apostoaie
RDF_API_Java_Stefan_ApostoaieRDF_API_Java_Stefan_Apostoaie
RDF_API_Java_Stefan_Apostoaieiosstef
 
Virtuoso Universal Server Overview
Virtuoso Universal Server OverviewVirtuoso Universal Server Overview
Virtuoso Universal Server Overviewrumito
 
Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.Menzo Windhouwer
 
Flexible metadata schemes for research data repositories - Clarin Conference...
Flexible metadata schemes for research data repositories  - Clarin Conference...Flexible metadata schemes for research data repositories  - Clarin Conference...
Flexible metadata schemes for research data repositories - Clarin Conference...Vyacheslav Tykhonov
 
DC Architecture WG Meeting - DC-2006, Mexico
DC Architecture WG Meeting - DC-2006, MexicoDC Architecture WG Meeting - DC-2006, Mexico
DC Architecture WG Meeting - DC-2006, MexicoEduserv Foundation
 
Ldap system administration
Ldap system administrationLdap system administration
Ldap system administrationAli Abdo
 
IASLIC's 23rd National Seminar, Kolkata by Goutam Biswas
IASLIC's 23rd National Seminar, Kolkata by Goutam BiswasIASLIC's 23rd National Seminar, Kolkata by Goutam Biswas
IASLIC's 23rd National Seminar, Kolkata by Goutam BiswasGoutam Biswas
 
Open Source Library Automation Software - NewGenLib
Open Source Library Automation Software - NewGenLibOpen Source Library Automation Software - NewGenLib
Open Source Library Automation Software - NewGenLibVerus Solutions Pvt ltd
 
Expressing Concept Schemes & Competency Frameworks in CTDL
Expressing Concept Schemes & Competency Frameworks in CTDLExpressing Concept Schemes & Competency Frameworks in CTDL
Expressing Concept Schemes & Competency Frameworks in CTDLCredential Engine
 

Was ist angesagt? (19)

DC-2008 Architecture Forum Open session
DC-2008 Architecture Forum Open sessionDC-2008 Architecture Forum Open session
DC-2008 Architecture Forum Open session
 
Deploying RDF Linked Data via Virtuoso Universal Server
Deploying RDF Linked Data via Virtuoso Universal ServerDeploying RDF Linked Data via Virtuoso Universal Server
Deploying RDF Linked Data via Virtuoso Universal Server
 
Semantic web
Semantic webSemantic web
Semantic web
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes research
 
Semantic Technologies and Triplestores for Business Intelligence
Semantic Technologies and Triplestores for Business IntelligenceSemantic Technologies and Triplestores for Business Intelligence
Semantic Technologies and Triplestores for Business Intelligence
 
CLARIAH CMDI use case and flexible metadata schemes
CLARIAH CMDI use case and flexible metadata schemesCLARIAH CMDI use case and flexible metadata schemes
CLARIAH CMDI use case and flexible metadata schemes
 
MLA crosswalk
MLA crosswalkMLA crosswalk
MLA crosswalk
 
Metadata lecture 5 part 2
Metadata lecture 5 part 2Metadata lecture 5 part 2
Metadata lecture 5 part 2
 
Intro
IntroIntro
Intro
 
Solving Real Problems Using Linked Data
Solving Real Problems Using Linked DataSolving Real Problems Using Linked Data
Solving Real Problems Using Linked Data
 
RDF_API_Java_Stefan_Apostoaie
RDF_API_Java_Stefan_ApostoaieRDF_API_Java_Stefan_Apostoaie
RDF_API_Java_Stefan_Apostoaie
 
Virtuoso Universal Server Overview
Virtuoso Universal Server OverviewVirtuoso Universal Server Overview
Virtuoso Universal Server Overview
 
Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.
 
Flexible metadata schemes for research data repositories - Clarin Conference...
Flexible metadata schemes for research data repositories  - Clarin Conference...Flexible metadata schemes for research data repositories  - Clarin Conference...
Flexible metadata schemes for research data repositories - Clarin Conference...
 
DC Architecture WG Meeting - DC-2006, Mexico
DC Architecture WG Meeting - DC-2006, MexicoDC Architecture WG Meeting - DC-2006, Mexico
DC Architecture WG Meeting - DC-2006, Mexico
 
Ldap system administration
Ldap system administrationLdap system administration
Ldap system administration
 
IASLIC's 23rd National Seminar, Kolkata by Goutam Biswas
IASLIC's 23rd National Seminar, Kolkata by Goutam BiswasIASLIC's 23rd National Seminar, Kolkata by Goutam Biswas
IASLIC's 23rd National Seminar, Kolkata by Goutam Biswas
 
Open Source Library Automation Software - NewGenLib
Open Source Library Automation Software - NewGenLibOpen Source Library Automation Software - NewGenLib
Open Source Library Automation Software - NewGenLib
 
Expressing Concept Schemes & Competency Frameworks in CTDL
Expressing Concept Schemes & Competency Frameworks in CTDLExpressing Concept Schemes & Competency Frameworks in CTDL
Expressing Concept Schemes & Competency Frameworks in CTDL
 

Andere mochten auch

Uso del internet rivaldo
Uso del internet  rivaldoUso del internet  rivaldo
Uso del internet rivaldoalevehe11
 
Certificate-MERAJ
Certificate-MERAJCertificate-MERAJ
Certificate-MERAJMohammed Meraj
 
Hamza CV (1)
Hamza CV (1)Hamza CV (1)
Hamza CV (1)Hamza Mian
 
Insects [training]
Insects  [training]Insects  [training]
Insects [training]Trong1903
 
Peningkatan mutu kompetensi pendidik dan tenaga kependidikan serta
Peningkatan mutu kompetensi  pendidik dan tenaga kependidikan sertaPeningkatan mutu kompetensi  pendidik dan tenaga kependidikan serta
Peningkatan mutu kompetensi pendidik dan tenaga kependidikan sertaKank Hari
 
RevoluciĂŁÂłn mexicana presentacion.esp5
RevoluciĂŁÂłn mexicana presentacion.esp5RevoluciĂŁÂłn mexicana presentacion.esp5
RevoluciĂŁÂłn mexicana presentacion.esp5hobbitgirl23
 
Buku kebijakan spmi
Buku kebijakan spmiBuku kebijakan spmi
Buku kebijakan spmispmi
 
OSS ERP iDempiereの共通基本操作
OSS ERP iDempiereの共通基本操作OSS ERP iDempiereの共通基本操作
OSS ERP iDempiereの共通基本操作Hideaki Hagiwara
 
TeorĂ­a de la comunicaciĂłn masiva
TeorĂ­a de la comunicaciĂłn masivaTeorĂ­a de la comunicaciĂłn masiva
TeorĂ­a de la comunicaciĂłn masivaJosĂŠ Luis LĂłpez
 
20160124_GPL勉強会
20160124_GPL勉強会20160124_GPL勉強会
20160124_GPL勉強会rie05
 
MĂşsica contemporĂĄnea 3ÂşESO
MĂşsica contemporĂĄnea 3ÂşESOMĂşsica contemporĂĄnea 3ÂşESO
MĂşsica contemporĂĄnea 3ÂşESOInma Montesinos
 
Ejercicio guiado de Inkscape
Ejercicio guiado de InkscapeEjercicio guiado de Inkscape
Ejercicio guiado de InkscapeAinara PĂŠrez
 
Presidential helpers (7.4)
Presidential helpers (7.4)Presidential helpers (7.4)
Presidential helpers (7.4)Matthew Caggia
 

Andere mochten auch (20)

Sem tĂ­tulo 1
Sem tĂ­tulo 1Sem tĂ­tulo 1
Sem tĂ­tulo 1
 
Uso del internet rivaldo
Uso del internet  rivaldoUso del internet  rivaldo
Uso del internet rivaldo
 
Certificate-MERAJ
Certificate-MERAJCertificate-MERAJ
Certificate-MERAJ
 
Integrating nutrition to systems research: through Nutrition sensitive landsc...
Integrating nutrition to systems research: through Nutrition sensitive landsc...Integrating nutrition to systems research: through Nutrition sensitive landsc...
Integrating nutrition to systems research: through Nutrition sensitive landsc...
 
Testing
TestingTesting
Testing
 
Marlene delgado
Marlene delgadoMarlene delgado
Marlene delgado
 
PREHISTORY
PREHISTORYPREHISTORY
PREHISTORY
 
Hamza CV (1)
Hamza CV (1)Hamza CV (1)
Hamza CV (1)
 
Gender Awareness and Dynamics in the Workplace
Gender Awareness and Dynamics in the WorkplaceGender Awareness and Dynamics in the Workplace
Gender Awareness and Dynamics in the Workplace
 
Insects [training]
Insects  [training]Insects  [training]
Insects [training]
 
So Sánh phần mềm SugarCRM với MisaCRM, genCRM, vTiger (V.2016)
So Sánh phần mềm SugarCRM với MisaCRM, genCRM,  vTiger (V.2016)So Sánh phần mềm SugarCRM với MisaCRM, genCRM,  vTiger (V.2016)
So Sánh phần mềm SugarCRM với MisaCRM, genCRM, vTiger (V.2016)
 
Peningkatan mutu kompetensi pendidik dan tenaga kependidikan serta
Peningkatan mutu kompetensi  pendidik dan tenaga kependidikan sertaPeningkatan mutu kompetensi  pendidik dan tenaga kependidikan serta
Peningkatan mutu kompetensi pendidik dan tenaga kependidikan serta
 
RevoluciĂŁÂłn mexicana presentacion.esp5
RevoluciĂŁÂłn mexicana presentacion.esp5RevoluciĂŁÂłn mexicana presentacion.esp5
RevoluciĂŁÂłn mexicana presentacion.esp5
 
Buku kebijakan spmi
Buku kebijakan spmiBuku kebijakan spmi
Buku kebijakan spmi
 
OSS ERP iDempiereの共通基本操作
OSS ERP iDempiereの共通基本操作OSS ERP iDempiereの共通基本操作
OSS ERP iDempiereの共通基本操作
 
TeorĂ­a de la comunicaciĂłn masiva
TeorĂ­a de la comunicaciĂłn masivaTeorĂ­a de la comunicaciĂłn masiva
TeorĂ­a de la comunicaciĂłn masiva
 
20160124_GPL勉強会
20160124_GPL勉強会20160124_GPL勉強会
20160124_GPL勉強会
 
MĂşsica contemporĂĄnea 3ÂşESO
MĂşsica contemporĂĄnea 3ÂşESOMĂşsica contemporĂĄnea 3ÂşESO
MĂşsica contemporĂĄnea 3ÂşESO
 
Ejercicio guiado de Inkscape
Ejercicio guiado de InkscapeEjercicio guiado de Inkscape
Ejercicio guiado de Inkscape
 
Presidential helpers (7.4)
Presidential helpers (7.4)Presidential helpers (7.4)
Presidential helpers (7.4)
 

Ähnlich wie CMDI2RDF

Introduction to Dublin Core Metadata
Introduction to Dublin Core MetadataIntroduction to Dublin Core Metadata
Introduction to Dublin Core MetadataHannes Ebner
 
Dublin Core Metadata Initiative Abstract Model
Dublin Core Metadata Initiative Abstract ModelDublin Core Metadata Initiative Abstract Model
Dublin Core Metadata Initiative Abstract ModelJenn Riley
 
CS6010 Social Network Analysis Unit II
CS6010 Social Network Analysis   Unit IICS6010 Social Network Analysis   Unit II
CS6010 Social Network Analysis Unit IIpkaviya
 
Providing Linked Data
Providing Linked DataProviding Linked Data
Providing Linked DataEUCLID project
 
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLChimezie Ogbuji
 
Services semantic technology_terminology
Services semantic technology_terminologyServices semantic technology_terminology
Services semantic technology_terminologyTenforce
 
Fedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN InfrastructureFedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN InfrastructureMenzo Windhouwer
 
Knowledge Representation, Semantic Web
Knowledge Representation, Semantic WebKnowledge Representation, Semantic Web
Knowledge Representation, Semantic WebSerendipity Seraph
 
20080917 Rev
20080917 Rev20080917 Rev
20080917 Revcharper
 
Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries) Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries) robin fay
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebPascal-Nicolas Becker
 
RDA-DCAM and Application Profiles
RDA-DCAM and Application ProfilesRDA-DCAM and Application Profiles
RDA-DCAM and Application ProfilesMikael Nilsson
 
Linked Open Data and DANS
Linked Open Data and DANSLinked Open Data and DANS
Linked Open Data and DANSvty
 
Comparative study on the processing of RDF in PHP
Comparative study on the processing of RDF in PHPComparative study on the processing of RDF in PHP
Comparative study on the processing of RDF in PHPMSGUNC
 
‘Development of a MODS-RDF Cataloguing Tool for the Digital Resources and Ima...
‘Development of a MODS-RDF Cataloguing Tool for the Digital Resources and Ima...‘Development of a MODS-RDF Cataloguing Tool for the Digital Resources and Ima...
‘Development of a MODS-RDF Cataloguing Tool for the Digital Resources and Ima...CONUL Conference
 

Ähnlich wie CMDI2RDF (20)

Introduction to Dublin Core Metadata
Introduction to Dublin Core MetadataIntroduction to Dublin Core Metadata
Introduction to Dublin Core Metadata
 
CMD2RDF
CMD2RDFCMD2RDF
CMD2RDF
 
Dublin Core Metadata Initiative Abstract Model
Dublin Core Metadata Initiative Abstract ModelDublin Core Metadata Initiative Abstract Model
Dublin Core Metadata Initiative Abstract Model
 
CS6010 Social Network Analysis Unit II
CS6010 Social Network Analysis   Unit IICS6010 Social Network Analysis   Unit II
CS6010 Social Network Analysis Unit II
 
Providing Linked Data
Providing Linked DataProviding Linked Data
Providing Linked Data
 
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDL
 
Services semantic technology_terminology
Services semantic technology_terminologyServices semantic technology_terminology
Services semantic technology_terminology
 
Fedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN InfrastructureFedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN Infrastructure
 
Knowledge Representation, Semantic Web
Knowledge Representation, Semantic WebKnowledge Representation, Semantic Web
Knowledge Representation, Semantic Web
 
20080917 Rev
20080917 Rev20080917 Rev
20080917 Rev
 
Knowledge mangement
Knowledge mangementKnowledge mangement
Knowledge mangement
 
Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries) Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries)
 
Best of Marketing
Best of MarketingBest of Marketing
Best of Marketing
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
 
RDA-DCAM and Application Profiles
RDA-DCAM and Application ProfilesRDA-DCAM and Application Profiles
RDA-DCAM and Application Profiles
 
2015 CIC: #EdTech Forum - LRMI
2015 CIC: #EdTech Forum - LRMI2015 CIC: #EdTech Forum - LRMI
2015 CIC: #EdTech Forum - LRMI
 
Semantics
SemanticsSemantics
Semantics
 
Linked Open Data and DANS
Linked Open Data and DANSLinked Open Data and DANS
Linked Open Data and DANS
 
Comparative study on the processing of RDF in PHP
Comparative study on the processing of RDF in PHPComparative study on the processing of RDF in PHP
Comparative study on the processing of RDF in PHP
 
‘Development of a MODS-RDF Cataloguing Tool for the Digital Resources and Ima...
‘Development of a MODS-RDF Cataloguing Tool for the Digital Resources and Ima...‘Development of a MODS-RDF Cataloguing Tool for the Digital Resources and Ima...
‘Development of a MODS-RDF Cataloguing Tool for the Digital Resources and Ima...
 

Mehr von CLARIAH

ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018
ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018
ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018CLARIAH
 
DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018
DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018
DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018CLARIAH
 
Masterclass innosurance 2018
Masterclass innosurance 2018Masterclass innosurance 2018
Masterclass innosurance 2018CLARIAH
 
Flat TLA
Flat TLAFlat TLA
Flat TLACLARIAH
 
QB'er demonstration
QB'er demonstrationQB'er demonstration
QB'er demonstrationCLARIAH
 
Collection registration for the CLARIAH Media Suite.
Collection registration for the CLARIAH Media Suite.Collection registration for the CLARIAH Media Suite.
Collection registration for the CLARIAH Media Suite.CLARIAH
 
2016 05-20-clariah-wp4
2016 05-20-clariah-wp42016 05-20-clariah-wp4
2016 05-20-clariah-wp4CLARIAH
 
2016 05-20-clariah-wp3
2016 05-20-clariah-wp32016 05-20-clariah-wp3
2016 05-20-clariah-wp3CLARIAH
 
2016 05-20-clariah-wp2
2016 05-20-clariah-wp22016 05-20-clariah-wp2
2016 05-20-clariah-wp2CLARIAH
 
2016 05-20-clariah-wp5
2016 05-20-clariah-wp52016 05-20-clariah-wp5
2016 05-20-clariah-wp5CLARIAH
 
MTAS Henny Brugman
MTAS Henny BrugmanMTAS Henny Brugman
MTAS Henny BrugmanCLARIAH
 
LREC Ton vd Wouden
LREC Ton vd WoudenLREC Ton vd Wouden
LREC Ton vd WoudenCLARIAH
 
Paqu Gertjan van Noord en Jan Odijk
Paqu Gertjan van Noord en Jan OdijkPaqu Gertjan van Noord en Jan Odijk
Paqu Gertjan van Noord en Jan OdijkCLARIAH
 
Open sonar martinreynaert
Open sonar martinreynaertOpen sonar martinreynaert
Open sonar martinreynaertCLARIAH
 
Struc data Auke Rijpma
Struc data Auke RijpmaStruc data Auke Rijpma
Struc data Auke RijpmaCLARIAH
 
Diachronous conceptuallexicons Marieke van Erp / Piek Vossen
Diachronous conceptuallexicons Marieke van Erp / Piek VossenDiachronous conceptuallexicons Marieke van Erp / Piek Vossen
Diachronous conceptuallexicons Marieke van Erp / Piek VossenCLARIAH
 
Corpus studio Erwin Komen
Corpus studio Erwin KomenCorpus studio Erwin Komen
Corpus studio Erwin KomenCLARIAH
 
Athena richard zijdeman
Athena richard zijdemanAthena richard zijdeman
Athena richard zijdemanCLARIAH
 
Struc data aukerijpma
Struc data aukerijpmaStruc data aukerijpma
Struc data aukerijpmaCLARIAH
 
Anansi jauco noordzij
Anansi jauco noordzijAnansi jauco noordzij
Anansi jauco noordzijCLARIAH
 

Mehr von CLARIAH (20)

ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018
ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018
ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018
 
DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018
DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018
DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018
 
Masterclass innosurance 2018
Masterclass innosurance 2018Masterclass innosurance 2018
Masterclass innosurance 2018
 
Flat TLA
Flat TLAFlat TLA
Flat TLA
 
QB'er demonstration
QB'er demonstrationQB'er demonstration
QB'er demonstration
 
Collection registration for the CLARIAH Media Suite.
Collection registration for the CLARIAH Media Suite.Collection registration for the CLARIAH Media Suite.
Collection registration for the CLARIAH Media Suite.
 
2016 05-20-clariah-wp4
2016 05-20-clariah-wp42016 05-20-clariah-wp4
2016 05-20-clariah-wp4
 
2016 05-20-clariah-wp3
2016 05-20-clariah-wp32016 05-20-clariah-wp3
2016 05-20-clariah-wp3
 
2016 05-20-clariah-wp2
2016 05-20-clariah-wp22016 05-20-clariah-wp2
2016 05-20-clariah-wp2
 
2016 05-20-clariah-wp5
2016 05-20-clariah-wp52016 05-20-clariah-wp5
2016 05-20-clariah-wp5
 
MTAS Henny Brugman
MTAS Henny BrugmanMTAS Henny Brugman
MTAS Henny Brugman
 
LREC Ton vd Wouden
LREC Ton vd WoudenLREC Ton vd Wouden
LREC Ton vd Wouden
 
Paqu Gertjan van Noord en Jan Odijk
Paqu Gertjan van Noord en Jan OdijkPaqu Gertjan van Noord en Jan Odijk
Paqu Gertjan van Noord en Jan Odijk
 
Open sonar martinreynaert
Open sonar martinreynaertOpen sonar martinreynaert
Open sonar martinreynaert
 
Struc data Auke Rijpma
Struc data Auke RijpmaStruc data Auke Rijpma
Struc data Auke Rijpma
 
Diachronous conceptuallexicons Marieke van Erp / Piek Vossen
Diachronous conceptuallexicons Marieke van Erp / Piek VossenDiachronous conceptuallexicons Marieke van Erp / Piek Vossen
Diachronous conceptuallexicons Marieke van Erp / Piek Vossen
 
Corpus studio Erwin Komen
Corpus studio Erwin KomenCorpus studio Erwin Komen
Corpus studio Erwin Komen
 
Athena richard zijdeman
Athena richard zijdemanAthena richard zijdeman
Athena richard zijdeman
 
Struc data aukerijpma
Struc data aukerijpmaStruc data aukerijpma
Struc data aukerijpma
 
Anansi jauco noordzij
Anansi jauco noordzijAnansi jauco noordzij
Anansi jauco noordzij
 

KĂźrzlich hochgeladen

%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durbanmasabamasaba
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrainmasabamasaba
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastPapp KrisztiĂĄn
 
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfayushiqss
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
Generic or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisionsGeneric or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisionsBert Jan Schrijver
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareJim McKeeth
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyviewmasabamasaba
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024Mind IT Systems
 
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburgmasabamasaba
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is insideshinachiaurasa2
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...masabamasaba
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park masabamasaba
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
 

KĂźrzlich hochgeladen (20)

%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
Generic or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisionsGeneric or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisions
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 

CMDI2RDF

  • 2. 2 • Create an interoperable domain of Language Resources (LR) – Interoperable formats for LR content – Persistent identification (and citation) of LRs – Use of SAML based AAI for access to LRs – Use of the Component Metadata Infrastructure (CMDI) for describing LRs
  • 3. 3 • Created as a response to a fragmented situation of LR metadata • Flexible – Not a single schema, but supports different metadata schema – Different schema for different situations – Semantic Interoperability via linking to semantic registries • Community driven – communities can model their own metadata schema – know their data and can create the right schema – know the right terminology • Sharing – Concepts, Terminology, Vocabularies • CLARIN Concept Registry for linguistic concepts, • ISO 368 and other relevant vocabularies • CLAVAS for organisation names – Components & profiles via the CLARIN metadata component registry
  • 4. 4 • A Component groups together metadata Elements, which naturally belong together to describe a property of the resources – The Location where a SpeechRecording took place – The Location of an Actor – A Location is described by an address a/o region a/o country a/o continent • Components can be nested – The Language a specific Actor speaks – An Actor who takes part in a SpeechRecording for a specific Project • A Profile is a specific collection of Components for a specific type of resources, e.g., speech recordings SpeechRecordingP ActorC LocationC - addressE - regionE - countryE - continentE LocationC ProjectC LanguageC LanguageC Technical MetadataC
  • 6. 6 • Started in 2010, version 1.2 released in 2016 supporting remote vocabularies • Actively supported by CLARIN ERIC and several national CLARIN consortia • Many supporting tools: – VLO, COMEDI, ARBIL, CMDI maker, Virtual Collection Registry … • Link to the Linked (open) Data world: CMDI2RDF CMDI LODCMDI2RDF
  • 7. 7 • Started as a 2014 CLARIN NL project by TLA/MPI and DANS • Now a service supported by CLARIAH WP2 (X11.400) • Linking also to other ‘linguistic’ LoD information sources: – WALS for linguistic typology information – CLAVAS organization names – DBpedia (currently only used as glue) • Automatic synchronization CMDI metadata • Simplification of the RDFs CMDI model
  • 8. 8 • CMD is classic W3C schema constrained XML • To map a CMD record to RDF we need – A mapping for the basic component model to RDFS • Basic classes and properties to represent profiles, components, elements, attributes and their relationships and values – A mapping for a specific profile or component to RDFS • A specific subclass or subproperty of the basic component model – A mapping for specific metadata records to RDF instances of RDFS • Instances of profile or component – Additionaly there is a generic CMD envelop that is mapped using common LOD vocabularies
  • 9. 9  Basic CMD model is described by ISO/DIS 24622-1  1st part of ISO TC 37 SC 4 3 CMD standards family  Natural mapping to RDF would be:  Profiles/components to RDF Classes  Elements to RDF Properties  Complication  CLARIN’s CMDI allows attributes on both Components and Elements  So elements have to be RDF Classes as well
  • 10. 10 • Nevertheless introduces extra hierarchy • CMDI is already a hierarchical metadata schema • Human readability decreases • Other solutions welcome! R 14 Age <Description URI= …. > <Age>14</Age> … </Person <Description…. > <Age status=‘U’>14</Age> … </Description> R Age 14 U Simplified example status
  • 11. 11 OAI harvester CLARIN joint metadata domain CMD2RDF • conversion • enrichment Virtuoso caching CMD-RDF • SPARQL • REST • browse (L)L(O)D cloud Component Registry CLAVAS WALS Technology: • Virtuoso RDF store • Elda as browser • Tomcat as application server • Conversion pipeline in Java • Core transforms in XSLT • All source code on GitHub, • Docker build file & images available
  • 12. 12
  • 13. 13 • Offers LoD for different LR metadata infrastructures – LRE Map (LREC) – META-SHARE – CLARIN – DataHub (linguistic part) • However – Wrt. CLARIN only data with DC profiles • Just a small part of CLARIN – Seems partly based on static old data dumps
  • 14. 14 • Goals: – Find metadata type of information about LRs in LD format – Translate that into a ‘suitable’ CMDI profile based metadata record • Is there such LD that is not already available direct in another format: OLAC, CLARIN, DC, META-SHARE – If so, useful to have this metadata in the CLARIN VLO metadata catalogue – Humanities data archives will have mostly DC, (inventory available from different projects: e.g. DASISH) and frequently offer LD – Easier ways exist to translate DC into CMDI (e.g. the CMDI DC profile) – But LD can be a pivot set for many such translations • Still in exploratory phase – Would like to use a general strategy, – Its very labor intensive to craft specific transformations for every LD set.
  • 15. 15 • Useful for CLARIN? – Enriching existing CMDI metadata and recycling them – Relations to sources already known as: • WALS, DBpedia, CLAVAS, GlotoLog, … • Relations to CLARIAH LD sources ? – Enable the VLO (or an alternative browser) for visualizing this information – Increasing metadata quality: • Use CLAVAS to repair errors • Include preferred labels – Some CMDI adaptations required • Foreign namespace support in CMDI payload A VLO B C RDF2CMD CLARIN CENTRES CLARIAH? Enriched CMDI CMDI DPpedia Glotolog RDFstore

Hinweis der Redaktion

  1. Virtuoso as a tripelstore Tomcat as application server Elda as browser Conversion pipeline in Java core transforms in XSLT all in a Docker package Code all on GitHub: