SlideShare ist ein Scribd-Unternehmen logo
1 von 21
www.isocat.org

ISOcat and RELcat:
2 cooperating Semantic Registries

Menzo Windhouwer
menzo.windhouwer@dans.knaw.nl
The Language Archive – DANS

Ineke Schuurman
ineke@ccl.kuleuven.be
KU Leuven, CLARIN-NL – Utrecht University
17 January 2014

CLIN 24

1
www.isocat.org

Outline

• The need for explicit semantics
– ISOcat

• Mapping issues
– Languages, theoretical frameworks
– Granularity levels
– RELcat

• CGN case study
• Conclusions and future work
17 January 2014

CLIN 24

2
www.isocat.org

Typological Database Nijmegen
TOP NOTION tds:Noun GROUPS{
NOTION tdn:GrammaticalDistinctions
LABEL "Grammatical distinctions for nouns."
GROUPS {
NOTION tdn:AgentNouns
LABEL "Agent nouns."
DESCRIPTION "Nouns can function as the agent of a clause."
LINK TO CONCEPT agentRole
GROUPS {
NOTION tdn:v098_plusAffix
LABEL "Agent nouns formed by verb stem plus affix."
LINK TO CONCEPTS (agentRole, verbalMorphology, boundAffix)
DESCRIPTION
<p>Agent nouns are formed by a verb stem plus an affix, e.g. English <qv>walk-er</qv>.</p>
NOTE AUTHOR IS "TDS" TYPE IS "original TDN label" "AGENT NOUNS ARE VERB STEM PLUS AFFIX"
IS FIELD v098;
...

Notes: TDN is not in archived in TLA, but curated in TDS, a previous project Menzo worked on, and now archived at DANS;
17 January 2014
CLIN 24
3
also this not a TDN punchcard
www.isocat.org

17 January 2014

DOBES corpora

CLIN 24

4
www.isocat.org

ISOcat

• An open Data Category/Concept Registry where
everyone can
– find and select data categories/concepts
– create new data categories/concepts
– share data categories/concepts

• Each data category/concept has a Persistent
Identifier which can be embedded in a resource
(schema) to make the intended semantics (more)
explicit
17 January 2014

CLIN 24

5
www.isocat.org

Mapping issues

• Interesting resources for a specific research
question might
– use very different theoretical frameworks, which
might share few/none data categories/concepts
– use more coarse or finer grained data
categories/concepts

• How to overcome these differences by
mapping data categories/concepts to each
other?
17 January 2014

CLIN 24

6
www.isocat.org

Some examples

• definite article (PoS)
– EN: 1 (-)
– FR: 2 (masc, fem)
– NL: 2 (neuter, non-neuter)
– DE: 3 (masc, fem, neuter)

Dutch ‘non-neuter’ , for example, should be
related to ‘masc’ and ‘fem’
17 January 2014

CLIN 24

7
www.isocat.org

Some examples

• Indirect object (syntax)
– EN: indirect object
– NL:
• meewerkend voorwerp (1), or
• meewerkend voorwerp (2) plus belanghebbend
voorwerp
– All translated as ‘indirect object’

=> 3 definitions of ‘indirect object’, relations are
to be shown !
17 January 2014

CLIN 24

8
www.isocat.org

Some examples

• Event (semantics)
– ISO-TimeML: event and state, where ‘state’ is a
type of event
– Other theories (Kamp & Reyle etc): eventuality,
two subtypes: ‘event’ and ‘state’

Concepts ‘eventuality’, ‘event’ and ‘state’ are to
be related
17 January 2014

CLIN 24

9
www.isocat.org

ISOcat internal issues

Data categories that are almost the same,
apart from type, profile, language, …
Currently we insert a new DC. But note that the
original one and the new one should be
marked as having a same-as relation

17 January 2014

CLIN 24

10
www.isocat.org

RELcat

• A Relation Registry (under construction) to store
–
–
–
–

(almost) same-as relationships
subsumption relationships (isSuperClassOf, isSubClassOf)
mereology relationships (isPartOf, hasPart)
…

between data categories/concepts
• The focus is on informal and possibly partial
ontologies to be used for resource discovery
• Based on RDF triples
17 January 2014

CLIN 24

11
www.isocat.org

CGN case study

• Atomic building blocks of CGN tags are
defined in ISOcat (still private)
• The EBNF schema of a CGN tag is stored in
SCHEMAcat
• The subsumption relations in the value
domains are stored in RELcat
• (almost) same-as relationships with other data
categories/concepts are also stored in RELcat
17 January 2014

CLIN 24

12
www.isocat.org

CGN granularity mappings

• How to deal with (almost) same-as
relationships that involve more then one
atomic CGN data category/concept?
– Example: N(SOORT) = Common Noun

• Based on the CGN EBNF this involves the
following slots of the /CGN tag/
– /PoS/ = /N/
– /NTYPE/ = /SOORT/

• How to express this in RDF?
17 January 2014

CLIN 24

13
www.isocat.org

RELcat RDF mapping

• Data categories/concepts can function as
subjects and objects in an RDF triple
• The predicate of an RDF triple is a RELcat
relationship type
• Alternative: complex data categories as
properties

17 January 2014

CLIN 24

14
www.isocat.org

N(SOORT) = Common Noun
CGN tag

isA

sameAs

Common Noun
17 January 2014

CLIN 24

15
www.isocat.org

N(SOORT) = Common Noun
CGN tag

isA
hasPart

hasPart

PoS

has more parts
NTYPE

has more
potential
values

has more
potential
values
sameAs

hasPotentialValue

N
17 January 2014

Common Noun
CLIN 24

hasPotentialValue

SOORT
16
www.isocat.org

N(SOORT) = Common Noun
CGN tag

isA
hasPart
hasPart

PoS
has more
potential
values

hasPart
hasPart

isA
hasValue

hasPotentialValue

17 January 2014

NTYPE

has more
potential
values

isA
sameAs

isA
N

has more parts

hasValue

hasPotentialValue

isA
Common Noun
CLIN 24

SOORT
17
www.isocat.org

N(SOORT) = Common Noun
CGN tag

isA
hasPart
hasPart

PoS
has more
potential
values

hasPart
hasPart

isA
hasValue

hasPotentialValue

17 January 2014

NTYPE

has more
potential
values

isA
sameAs

isA
N

has more parts

hasValue

hasPotentialValue

isA
Common Noun
CLIN 24

SOORT
18
www.isocat.org

Cooperation between
ISOcat and RELcat

• ISOcat: value domains of closed data
categories
– RELcat: hasPotentialValue (new relationship type)

• ISOcat: is-a relations between simple data
categories
– RELcat: subsumption relations

• SCHEMAcat: part-of relationships
– RELcat: mereology relationships
17 January 2014

CLIN 24

19
www.isocat.org

Conclusions and future work

• Simple mappings are easy
• Complex mapping get easily fairly complex
– UI support?
– DSL support?
– Alternative RDF mapping?

• User front-end for RELcat
– Integration of RELcat and ISOcat?
17 January 2014

CLIN 24

20
www.isocat.org

Other examples

• “JJR” -> “POS=adjective & degree=comparative”
• “Transitive” -> “thetavp=vp120 & synvps=[synNP] &
caseAssigner=True”
• “VVIMP” -> “POS= verb & main verb & mood=imperative”

17 January 2014

CLIN 24

21

Weitere ähnliche Inhalte

Ähnlich wie ISOcat and RELcat, two cooperating semantic registries

Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
Guus Schreiber
 
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNetGathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
Andrea Nuzzolese
 
What do cats have to do with explicit semantics?
What do cats have to do with explicit semantics?What do cats have to do with explicit semantics?
What do cats have to do with explicit semantics?
Menzo Windhouwer
 
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
rchbeir
 

Ähnlich wie ISOcat and RELcat, two cooperating semantic registries (20)

Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
 
Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?
 
What_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfWhat_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdf
 
How to model digital objects within the semantic web
How to model digital objects within the semantic webHow to model digital objects within the semantic web
How to model digital objects within the semantic web
 
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNetGathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
 
What do cats have to do with explicit semantics?
What do cats have to do with explicit semantics?What do cats have to do with explicit semantics?
What do cats have to do with explicit semantics?
 
Semantic Web - Ontology 101
Semantic Web - Ontology 101Semantic Web - Ontology 101
Semantic Web - Ontology 101
 
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept AnalysisExtracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for Discovery
 
2012 04-26-ifip-wg.pptx
2012 04-26-ifip-wg.pptx2012 04-26-ifip-wg.pptx
2012 04-26-ifip-wg.pptx
 
Expressing Dublin Core Metadata.ppt
Expressing Dublin Core Metadata.pptExpressing Dublin Core Metadata.ppt
Expressing Dublin Core Metadata.ppt
 
Machine Learning Methods for Analysing and Linking RDF Data
Machine Learning Methods for Analysing and Linking RDF DataMachine Learning Methods for Analysing and Linking RDF Data
Machine Learning Methods for Analysing and Linking RDF Data
 
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
 
Understanding the world through ontology patterns - Eva Blomqvist, ESSENCE co...
Understanding the world through ontology patterns - Eva Blomqvist, ESSENCE co...Understanding the world through ontology patterns - Eva Blomqvist, ESSENCE co...
Understanding the world through ontology patterns - Eva Blomqvist, ESSENCE co...
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
 
OpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for allOpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for all
 
Lean Logic for Lean Times: Varieties of Natural Logic
Lean Logic for Lean Times: Varieties of Natural LogicLean Logic for Lean Times: Varieties of Natural Logic
Lean Logic for Lean Times: Varieties of Natural Logic
 
Logics and Ontologies for Portuguese Understanding
Logics and Ontologies for Portuguese UnderstandingLogics and Ontologies for Portuguese Understanding
Logics and Ontologies for Portuguese Understanding
 
Discovering Alignments in Ontologies of Linked Data
Discovering Alignments in Ontologies of Linked DataDiscovering Alignments in Ontologies of Linked Data
Discovering Alignments in Ontologies of Linked Data
 
20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs
 

Mehr von Menzo Windhouwer

LDL 2012 - Linking to ISOcat Data Categories
LDL 2012 - Linking to ISOcat Data CategoriesLDL 2012 - Linking to ISOcat Data Categories
LDL 2012 - Linking to ISOcat Data Categories
Menzo Windhouwer
 

Mehr von Menzo Windhouwer (11)

CMD2RDF
CMD2RDFCMD2RDF
CMD2RDF
 
Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.
 
Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...
Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...
Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...
 
A CMD Core Model for CLARIN Web Services
A CMD Core Model for CLARIN Web ServicesA CMD Core Model for CLARIN Web Services
A CMD Core Model for CLARIN Web Services
 
LDL 2012 - Linking to ISOcat Data Categories
LDL 2012 - Linking to ISOcat Data CategoriesLDL 2012 - Linking to ISOcat Data Categories
LDL 2012 - Linking to ISOcat Data Categories
 
ISOcat to LMF to TEI
ISOcat to LMF to TEIISOcat to LMF to TEI
ISOcat to LMF to TEI
 
On the way to a Relation Registry for ISOcat data categories
On the way to a Relation Registry for ISOcat data categoriesOn the way to a Relation Registry for ISOcat data categories
On the way to a Relation Registry for ISOcat data categories
 
The ISO-DCR
The ISO-DCRThe ISO-DCR
The ISO-DCR
 
Use of ISOcat within CMDI
Use of ISOcat within CMDIUse of ISOcat within CMDI
Use of ISOcat within CMDI
 
ISOcat: a short introduction
ISOcat: a short introductionISOcat: a short introduction
ISOcat: a short introduction
 
Sustainable operability: Keeping complex linguistic resources alive.
Sustainable operability: Keeping complex linguistic resources alive.Sustainable operability: Keeping complex linguistic resources alive.
Sustainable operability: Keeping complex linguistic resources alive.
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Kürzlich hochgeladen (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 

ISOcat and RELcat, two cooperating semantic registries

  • 1. www.isocat.org ISOcat and RELcat: 2 cooperating Semantic Registries Menzo Windhouwer menzo.windhouwer@dans.knaw.nl The Language Archive – DANS Ineke Schuurman ineke@ccl.kuleuven.be KU Leuven, CLARIN-NL – Utrecht University 17 January 2014 CLIN 24 1
  • 2. www.isocat.org Outline • The need for explicit semantics – ISOcat • Mapping issues – Languages, theoretical frameworks – Granularity levels – RELcat • CGN case study • Conclusions and future work 17 January 2014 CLIN 24 2
  • 3. www.isocat.org Typological Database Nijmegen TOP NOTION tds:Noun GROUPS{ NOTION tdn:GrammaticalDistinctions LABEL "Grammatical distinctions for nouns." GROUPS { NOTION tdn:AgentNouns LABEL "Agent nouns." DESCRIPTION "Nouns can function as the agent of a clause." LINK TO CONCEPT agentRole GROUPS { NOTION tdn:v098_plusAffix LABEL "Agent nouns formed by verb stem plus affix." LINK TO CONCEPTS (agentRole, verbalMorphology, boundAffix) DESCRIPTION <p>Agent nouns are formed by a verb stem plus an affix, e.g. English <qv>walk-er</qv>.</p> NOTE AUTHOR IS "TDS" TYPE IS "original TDN label" "AGENT NOUNS ARE VERB STEM PLUS AFFIX" IS FIELD v098; ... Notes: TDN is not in archived in TLA, but curated in TDS, a previous project Menzo worked on, and now archived at DANS; 17 January 2014 CLIN 24 3 also this not a TDN punchcard
  • 5. www.isocat.org ISOcat • An open Data Category/Concept Registry where everyone can – find and select data categories/concepts – create new data categories/concepts – share data categories/concepts • Each data category/concept has a Persistent Identifier which can be embedded in a resource (schema) to make the intended semantics (more) explicit 17 January 2014 CLIN 24 5
  • 6. www.isocat.org Mapping issues • Interesting resources for a specific research question might – use very different theoretical frameworks, which might share few/none data categories/concepts – use more coarse or finer grained data categories/concepts • How to overcome these differences by mapping data categories/concepts to each other? 17 January 2014 CLIN 24 6
  • 7. www.isocat.org Some examples • definite article (PoS) – EN: 1 (-) – FR: 2 (masc, fem) – NL: 2 (neuter, non-neuter) – DE: 3 (masc, fem, neuter) Dutch ‘non-neuter’ , for example, should be related to ‘masc’ and ‘fem’ 17 January 2014 CLIN 24 7
  • 8. www.isocat.org Some examples • Indirect object (syntax) – EN: indirect object – NL: • meewerkend voorwerp (1), or • meewerkend voorwerp (2) plus belanghebbend voorwerp – All translated as ‘indirect object’ => 3 definitions of ‘indirect object’, relations are to be shown ! 17 January 2014 CLIN 24 8
  • 9. www.isocat.org Some examples • Event (semantics) – ISO-TimeML: event and state, where ‘state’ is a type of event – Other theories (Kamp & Reyle etc): eventuality, two subtypes: ‘event’ and ‘state’ Concepts ‘eventuality’, ‘event’ and ‘state’ are to be related 17 January 2014 CLIN 24 9
  • 10. www.isocat.org ISOcat internal issues Data categories that are almost the same, apart from type, profile, language, … Currently we insert a new DC. But note that the original one and the new one should be marked as having a same-as relation 17 January 2014 CLIN 24 10
  • 11. www.isocat.org RELcat • A Relation Registry (under construction) to store – – – – (almost) same-as relationships subsumption relationships (isSuperClassOf, isSubClassOf) mereology relationships (isPartOf, hasPart) … between data categories/concepts • The focus is on informal and possibly partial ontologies to be used for resource discovery • Based on RDF triples 17 January 2014 CLIN 24 11
  • 12. www.isocat.org CGN case study • Atomic building blocks of CGN tags are defined in ISOcat (still private) • The EBNF schema of a CGN tag is stored in SCHEMAcat • The subsumption relations in the value domains are stored in RELcat • (almost) same-as relationships with other data categories/concepts are also stored in RELcat 17 January 2014 CLIN 24 12
  • 13. www.isocat.org CGN granularity mappings • How to deal with (almost) same-as relationships that involve more then one atomic CGN data category/concept? – Example: N(SOORT) = Common Noun • Based on the CGN EBNF this involves the following slots of the /CGN tag/ – /PoS/ = /N/ – /NTYPE/ = /SOORT/ • How to express this in RDF? 17 January 2014 CLIN 24 13
  • 14. www.isocat.org RELcat RDF mapping • Data categories/concepts can function as subjects and objects in an RDF triple • The predicate of an RDF triple is a RELcat relationship type • Alternative: complex data categories as properties 17 January 2014 CLIN 24 14
  • 15. www.isocat.org N(SOORT) = Common Noun CGN tag isA sameAs Common Noun 17 January 2014 CLIN 24 15
  • 16. www.isocat.org N(SOORT) = Common Noun CGN tag isA hasPart hasPart PoS has more parts NTYPE has more potential values has more potential values sameAs hasPotentialValue N 17 January 2014 Common Noun CLIN 24 hasPotentialValue SOORT 16
  • 17. www.isocat.org N(SOORT) = Common Noun CGN tag isA hasPart hasPart PoS has more potential values hasPart hasPart isA hasValue hasPotentialValue 17 January 2014 NTYPE has more potential values isA sameAs isA N has more parts hasValue hasPotentialValue isA Common Noun CLIN 24 SOORT 17
  • 18. www.isocat.org N(SOORT) = Common Noun CGN tag isA hasPart hasPart PoS has more potential values hasPart hasPart isA hasValue hasPotentialValue 17 January 2014 NTYPE has more potential values isA sameAs isA N has more parts hasValue hasPotentialValue isA Common Noun CLIN 24 SOORT 18
  • 19. www.isocat.org Cooperation between ISOcat and RELcat • ISOcat: value domains of closed data categories – RELcat: hasPotentialValue (new relationship type) • ISOcat: is-a relations between simple data categories – RELcat: subsumption relations • SCHEMAcat: part-of relationships – RELcat: mereology relationships 17 January 2014 CLIN 24 19
  • 20. www.isocat.org Conclusions and future work • Simple mappings are easy • Complex mapping get easily fairly complex – UI support? – DSL support? – Alternative RDF mapping? • User front-end for RELcat – Integration of RELcat and ISOcat? 17 January 2014 CLIN 24 20
  • 21. www.isocat.org Other examples • “JJR” -> “POS=adjective & degree=comparative” • “Transitive” -> “thetavp=vp120 & synvps=[synNP] & caseAssigner=True” • “VVIMP” -> “POS= verb & main verb & mood=imperative” 17 January 2014 CLIN 24 21