SlideShare ist ein Scribd-Unternehmen logo
1 von 35
Some thoughts about the gaps across
languages and domains
through the experience on building the
core common vocabularies
Hideaki Takeda
National Institute of Informatics
takeda@nii.ac.jp
Glocal KO Workshop, Thursday August 13, 2015, Copenhagen
Who am I?
Hideaki Takeda, Dr., Eng.
• Professor, National Institute of Informatics
– Research Institute mainly for Computer Science
• Background: Computer Science, in particular, Artificial
Intelligence
• Current interest: Semantic Web, Ontology, Linked Open
Data (LOD), Social Media Analysis
• Social activities
– President, Linked Open Data Initiative (NPO)
– Founder, Dbpedia Japanese Chapter
– Specialist, Information-technology Promotion Agency,
Japan (IPA)
– Chair, Japan Link Center (Registration Agency of
International DOI Foundation)
– Board, ORCID
Core Vocabularies
• Background
– Everything is on infosphere, i.e., web
– Lots of information, lots of data, lots of systems
• Problems
– Misunderstanding/mis-matching/”missing
links“ across different domains
– Gap between human and machines (computers)
Core Vocabularies
• Aim
– Increase interoperability of information/data
– Bridge human and machine understanding
• Target
– Governmental documents/data
• Method
– Define a set of concepts which bridge (human-
readable) terms and (computer-processable) symbols
(URIs)
– Starting from the most common concepts
Core Vocabularies
• Activities worldwide
– USA: NIEM Core
• NIEM (National Information Exchange Model)
– Europe: ISA Core Vocabularies
– UN: United Nations Centre for Trade Facilitation
and Electronic Business (UN/CEFACT)
• Core Components Library (UN/CCL)
– Japan: IMI Core Vocabulary
ISA Core Vocabularies v 1.1
NIEM Architecture
http://niem.github.io/technical/iepd-versions/
NIEM
http://reference.niem.gov/niem/guidance/user-guide/vol1/user-guide-vol1.pdf
http://www.epa.gov/oei/symposium/2010/roy.pdf
IMI Project
• Supported by
– Ministry of Economy, Trade,
and Industry, Japan
• Technical Framework
– Data Model
– Core Vocabulary
– Design Rules
• Support Framework
– Tools
• for data developer
• for schema developer
– Database
• schema / tools / templates/ …
Person Type
Name
Gender
Gender Code
Birth Date
Address
…
Name Type
Type
Name
Family Name
Given Name
…
Address Type
Type
Notation
Zip Code
Prefecture
City
…
String
String
String
Code TypeString
String
String
String
String
String
Code Type
Type
Value
Name Type
Address Type
Codelist Type
String
Thing Type
10
IMI as a template for schema
Registration form for Confere
Name:
Address:
Gender:
Affiliation:
Affiliation
Address:
Attending date: -
M /
Person Type
Name
Gender
Gender Code
Birth Date
Address
…
Name Type
Type
Name
Family Name
Given Name
…
Address Type
Type
Notation
Zip Code
Prefecture
City
…
String
String
String
Code Type
String String
String
String
String
String
Code Type
Type
Value
Name Type
Address Type
Codelist Type
String
Thing Type
IMI Individual Form
Person Type
Name
Gender
Address
Affiliation
Name Type
Name
Address Type
Notation
Zip-code
String
String
String
String
Name
Address
Org.
Person
Date
Event Participation Type
Participant
Date
Design Schema
Remove unnecessary items
Add necessary items
Roles of IMI
• Structured concept dictionary
– Concept dictionary
• Terms as notation of concepts
– The entry is concept, not term
• Class concept and relation concept
• General-specific relation
– Structured dictionary
• Concepts form a network of concepts which in tern represents
meaning of individual concepts
• A class concept consists of relation concepts representing
attributes and general/specific relations
• A relation concept consists of class concepts connected as
domains and ranges and general/specific relations
• Template for schemata
– Add or remove items for the specific needs
Use of IMI
• Define the concept model
• “Serialize” it into specific “physical” forms
• Use suitable a physical form
IMI Concept
Model
RDF XML
Natural
Language Form
For Open Data For data exchange For spread sheets and documents
• Relax definition
• Interoperability
with other open data
schemata
• Strict definition
• Interoperability with DB
schemata
• Relax definition with simple
structure
• Readability by humans
IMI Core vocabulary v2.2
• Published on Feb.3 2015
• 48 core class terms
– person, address, facility, location, date, …
• 206 core property terms
– name of person, birth date, birth country, …
• Multi format
– rdf schema, xml schema
and documents for human
http://imi.ipa.go.jp/ns/core/2/ 14
Class definition (person class)
person 人
説明:人の情報を表現するためのデータ型 Data Type to describe a person
継承(inherit from) : ic:実体型
property Data type cardinality 説明 (ja) Description (en)
ID ID ic:ID型 0..n ID Identification of a Person
Name of person 氏名 ic:氏名型 0..n 氏名 Name of a Person
Gender 性別 xsd:string 0..1 性別の表記 Gender of a Person
Gender code 性別コード ic:コード型 0..1 性別コード Gender of a Person
Birth date 生年月日 ic:日付型 0..1 生年月日 Date of Birth of a Person
Death date 死亡年月日 ic:日付型 0..1 死亡年月日 Date of Death of a Person
Residence address 住所 ic:住所型 0..n 現住所 Present address of a Person
Domicile of origin 本籍 ic:住所型 0..1 本籍 Legal residence address of a Person
Contact information 連絡先 ic:連絡先型 0..n 連絡先 Contact information of a Person
Nationality 国籍 xsd:string 0..n 国籍の表記
A county that assigns rights, duties, and
privileges to a person because of the birth or
naturalization of the person in that country.
Nationality code 国籍コード ic:コード型 0..n
住民基本台帳で利用さ
れている国籍コード
A county that assigns rights, duties, and
privileges to a person because of the birth or
naturalization of the person in that country.
Birth country 出生国 xsd:string 0..1 生まれた国名 A location where a person was born.
Birth country code 出生国コード ic:コード型 0..1 生まれた国のコード A location where a person was born.
Birth place 出生地 ic:住所型 0..1 生まれた場所 A location where a person was born.
16
Class Structure
person 人
name ic:氏名型
Contact ic:連絡先型
: :
氏名
Family name xsd:string
Romanized Family name xsd:string
: :
contact 連絡先
Phone number ic:電話番号型
Address ic:住所型
: :
電話番号
: :
address 住所
Country xsd:string
Prefecture xsd:string
: :
 A class term has a property term as a sub element and the property term can refer a class
term. Again, the class term has a list of property terms. That constructs a layered structure
of terms as the following figure.
phone number
name
Concept of the IMI framework
International interoperability is highly
considered in preparing IMI.
Core
Vocabulary
Shelter
Location
Hospital
Station
Geographical Space
/Facilities
Transportation
Disaster
Prevention
Finance
Domain-specific
Vocabularies
Disaster
Restoration
Cost
Cross Domain
Vocabulary
IMI
Japanese
Local
government
Standard
(APPLIC)
DE fact
Standards
(DC, foaf,
etc)
NIEM
(US)
ISA
(EU)
Schema.org
18
Mapping between concepts in
different core vocabularies
• Difficulty of concept-concept mapping
– Matching of meaning tends to be very abstract
discussion
Concept
reference
Ontology
Real world
Concept
reference
?
Mapping between concepts in
different core vocabularies
• Difficulty of concept-concept mapping
– Matching of meaning tends to be very abstract
discussion
– Matching of references is easier
Concept
reference
Ontology
Real world
Concept
reference
?
Mapping between concepts in
different core vocabularies
• Difficulty of concept-concept mapping
– Syntactical mapping vs. semantic mapping
• Just consider what it refers in the real world, not how it
is represented in systems.
Concept
reference
Ontology
Concept
reference
?
Systems World
Cognitive World
Person
person 人
説明:人の情報を表現するためのデータ型 Data Type to describe a
person
継承(inherit from) : ic:実体型
prop
erty
Data
type
cardi
nalit
y
説明 (ja) Description (en)
ID ID ic:ID型 0..n ID Identification of a Person
Name of
person
氏名
ic:氏名
型
0..n 氏名 Name of a Person
Gender 性別
xsd:strin
g
0..1 性別の表記 Gender of a Person
ender code
性別
コード
ic:コード
型
0..1 性別コード Gender of a Person
Birth date
生年月
日
ic:日付
型
0..1 生年月日 Date of Birth of a Person
Death date
死亡年
月日
ic:日付
型
0..1 死亡年月日 Date of Death of a Person
Residence
address
住所
ic:住所
型
0..n 現住所 Present address of a Person
Domicile of
origin
本籍
ic:住所
型
0..1 本籍
Legal residence address of a
Person
Contact
nformation
連絡先
ic:連絡
先型
0..n 連絡先
Contact information of a
Person
Nationality 国籍
xsd:strin
g
0..n 国籍の表記
A county that assigns rights,
duties, and privileges to a
person because of the birth or
naturalization of the person in
that country.
住民基本台帳
A county that assigns rights,
duties, and privileges to a
?
?
Systems World
Cognitive World
Postal Code
?
?
“101-8430” ^^xsd:string “SW1A 0AA”@en
(postal code in Japan) (postal code in Europe)
Systems World
Cognitive World
Semantic Mapping
• Semantic Mapping
– Mapping on the cognitive layer
– Two ways of judging mapping
• Extensional Mapping
– Check whether ‘things’ are shared
– e.g., person
– Mostly for Class Mapping
• Intensional Mapping
– Check whether ‘values’ are shared
– e.g., postal-code
– Mostly for Property Mapping
• Syntactical Mapping
– Mapping on the systems layer
Types of matching: SKOS
• Exact Match
• Close Match
• Broad/Narrow Match
• Related Match
Close match
• Close match: nearly matched but not exactly
matched.
• Extensional mapping
– Coverage of ‘things’ are overlapped so much
• Coverage of ‘Country’ is slightly different
– ‘things’ are close
• Reference of ‘Person’ is slightly different (person vs. legal
Person)
• Intensional mapping
– Coverage of ‘values’ are overlapped so much
Broad match/narrow match
• Broad/narrow match
– One subsumes the other
• Extensional mapping
– Coverage of ‘things’ are subsumed, i.e., the subset
is exact match
• Intensional mapping
– Coverage of ‘values’ are subsumed, i.e., the subset
is exact match
More different matching
• Complicated match
– An element of a system matches a combination of
two or more elements.
– “Pathway” match
• A single property matches the combination of two or
more properties
– “Conditional” match
• An element matches the other element if some
condition is hold
IdentifierIssuingAuthority Link Has related match IMI ic:ID型.ic:ID体系.ic:発行者
LegalEntityRegisteredAddress Link Has broad match IMI ic:法人型.ic:住所 It is exact match if the value of ic:住所.種別 should be "登記住所".
Results
Core Vocabulary Identifier Link Mapping relation Data model Identifier
Address Link Has exact match IMI ic:住所型
AddressAddressArea Link Has narrow match IMI ic:住所型.ic:町名
AddressAddressArea Link Has narrow match IMI ic:住所型.ic:丁目
AddressAddressArea Link Has narrow match IMI ic:住所型.ic:番地補足
AddressAddressArea Link Has narrow match IMI ic:住所型.ic:番地
AddressAddressArea Link Has narrow match IMI ic:住所型.ic:号
AddressAddressID Link Has exact match IMI ic:住所型.ic:ID
AddressAdminUnitL1 Link Has exact match IMI ic:住所型.ic:国
AddressAdminUnitL2 Link Has narrow match IMI ic:住所型.ic:都道府県
AddressFullAddress Link Has exact match IMI ic:住所型.ic:表記
AddressLocatorDesignator Link Has narrow match IMI ic:住所型.ic:ビル番号
AddressLocatorDesignator Link Has narrow match IMI ic:住所型.ic:部屋番号
AddressLocatorName Link Has narrow match IMI ic:住所型.ic:ビル名
AddressPOBox Link Has related match IMI ic:住所型.ic:方書
AddressPostCode Link Has exact match IMI ic:住所型.ic:郵便番号
AddressPostName Link Has narrow match IMI ic:住所型.ic:市区町村
AddressPostName Link Has narrow match IMI ic:住所型.ic:区
AddressThoroughfare Link Has no match IMI
Agent Link Has exact match IMI ic:実体型
Results
Identifier Link Has exact match IMI ic:ID型
IdentifierIdentifier Link Has exact match IMI ic:ID型.ic:識別値
IdentifierIssueDate Link Has no match IMI
IdentifierIssuingAuthority Link Has related match IMI ic:ID型.ic:ID体系.ic:発行者
IdentifierIssuingAuthorityURI Link Has exact match IMI ic:ID型.ic:ID体系.ic:URI
IdentifierType Link Has no match IMI
JurisdictionIdentifier Link Has related match IMI ic:国籍コード
JurisdictionName Link Has related match IMI ic:国籍
LegalEntity Link Has exact match IMI ic:法人型
LegalEntityAddress Link Has broad match IMI ic:法人型.ic:住所
LegalEntityAlternativeName Link Has no match IMI
LegalEntityCompanyActivity Link Has close match IMI ic:法人型.ic:事業種目
LegalEntityCompanyStatus Link Has related match IMI ic:法人型.ic:活動状況
LegalEntityCompanyType Link Has exact match IMI ic:法人型.ic:組織種別
LegalEntityIdentifier Link Has exact match IMI ic:法人型.ic:ID
LegalEntityLegalIdentifier Link Has no match IMI
LegalEntityLegalName Link Has broad match IMI ic:法人型.ic:名称.表記
LegalEntityLocation Link Has related match IMI ic:法人型.ic:地物.説明
LegalEntityRegisteredAddress Link Has broad match IMI ic:法人型.ic:住所
Location Link Has exact match IMI ic:場所型
LocationAddress Link Has exact match IMI ic:場所型.ic:住所
LocationGeographicIdentifier Link Has broad match IMI ic:場所型.ic:地理識別子
LocationGeographicName Link Has exact match IMI ic:場所型.ic:名称.ic:表記
LocationGeometry Link Has exact match IMI ic:場所型.ic:地理座標
Results
Person Link Has exact match IMI ic:人型
PersonAddress Link Has exact match IMI ic:人型.ic:住所
PersonAlternativeName Link Has broad match IMI ic:人型.ic:氏名.ic:姓名
PersonBirthName Link Has broad match IMI ic:人型.ic:氏名.ic:姓名
PersonCitizenship Link Has no match IMI
PersonCountryOfBirth Link Has exact match IMI ic:人型.ic:出生国
PersonCountryOfDeath Link Has no match IMI
PersonDateOfBirth Link Has exact match IMI ic:人型.ic:生年月日
PersonDateOfDeath Link Has exact match IMI ic:人型.ic:死亡年月日
PersonFamilyName Link Has exact match IMI ic:人型.ic:氏名.ic:姓
PersonFullName Link Has exact match IMI ic:人型.ic:氏名.ic:姓名
PersonGender Link Has exact match IMI ic:人型.ic:性別コード
PersonGivenName Link Has exact match IMI ic:人型.ic:氏名.ic:名
PersonIdentifier Link Has broad match IMI ic:人型.ic:ID
PersonPatronymicName Link Has no match IMI ic:人型.ic:氏名.ic:姓名
PersonPlaceOfBirth Link Has narrow match IMI ic:人型.ic:出生地
Bridging core and domain vocabularies
(working in progress)
• Aim: Core vocabulary would be extended to
domain vocabularies
– Agriculture
– Finance
– Traffic
– …
• Task:
– Can concepts be shared between core and domains?
really?
Agricultural Activity Ontology (AAO)
Agricultural activity
crop production activity
activity for propagation
activity in the vegetative growth stage
activity in the reproductive growth stage
activity for environment control
activity for soil control
activity for climate control
activity for water control
activity for biotic control
activity for chemical control
post production activity
activity for harvesting
activity for processing
activity for extending shelf-life
activity for wrapping
indirect activity
activity for preparing materials
activity for cleaning
activity for transport
activity for monitoring
activity for maintaining farm equipment
administrative activity
activity for business administration
http://cavoc.org/aao/
An example: “activity” (and “event”)
• S: (n) activity (any specific behavior) "they avoided all recreational activity"
– direct hyponym / full hyponym
– direct hypernym / inherited hypernym / sister term
• S: (n) act, deed, human action, human activity (something that people do or cause to happen)
– S: (n) event (something that happens at a given place and time)
– [WordNet]
• Each activity is a Happening which involves volition and participants. It has
temporal dimension. It is distinguished from Events by the fact that the activity
does not trigger change of state and does not have a conceptual end point.
– [PROTON Extent module (a lightweight upper-level ontology)]
• Activity: This class represents the abstract content of an event, which may be
repeated many times, once or never. For example a training course, or a play.
– [The Event Programme Vocabulary (prog)]
• E5 Event
– Subclass of: E4 Period
– Superclass of: E7 Activity, E63 Beginning of Existence, E64 End of Existence
• E7 Activity
– Subclass of: E5 Event
– Superclass of: E8 Acquisition, E9 Move, E10 Transfer of Custody, E11 Modification,
E13 Attribute Assignment, E65 Creation …
– [CIDOC Conceptual Reference Model]
Summary
• Sharing concepts is a very long way
• No ground truth
– Step-by-step understanding of the world
– Careful consensus making
• More flexible framework is needed
– Simple mapping is not so happy

Weitere ähnliche Inhalte

Andere mochten auch

Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2
Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2
Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2Amrita Prasad
 
Marketing na Internet
Marketing na InternetMarketing na Internet
Marketing na Internetrenatofrigo
 
Mitos y errores en las relaciones de pareja
Mitos y errores en las relaciones de parejaMitos y errores en las relaciones de pareja
Mitos y errores en las relaciones de parejaAdmingac
 
Roteiro de estudo de caso simulação do processo de compras
Roteiro de estudo de caso simulação do processo de comprasRoteiro de estudo de caso simulação do processo de compras
Roteiro de estudo de caso simulação do processo de comprasAntonio Marcos Montai Messias
 
Problemas de aprendizaje
Problemas de aprendizajeProblemas de aprendizaje
Problemas de aprendizajeLISS
 
Seo proposal for tensator group
Seo proposal for tensator groupSeo proposal for tensator group
Seo proposal for tensator groupParixit Dwivedi
 
Aplicación de los estudios de Métodos y Tiempos
Aplicación de los estudios de Métodos y TiemposAplicación de los estudios de Métodos y Tiempos
Aplicación de los estudios de Métodos y TiemposZadecon
 
Conferencia: Como montar tu primera tienda online
Conferencia: Como montar tu primera tienda onlineConferencia: Como montar tu primera tienda online
Conferencia: Como montar tu primera tienda onlineDario Schilman
 

Andere mochten auch (14)

corripio
corripio corripio
corripio
 
Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2
Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2
Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2
 
Sigmund freud obras completas - lopez ballesteros
Sigmund freud   obras completas - lopez ballesterosSigmund freud   obras completas - lopez ballesteros
Sigmund freud obras completas - lopez ballesteros
 
Augmenter la satisfaction de l'utilisateur
Augmenter la satisfaction de l'utilisateurAugmenter la satisfaction de l'utilisateur
Augmenter la satisfaction de l'utilisateur
 
Manual dqp
Manual dqpManual dqp
Manual dqp
 
Estrategia nal. obesidad 1
Estrategia nal. obesidad 1Estrategia nal. obesidad 1
Estrategia nal. obesidad 1
 
Marketing na Internet
Marketing na InternetMarketing na Internet
Marketing na Internet
 
Mitos y errores en las relaciones de pareja
Mitos y errores en las relaciones de parejaMitos y errores en las relaciones de pareja
Mitos y errores en las relaciones de pareja
 
Roteiro de estudo de caso simulação do processo de compras
Roteiro de estudo de caso simulação do processo de comprasRoteiro de estudo de caso simulação do processo de compras
Roteiro de estudo de caso simulação do processo de compras
 
Problemas de aprendizaje
Problemas de aprendizajeProblemas de aprendizaje
Problemas de aprendizaje
 
Seo proposal for tensator group
Seo proposal for tensator groupSeo proposal for tensator group
Seo proposal for tensator group
 
Aplicación de los estudios de Métodos y Tiempos
Aplicación de los estudios de Métodos y TiemposAplicación de los estudios de Métodos y Tiempos
Aplicación de los estudios de Métodos y Tiempos
 
Conferencia: Como montar tu primera tienda online
Conferencia: Como montar tu primera tienda onlineConferencia: Como montar tu primera tienda online
Conferencia: Como montar tu primera tienda online
 
Mai2010 einladung doktorandenkolloquium
Mai2010 einladung doktorandenkolloquiumMai2010 einladung doktorandenkolloquium
Mai2010 einladung doktorandenkolloquium
 

Ähnlich wie Some thoughts about the gaps across languages and domains through the experience on building the core common vocabularies

141021 ipa global vocab
141021 ipa global vocab141021 ipa global vocab
141021 ipa global vocabKenji Hiramoto
 
Semtech 2011, Saltlux, Tony Lee
Semtech 2011, Saltlux, Tony LeeSemtech 2011, Saltlux, Tony Lee
Semtech 2011, Saltlux, Tony LeeSaltlux Inc.
 
SemTech 2011, Saltlux, Tony Lee
SemTech 2011, Saltlux, Tony LeeSemTech 2011, Saltlux, Tony Lee
SemTech 2011, Saltlux, Tony LeeSaltlux Inc.
 
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.dri_ireland
 
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)Marina Santini
 
Construction of Authority Information for Personal Names Focused on the Forme...
Construction of Authority Information for Personal Names Focused on the Forme...Construction of Authority Information for Personal Names Focused on the Forme...
Construction of Authority Information for Personal Names Focused on the Forme...tmra
 
Introduction to Application Profiles
Introduction to Application ProfilesIntroduction to Application Profiles
Introduction to Application ProfilesDiane Hillmann
 
PPT slides
PPT slidesPPT slides
PPT slidesbutest
 
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Leon Derczynski
 
Text mining introduction-1
Text mining   introduction-1Text mining   introduction-1
Text mining introduction-1Sumit Sony
 
What Business Innovators Need to Know about Content Analytics
What Business Innovators Need to Know about Content AnalyticsWhat Business Innovators Need to Know about Content Analytics
What Business Innovators Need to Know about Content AnalyticsSeth Grimes
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsAndre Freitas
 
Web 3 Expert System
Web 3 Expert SystemWeb 3 Expert System
Web 3 Expert Systemguest4513a7
 
Web 3 Expert System
Web 3 Expert SystemWeb 3 Expert System
Web 3 Expert SystemMediabistro
 
Leveraging Semantic Parsing for Relation Linking over Knowledge Bases
Leveraging Semantic Parsing for Relation Linking over Knowledge BasesLeveraging Semantic Parsing for Relation Linking over Knowledge Bases
Leveraging Semantic Parsing for Relation Linking over Knowledge BasesNandana Mihindukulasooriya
 
Topic models, vector semantics and applications
Topic models, vector semantics and applicationsTopic models, vector semantics and applications
Topic models, vector semantics and applicationsVasileios Lampos
 
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Bhaskar Mitra
 
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceBroad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceLeon Derczynski
 
Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Bianca Pereira
 

Ähnlich wie Some thoughts about the gaps across languages and domains through the experience on building the core common vocabularies (20)

141021 ipa global vocab
141021 ipa global vocab141021 ipa global vocab
141021 ipa global vocab
 
Semtech 2011, Saltlux, Tony Lee
Semtech 2011, Saltlux, Tony LeeSemtech 2011, Saltlux, Tony Lee
Semtech 2011, Saltlux, Tony Lee
 
SemTech 2011, Saltlux, Tony Lee
SemTech 2011, Saltlux, Tony LeeSemTech 2011, Saltlux, Tony Lee
SemTech 2011, Saltlux, Tony Lee
 
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
 
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
 
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)
 
Construction of Authority Information for Personal Names Focused on the Forme...
Construction of Authority Information for Personal Names Focused on the Forme...Construction of Authority Information for Personal Names Focused on the Forme...
Construction of Authority Information for Personal Names Focused on the Forme...
 
Introduction to Application Profiles
Introduction to Application ProfilesIntroduction to Application Profiles
Introduction to Application Profiles
 
PPT slides
PPT slidesPPT slides
PPT slides
 
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
 
Text mining introduction-1
Text mining   introduction-1Text mining   introduction-1
Text mining introduction-1
 
What Business Innovators Need to Know about Content Analytics
What Business Innovators Need to Know about Content AnalyticsWhat Business Innovators Need to Know about Content Analytics
What Business Innovators Need to Know about Content Analytics
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP Systems
 
Web 3 Expert System
Web 3 Expert SystemWeb 3 Expert System
Web 3 Expert System
 
Web 3 Expert System
Web 3 Expert SystemWeb 3 Expert System
Web 3 Expert System
 
Leveraging Semantic Parsing for Relation Linking over Knowledge Bases
Leveraging Semantic Parsing for Relation Linking over Knowledge BasesLeveraging Semantic Parsing for Relation Linking over Knowledge Bases
Leveraging Semantic Parsing for Relation Linking over Knowledge Bases
 
Topic models, vector semantics and applications
Topic models, vector semantics and applicationsTopic models, vector semantics and applications
Topic models, vector semantics and applications
 
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)
 
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceBroad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
 
Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)
 

Mehr von National Institute of Informatics (NII)

趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)
趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)
趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)National Institute of Informatics (NII)
 
趙簡単LOD入門 〜デジタル庁をデジタル化する〜
趙簡単LOD入門 〜デジタル庁をデジタル化する〜趙簡単LOD入門 〜デジタル庁をデジタル化する〜
趙簡単LOD入門 〜デジタル庁をデジタル化する〜National Institute of Informatics (NII)
 
セマンティックWeb技術を用いた農業分野の標準語彙の構築
セマンティックWeb技術を用いた農業分野の標準語彙の構築セマンティックWeb技術を用いた農業分野の標準語彙の構築
セマンティックWeb技術を用いた農業分野の標準語彙の構築National Institute of Informatics (NII)
 
NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ
NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ
NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ National Institute of Informatics (NII)
 
Presenting and Preserving the Change in Taxonomic Knowledge for Linked Data
Presenting and Preserving the Change in Taxonomic Knowledge for Linked DataPresenting and Preserving the Change in Taxonomic Knowledge for Linked Data
Presenting and Preserving the Change in Taxonomic Knowledge for Linked DataNational Institute of Informatics (NII)
 
共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜
共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜
共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜National Institute of Informatics (NII)
 
研究データ利活用に関する国内活動及び国際動向について
研究データ利活用に関する国内活動及び国際動向について研究データ利活用に関する国内活動及び国際動向について
研究データ利活用に関する国内活動及び国際動向についてNational Institute of Informatics (NII)
 

Mehr von National Institute of Informatics (NII) (20)

趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)
趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)
趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)
 
趙簡単LOD入門 〜デジタル庁をデジタル化する〜
趙簡単LOD入門 〜デジタル庁をデジタル化する〜趙簡単LOD入門 〜デジタル庁をデジタル化する〜
趙簡単LOD入門 〜デジタル庁をデジタル化する〜
 
"分人"型社会とAI
"分人"型社会とAI"分人"型社会とAI
"分人"型社会とAI
 
セマンティックWeb技術を用いた農業分野の標準語彙の構築
セマンティックWeb技術を用いた農業分野の標準語彙の構築セマンティックWeb技術を用いた農業分野の標準語彙の構築
セマンティックWeb技術を用いた農業分野の標準語彙の構築
 
研究オープンデータにおける大学と研究者の役割
研究オープンデータにおける大学と研究者の役割研究オープンデータにおける大学と研究者の役割
研究オープンデータにおける大学と研究者の役割
 
NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ
NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ
NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ
 
Presenting and Preserving the Change in Taxonomic Knowledge for Linked Data
Presenting and Preserving the Change in Taxonomic Knowledge for Linked DataPresenting and Preserving the Change in Taxonomic Knowledge for Linked Data
Presenting and Preserving the Change in Taxonomic Knowledge for Linked Data
 
Crop vocabulary (CVO): Core vocabulary of crop names
Crop vocabulary (CVO): Core vocabulary of crop namesCrop vocabulary (CVO): Core vocabulary of crop names
Crop vocabulary (CVO): Core vocabulary of crop names
 
ORCIDとオープンサイエンス
ORCIDとオープンサイエンスORCIDとオープンサイエンス
ORCIDとオープンサイエンス
 
How to build ontologies - a case study of Agriculture Activity Ontology
How to build ontologies - a case study of Agriculture Activity OntologyHow to build ontologies - a case study of Agriculture Activity Ontology
How to build ontologies - a case study of Agriculture Activity Ontology
 
LODとオープンデータ (DBpediaとIMIの周辺を中心に)
LODとオープンデータ(DBpediaとIMIの周辺を中心に)LODとオープンデータ(DBpediaとIMIの周辺を中心に)
LODとオープンデータ (DBpediaとIMIの周辺を中心に)
 
共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜
共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜
共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜
 
Working with Global Infrastructure at a National Level
Working with Global Infrastructure at a National LevelWorking with Global Infrastructure at a National Level
Working with Global Infrastructure at a National Level
 
Activities of JaLC as a national service
Activities of JaLC as a national serviceActivities of JaLC as a national service
Activities of JaLC as a national service
 
Development and Application of Agriculture Ontologies
Development and Application of Agriculture Ontologies Development and Application of Agriculture Ontologies
Development and Application of Agriculture Ontologies
 
Design Process of Agriculture Ontologies
Design Process of Agriculture OntologiesDesign Process of Agriculture Ontologies
Design Process of Agriculture Ontologies
 
AIの未来 ~技術と社会の関係のダイナミクス~
AIの未来~技術と社会の関係のダイナミクス~AIの未来~技術と社会の関係のダイナミクス~
AIの未来 ~技術と社会の関係のダイナミクス~
 
Towards Knowledge-Enabled Society
Towards Knowledge-Enabled SocietyTowards Knowledge-Enabled Society
Towards Knowledge-Enabled Society
 
研究データ利活用に関する国内活動及び国際動向について
研究データ利活用に関する国内活動及び国際動向について研究データ利活用に関する国内活動及び国際動向について
研究データ利活用に関する国内活動及び国際動向について
 
オープンサイエンスとオープンデータ
オープンサイエンスとオープンデータオープンサイエンスとオープンデータ
オープンサイエンスとオープンデータ
 

Kürzlich hochgeladen

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 

Kürzlich hochgeladen (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 

Some thoughts about the gaps across languages and domains through the experience on building the core common vocabularies

  • 1. Some thoughts about the gaps across languages and domains through the experience on building the core common vocabularies Hideaki Takeda National Institute of Informatics takeda@nii.ac.jp Glocal KO Workshop, Thursday August 13, 2015, Copenhagen
  • 2. Who am I? Hideaki Takeda, Dr., Eng. • Professor, National Institute of Informatics – Research Institute mainly for Computer Science • Background: Computer Science, in particular, Artificial Intelligence • Current interest: Semantic Web, Ontology, Linked Open Data (LOD), Social Media Analysis • Social activities – President, Linked Open Data Initiative (NPO) – Founder, Dbpedia Japanese Chapter – Specialist, Information-technology Promotion Agency, Japan (IPA) – Chair, Japan Link Center (Registration Agency of International DOI Foundation) – Board, ORCID
  • 3. Core Vocabularies • Background – Everything is on infosphere, i.e., web – Lots of information, lots of data, lots of systems • Problems – Misunderstanding/mis-matching/”missing links“ across different domains – Gap between human and machines (computers)
  • 4. Core Vocabularies • Aim – Increase interoperability of information/data – Bridge human and machine understanding • Target – Governmental documents/data • Method – Define a set of concepts which bridge (human- readable) terms and (computer-processable) symbols (URIs) – Starting from the most common concepts
  • 5. Core Vocabularies • Activities worldwide – USA: NIEM Core • NIEM (National Information Exchange Model) – Europe: ISA Core Vocabularies – UN: United Nations Centre for Trade Facilitation and Electronic Business (UN/CEFACT) • Core Components Library (UN/CCL) – Japan: IMI Core Vocabulary
  • 6.
  • 10. IMI Project • Supported by – Ministry of Economy, Trade, and Industry, Japan • Technical Framework – Data Model – Core Vocabulary – Design Rules • Support Framework – Tools • for data developer • for schema developer – Database • schema / tools / templates/ … Person Type Name Gender Gender Code Birth Date Address … Name Type Type Name Family Name Given Name … Address Type Type Notation Zip Code Prefecture City … String String String Code TypeString String String String String String Code Type Type Value Name Type Address Type Codelist Type String Thing Type 10
  • 11. IMI as a template for schema Registration form for Confere Name: Address: Gender: Affiliation: Affiliation Address: Attending date: - M / Person Type Name Gender Gender Code Birth Date Address … Name Type Type Name Family Name Given Name … Address Type Type Notation Zip Code Prefecture City … String String String Code Type String String String String String String Code Type Type Value Name Type Address Type Codelist Type String Thing Type IMI Individual Form Person Type Name Gender Address Affiliation Name Type Name Address Type Notation Zip-code String String String String Name Address Org. Person Date Event Participation Type Participant Date Design Schema Remove unnecessary items Add necessary items
  • 12. Roles of IMI • Structured concept dictionary – Concept dictionary • Terms as notation of concepts – The entry is concept, not term • Class concept and relation concept • General-specific relation – Structured dictionary • Concepts form a network of concepts which in tern represents meaning of individual concepts • A class concept consists of relation concepts representing attributes and general/specific relations • A relation concept consists of class concepts connected as domains and ranges and general/specific relations • Template for schemata – Add or remove items for the specific needs
  • 13. Use of IMI • Define the concept model • “Serialize” it into specific “physical” forms • Use suitable a physical form IMI Concept Model RDF XML Natural Language Form For Open Data For data exchange For spread sheets and documents • Relax definition • Interoperability with other open data schemata • Strict definition • Interoperability with DB schemata • Relax definition with simple structure • Readability by humans
  • 14. IMI Core vocabulary v2.2 • Published on Feb.3 2015 • 48 core class terms – person, address, facility, location, date, … • 206 core property terms – name of person, birth date, birth country, … • Multi format – rdf schema, xml schema and documents for human http://imi.ipa.go.jp/ns/core/2/ 14
  • 15.
  • 16. Class definition (person class) person 人 説明:人の情報を表現するためのデータ型 Data Type to describe a person 継承(inherit from) : ic:実体型 property Data type cardinality 説明 (ja) Description (en) ID ID ic:ID型 0..n ID Identification of a Person Name of person 氏名 ic:氏名型 0..n 氏名 Name of a Person Gender 性別 xsd:string 0..1 性別の表記 Gender of a Person Gender code 性別コード ic:コード型 0..1 性別コード Gender of a Person Birth date 生年月日 ic:日付型 0..1 生年月日 Date of Birth of a Person Death date 死亡年月日 ic:日付型 0..1 死亡年月日 Date of Death of a Person Residence address 住所 ic:住所型 0..n 現住所 Present address of a Person Domicile of origin 本籍 ic:住所型 0..1 本籍 Legal residence address of a Person Contact information 連絡先 ic:連絡先型 0..n 連絡先 Contact information of a Person Nationality 国籍 xsd:string 0..n 国籍の表記 A county that assigns rights, duties, and privileges to a person because of the birth or naturalization of the person in that country. Nationality code 国籍コード ic:コード型 0..n 住民基本台帳で利用さ れている国籍コード A county that assigns rights, duties, and privileges to a person because of the birth or naturalization of the person in that country. Birth country 出生国 xsd:string 0..1 生まれた国名 A location where a person was born. Birth country code 出生国コード ic:コード型 0..1 生まれた国のコード A location where a person was born. Birth place 出生地 ic:住所型 0..1 生まれた場所 A location where a person was born. 16
  • 17. Class Structure person 人 name ic:氏名型 Contact ic:連絡先型 : : 氏名 Family name xsd:string Romanized Family name xsd:string : : contact 連絡先 Phone number ic:電話番号型 Address ic:住所型 : : 電話番号 : : address 住所 Country xsd:string Prefecture xsd:string : :  A class term has a property term as a sub element and the property term can refer a class term. Again, the class term has a list of property terms. That constructs a layered structure of terms as the following figure. phone number name
  • 18. Concept of the IMI framework International interoperability is highly considered in preparing IMI. Core Vocabulary Shelter Location Hospital Station Geographical Space /Facilities Transportation Disaster Prevention Finance Domain-specific Vocabularies Disaster Restoration Cost Cross Domain Vocabulary IMI Japanese Local government Standard (APPLIC) DE fact Standards (DC, foaf, etc) NIEM (US) ISA (EU) Schema.org 18
  • 19. Mapping between concepts in different core vocabularies • Difficulty of concept-concept mapping – Matching of meaning tends to be very abstract discussion Concept reference Ontology Real world Concept reference ?
  • 20. Mapping between concepts in different core vocabularies • Difficulty of concept-concept mapping – Matching of meaning tends to be very abstract discussion – Matching of references is easier Concept reference Ontology Real world Concept reference ?
  • 21. Mapping between concepts in different core vocabularies • Difficulty of concept-concept mapping – Syntactical mapping vs. semantic mapping • Just consider what it refers in the real world, not how it is represented in systems. Concept reference Ontology Concept reference ? Systems World Cognitive World
  • 22. Person person 人 説明:人の情報を表現するためのデータ型 Data Type to describe a person 継承(inherit from) : ic:実体型 prop erty Data type cardi nalit y 説明 (ja) Description (en) ID ID ic:ID型 0..n ID Identification of a Person Name of person 氏名 ic:氏名 型 0..n 氏名 Name of a Person Gender 性別 xsd:strin g 0..1 性別の表記 Gender of a Person ender code 性別 コード ic:コード 型 0..1 性別コード Gender of a Person Birth date 生年月 日 ic:日付 型 0..1 生年月日 Date of Birth of a Person Death date 死亡年 月日 ic:日付 型 0..1 死亡年月日 Date of Death of a Person Residence address 住所 ic:住所 型 0..n 現住所 Present address of a Person Domicile of origin 本籍 ic:住所 型 0..1 本籍 Legal residence address of a Person Contact nformation 連絡先 ic:連絡 先型 0..n 連絡先 Contact information of a Person Nationality 国籍 xsd:strin g 0..n 国籍の表記 A county that assigns rights, duties, and privileges to a person because of the birth or naturalization of the person in that country. 住民基本台帳 A county that assigns rights, duties, and privileges to a ? ? Systems World Cognitive World
  • 23. Postal Code ? ? “101-8430” ^^xsd:string “SW1A 0AA”@en (postal code in Japan) (postal code in Europe) Systems World Cognitive World
  • 24. Semantic Mapping • Semantic Mapping – Mapping on the cognitive layer – Two ways of judging mapping • Extensional Mapping – Check whether ‘things’ are shared – e.g., person – Mostly for Class Mapping • Intensional Mapping – Check whether ‘values’ are shared – e.g., postal-code – Mostly for Property Mapping • Syntactical Mapping – Mapping on the systems layer
  • 25. Types of matching: SKOS • Exact Match • Close Match • Broad/Narrow Match • Related Match
  • 26. Close match • Close match: nearly matched but not exactly matched. • Extensional mapping – Coverage of ‘things’ are overlapped so much • Coverage of ‘Country’ is slightly different – ‘things’ are close • Reference of ‘Person’ is slightly different (person vs. legal Person) • Intensional mapping – Coverage of ‘values’ are overlapped so much
  • 27. Broad match/narrow match • Broad/narrow match – One subsumes the other • Extensional mapping – Coverage of ‘things’ are subsumed, i.e., the subset is exact match • Intensional mapping – Coverage of ‘values’ are subsumed, i.e., the subset is exact match
  • 28. More different matching • Complicated match – An element of a system matches a combination of two or more elements. – “Pathway” match • A single property matches the combination of two or more properties – “Conditional” match • An element matches the other element if some condition is hold IdentifierIssuingAuthority Link Has related match IMI ic:ID型.ic:ID体系.ic:発行者 LegalEntityRegisteredAddress Link Has broad match IMI ic:法人型.ic:住所 It is exact match if the value of ic:住所.種別 should be "登記住所".
  • 29. Results Core Vocabulary Identifier Link Mapping relation Data model Identifier Address Link Has exact match IMI ic:住所型 AddressAddressArea Link Has narrow match IMI ic:住所型.ic:町名 AddressAddressArea Link Has narrow match IMI ic:住所型.ic:丁目 AddressAddressArea Link Has narrow match IMI ic:住所型.ic:番地補足 AddressAddressArea Link Has narrow match IMI ic:住所型.ic:番地 AddressAddressArea Link Has narrow match IMI ic:住所型.ic:号 AddressAddressID Link Has exact match IMI ic:住所型.ic:ID AddressAdminUnitL1 Link Has exact match IMI ic:住所型.ic:国 AddressAdminUnitL2 Link Has narrow match IMI ic:住所型.ic:都道府県 AddressFullAddress Link Has exact match IMI ic:住所型.ic:表記 AddressLocatorDesignator Link Has narrow match IMI ic:住所型.ic:ビル番号 AddressLocatorDesignator Link Has narrow match IMI ic:住所型.ic:部屋番号 AddressLocatorName Link Has narrow match IMI ic:住所型.ic:ビル名 AddressPOBox Link Has related match IMI ic:住所型.ic:方書 AddressPostCode Link Has exact match IMI ic:住所型.ic:郵便番号 AddressPostName Link Has narrow match IMI ic:住所型.ic:市区町村 AddressPostName Link Has narrow match IMI ic:住所型.ic:区 AddressThoroughfare Link Has no match IMI Agent Link Has exact match IMI ic:実体型
  • 30. Results Identifier Link Has exact match IMI ic:ID型 IdentifierIdentifier Link Has exact match IMI ic:ID型.ic:識別値 IdentifierIssueDate Link Has no match IMI IdentifierIssuingAuthority Link Has related match IMI ic:ID型.ic:ID体系.ic:発行者 IdentifierIssuingAuthorityURI Link Has exact match IMI ic:ID型.ic:ID体系.ic:URI IdentifierType Link Has no match IMI JurisdictionIdentifier Link Has related match IMI ic:国籍コード JurisdictionName Link Has related match IMI ic:国籍 LegalEntity Link Has exact match IMI ic:法人型 LegalEntityAddress Link Has broad match IMI ic:法人型.ic:住所 LegalEntityAlternativeName Link Has no match IMI LegalEntityCompanyActivity Link Has close match IMI ic:法人型.ic:事業種目 LegalEntityCompanyStatus Link Has related match IMI ic:法人型.ic:活動状況 LegalEntityCompanyType Link Has exact match IMI ic:法人型.ic:組織種別 LegalEntityIdentifier Link Has exact match IMI ic:法人型.ic:ID LegalEntityLegalIdentifier Link Has no match IMI LegalEntityLegalName Link Has broad match IMI ic:法人型.ic:名称.表記 LegalEntityLocation Link Has related match IMI ic:法人型.ic:地物.説明 LegalEntityRegisteredAddress Link Has broad match IMI ic:法人型.ic:住所 Location Link Has exact match IMI ic:場所型 LocationAddress Link Has exact match IMI ic:場所型.ic:住所 LocationGeographicIdentifier Link Has broad match IMI ic:場所型.ic:地理識別子 LocationGeographicName Link Has exact match IMI ic:場所型.ic:名称.ic:表記 LocationGeometry Link Has exact match IMI ic:場所型.ic:地理座標
  • 31. Results Person Link Has exact match IMI ic:人型 PersonAddress Link Has exact match IMI ic:人型.ic:住所 PersonAlternativeName Link Has broad match IMI ic:人型.ic:氏名.ic:姓名 PersonBirthName Link Has broad match IMI ic:人型.ic:氏名.ic:姓名 PersonCitizenship Link Has no match IMI PersonCountryOfBirth Link Has exact match IMI ic:人型.ic:出生国 PersonCountryOfDeath Link Has no match IMI PersonDateOfBirth Link Has exact match IMI ic:人型.ic:生年月日 PersonDateOfDeath Link Has exact match IMI ic:人型.ic:死亡年月日 PersonFamilyName Link Has exact match IMI ic:人型.ic:氏名.ic:姓 PersonFullName Link Has exact match IMI ic:人型.ic:氏名.ic:姓名 PersonGender Link Has exact match IMI ic:人型.ic:性別コード PersonGivenName Link Has exact match IMI ic:人型.ic:氏名.ic:名 PersonIdentifier Link Has broad match IMI ic:人型.ic:ID PersonPatronymicName Link Has no match IMI ic:人型.ic:氏名.ic:姓名 PersonPlaceOfBirth Link Has narrow match IMI ic:人型.ic:出生地
  • 32. Bridging core and domain vocabularies (working in progress) • Aim: Core vocabulary would be extended to domain vocabularies – Agriculture – Finance – Traffic – … • Task: – Can concepts be shared between core and domains? really?
  • 33. Agricultural Activity Ontology (AAO) Agricultural activity crop production activity activity for propagation activity in the vegetative growth stage activity in the reproductive growth stage activity for environment control activity for soil control activity for climate control activity for water control activity for biotic control activity for chemical control post production activity activity for harvesting activity for processing activity for extending shelf-life activity for wrapping indirect activity activity for preparing materials activity for cleaning activity for transport activity for monitoring activity for maintaining farm equipment administrative activity activity for business administration http://cavoc.org/aao/
  • 34. An example: “activity” (and “event”) • S: (n) activity (any specific behavior) "they avoided all recreational activity" – direct hyponym / full hyponym – direct hypernym / inherited hypernym / sister term • S: (n) act, deed, human action, human activity (something that people do or cause to happen) – S: (n) event (something that happens at a given place and time) – [WordNet] • Each activity is a Happening which involves volition and participants. It has temporal dimension. It is distinguished from Events by the fact that the activity does not trigger change of state and does not have a conceptual end point. – [PROTON Extent module (a lightweight upper-level ontology)] • Activity: This class represents the abstract content of an event, which may be repeated many times, once or never. For example a training course, or a play. – [The Event Programme Vocabulary (prog)] • E5 Event – Subclass of: E4 Period – Superclass of: E7 Activity, E63 Beginning of Existence, E64 End of Existence • E7 Activity – Subclass of: E5 Event – Superclass of: E8 Acquisition, E9 Move, E10 Transfer of Custody, E11 Modification, E13 Attribute Assignment, E65 Creation … – [CIDOC Conceptual Reference Model]
  • 35. Summary • Sharing concepts is a very long way • No ground truth – Step-by-step understanding of the world – Careful consensus making • More flexible framework is needed – Simple mapping is not so happy