SlideShare ist ein Scribd-Unternehmen logo
1 von 38
DODDLE-OWL: A Domain Ontology
Construction Tool with OWL
Takeshi Morita 1)
,Naoki Fukuta 2)
, Noriaki Izumi 3)
,
and Takahira Yamaguchi 1)
1) Keio University, Japan
2) Shizuoka University, Japan
3) National Institute of AIST, Japan
Contents
• Motivation
• Related Works
• DODDLE-OWL Overview
• Implementation Architecture
• Case Studies
• Demonstration
• Conclusions
Contents
• Motivation
• Related Works
• DODDLE-OWL Overview
• Implementation Architecture
• Case Studies
• Demonstration
• Conclusions
Motivation
• Background
– The role of domain ontologies is important for the
Semantic Web
– Sharing common understanding among people and
software agents
– Finding appropriate information on the web
• Issues of large cost with building up domain
ontologies
– Many concepts in a domain
– Each concept has high specific meaning
– We need knowledge of domain experts
– Cost-benefit performance of domain ontologies is
lower than that of general ontologies (e.g. WordNet,
EDR)
Semi-Automatic Construction
Set of
Concept Pairs
Quality Refinement
Domain
Specific
Documents
(English or
Japanese)
A Domain Ontology
(OWL format)
Initial
Concept Hierarchy
Translation
EDR
(general)
EDR
(technical)
WordNet
General
Ontologies
Our Goal
User (domain expert)
Taxonomic
Relationships
Non-Taxonomic
Relationships
DODDLE-OWL:
a Domain Ontology rapiD
DeveLopment Environment
– OWL extension
Focusing the quality refinement
phase of ontology construction
Contents
• Motivation
• Related Works
• DODDLE-OWL Overview
• Implementation Architecture
• Case Studies
• Demonstration
• Conclusions
Related WorksMehrnoush Shamsfard, Ahmad Abdollahzadeh Barforoush,
The State of the Art in Ontology Learning: A Framework for Comparison
Learning System Element (s)
Learned
Prior Knowledge Input
DODDLE-OWL
(Keio University)
Taxonomic and non-taxonomic
conceptual Relations
WordNet, EDR Unstructured domain specific
texts (English and Japanese)
ASIUM
(Paris-Sud
University )
Verb subcat.
Frames + hierarchies
Linguistic K. Unstructured
(corpora) ( French )
HASTI
(Amir Kabir University
of Technology )
Words, Concepts, Taxonomic
and Non-Taxonomic
conceptual relations, axioms
Almost empty
(small kernel)
Unstructured NL texts
(Persian )
SVETLAN’
(CNRS laboratory)
Noun Classes Structured + Unstructured
input to SEGAPSITH
(French )
SYNDIKATE
(University of Albert-
Ludwigs)
Words , Concepts, Taxonomic
and Non-Taxonomic
conceptual relations
Generic and
domain lexicons
and ontologies
Unstructured   NL texts
(German )
TEXT-TO-ONTO
(University of
Karlsruhe)
Concepts, Taxonomic and Non-
Taxonomic conceptual
relations
Lexical DB +
domain lexicon
NL texts , Web docs,
Semi-structured (XML, DTD)
and structured
(German , HTML , XML,
DTD )
WEB→KB Instances of classes and The ontology for An ontology + Training
Support to construct
Taxonomic and Non-Taxonomic
Relationships.
Learning System Degree of
Automation
DODDLE-OWL
(Keio University)
User Interaction,
Hand-made
modification
ASIUM
(Paris-Sud University )
Cooperative
HASTI
(Amir Kabir University
of Technology )
Both automatic and
cooperative modes
SVETLAN’
(CNRS laboratory)
Automatic
SYNDIKATE
(University of Albert-
Ludwigs)
Automatic
TEXT-TO-ONTO
(University of
Karlsruhe)
Semi-automatic
interactive,
balanced
cooperative
WEB→KB
(Distributed Systems
Technology Centre)
Automatic
Related Works (cont.)Mehrnoush Shamsfard, Ahmad Abdollahzadeh Barforoush,
The State of the Art in Ontology Learning: A Framework for Comparison
Many ontology learning systems
are focusing on automatic
ontology construction.
The user is difficult to refine the
automatic generated ontologies.
However
Our system is focusing on
high-level support
for user interaction.
The user is easy to refine semi-
automatic generated ontologies
and constructs high quality
domain ontologies.
Therefore
Contents
• Motivation
• Related Works
• DODDLE-OWL Overview
• Implementation Architecture
• Case Studies
• Demonstration
• Conclusions
Set of
Concept Pairs
A Domain Ontology
(OWL format)
Initial
Concept Hierarchy
EDR
(general)
EDR
(technical)
WordNet
General
Ontologies
System Overview
User (domain expert)
Taxonomic
Relationships
Non-Taxonomic
Relationships
Semi-Automatic Construction
Quality Refinement
Domain
Specific
Documents
(English or
Japanese)
Translation
DODDLE-OWL:
a Domain Ontology rapiD
DeveLopment Environment
– OWL extension
Focusing the quality refinement
phase of ontology construction
Documents
Construction Module
Refinement Module
Visualization
Module
Input Module
Translation
Module
Set of
Concept Pairs
A Domain Ontology
(OWL format)
Initial
Concept Hierarchy
EDR
(general)
EDR
(technical)
WordNet
General
Ontologies
System Overview
User (domain expert)
Taxonomic
Relationships
Non-Taxonomic
Relationships
Documents
Construction Module
Refinement Module
Visualization
Module
Input Module
Translation
Module
Procedure of Input Module
WordNet
EDR
(general)
EDR
(technical)
Ontology Selection
Domain specific documents
(English or Japanese)
Input Word Selection
Word POS TF IDF TF-IDF
W1 Noun ….. ….. …..
W2 Complex Word ….. ….. …..
Morphological Analysis
Complex Word Extraction
Document Selection
………………………………………..
Disambiguation
Input Word Set
Input Concepts
W1
W2
W3
EDR (general): Ci
WordNet: Cj
EDR (technical): Ck
………………………………………..
Input Word Set Input Concept Set
Select significant
words for the domain
(input words)
Identify the sense of
input words to map
those words to concepts
in the general ontologies
A Domain Ontology
(OWL format)
EDR
(general)
EDR
(technical)
WordNet
General
Ontologies
User (domain expert)
Taxonomic
Relationships
Non-Taxonomic
Relationships
Documents
Refinement Module
Visualization
Module
Translation
Module
statistic
methods
matching
& trimming
Relationship
Construction
Construction Module
Set of Concept Pairs
Input Concept Selection
Hierarchy
Construction
Association Rule WordSpace
Initial Concept Hierarchy
Input concepts
Input Module
Construction Module
statistic
methods
matching
& trimming
Relationship
Construction
Hierarchy
Construction
Association
Rule
WordSpace
Documents
Set of Concept PairsInitial Concept Hierarchy
Refinement Module
Input Module
Input concepts
EDR
(general)
EDR
(technical)
WordNet
General
Ontologies
Initial Domain Ontology
Hierarchy Construction Module
Taxonomic Relationships
in the general ontologies
Merging
Input Concepts
Trimming
Root Root
unnecessary Internal Node
Best Matched Node
Salient Internal Node
Initial Model initial concept hierarchy
get paths related
to input concepts
generate
an initial model
trimming
Extract Concept Pairs
by different methods
Relationship Construction Module
Input concepts
Matching
Documents
WordSpace
Association
Rule
Set of concept pairs
method based on
context similarity
Popular method
in the field of data mining
• Words and phrases in documents can be expressed
by vector representation containing co-occurrence
  statistics
WordSpace Method
WordSpace  ( Marti A. Hearst, Hinrich Schutze )
… wi … wj …C1 … wk …
… wi … wj …C2 … wk …
Context Similarity between concepts C1 and C2
• Inner products among the vectors work as
the similarity between the words and phrases.
High similarity
Significant related concept pair for the domain
Association Rule
• Find associations between items in a set of
transaction
• In our research
– Each item is an input concept appearing in the
document
– One transaction is one sentence in the document
• Parameters
– Support = contain X and Y / All transaction
– Confidence = contain X and Y / contain X
X and Y: input concepts
statistic
methods
matching
& trimming
Relationship
Construction
Construction Module
Set of Concept Pairs
Input Concept Selection
Hierarchy
Construction
Refinement Module
Association Rule WordSpace
Concept
Specification
Template
Documents
Visualization
Module
A Domain Ontology
(OWL format)
Input Module
Initial Concept Hierarchy
Matched Result
Analysis
Hierarchy
Refinement
Trimmed Result
Analysis
Translation
Module
Input conceptsEDR
(general)
EDR
(technical)
WordNet
General
Ontologies
The value of
co-concurrency
Relationship
Refinement
Concept
Specification
Hierarchy Refinement
Module
Relationship Refinement
Module
Refinement Module
Concept
Hierarchy
Concept Drift
Management
Better performance
by changing parameters
with interaction of a userVisualization Module:
MR3
RDF&RDFS Visual Editing
Translation
Module
Set of Concept PairsInitial Concept Hierarchy
support to refine
the initial concept
hierarchy graphically
Hierarchy Refinement Module -
Concept Drift
A Domain Ontologygeneral ontologies
Reusable Part
No reusable part
because of
concept drift
an initial concept
hierarchy
constructed
The position of particular
concepts changes depending
on the domain.
adjust the initial concept hierarchy
to the specific domain
Concept Drift
Hierarchy Refinement Module
strategy 1 Matched Result Analysis
Point out differences
of abstraction level
among sibling nodes
according to the
Trimmed Result
MOVE
MOVE
MOVE
STAY
A
B C
D
Trimming
Area
Initial Model
0
0 3
B C D
Trimmed Model
A
B C
A
D
strategy 2 Trimmed Result Analysis
Divide the initial concept
hierarchy into reusable area
and not reusable area
according to the position of
Best Matched Nodes
suggest a user to move not
reusable area
Trimming
Reconstructed
by User
Best Matched Node
Internal Node
Reconstructed
by User
Concept Drift Management
Matched Result Analysis
Trimmed Result Analysis
Visualization Module
Visualization Module
Parts of Modification
are highlighted based on
Matched result analysis
and Trimmed Result analysis
Relationship Refinement Module
Non-Taxonomic Relationship
Learning
Non-Taxonomic Relationships
Identify correct pairs from
generated candidates
Setting parameters for
WordSpace and
Association Rule
Construct non-taxonomic
relationships by
considering the relation
with each concept pair
statistic
methods
matching
& trimming
Relationship
Construction
Construction Module
Set of Concept Pairs
Input Concept Selection
Hierarchy
Construction
Refinement Module
Association Rule WordSpace
Concept
Specification
Template
Documents
Visualization
Module
A Domain Ontology
(OWL format)
Input Module
Initial Concept Hierarchy
Matched Result
Analysis
Hierarchy
Refinement
Trimmed Result
Analysis
Translation
Module
Input conceptsEDR
(general)
EDR
(technical)
WordNet
General
Ontologies
The value of
co-concurrency
Relationship
Refinement
Translation Module
owl:Class
owl:Class owl:Class
owl:Class
rdfs:subClassOf
owl:ObjectProperty
rdfs:domain rdfs:range
Taxonomic Relationships Non-Taxonomic Relationships
Export constructed domain ontology in OWL format
relationship
rdfs:subClassOf rdfs:subClassOf
Contents
• Motivation
• Related Works
• DODDLE-OWL Overview
• Implementation Architecture
• Case Studies
• Demonstration
• Conclusions
Implementation Architecture
Java Virtual Machine
Jena2MR3
Visualization
Module
Translation
Module
Construction and
Refinement Module
Input Module
Java WordNet Library (JWNL)
Gensen Sen SS-Tagger
JWNL: http://jwordnet.sourceforge.net/
Gensen: A Complex Word Extraction Tool
Sen: A Japanese morpheme analyzer, http://ultimania.org/sen/
SS-Tagger: English Tagger
MR3
: an RDF & RDFS graphical editor, http://mmm.semanticweb.org/mr3/
Jena Semantic Web Tool Framework: HP Labs, http://jena.sourceforge.net/
Contents
• Motivation
• Related Works
• DODDLE-OWL Overview
• Implementation Architecture
• Case Studies
• Demonstration
• Conclusions
Case Studies
• Purpose
– To check DODDLE-OWL can support the user in
constructing taxonomic and non-taxonomic relationships
• Target Field  
– Particular field of business  
– xCBL (XML Common Business Library)
• http://www.xcbl.org/
• Domain Specific Document
– xCBL Document Description
– about 150 sentences and 2500 words
• Input Concepts
– 57 business concepts from the document
• User
– Not an expert but has business knowledge
Results and Evaluation for
Taxonomic Relationships Construction
Initial
Concept Hierarcy
Business Ontology
Get paths related
to input concepts
from WordNet
Trimming Modification
57
Concepts
Precision Recall
ST1: Matched Result Analysis 5/25(=0.2)
 
5/7(=0.71)
Evaluation of two strategies by the user
82
Concepts
152
Concepts
83
Concepts
The number of concepts in each model
Input Concepts Initial Model
Constructed with Hierarchy Construction Module
Constructed with Hierarchy
Refinement Module
Results and Evaluation for
Non-Taxonomic Relationships Construction
WS AR The union of WS & AR
# Extracted
concept pairs
40 39 66
# Accepted
concept pairs
30 20 39
# Rejected
concept pairs
10 19 27
Precision 0.75 (30/40) 0.51 (20/39) 0.59 (39/66)
Association Rule ( 20/39 )WordSpace ( 30/40 )
8 17
1119 9
2
The union of WS & AR ( 39/66 )
Frequency of
Extracted 4-gram
Context Scope
( before :
after )
threshold
of similarity
2 10:10 0.6
Minimum
Support
Minimum
Confidence
0.7 % 55 %
For WordSpace parametersFor Association Rule parameters
Results and Evaluation for
Non-Taxonomic Relationships Acquisition
WS AR The union of WS & AR
# Extracted
concept pairs
40 39 66
# Accepted
concept pairs
30 20 39
# Rejected
concept pairs
10 19 27
Precision 0.75 (30/40) 0.51 (20/39) 0.59 (39/66)
Association Rule
( 20/39 )
8 17
1119 9
WordSpace ( 30/40 )
2
The union of WS & AR ( 39/66 ) The precision of WS method is good,
but the WS method has its bias
so we cannot get certain types of
concept pairs from it.
we combine two different methods for
getting wider range of concept pairs.
Contents
• Motivation
• Related Works
• DODDLE-OWL Overview
• Implementation Architecture
• Case Studies
• Demonstration
• Conclusions
Demonstration
• Construct taxonomic and non-taxonomic
relationships for xCBL
Contents
• Motivation
• Related Works
• DODDLE-OWL Overview
• Implementation Architecture
• Case Studies
• Demonstration
• Conclusions
Conclusions
• Summary
– DODDLE-OWL: a Domain Ontology rapiD DeveLopment
Environment – OWL extension
• Focusing the quality refinement phase of ontology construction
– Case studies
• construct a domain ontology for xCBL
• Support the user in constructing and refining the domain ontology
• Future Work
– Reuse existing (domain) ontologies in any forms
– Apply DODDLE-OWL to large scale domain ontology
construction
• Rocket operation ontology
• About 40,000 concepts
Thank you for your attention.
DODDLE-OWL has been released .
Please visit this web site, if you like it.
about 100 user now
http://mmm.semanticweb.org/doddle/

Weitere ähnliche Inhalte

Was ist angesagt?

Semantics at Scale: A Distributional Approach
Semantics at Scale: A Distributional ApproachSemantics at Scale: A Distributional Approach
Semantics at Scale: A Distributional ApproachAndre Freitas
 
How hard is this Query? Measuring the Semantic Complexity of Schema-agnostic ...
How hard is this Query? Measuring the Semantic Complexity of Schema-agnostic ...How hard is this Query? Measuring the Semantic Complexity of Schema-agnostic ...
How hard is this Query? Measuring the Semantic Complexity of Schema-agnostic ...Andre Freitas
 
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...Andre Freitas
 
slides
slidesslides
slidesbutest
 
Improving Document Clustering by Eliminating Unnatural Language
Improving Document Clustering by Eliminating Unnatural LanguageImproving Document Clustering by Eliminating Unnatural Language
Improving Document Clustering by Eliminating Unnatural LanguageJinho Choi
 
Representing and Reasoning with Modular Ontologies (2007)
Representing and Reasoning with Modular Ontologies (2007)Representing and Reasoning with Modular Ontologies (2007)
Representing and Reasoning with Modular Ontologies (2007)Jie Bao
 
Ontologies and Software Modeling: Potentials, Experience and Challenges
Ontologies and Software Modeling: Potentials, Experience and Challenges Ontologies and Software Modeling: Potentials, Experience and Challenges
Ontologies and Software Modeling: Potentials, Experience and Challenges Dragan Gasevic
 
A little more semantics goes a lot further!  Getting more out of Linked Data ...
A little more semantics goes a lot further!  Getting more out of Linked Data ...A little more semantics goes a lot further!  Getting more out of Linked Data ...
A little more semantics goes a lot further!  Getting more out of Linked Data ...Michel Dumontier
 
Word Tagging with Foundational Ontology Classes
Word Tagging with Foundational Ontology ClassesWord Tagging with Foundational Ontology Classes
Word Tagging with Foundational Ontology ClassesAndre Freitas
 
Schema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Schema-Agnostic Queries (SAQ-2015): Semantic Web ChallengeSchema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Schema-Agnostic Queries (SAQ-2015): Semantic Web ChallengeAndre Freitas
 
Developing Ontologies for Collaborative Engineering in Mechatronics
Developing Ontologies for Collaborative Engineering in MechatronicsDeveloping Ontologies for Collaborative Engineering in Mechatronics
Developing Ontologies for Collaborative Engineering in MechatronicsVioleta Damjanovic-Behrendt
 
Parallel text extraction from multimodal comparable corpora
Parallel text extraction from multimodal comparable corporaParallel text extraction from multimodal comparable corpora
Parallel text extraction from multimodal comparable corporaHaithem Afli
 
Ontology Building and its Application using Hozo
Ontology Building and its Application using HozoOntology Building and its Application using Hozo
Ontology Building and its Application using HozoKouji Kozaki
 
A Distributed Tableau Algorithm for Package-based Description Logics
A Distributed Tableau Algorithm for Package-based Description LogicsA Distributed Tableau Algorithm for Package-based Description Logics
A Distributed Tableau Algorithm for Package-based Description LogicsJie Bao
 
Topic Extraction on Domain Ontology
Topic Extraction on Domain OntologyTopic Extraction on Domain Ontology
Topic Extraction on Domain OntologyKeerti Bhogaraju
 
Semantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementSemantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementAndre Freitas
 
Towards a mnemonic classification of software languages
Towards a mnemonic classification of software languagesTowards a mnemonic classification of software languages
Towards a mnemonic classification of software languagesMikhail Barash
 
ontology based- data_integration.
ontology based- data_integration.ontology based- data_integration.
ontology based- data_integration.AliAlJadaa
 

Was ist angesagt? (20)

Semantics at Scale: A Distributional Approach
Semantics at Scale: A Distributional ApproachSemantics at Scale: A Distributional Approach
Semantics at Scale: A Distributional Approach
 
How hard is this Query? Measuring the Semantic Complexity of Schema-agnostic ...
How hard is this Query? Measuring the Semantic Complexity of Schema-agnostic ...How hard is this Query? Measuring the Semantic Complexity of Schema-agnostic ...
How hard is this Query? Measuring the Semantic Complexity of Schema-agnostic ...
 
Ontology-based Classification and Faceted Search Interface for APIs
Ontology-based Classification and Faceted Search Interface for APIsOntology-based Classification and Faceted Search Interface for APIs
Ontology-based Classification and Faceted Search Interface for APIs
 
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
 
slides
slidesslides
slides
 
Improving Document Clustering by Eliminating Unnatural Language
Improving Document Clustering by Eliminating Unnatural LanguageImproving Document Clustering by Eliminating Unnatural Language
Improving Document Clustering by Eliminating Unnatural Language
 
Representing and Reasoning with Modular Ontologies (2007)
Representing and Reasoning with Modular Ontologies (2007)Representing and Reasoning with Modular Ontologies (2007)
Representing and Reasoning with Modular Ontologies (2007)
 
Ontologies and Software Modeling: Potentials, Experience and Challenges
Ontologies and Software Modeling: Potentials, Experience and Challenges Ontologies and Software Modeling: Potentials, Experience and Challenges
Ontologies and Software Modeling: Potentials, Experience and Challenges
 
A little more semantics goes a lot further!  Getting more out of Linked Data ...
A little more semantics goes a lot further!  Getting more out of Linked Data ...A little more semantics goes a lot further!  Getting more out of Linked Data ...
A little more semantics goes a lot further!  Getting more out of Linked Data ...
 
Computer fundamentals
Computer fundamentalsComputer fundamentals
Computer fundamentals
 
Word Tagging with Foundational Ontology Classes
Word Tagging with Foundational Ontology ClassesWord Tagging with Foundational Ontology Classes
Word Tagging with Foundational Ontology Classes
 
Schema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Schema-Agnostic Queries (SAQ-2015): Semantic Web ChallengeSchema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Schema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
 
Developing Ontologies for Collaborative Engineering in Mechatronics
Developing Ontologies for Collaborative Engineering in MechatronicsDeveloping Ontologies for Collaborative Engineering in Mechatronics
Developing Ontologies for Collaborative Engineering in Mechatronics
 
Parallel text extraction from multimodal comparable corpora
Parallel text extraction from multimodal comparable corporaParallel text extraction from multimodal comparable corpora
Parallel text extraction from multimodal comparable corpora
 
Ontology Building and its Application using Hozo
Ontology Building and its Application using HozoOntology Building and its Application using Hozo
Ontology Building and its Application using Hozo
 
A Distributed Tableau Algorithm for Package-based Description Logics
A Distributed Tableau Algorithm for Package-based Description LogicsA Distributed Tableau Algorithm for Package-based Description Logics
A Distributed Tableau Algorithm for Package-based Description Logics
 
Topic Extraction on Domain Ontology
Topic Extraction on Domain OntologyTopic Extraction on Domain Ontology
Topic Extraction on Domain Ontology
 
Semantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementSemantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and Refinement
 
Towards a mnemonic classification of software languages
Towards a mnemonic classification of software languagesTowards a mnemonic classification of software languages
Towards a mnemonic classification of software languages
 
ontology based- data_integration.
ontology based- data_integration.ontology based- data_integration.
ontology based- data_integration.
 

Andere mochten auch

Seronto Domain Ontology Framework
Seronto Domain Ontology FrameworkSeronto Domain Ontology Framework
Seronto Domain Ontology FrameworkNicolas Bertrand
 
Ivan Herman - Semantic Web Activities @ W3C
Ivan Herman - Semantic Web Activities @ W3CIvan Herman - Semantic Web Activities @ W3C
Ivan Herman - Semantic Web Activities @ W3Csssw2012
 
FiCloud2016 lov4iot second life ontology
FiCloud2016 lov4iot second life ontologyFiCloud2016 lov4iot second life ontology
FiCloud2016 lov4iot second life ontologyAmélie Gyrard
 
Ontologies and Vocabularies
Ontologies and VocabulariesOntologies and Vocabularies
Ontologies and Vocabulariesseanb
 
Use of ontologies in natural language processing
Use of ontologies in natural language processingUse of ontologies in natural language processing
Use of ontologies in natural language processingATHMAN HAJ-HAMOU
 
Ontology Powerpoint
Ontology PowerpointOntology Powerpoint
Ontology PowerpointARH_Miller
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic WebTomek Pluskiewicz
 
An introduction to Semantic Web and Linked Data
An introduction to Semantic Web and Linked DataAn introduction to Semantic Web and Linked Data
An introduction to Semantic Web and Linked DataFabien Gandon
 

Andere mochten auch (9)

Seronto Domain Ontology Framework
Seronto Domain Ontology FrameworkSeronto Domain Ontology Framework
Seronto Domain Ontology Framework
 
Ivan Herman - Semantic Web Activities @ W3C
Ivan Herman - Semantic Web Activities @ W3CIvan Herman - Semantic Web Activities @ W3C
Ivan Herman - Semantic Web Activities @ W3C
 
FiCloud2016 lov4iot second life ontology
FiCloud2016 lov4iot second life ontologyFiCloud2016 lov4iot second life ontology
FiCloud2016 lov4iot second life ontology
 
Ontologies and Vocabularies
Ontologies and VocabulariesOntologies and Vocabularies
Ontologies and Vocabularies
 
Using ontology for natural language processing
Using ontology for natural language processingUsing ontology for natural language processing
Using ontology for natural language processing
 
Use of ontologies in natural language processing
Use of ontologies in natural language processingUse of ontologies in natural language processing
Use of ontologies in natural language processing
 
Ontology Powerpoint
Ontology PowerpointOntology Powerpoint
Ontology Powerpoint
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
An introduction to Semantic Web and Linked Data
An introduction to Semantic Web and Linked DataAn introduction to Semantic Web and Linked Data
An introduction to Semantic Web and Linked Data
 

Ähnlich wie DODDLE-OWL: A Domain Ontology Construction Tool with OWL

Ontology-based Cooperation of Information Systems
Ontology-based Cooperation of Information SystemsOntology-based Cooperation of Information Systems
Ontology-based Cooperation of Information SystemsRaji Ghawi
 
Adri Jovin - Semantic Web
Adri Jovin - Semantic WebAdri Jovin - Semantic Web
Adri Jovin - Semantic WebAdri Jovin
 
Understanding Information Architecture
Understanding Information ArchitectureUnderstanding Information Architecture
Understanding Information ArchitectureScott Abel
 
Searching Repositories of Web Application Models
Searching Repositories of Web Application ModelsSearching Repositories of Web Application Models
Searching Repositories of Web Application ModelsMarco Brambilla
 
Natural Language Understanding of Systems Engineering Artifacts
Natural Language Understanding of Systems Engineering ArtifactsNatural Language Understanding of Systems Engineering Artifacts
Natural Language Understanding of Systems Engineering ArtifactsÁkos Horváth
 
Neural Models for Information Retrieval
Neural Models for Information RetrievalNeural Models for Information Retrieval
Neural Models for Information RetrievalBhaskar Mitra
 
Semantic Web for Enterprise Architecture
Semantic Web for Enterprise ArchitectureSemantic Web for Enterprise Architecture
Semantic Web for Enterprise ArchitectureJames Lapalme
 
ModelWriter Presentation International 01-07-2015
ModelWriter Presentation International 01-07-2015ModelWriter Presentation International 01-07-2015
ModelWriter Presentation International 01-07-2015Ferhat Erata
 
Structured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product StackStructured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product StackMike Bergman
 
From Semantics to Self-supervised Learning for Speech and Beyond (Opening Ke...
From Semantics to Self-supervised Learning  for Speech and Beyond (Opening Ke...From Semantics to Self-supervised Learning  for Speech and Beyond (Opening Ke...
From Semantics to Self-supervised Learning for Speech and Beyond (Opening Ke...linshanleearchive
 
Introduction to Software - Coder Forge - John Mulhall
Introduction to Software - Coder Forge - John MulhallIntroduction to Software - Coder Forge - John Mulhall
Introduction to Software - Coder Forge - John MulhallJohn Mulhall
 
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...Facultad de Informática UCM
 
Semantic Web, Ontology, and Ontology Learning: Introduction
Semantic Web, Ontology, and Ontology Learning: IntroductionSemantic Web, Ontology, and Ontology Learning: Introduction
Semantic Web, Ontology, and Ontology Learning: IntroductionKent State University
 
Pattern-based Ontology Engineering
Pattern-based Ontology EngineeringPattern-based Ontology Engineering
Pattern-based Ontology Engineeringkjanowicz
 
Toward The Semantic Deep Web
Toward The Semantic Deep WebToward The Semantic Deep Web
Toward The Semantic Deep WebSamiul Hoque
 
ONTOLOGY BASED DATA ACCESS
ONTOLOGY BASED DATA ACCESSONTOLOGY BASED DATA ACCESS
ONTOLOGY BASED DATA ACCESSKishan Patel
 
Semantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldSemantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldAmit Sheth
 
Diagrammatic knowledge modeling for managers – ontology-based approach
Diagrammatic knowledge modeling for managers  – ontology-based approachDiagrammatic knowledge modeling for managers  – ontology-based approach
Diagrammatic knowledge modeling for managers – ontology-based approachDmitry Kudryavtsev
 

Ähnlich wie DODDLE-OWL: A Domain Ontology Construction Tool with OWL (20)

Ontology-based Cooperation of Information Systems
Ontology-based Cooperation of Information SystemsOntology-based Cooperation of Information Systems
Ontology-based Cooperation of Information Systems
 
Adri Jovin - Semantic Web
Adri Jovin - Semantic WebAdri Jovin - Semantic Web
Adri Jovin - Semantic Web
 
Understanding Information Architecture
Understanding Information ArchitectureUnderstanding Information Architecture
Understanding Information Architecture
 
Searching Repositories of Web Application Models
Searching Repositories of Web Application ModelsSearching Repositories of Web Application Models
Searching Repositories of Web Application Models
 
Natural Language Understanding of Systems Engineering Artifacts
Natural Language Understanding of Systems Engineering ArtifactsNatural Language Understanding of Systems Engineering Artifacts
Natural Language Understanding of Systems Engineering Artifacts
 
Neural Models for Information Retrieval
Neural Models for Information RetrievalNeural Models for Information Retrieval
Neural Models for Information Retrieval
 
Semantic Web for Enterprise Architecture
Semantic Web for Enterprise ArchitectureSemantic Web for Enterprise Architecture
Semantic Web for Enterprise Architecture
 
ModelWriter Presentation International 01-07-2015
ModelWriter Presentation International 01-07-2015ModelWriter Presentation International 01-07-2015
ModelWriter Presentation International 01-07-2015
 
Structured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product StackStructured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product Stack
 
From Semantics to Self-supervised Learning for Speech and Beyond (Opening Ke...
From Semantics to Self-supervised Learning  for Speech and Beyond (Opening Ke...From Semantics to Self-supervised Learning  for Speech and Beyond (Opening Ke...
From Semantics to Self-supervised Learning for Speech and Beyond (Opening Ke...
 
Introduction to Software - Coder Forge - John Mulhall
Introduction to Software - Coder Forge - John MulhallIntroduction to Software - Coder Forge - John Mulhall
Introduction to Software - Coder Forge - John Mulhall
 
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
 
Semantic Web, Ontology, and Ontology Learning: Introduction
Semantic Web, Ontology, and Ontology Learning: IntroductionSemantic Web, Ontology, and Ontology Learning: Introduction
Semantic Web, Ontology, and Ontology Learning: Introduction
 
Modest Formalization of Software Design Patterns
Modest Formalization of Software Design PatternsModest Formalization of Software Design Patterns
Modest Formalization of Software Design Patterns
 
Pattern-based Ontology Engineering
Pattern-based Ontology EngineeringPattern-based Ontology Engineering
Pattern-based Ontology Engineering
 
Toward The Semantic Deep Web
Toward The Semantic Deep WebToward The Semantic Deep Web
Toward The Semantic Deep Web
 
ONTOLOGY BASED DATA ACCESS
ONTOLOGY BASED DATA ACCESSONTOLOGY BASED DATA ACCESS
ONTOLOGY BASED DATA ACCESS
 
Semantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldSemantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-World
 
Illustrated Code (ASE 2021)
Illustrated Code (ASE 2021)Illustrated Code (ASE 2021)
Illustrated Code (ASE 2021)
 
Diagrammatic knowledge modeling for managers – ontology-based approach
Diagrammatic knowledge modeling for managers  – ontology-based approachDiagrammatic knowledge modeling for managers  – ontology-based approach
Diagrammatic knowledge modeling for managers – ontology-based approach
 

DODDLE-OWL: A Domain Ontology Construction Tool with OWL

  • 1. DODDLE-OWL: A Domain Ontology Construction Tool with OWL Takeshi Morita 1) ,Naoki Fukuta 2) , Noriaki Izumi 3) , and Takahira Yamaguchi 1) 1) Keio University, Japan 2) Shizuoka University, Japan 3) National Institute of AIST, Japan
  • 2. Contents • Motivation • Related Works • DODDLE-OWL Overview • Implementation Architecture • Case Studies • Demonstration • Conclusions
  • 3. Contents • Motivation • Related Works • DODDLE-OWL Overview • Implementation Architecture • Case Studies • Demonstration • Conclusions
  • 4. Motivation • Background – The role of domain ontologies is important for the Semantic Web – Sharing common understanding among people and software agents – Finding appropriate information on the web • Issues of large cost with building up domain ontologies – Many concepts in a domain – Each concept has high specific meaning – We need knowledge of domain experts – Cost-benefit performance of domain ontologies is lower than that of general ontologies (e.g. WordNet, EDR)
  • 5. Semi-Automatic Construction Set of Concept Pairs Quality Refinement Domain Specific Documents (English or Japanese) A Domain Ontology (OWL format) Initial Concept Hierarchy Translation EDR (general) EDR (technical) WordNet General Ontologies Our Goal User (domain expert) Taxonomic Relationships Non-Taxonomic Relationships DODDLE-OWL: a Domain Ontology rapiD DeveLopment Environment – OWL extension Focusing the quality refinement phase of ontology construction
  • 6. Contents • Motivation • Related Works • DODDLE-OWL Overview • Implementation Architecture • Case Studies • Demonstration • Conclusions
  • 7. Related WorksMehrnoush Shamsfard, Ahmad Abdollahzadeh Barforoush, The State of the Art in Ontology Learning: A Framework for Comparison Learning System Element (s) Learned Prior Knowledge Input DODDLE-OWL (Keio University) Taxonomic and non-taxonomic conceptual Relations WordNet, EDR Unstructured domain specific texts (English and Japanese) ASIUM (Paris-Sud University ) Verb subcat. Frames + hierarchies Linguistic K. Unstructured (corpora) ( French ) HASTI (Amir Kabir University of Technology ) Words, Concepts, Taxonomic and Non-Taxonomic conceptual relations, axioms Almost empty (small kernel) Unstructured NL texts (Persian ) SVETLAN’ (CNRS laboratory) Noun Classes Structured + Unstructured input to SEGAPSITH (French ) SYNDIKATE (University of Albert- Ludwigs) Words , Concepts, Taxonomic and Non-Taxonomic conceptual relations Generic and domain lexicons and ontologies Unstructured   NL texts (German ) TEXT-TO-ONTO (University of Karlsruhe) Concepts, Taxonomic and Non- Taxonomic conceptual relations Lexical DB + domain lexicon NL texts , Web docs, Semi-structured (XML, DTD) and structured (German , HTML , XML, DTD ) WEB→KB Instances of classes and The ontology for An ontology + Training Support to construct Taxonomic and Non-Taxonomic Relationships.
  • 8. Learning System Degree of Automation DODDLE-OWL (Keio University) User Interaction, Hand-made modification ASIUM (Paris-Sud University ) Cooperative HASTI (Amir Kabir University of Technology ) Both automatic and cooperative modes SVETLAN’ (CNRS laboratory) Automatic SYNDIKATE (University of Albert- Ludwigs) Automatic TEXT-TO-ONTO (University of Karlsruhe) Semi-automatic interactive, balanced cooperative WEB→KB (Distributed Systems Technology Centre) Automatic Related Works (cont.)Mehrnoush Shamsfard, Ahmad Abdollahzadeh Barforoush, The State of the Art in Ontology Learning: A Framework for Comparison Many ontology learning systems are focusing on automatic ontology construction. The user is difficult to refine the automatic generated ontologies. However Our system is focusing on high-level support for user interaction. The user is easy to refine semi- automatic generated ontologies and constructs high quality domain ontologies. Therefore
  • 9. Contents • Motivation • Related Works • DODDLE-OWL Overview • Implementation Architecture • Case Studies • Demonstration • Conclusions
  • 10. Set of Concept Pairs A Domain Ontology (OWL format) Initial Concept Hierarchy EDR (general) EDR (technical) WordNet General Ontologies System Overview User (domain expert) Taxonomic Relationships Non-Taxonomic Relationships Semi-Automatic Construction Quality Refinement Domain Specific Documents (English or Japanese) Translation DODDLE-OWL: a Domain Ontology rapiD DeveLopment Environment – OWL extension Focusing the quality refinement phase of ontology construction Documents Construction Module Refinement Module Visualization Module Input Module Translation Module
  • 11. Set of Concept Pairs A Domain Ontology (OWL format) Initial Concept Hierarchy EDR (general) EDR (technical) WordNet General Ontologies System Overview User (domain expert) Taxonomic Relationships Non-Taxonomic Relationships Documents Construction Module Refinement Module Visualization Module Input Module Translation Module
  • 12. Procedure of Input Module WordNet EDR (general) EDR (technical) Ontology Selection Domain specific documents (English or Japanese) Input Word Selection Word POS TF IDF TF-IDF W1 Noun ….. ….. ….. W2 Complex Word ….. ….. ….. Morphological Analysis Complex Word Extraction Document Selection ……………………………………….. Disambiguation Input Word Set Input Concepts W1 W2 W3 EDR (general): Ci WordNet: Cj EDR (technical): Ck ……………………………………….. Input Word Set Input Concept Set Select significant words for the domain (input words) Identify the sense of input words to map those words to concepts in the general ontologies
  • 13. A Domain Ontology (OWL format) EDR (general) EDR (technical) WordNet General Ontologies User (domain expert) Taxonomic Relationships Non-Taxonomic Relationships Documents Refinement Module Visualization Module Translation Module statistic methods matching & trimming Relationship Construction Construction Module Set of Concept Pairs Input Concept Selection Hierarchy Construction Association Rule WordSpace Initial Concept Hierarchy Input concepts Input Module
  • 14. Construction Module statistic methods matching & trimming Relationship Construction Hierarchy Construction Association Rule WordSpace Documents Set of Concept PairsInitial Concept Hierarchy Refinement Module Input Module Input concepts EDR (general) EDR (technical) WordNet General Ontologies Initial Domain Ontology
  • 15. Hierarchy Construction Module Taxonomic Relationships in the general ontologies Merging Input Concepts Trimming Root Root unnecessary Internal Node Best Matched Node Salient Internal Node Initial Model initial concept hierarchy get paths related to input concepts generate an initial model trimming
  • 16. Extract Concept Pairs by different methods Relationship Construction Module Input concepts Matching Documents WordSpace Association Rule Set of concept pairs method based on context similarity Popular method in the field of data mining
  • 17. • Words and phrases in documents can be expressed by vector representation containing co-occurrence   statistics WordSpace Method WordSpace  ( Marti A. Hearst, Hinrich Schutze ) … wi … wj …C1 … wk … … wi … wj …C2 … wk … Context Similarity between concepts C1 and C2 • Inner products among the vectors work as the similarity between the words and phrases. High similarity Significant related concept pair for the domain
  • 18. Association Rule • Find associations between items in a set of transaction • In our research – Each item is an input concept appearing in the document – One transaction is one sentence in the document • Parameters – Support = contain X and Y / All transaction – Confidence = contain X and Y / contain X X and Y: input concepts
  • 19. statistic methods matching & trimming Relationship Construction Construction Module Set of Concept Pairs Input Concept Selection Hierarchy Construction Refinement Module Association Rule WordSpace Concept Specification Template Documents Visualization Module A Domain Ontology (OWL format) Input Module Initial Concept Hierarchy Matched Result Analysis Hierarchy Refinement Trimmed Result Analysis Translation Module Input conceptsEDR (general) EDR (technical) WordNet General Ontologies The value of co-concurrency Relationship Refinement
  • 20. Concept Specification Hierarchy Refinement Module Relationship Refinement Module Refinement Module Concept Hierarchy Concept Drift Management Better performance by changing parameters with interaction of a userVisualization Module: MR3 RDF&RDFS Visual Editing Translation Module Set of Concept PairsInitial Concept Hierarchy support to refine the initial concept hierarchy graphically
  • 21. Hierarchy Refinement Module - Concept Drift A Domain Ontologygeneral ontologies Reusable Part No reusable part because of concept drift an initial concept hierarchy constructed The position of particular concepts changes depending on the domain. adjust the initial concept hierarchy to the specific domain Concept Drift
  • 22. Hierarchy Refinement Module strategy 1 Matched Result Analysis Point out differences of abstraction level among sibling nodes according to the Trimmed Result MOVE MOVE MOVE STAY A B C D Trimming Area Initial Model 0 0 3 B C D Trimmed Model A B C A D strategy 2 Trimmed Result Analysis Divide the initial concept hierarchy into reusable area and not reusable area according to the position of Best Matched Nodes suggest a user to move not reusable area Trimming Reconstructed by User Best Matched Node Internal Node Reconstructed by User
  • 23. Concept Drift Management Matched Result Analysis Trimmed Result Analysis Visualization Module Visualization Module Parts of Modification are highlighted based on Matched result analysis and Trimmed Result analysis
  • 24. Relationship Refinement Module Non-Taxonomic Relationship Learning Non-Taxonomic Relationships Identify correct pairs from generated candidates Setting parameters for WordSpace and Association Rule Construct non-taxonomic relationships by considering the relation with each concept pair
  • 25. statistic methods matching & trimming Relationship Construction Construction Module Set of Concept Pairs Input Concept Selection Hierarchy Construction Refinement Module Association Rule WordSpace Concept Specification Template Documents Visualization Module A Domain Ontology (OWL format) Input Module Initial Concept Hierarchy Matched Result Analysis Hierarchy Refinement Trimmed Result Analysis Translation Module Input conceptsEDR (general) EDR (technical) WordNet General Ontologies The value of co-concurrency Relationship Refinement
  • 26. Translation Module owl:Class owl:Class owl:Class owl:Class rdfs:subClassOf owl:ObjectProperty rdfs:domain rdfs:range Taxonomic Relationships Non-Taxonomic Relationships Export constructed domain ontology in OWL format relationship rdfs:subClassOf rdfs:subClassOf
  • 27. Contents • Motivation • Related Works • DODDLE-OWL Overview • Implementation Architecture • Case Studies • Demonstration • Conclusions
  • 28. Implementation Architecture Java Virtual Machine Jena2MR3 Visualization Module Translation Module Construction and Refinement Module Input Module Java WordNet Library (JWNL) Gensen Sen SS-Tagger JWNL: http://jwordnet.sourceforge.net/ Gensen: A Complex Word Extraction Tool Sen: A Japanese morpheme analyzer, http://ultimania.org/sen/ SS-Tagger: English Tagger MR3 : an RDF & RDFS graphical editor, http://mmm.semanticweb.org/mr3/ Jena Semantic Web Tool Framework: HP Labs, http://jena.sourceforge.net/
  • 29. Contents • Motivation • Related Works • DODDLE-OWL Overview • Implementation Architecture • Case Studies • Demonstration • Conclusions
  • 30. Case Studies • Purpose – To check DODDLE-OWL can support the user in constructing taxonomic and non-taxonomic relationships • Target Field   – Particular field of business   – xCBL (XML Common Business Library) • http://www.xcbl.org/ • Domain Specific Document – xCBL Document Description – about 150 sentences and 2500 words • Input Concepts – 57 business concepts from the document • User – Not an expert but has business knowledge
  • 31. Results and Evaluation for Taxonomic Relationships Construction Initial Concept Hierarcy Business Ontology Get paths related to input concepts from WordNet Trimming Modification 57 Concepts Precision Recall ST1: Matched Result Analysis 5/25(=0.2)   5/7(=0.71) Evaluation of two strategies by the user 82 Concepts 152 Concepts 83 Concepts The number of concepts in each model Input Concepts Initial Model Constructed with Hierarchy Construction Module Constructed with Hierarchy Refinement Module
  • 32. Results and Evaluation for Non-Taxonomic Relationships Construction WS AR The union of WS & AR # Extracted concept pairs 40 39 66 # Accepted concept pairs 30 20 39 # Rejected concept pairs 10 19 27 Precision 0.75 (30/40) 0.51 (20/39) 0.59 (39/66) Association Rule ( 20/39 )WordSpace ( 30/40 ) 8 17 1119 9 2 The union of WS & AR ( 39/66 ) Frequency of Extracted 4-gram Context Scope ( before : after ) threshold of similarity 2 10:10 0.6 Minimum Support Minimum Confidence 0.7 % 55 % For WordSpace parametersFor Association Rule parameters
  • 33. Results and Evaluation for Non-Taxonomic Relationships Acquisition WS AR The union of WS & AR # Extracted concept pairs 40 39 66 # Accepted concept pairs 30 20 39 # Rejected concept pairs 10 19 27 Precision 0.75 (30/40) 0.51 (20/39) 0.59 (39/66) Association Rule ( 20/39 ) 8 17 1119 9 WordSpace ( 30/40 ) 2 The union of WS & AR ( 39/66 ) The precision of WS method is good, but the WS method has its bias so we cannot get certain types of concept pairs from it. we combine two different methods for getting wider range of concept pairs.
  • 34. Contents • Motivation • Related Works • DODDLE-OWL Overview • Implementation Architecture • Case Studies • Demonstration • Conclusions
  • 35. Demonstration • Construct taxonomic and non-taxonomic relationships for xCBL
  • 36. Contents • Motivation • Related Works • DODDLE-OWL Overview • Implementation Architecture • Case Studies • Demonstration • Conclusions
  • 37. Conclusions • Summary – DODDLE-OWL: a Domain Ontology rapiD DeveLopment Environment – OWL extension • Focusing the quality refinement phase of ontology construction – Case studies • construct a domain ontology for xCBL • Support the user in constructing and refining the domain ontology • Future Work – Reuse existing (domain) ontologies in any forms – Apply DODDLE-OWL to large scale domain ontology construction • Rocket operation ontology • About 40,000 concepts
  • 38. Thank you for your attention. DODDLE-OWL has been released . Please visit this web site, if you like it. about 100 user now http://mmm.semanticweb.org/doddle/

Hinweis der Redaktion

  1. 最新版に置き換える