SlideShare ist ein Scribd-Unternehmen logo
1 von 10
Downloaden Sie, um offline zu lesen
International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014
DOI : 10.5121/ijist.2014.4301 1
IDENTIFYING THE SEMANTIC RELATIONS ON
UNSTRUCTURED DATA
Chien D C Ta1
and Tuoi Phan Thi1
1Faculty of Computer Science and Engineering,
HoChiMinh City University of Technology
Abstract
Ontologisms have been applied to many applications in recent years, especially on Sematic Web, Informa-
tion Retrieval, Information Extraction, and Question and Answer. The purpose of domain-specific ontology
is to get rid of conceptual and terminological confusion. It accomplishes this by specifying a set of generic
concepts that characterizes the domain as well as their definitions and interrelationships. This paper will
describe some algorithms for identifying semantic relations and constructing an Information Technology
Ontology, while extracting the concepts and objects from different sources. The Ontology is constructed
based on three main resources: ACM, Wikipedia and unstructured files from ACM Digital Library. Our
algorithms are combined of Natural Language Processing and Machine Learning. We use Natural Lan-
guage Processing tools, such as OpenNLP, Stanford Lexical Dependency Parser in order to explore sen-
tences. We then extract these sentences based on English pattern in order to build training set. We use a
random sample among 245 categories of ACM to evaluate our results. Results generated show that our
system yields superior performance.
Keywords
Domain ontology, Knowledge based system, Semantic Relation, Information Extraction.
1. Introduction
The methods of human computer interaction on Internet as with Google Search, Information Ex-
traction, Question and Answer have become important tools in modern society. End users can use
these systems with various purposes, such as querying information, learning, E-commerce, etc.
However, the precision of these systems is always a big issue that the research must solve. In or-
der to achieve high precision, the systems should consider semantic of sentences. Previous re-
search has been done in the realm of semantic relation detection but it still remains challenging to
this day. V. Malase et al [1] use lexical- syntactic patterns to detect semantic relations between
the main terms of definition in order to help terminologist build structured terminology following
these relations. With the dramatic increase of data on the Internet, identifying semantic relations
plays an important role in semantic-oriented applications.
Our goal is to automatically identify some of the semantic relations that might be found in do-
main-specific corpora. Here, we will describe methods of identifying semantic relations. For this
purpose, we will combine Natural Language Processing and Matching Learning. We will also
define some English patterns and semantic roles to identify semantic relations in 2000 text files
from ACM Digital Library. We then propose a method to identify synonyms, hyponyms, and
hypernyms of instances in domain-specific ontology. Finally, we will use three measures: Preci-
sion, Recall and F-Measure in order to evaluate these methods. The evaluation results shown af-
terward will prove the effectiveness of the proposed mythology.
International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014
2
2. Related Work
Information extraction is an important research topic in Natural language Processing (NLP) [2]. It
tries to find semantic relations, relevant information from the large amount of text documents and
on the World Wide Web. Y. Jie et al [3] focused on semantic rules to build Extraction system
from LIDAR (Light Detection and Ranging). F. Gomez et al [4] built a semantic interpreter to
assign meaning to the grammatical relations of the sentences when they constructed a knowledge
base about a given topic. K. Kongkachandra et al [5] proposed semantic based key-phrase recov-
ery for domain-independent key-phrase extraction. In this method, he added a key-phrase recov-
ery function as a post process of the conventional key-phrase extractors in order to reconsider the
failed key phrases by semantic matching based on sentence meaning. Z.Goudong et al [6] pro-
posed novel tree kernel-based method with rich syntactic and semantic information for the extrac-
tion of semantic relations between named entities. A.B. Abacha et al [7] built a platform MeTAE
(Medical Texts Annotation and Exploration). This system allows the extracting and annotating of
Medical entities and relationships from Medical text. He relied on linguistic patterns to detect the
semantic relations in medical text files. A.D.S Jayatilaka et al [8] constructed ontology from Web
pages. He introduced web usage patterns as a novel source of semantics in ontology learning. The
proposed methodology combines web content mining with web usage mining in the know-
ledge extraction process. H. Li et al [9] extract semantic relations between Chinese named enti-
ties based on semantic features and the Vector Space Model (VSM).
Those research attempts were meant to identify semantic relations from web documents or text
files in order to develop ontology. They either used natural language processing techniques, the
statistical, or the machine learning approach in the ontology learning process. Our research com-
bines Natural Language Processing and Matching Learning to identify semantic relations on un-
structured data.
3. Algorithms for Identifying Semantic Relations
3.1 Information Technology Ontology (ITO)
Domain Ontology includes concepts, attributes and events. Ontology is a tuple O = (C, I, R, T, V,
≤, ⊥, ∈, =) [10], where:
C is the set of concepts, I is the set of individuals including attributes and events.
R is the set of relations; T is the set of data types.
V is the set of values (C, I, R, T, V being pair wise disjoint)
≤ is a relation on(C×C) ∪ (R×R) ∪ (T×T) called specialization.
⊥ is a relation on(C×C) ∪ (R×R) ∪ (T×T) called exclusion. ∈ is a relation over (I×C) ∪ (V× T)
called instantiation, = is a relation over I×P× (I∪V) called assignment.
We propose an ontology structure, as shown in Figure.1
International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014
3
Figure1. Structure of Information Technology Ontology
There are four layers in this Ontology
Topic layer includes 245 categories in Information Technology area from ACM Category [11].
Ingredient layer includes many instances in Information Technology area from difference re-
sources such as Wikipedia and unstructured text files.
Synset layer is a set of synonyms, hyponyms, and hypernyms of instances from ingredient layer.
We will describe it in detail in next session.
The last layer is known as sentence layer. We will introduce it in detail in next session.
3.2 Algorithm for Identifying Synonyms, Hyponyms and Hypernyms Relations
The Synset layer in this ontology includes synonyms, hyponyms and hypernyms of instances be-
longing to an ingredient layer. In order to find a set of synonyms, hyponyms and hypernyms of
instances from ingredient layer, we use WordNet. Similarly to Wikipedia, WordNet is an ontolo-
gy that includes many fields and languages. However, we will only focus on English language.
Our proposed algorithm is as follows.
Procedure Find_out_Synset ()
While (instance of Ingredient layer is not null)
Begin
Synonym_List =QueryIntoWordNet (instance)
If (Synset_List is not null)
Link (instance to Synonym, hyponym, hypernym)
End if
End
End While
End Pro
International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014
4
Table 1 represents results after applying the algorithm for some instances.
Table 1. Set of Synonym, Hyponym and Hypernym corresponding with instances
Instances from
Ingredient layer
Synonyms Hyponyms Hypernyms
NLP Natural Language
Processing
Informatics, infor-
mation processing
Data structure Hierarchical struc-
ture
Organization, sys-
tem
Computer Network Internet, intranet,
WAN
Electronic network
RAM Random Access
Memory
Core memory Volatile storage
From Table 1, we can identify some semantic relations between an instance of Ingredient layer
with its synonyms, hyponyms and hypernyms, such as
- NLP is a Natural Language Processing
- NLP such as Informatics, information processing
- Hierarchical structure includes Data structure
- Data structure such as organization, system
- RAM is random access memory
- Core memory includes RAM
3.3 Algorithm for Identifying Semantic Relations based on Syntax Patterns and
Linguistic
To enrich the sentence layer of ontology, we use English syntax patterns to extract sentences re-
lated to instances of ingredient layer. This process is implemented during the extraction of in-
stances. For this purpose, we use OpenNLP tool to recognize sentences. Then, we use a Stanford
Lexical Dependency Parser (SLDP) to refine them. SLDP can output typed dependency tree to
present a grammatical relationship between keywords in the sentence. After that, we apply lin-
guistic markers to the sentences to recognize their semantic meaning. Our proposed model is
shown in Figure. 2.
International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014
5
Figure2. Identifying Semantic Relations based on Pattern and Linguistic
Our training corpus (2000 papers) for testing is focused on the field of Information Technology.
Electronic documents were automatically collected from ACM digital library. We designed ele-
ven English patterns to select sentences identified by OpenNLP. The English patterns are shown
in table 2.
Table 2. English Patterns used to extract sentences
No Pattern Example
1 S + V Computer is broken
2 S + V + Object He was mowing the lawn
3 S + V + Adjective The girl was tall
4 S + V + Indirect Object The woman went to the house
5 S + V + Direct Object The man hit the ball
6 S + V + Complement Some students in the class are engi-
neers
7 S + V + Prepositional Phrase The cat waited for its owner yester-
day
8 S +V+ Indirect Object+ Direct Object Granny left Gary all of her money
9 S + V + Direct Object + Adjective The Jury found the defendant guilty
10 S + V + Direct Object + NP The Jury found the defendant guilty
11 S + V + Direct Object + Complement The class picked Susieclass repre-
sentative.
After extracting sentences based on the English patterns, we refine them by SLDP. With SLDP,
we eliminate the unnecessary words in a sentence. Below are some examples of such sentences.
International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014
6
Table 3. Examples of eliminating unnecessary words
Before applying SLDP After applying SLDP
COBOL is not popular programming
language in recent years.
COBOL is not popular programming lan-
guage.
Oracle database is one of the Relation-
al Database Management System.
Oracle is Relational Database Manage-
ment System.
In my opinion Java Language is diffi-
cult to program
Java Language is difficult to program
C Sharp in Microsoft Dot Net is also
an Object Oriented programming lan-
guage
C Sharp is Object Oriented programming
language
In our ontology, most of the concepts have semantic relations between each other. In order to
identify those semantic relations, we rely on linguistic roles (linguistic marker), which are defined
as follows:
IS-A: this generic-specific relation reflects hierarchical inheritance in network of concepts. All
entities are categorized as instances of a particular class. Class can become instances of a particu-
lar class. Thus, any concepts can be linked to its immediate super ordinate concept. For example,
Random Access Memory (concept) is a core memory (concept) in computer.
PART-OF: this relation also reflects the hierarchical structure of the domain. This relation direct-
ly refers to the parts of each concept in sentence. For example, Random Access Memory (ROM)
(concept) is part of memory (concept)
MADE-OF: this relation links to concepts, which made of material concepts. For example, Inte-
grated Circuit (IC) Chip can be made of a semiconductor material.
DELIMITED-BY: this relation marks the boundaries, dividing one concept from another. This is
a domain-specific relation, mainly for the concepts, which are belonged to different topic in the
field of Information technology. This relation is usually represented by a number of verbs, such as
include, delimit, limit, circumscribe, restrict, etc. For example, the processing of a computer is
restricted by CPU, RAM.
TAKES-PLACE-IN: this relation describes the context of processes, which are related to spatial
and temporal dimensions. A number of verbs represent this relation, such as happen, occur, take
place in, etc. For example, In order to tackle the conflict of process, time scheduling takes place
in the Operating System.
ATTRIBUTE-OF: this relation is only useful for concepts designated by specialized adjectives,
such as strong, powerful, etc., or nouns that define the properties of other concepts. For example,
these router devices are powerful and useful in a network.
RESULT-OF: this relation is relevant to either processes or entities that are derived from other
processes. For example, as a result of the inconsistency, this file is considered corrupted.
AFFECTS: this relation, along with RESULT-OF, is a crucial semantic relation in the knowledge
base since both can relate all kinds of concepts in ontology.
International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014
7
CAUSES: this relation is directly relevant to the processes or concepts that are derived from other
processes. On the contrary of RESULT-OF, this relation usually represents negative meaning.
These linguistic roles help us identify semantic relations between keywords in a sentence. There-
fore, we can recognize sentence meaning and exactly categorize them. The semantic relations are
represented, as shown in Figure. 3.
Figure 3. Semantic relations in Information Technology Ontology
4. Experimental Results
Over 2000 text documents from the ACM Digital Library were used for testing purposes. These
documents with different topics belonged to the field of Information technology. In order to easily
process the documents, we grouped them by category before extracting instances and concepts. In
our paper, we chose randomsample corpora among 245 categories from ACM.
The system’s performance was calculatedby using three measures: Precision, Recall and F-
measure. They are calculated by each category in domain-specific ontology as below:
( ) =
( )
( ) + ( )
=
( )
( ) + ( )
F − Mesure = 2
Precision ∗ Recall
Precision + Recall
Where Ci represents a category in ITO and correct, bad, missing represented the number of cor-
rect, wrong, missing, respectively.
Experiment results are shown in table 4, 5, 6, 7, and 8.
International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014
8
Table 4. Experiments results on instances of ingredient layer
Category Qualityof
instances
Precision
(%)
Recall
(%)
F-Measure
(%)
Application (AP_inst) 3672 79.26 76.51 77.86
Artificial Intelligent
(AI_inst)
5714 82.94 78.92 80.88
Logic Design
(LD_inst)
4644 82.18 80.06 81.11
Operating System
(OS_inst)
6785 84.47 81.37 82.89
Process Management
(PM_inst)
3056 76.53 72.51 74.47
Software (Soft_inst) 4249 81.64 79.62 80.62
Table 5. Experiments results on set of synonyms
Category Quality of
synonym
Precision
(%)
Recall
(%)
F-Measure
(%)
Application
(AP_syno)
524 79.26 76.51 77.86
Artificial Intelligent
(AI_syno)
689 94.41 88.15 91.17
Logic Design
(LD_syno)
472 92.24 84.27 88.08
Operating System
(OS_syno)
861 96.18 91.58 93.82
Process Management
(PM_syno)
517 93.25 86.16 89.56
Software (SW_syno) 583 94.26 89.04 91.57
Table 6.Experiments results on set of Hyponyms
Category Qualityof
Hyponym
Precision
(%)
Recall
(%)
F-Measure
(%)
Application
(AP_hypo)
714 89.38 76.51 82.45
ArtificialIntelligent
(AI_hypo)
837 96.14 88.29 92.04
Logic Design
(LD_hypo)
718 87.54 84.26 85.86
Operating System
(OS_hypo)
972 96.82 91.42 94.04
Process Management
(PM_hypo)
728 88.31 85.15 86.70
Software (SW_hypo) 646 85.64 81.04 83.28
International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014
9
Table 7. Experiments results on set of Hypernyms
Category Qualityof
Hypernym
Precision
(%)
Recall
(%)
F-Measure
(%)
Application
(AP_hype)
916 79.26 76.51 77.86
Artificial Intelligent
(AI_hype)
1321 92.41 91.17 91.79
Logic Design
(LD_hype)
954 84.62 79.37 81.91
Operating System
(OS_hype)
1413 95.04 96.81 95.92
Process Management
(PM_hype)
834 82.31 84.55 83.41
Software (SW_hype) 893 85.48 80.19 82.75
Table 8. Experiments results on set of Sentences
Category Qualityof
Sentences
Precision
(%)
Recall
(%)
F-Measure
(%)
Application
(AP_sent)
683 83.26 80.75 81.98
Artificial Intelligent
(AI_sent)
972 87.93 85.64 86.77
Logic Design
(LD_sent)
589 81.52 79.18 80.33
Operating System
(OS_sent)
1026 92.41 88.32 90.32
Process Management
(PM_sent)
647 82.83 79.17 80.96
Software (SW_sent) 762 86.74 82.14 84.38
Figure 4. Experiment result evaluation of instances, synonyms, hyponyms and hypernymsbased on Preci-
sion, Recall and F-Measures.
International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014
10
5. Conclusions
Our experiment tried to identify semantic relations between nouns or noun phrases in unstruc-
tured files based on linguistic roles in order to build domain ontology on Information Technology.
After usingan OpenNLP tool to recognize words and sentences, we applied English patterns de-
fined by us to extract them. We then refined them using the SDLC tool. Therefore, our experi-
mental results have a high precision and high recall. In order to identify semantic relations, we
applied linguistic roles. The semantic roles directly link to instances in the layers of domain on-
tology. Also, we propose an algorithm to identify semantic relations based on synonyms, hypo-
nyms and hypernyms from instances of the ingredient layer. Overall scores are computed based
on three measures: Precision, Recall ad F-Measure. Efforts must also be invested in order to re-
duce the overall processing time of the system.
In future works, we will focus on building an Information Extraction system based on this ontol-
ogy to solve a number of problems, such as ontology learning and Question and Answer.
REFERENCES
[1] V. MalaSe et al , "Detecting semantic relations between terms in definitions," in The 3rd Interna-
tional Workshop on Computational Terminology - CompuTerm 2004 , 2004.
[2] G. Zhou et al, "Tree Kernel-Based Semantic Relation Extraction with Rich Syntactic and Semantic
Information," Information Sciences, vol. 180, no. 8, pp. 1313 - 1325, 2010.
[3] Y. Jie et al, "Building Extraction from LIDAR based Semantic Analysis," Geo-Spatial Information
Science, vol. 9, no. 4, Sep. 2006.
[4] F. Gomez et al, "Semantic interpretation and knowledge extraction," Knowledge-Based Systems, vol.
20, no. 1, pp. 51 - 60, July 2006.
[5] G.Kongkachandra et al, "Abductive Reasoning for Keyword Recovering in Semantic-based Key-word
Extraction," in The Fifth International Conference on Information Technology: New Generations -
IEEE, 2008, pp. 714 - 719.
[6] Z. Goudong et al, "Tree kernel-based semantic relation extraction with rich syntactic and semantic
information," Information Sciences, vol. 180, no. 8, pp. 1313 - 1325, Dec. 2009.
[7] A.B Abacha et al, "Automatic Extraction of Semantic Relations between Medical Entities- a rule
based approach," Journal of Biomedical Semantics, 2011.
[8] A.D.S Jayatilaka, "Knowledge Extraction for Semantic Web Using Web Mining ," in The Interna-
tional Conference on Advances in ICT for Emerging Regions (ICTer 2011) - IEEE, 2011, pp. 89 - 94.
[9] H. Li et al, "A Relation Extraction Method of Chinese Named Entities based on Location and Se-
mantic Features," Applied Intelligence, vol. 18, no. 1, pp. 1- 14, May 2012.
[10] J. Euzenat et al, Ontology Matching.: Springer, 2007.
[11] ACM. [Online].http://www.acm.org/about/class/ccs98-html.
AUTHORS
Tuoi Phan Thi, Professor. : She is a Professor at the Faculty of Computer Science and
Engineering, HCMC University of Technology, Vietnam. She obtained her Ph.D. in Com-
puter Science from Charles University, Czech Republic, in 1985. Her research interests are
compiler, information retrieval, natural language processing. She has been the Chief Investi-
gator of national key projects and published many papers in international journals and confe-
rence proceedings in those areas.
Chien D C Ta, Mr .: He is Ph.D. Student at the Faculty of Computer Science and Engineer-
ing, HCMC University of Technology, Vietnam. His research interests include Natural Lan-
guage Processing and its applications, Information Extraction.

Weitere ähnliche Inhalte

Was ist angesagt?

EXTRACTING ARABIC RELATIONS FROM THE WEB
EXTRACTING ARABIC RELATIONS FROM THE WEBEXTRACTING ARABIC RELATIONS FROM THE WEB
EXTRACTING ARABIC RELATIONS FROM THE WEBijcsit
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUEJournal For Research
 
The recognition system of sentential
The recognition system of sententialThe recognition system of sentential
The recognition system of sententialijaia
 
Data Integration Ontology Mapping
Data Integration Ontology MappingData Integration Ontology Mapping
Data Integration Ontology MappingPradeep B Pillai
 
58903240-SentiMatrix-Multilingual-Sentiment-Analysis-Service
58903240-SentiMatrix-Multilingual-Sentiment-Analysis-Service58903240-SentiMatrix-Multilingual-Sentiment-Analysis-Service
58903240-SentiMatrix-Multilingual-Sentiment-Analysis-ServiceMarius Corici
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mappingsamhati27
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mappingbutest
 
Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...
Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...
Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...Waqas Tariq
 
Analysis of lexico syntactic patterns for antonym pair extraction from a turk...
Analysis of lexico syntactic patterns for antonym pair extraction from a turk...Analysis of lexico syntactic patterns for antonym pair extraction from a turk...
Analysis of lexico syntactic patterns for antonym pair extraction from a turk...csandit
 
The role of linguistic information for shallow language processing
The role of linguistic information for shallow language processingThe role of linguistic information for shallow language processing
The role of linguistic information for shallow language processingConstantin Orasan
 
FINDING OUT NOISY PATTERNS FOR RELATION EXTRACTION OF BANGLA SENTENCES
FINDING OUT NOISY PATTERNS FOR RELATION EXTRACTION OF BANGLA SENTENCESFINDING OUT NOISY PATTERNS FOR RELATION EXTRACTION OF BANGLA SENTENCES
FINDING OUT NOISY PATTERNS FOR RELATION EXTRACTION OF BANGLA SENTENCESijnlc
 
Taxonomy extraction from automotive natural language requirements using unsup...
Taxonomy extraction from automotive natural language requirements using unsup...Taxonomy extraction from automotive natural language requirements using unsup...
Taxonomy extraction from automotive natural language requirements using unsup...ijnlc
 
New Quantitative Methodology for Identification of Drug Abuse Based on Featur...
New Quantitative Methodology for Identification of Drug Abuse Based on Featur...New Quantitative Methodology for Identification of Drug Abuse Based on Featur...
New Quantitative Methodology for Identification of Drug Abuse Based on Featur...Carrie Wang
 
A DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENT
A DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENTA DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENT
A DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENTcscpconf
 
Use of ontologies in natural language processing
Use of ontologies in natural language processingUse of ontologies in natural language processing
Use of ontologies in natural language processingATHMAN HAJ-HAMOU
 
Survey On Building A Database Driven Reverse Dictionary
Survey On Building A Database Driven Reverse DictionarySurvey On Building A Database Driven Reverse Dictionary
Survey On Building A Database Driven Reverse DictionaryEditor IJMTER
 
Ontology For Data Integration
Ontology For Data IntegrationOntology For Data Integration
Ontology For Data Integrationjuanesteva
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic webWorawith Sangkatip
 
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextCooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextUniversity of Bari (Italy)
 

Was ist angesagt? (20)

EXTRACTING ARABIC RELATIONS FROM THE WEB
EXTRACTING ARABIC RELATIONS FROM THE WEBEXTRACTING ARABIC RELATIONS FROM THE WEB
EXTRACTING ARABIC RELATIONS FROM THE WEB
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
 
The recognition system of sentential
The recognition system of sententialThe recognition system of sentential
The recognition system of sentential
 
Data Integration Ontology Mapping
Data Integration Ontology MappingData Integration Ontology Mapping
Data Integration Ontology Mapping
 
58903240-SentiMatrix-Multilingual-Sentiment-Analysis-Service
58903240-SentiMatrix-Multilingual-Sentiment-Analysis-Service58903240-SentiMatrix-Multilingual-Sentiment-Analysis-Service
58903240-SentiMatrix-Multilingual-Sentiment-Analysis-Service
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
 
Learning ontologies
Learning ontologiesLearning ontologies
Learning ontologies
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
 
Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...
Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...
Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...
 
Analysis of lexico syntactic patterns for antonym pair extraction from a turk...
Analysis of lexico syntactic patterns for antonym pair extraction from a turk...Analysis of lexico syntactic patterns for antonym pair extraction from a turk...
Analysis of lexico syntactic patterns for antonym pair extraction from a turk...
 
The role of linguistic information for shallow language processing
The role of linguistic information for shallow language processingThe role of linguistic information for shallow language processing
The role of linguistic information for shallow language processing
 
FINDING OUT NOISY PATTERNS FOR RELATION EXTRACTION OF BANGLA SENTENCES
FINDING OUT NOISY PATTERNS FOR RELATION EXTRACTION OF BANGLA SENTENCESFINDING OUT NOISY PATTERNS FOR RELATION EXTRACTION OF BANGLA SENTENCES
FINDING OUT NOISY PATTERNS FOR RELATION EXTRACTION OF BANGLA SENTENCES
 
Taxonomy extraction from automotive natural language requirements using unsup...
Taxonomy extraction from automotive natural language requirements using unsup...Taxonomy extraction from automotive natural language requirements using unsup...
Taxonomy extraction from automotive natural language requirements using unsup...
 
New Quantitative Methodology for Identification of Drug Abuse Based on Featur...
New Quantitative Methodology for Identification of Drug Abuse Based on Featur...New Quantitative Methodology for Identification of Drug Abuse Based on Featur...
New Quantitative Methodology for Identification of Drug Abuse Based on Featur...
 
A DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENT
A DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENTA DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENT
A DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENT
 
Use of ontologies in natural language processing
Use of ontologies in natural language processingUse of ontologies in natural language processing
Use of ontologies in natural language processing
 
Survey On Building A Database Driven Reverse Dictionary
Survey On Building A Database Driven Reverse DictionarySurvey On Building A Database Driven Reverse Dictionary
Survey On Building A Database Driven Reverse Dictionary
 
Ontology For Data Integration
Ontology For Data IntegrationOntology For Data Integration
Ontology For Data Integration
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic web
 
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextCooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
 

Ähnlich wie Identifying Semantic Relations Using NLP and ML

A03730108
A03730108A03730108
A03730108theijes
 
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING cscpconf
 
Context Sensitive Relatedness Measure of Word Pairs
Context Sensitive Relatedness Measure of Word PairsContext Sensitive Relatedness Measure of Word Pairs
Context Sensitive Relatedness Measure of Word PairsIJCSIS Research Publications
 
An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...
An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...
An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...iosrjce
 
SWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professionalSWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professionalgowthamnaidu0986
 
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextCooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextFulvio Rotella
 
PATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHS
PATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHSPATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHS
PATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHSkevig
 
PATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHS
PATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHSPATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHS
PATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHSijnlc
 
Conceptual Similarity Measurement Algorithm For Domain Specific Ontology
Conceptual Similarity Measurement Algorithm For Domain Specific OntologyConceptual Similarity Measurement Algorithm For Domain Specific Ontology
Conceptual Similarity Measurement Algorithm For Domain Specific OntologyZac Darcy
 
Conceptual similarity measurement algorithm for domain specific ontology[
Conceptual similarity measurement algorithm for domain specific ontology[Conceptual similarity measurement algorithm for domain specific ontology[
Conceptual similarity measurement algorithm for domain specific ontology[Zac Darcy
 
Automatic multiple choice question generation system for
Automatic multiple choice question generation system forAutomatic multiple choice question generation system for
Automatic multiple choice question generation system forAlexander Decker
 
ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURK...
ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURK...ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURK...
ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURK...cscpconf
 
Computational Intelligence Methods for Clustering of Sense Tagged Nepali Docu...
Computational Intelligence Methods for Clustering of Sense Tagged Nepali Docu...Computational Intelligence Methods for Clustering of Sense Tagged Nepali Docu...
Computational Intelligence Methods for Clustering of Sense Tagged Nepali Docu...IOSR Journals
 
An Efficient Semantic Relation Extraction Method For Arabic Texts Based On Si...
An Efficient Semantic Relation Extraction Method For Arabic Texts Based On Si...An Efficient Semantic Relation Extraction Method For Arabic Texts Based On Si...
An Efficient Semantic Relation Extraction Method For Arabic Texts Based On Si...CSCJournals
 
AMBIGUITY-AWARE DOCUMENT SIMILARITY
AMBIGUITY-AWARE DOCUMENT SIMILARITYAMBIGUITY-AWARE DOCUMENT SIMILARITY
AMBIGUITY-AWARE DOCUMENT SIMILARITYijnlc
 
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS AIRCC Publishing Corporation
 
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusA Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusAIRCC Publishing Corporation
 
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusA Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological Corpusijcsit
 

Ähnlich wie Identifying Semantic Relations Using NLP and ML (20)

A03730108
A03730108A03730108
A03730108
 
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
 
Context Sensitive Relatedness Measure of Word Pairs
Context Sensitive Relatedness Measure of Word PairsContext Sensitive Relatedness Measure of Word Pairs
Context Sensitive Relatedness Measure of Word Pairs
 
F017243241
F017243241F017243241
F017243241
 
An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...
An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...
An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...
 
SWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professionalSWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professional
 
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextCooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
 
PATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHS
PATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHSPATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHS
PATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHS
 
PATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHS
PATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHSPATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHS
PATENT DOCUMENT SUMMARIZATION USING CONCEPTUAL GRAPHS
 
Conceptual Similarity Measurement Algorithm For Domain Specific Ontology
Conceptual Similarity Measurement Algorithm For Domain Specific OntologyConceptual Similarity Measurement Algorithm For Domain Specific Ontology
Conceptual Similarity Measurement Algorithm For Domain Specific Ontology
 
Conceptual similarity measurement algorithm for domain specific ontology[
Conceptual similarity measurement algorithm for domain specific ontology[Conceptual similarity measurement algorithm for domain specific ontology[
Conceptual similarity measurement algorithm for domain specific ontology[
 
Automatic multiple choice question generation system for
Automatic multiple choice question generation system forAutomatic multiple choice question generation system for
Automatic multiple choice question generation system for
 
ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURK...
ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURK...ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURK...
ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURK...
 
Computational Intelligence Methods for Clustering of Sense Tagged Nepali Docu...
Computational Intelligence Methods for Clustering of Sense Tagged Nepali Docu...Computational Intelligence Methods for Clustering of Sense Tagged Nepali Docu...
Computational Intelligence Methods for Clustering of Sense Tagged Nepali Docu...
 
L017158389
L017158389L017158389
L017158389
 
An Efficient Semantic Relation Extraction Method For Arabic Texts Based On Si...
An Efficient Semantic Relation Extraction Method For Arabic Texts Based On Si...An Efficient Semantic Relation Extraction Method For Arabic Texts Based On Si...
An Efficient Semantic Relation Extraction Method For Arabic Texts Based On Si...
 
AMBIGUITY-AWARE DOCUMENT SIMILARITY
AMBIGUITY-AWARE DOCUMENT SIMILARITYAMBIGUITY-AWARE DOCUMENT SIMILARITY
AMBIGUITY-AWARE DOCUMENT SIMILARITY
 
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
 
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusA Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
 
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusA Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
 

Mehr von ijistjournal

A MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVIN
A MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVINA MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVIN
A MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVINijistjournal
 
DECEPTION AND RACISM IN THE TUSKEGEE SYPHILIS STUDY
DECEPTION AND RACISM IN THE TUSKEGEE  SYPHILIS STUDYDECEPTION AND RACISM IN THE TUSKEGEE  SYPHILIS STUDY
DECEPTION AND RACISM IN THE TUSKEGEE SYPHILIS STUDYijistjournal
 
Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...ijistjournal
 
A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...
A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...
A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...ijistjournal
 
Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...ijistjournal
 
Online Paper Submission - 4th International Conference on NLP & Data Mining (...
Online Paper Submission - 4th International Conference on NLP & Data Mining (...Online Paper Submission - 4th International Conference on NLP & Data Mining (...
Online Paper Submission - 4th International Conference on NLP & Data Mining (...ijistjournal
 
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSTOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSijistjournal
 
Research Articles Submission - International Journal of Information Sciences ...
Research Articles Submission - International Journal of Information Sciences ...Research Articles Submission - International Journal of Information Sciences ...
Research Articles Submission - International Journal of Information Sciences ...ijistjournal
 
ANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHM
ANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHMANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHM
ANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHMijistjournal
 
Call for Research Articles - 6th International Conference on Machine Learning...
Call for Research Articles - 6th International Conference on Machine Learning...Call for Research Articles - 6th International Conference on Machine Learning...
Call for Research Articles - 6th International Conference on Machine Learning...ijistjournal
 
Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...ijistjournal
 
Colour Image Steganography Based on Pixel Value Differencing in Spatial Domain
Colour Image Steganography Based on Pixel Value Differencing in Spatial DomainColour Image Steganography Based on Pixel Value Differencing in Spatial Domain
Colour Image Steganography Based on Pixel Value Differencing in Spatial Domainijistjournal
 
Call for Research Articles - 5th International conference on Advanced Natural...
Call for Research Articles - 5th International conference on Advanced Natural...Call for Research Articles - 5th International conference on Advanced Natural...
Call for Research Articles - 5th International conference on Advanced Natural...ijistjournal
 
International Journal of Information Sciences and Techniques (IJIST)
International Journal of Information Sciences and Techniques (IJIST)International Journal of Information Sciences and Techniques (IJIST)
International Journal of Information Sciences and Techniques (IJIST)ijistjournal
 
Research Article Submission - International Journal of Information Sciences a...
Research Article Submission - International Journal of Information Sciences a...Research Article Submission - International Journal of Information Sciences a...
Research Article Submission - International Journal of Information Sciences a...ijistjournal
 
Design and Implementation of LZW Data Compression Algorithm
Design and Implementation of LZW Data Compression AlgorithmDesign and Implementation of LZW Data Compression Algorithm
Design and Implementation of LZW Data Compression Algorithmijistjournal
 
Online Paper Submission - 5th International Conference on Soft Computing, Art...
Online Paper Submission - 5th International Conference on Soft Computing, Art...Online Paper Submission - 5th International Conference on Soft Computing, Art...
Online Paper Submission - 5th International Conference on Soft Computing, Art...ijistjournal
 
Submit Your Research Articles - International Journal of Information Sciences...
Submit Your Research Articles - International Journal of Information Sciences...Submit Your Research Articles - International Journal of Information Sciences...
Submit Your Research Articles - International Journal of Information Sciences...ijistjournal
 
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKPUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKijistjournal
 
Call for Papers - 13th International Conference on Cloud Computing: Services ...
Call for Papers - 13th International Conference on Cloud Computing: Services ...Call for Papers - 13th International Conference on Cloud Computing: Services ...
Call for Papers - 13th International Conference on Cloud Computing: Services ...ijistjournal
 

Mehr von ijistjournal (20)

A MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVIN
A MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVINA MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVIN
A MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVIN
 
DECEPTION AND RACISM IN THE TUSKEGEE SYPHILIS STUDY
DECEPTION AND RACISM IN THE TUSKEGEE  SYPHILIS STUDYDECEPTION AND RACISM IN THE TUSKEGEE  SYPHILIS STUDY
DECEPTION AND RACISM IN THE TUSKEGEE SYPHILIS STUDY
 
Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...
 
A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...
A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...
A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...
 
Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...
 
Online Paper Submission - 4th International Conference on NLP & Data Mining (...
Online Paper Submission - 4th International Conference on NLP & Data Mining (...Online Paper Submission - 4th International Conference on NLP & Data Mining (...
Online Paper Submission - 4th International Conference on NLP & Data Mining (...
 
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSTOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
 
Research Articles Submission - International Journal of Information Sciences ...
Research Articles Submission - International Journal of Information Sciences ...Research Articles Submission - International Journal of Information Sciences ...
Research Articles Submission - International Journal of Information Sciences ...
 
ANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHM
ANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHMANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHM
ANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHM
 
Call for Research Articles - 6th International Conference on Machine Learning...
Call for Research Articles - 6th International Conference on Machine Learning...Call for Research Articles - 6th International Conference on Machine Learning...
Call for Research Articles - 6th International Conference on Machine Learning...
 
Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...
 
Colour Image Steganography Based on Pixel Value Differencing in Spatial Domain
Colour Image Steganography Based on Pixel Value Differencing in Spatial DomainColour Image Steganography Based on Pixel Value Differencing in Spatial Domain
Colour Image Steganography Based on Pixel Value Differencing in Spatial Domain
 
Call for Research Articles - 5th International conference on Advanced Natural...
Call for Research Articles - 5th International conference on Advanced Natural...Call for Research Articles - 5th International conference on Advanced Natural...
Call for Research Articles - 5th International conference on Advanced Natural...
 
International Journal of Information Sciences and Techniques (IJIST)
International Journal of Information Sciences and Techniques (IJIST)International Journal of Information Sciences and Techniques (IJIST)
International Journal of Information Sciences and Techniques (IJIST)
 
Research Article Submission - International Journal of Information Sciences a...
Research Article Submission - International Journal of Information Sciences a...Research Article Submission - International Journal of Information Sciences a...
Research Article Submission - International Journal of Information Sciences a...
 
Design and Implementation of LZW Data Compression Algorithm
Design and Implementation of LZW Data Compression AlgorithmDesign and Implementation of LZW Data Compression Algorithm
Design and Implementation of LZW Data Compression Algorithm
 
Online Paper Submission - 5th International Conference on Soft Computing, Art...
Online Paper Submission - 5th International Conference on Soft Computing, Art...Online Paper Submission - 5th International Conference on Soft Computing, Art...
Online Paper Submission - 5th International Conference on Soft Computing, Art...
 
Submit Your Research Articles - International Journal of Information Sciences...
Submit Your Research Articles - International Journal of Information Sciences...Submit Your Research Articles - International Journal of Information Sciences...
Submit Your Research Articles - International Journal of Information Sciences...
 
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKPUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
 
Call for Papers - 13th International Conference on Cloud Computing: Services ...
Call for Papers - 13th International Conference on Cloud Computing: Services ...Call for Papers - 13th International Conference on Cloud Computing: Services ...
Call for Papers - 13th International Conference on Cloud Computing: Services ...
 

Kürzlich hochgeladen

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 

Kürzlich hochgeladen (20)

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 

Identifying Semantic Relations Using NLP and ML

  • 1. International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014 DOI : 10.5121/ijist.2014.4301 1 IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA Chien D C Ta1 and Tuoi Phan Thi1 1Faculty of Computer Science and Engineering, HoChiMinh City University of Technology Abstract Ontologisms have been applied to many applications in recent years, especially on Sematic Web, Informa- tion Retrieval, Information Extraction, and Question and Answer. The purpose of domain-specific ontology is to get rid of conceptual and terminological confusion. It accomplishes this by specifying a set of generic concepts that characterizes the domain as well as their definitions and interrelationships. This paper will describe some algorithms for identifying semantic relations and constructing an Information Technology Ontology, while extracting the concepts and objects from different sources. The Ontology is constructed based on three main resources: ACM, Wikipedia and unstructured files from ACM Digital Library. Our algorithms are combined of Natural Language Processing and Machine Learning. We use Natural Lan- guage Processing tools, such as OpenNLP, Stanford Lexical Dependency Parser in order to explore sen- tences. We then extract these sentences based on English pattern in order to build training set. We use a random sample among 245 categories of ACM to evaluate our results. Results generated show that our system yields superior performance. Keywords Domain ontology, Knowledge based system, Semantic Relation, Information Extraction. 1. Introduction The methods of human computer interaction on Internet as with Google Search, Information Ex- traction, Question and Answer have become important tools in modern society. End users can use these systems with various purposes, such as querying information, learning, E-commerce, etc. However, the precision of these systems is always a big issue that the research must solve. In or- der to achieve high precision, the systems should consider semantic of sentences. Previous re- search has been done in the realm of semantic relation detection but it still remains challenging to this day. V. Malase et al [1] use lexical- syntactic patterns to detect semantic relations between the main terms of definition in order to help terminologist build structured terminology following these relations. With the dramatic increase of data on the Internet, identifying semantic relations plays an important role in semantic-oriented applications. Our goal is to automatically identify some of the semantic relations that might be found in do- main-specific corpora. Here, we will describe methods of identifying semantic relations. For this purpose, we will combine Natural Language Processing and Matching Learning. We will also define some English patterns and semantic roles to identify semantic relations in 2000 text files from ACM Digital Library. We then propose a method to identify synonyms, hyponyms, and hypernyms of instances in domain-specific ontology. Finally, we will use three measures: Preci- sion, Recall and F-Measure in order to evaluate these methods. The evaluation results shown af- terward will prove the effectiveness of the proposed mythology.
  • 2. International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014 2 2. Related Work Information extraction is an important research topic in Natural language Processing (NLP) [2]. It tries to find semantic relations, relevant information from the large amount of text documents and on the World Wide Web. Y. Jie et al [3] focused on semantic rules to build Extraction system from LIDAR (Light Detection and Ranging). F. Gomez et al [4] built a semantic interpreter to assign meaning to the grammatical relations of the sentences when they constructed a knowledge base about a given topic. K. Kongkachandra et al [5] proposed semantic based key-phrase recov- ery for domain-independent key-phrase extraction. In this method, he added a key-phrase recov- ery function as a post process of the conventional key-phrase extractors in order to reconsider the failed key phrases by semantic matching based on sentence meaning. Z.Goudong et al [6] pro- posed novel tree kernel-based method with rich syntactic and semantic information for the extrac- tion of semantic relations between named entities. A.B. Abacha et al [7] built a platform MeTAE (Medical Texts Annotation and Exploration). This system allows the extracting and annotating of Medical entities and relationships from Medical text. He relied on linguistic patterns to detect the semantic relations in medical text files. A.D.S Jayatilaka et al [8] constructed ontology from Web pages. He introduced web usage patterns as a novel source of semantics in ontology learning. The proposed methodology combines web content mining with web usage mining in the know- ledge extraction process. H. Li et al [9] extract semantic relations between Chinese named enti- ties based on semantic features and the Vector Space Model (VSM). Those research attempts were meant to identify semantic relations from web documents or text files in order to develop ontology. They either used natural language processing techniques, the statistical, or the machine learning approach in the ontology learning process. Our research com- bines Natural Language Processing and Matching Learning to identify semantic relations on un- structured data. 3. Algorithms for Identifying Semantic Relations 3.1 Information Technology Ontology (ITO) Domain Ontology includes concepts, attributes and events. Ontology is a tuple O = (C, I, R, T, V, ≤, ⊥, ∈, =) [10], where: C is the set of concepts, I is the set of individuals including attributes and events. R is the set of relations; T is the set of data types. V is the set of values (C, I, R, T, V being pair wise disjoint) ≤ is a relation on(C×C) ∪ (R×R) ∪ (T×T) called specialization. ⊥ is a relation on(C×C) ∪ (R×R) ∪ (T×T) called exclusion. ∈ is a relation over (I×C) ∪ (V× T) called instantiation, = is a relation over I×P× (I∪V) called assignment. We propose an ontology structure, as shown in Figure.1
  • 3. International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014 3 Figure1. Structure of Information Technology Ontology There are four layers in this Ontology Topic layer includes 245 categories in Information Technology area from ACM Category [11]. Ingredient layer includes many instances in Information Technology area from difference re- sources such as Wikipedia and unstructured text files. Synset layer is a set of synonyms, hyponyms, and hypernyms of instances from ingredient layer. We will describe it in detail in next session. The last layer is known as sentence layer. We will introduce it in detail in next session. 3.2 Algorithm for Identifying Synonyms, Hyponyms and Hypernyms Relations The Synset layer in this ontology includes synonyms, hyponyms and hypernyms of instances be- longing to an ingredient layer. In order to find a set of synonyms, hyponyms and hypernyms of instances from ingredient layer, we use WordNet. Similarly to Wikipedia, WordNet is an ontolo- gy that includes many fields and languages. However, we will only focus on English language. Our proposed algorithm is as follows. Procedure Find_out_Synset () While (instance of Ingredient layer is not null) Begin Synonym_List =QueryIntoWordNet (instance) If (Synset_List is not null) Link (instance to Synonym, hyponym, hypernym) End if End End While End Pro
  • 4. International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014 4 Table 1 represents results after applying the algorithm for some instances. Table 1. Set of Synonym, Hyponym and Hypernym corresponding with instances Instances from Ingredient layer Synonyms Hyponyms Hypernyms NLP Natural Language Processing Informatics, infor- mation processing Data structure Hierarchical struc- ture Organization, sys- tem Computer Network Internet, intranet, WAN Electronic network RAM Random Access Memory Core memory Volatile storage From Table 1, we can identify some semantic relations between an instance of Ingredient layer with its synonyms, hyponyms and hypernyms, such as - NLP is a Natural Language Processing - NLP such as Informatics, information processing - Hierarchical structure includes Data structure - Data structure such as organization, system - RAM is random access memory - Core memory includes RAM 3.3 Algorithm for Identifying Semantic Relations based on Syntax Patterns and Linguistic To enrich the sentence layer of ontology, we use English syntax patterns to extract sentences re- lated to instances of ingredient layer. This process is implemented during the extraction of in- stances. For this purpose, we use OpenNLP tool to recognize sentences. Then, we use a Stanford Lexical Dependency Parser (SLDP) to refine them. SLDP can output typed dependency tree to present a grammatical relationship between keywords in the sentence. After that, we apply lin- guistic markers to the sentences to recognize their semantic meaning. Our proposed model is shown in Figure. 2.
  • 5. International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014 5 Figure2. Identifying Semantic Relations based on Pattern and Linguistic Our training corpus (2000 papers) for testing is focused on the field of Information Technology. Electronic documents were automatically collected from ACM digital library. We designed ele- ven English patterns to select sentences identified by OpenNLP. The English patterns are shown in table 2. Table 2. English Patterns used to extract sentences No Pattern Example 1 S + V Computer is broken 2 S + V + Object He was mowing the lawn 3 S + V + Adjective The girl was tall 4 S + V + Indirect Object The woman went to the house 5 S + V + Direct Object The man hit the ball 6 S + V + Complement Some students in the class are engi- neers 7 S + V + Prepositional Phrase The cat waited for its owner yester- day 8 S +V+ Indirect Object+ Direct Object Granny left Gary all of her money 9 S + V + Direct Object + Adjective The Jury found the defendant guilty 10 S + V + Direct Object + NP The Jury found the defendant guilty 11 S + V + Direct Object + Complement The class picked Susieclass repre- sentative. After extracting sentences based on the English patterns, we refine them by SLDP. With SLDP, we eliminate the unnecessary words in a sentence. Below are some examples of such sentences.
  • 6. International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014 6 Table 3. Examples of eliminating unnecessary words Before applying SLDP After applying SLDP COBOL is not popular programming language in recent years. COBOL is not popular programming lan- guage. Oracle database is one of the Relation- al Database Management System. Oracle is Relational Database Manage- ment System. In my opinion Java Language is diffi- cult to program Java Language is difficult to program C Sharp in Microsoft Dot Net is also an Object Oriented programming lan- guage C Sharp is Object Oriented programming language In our ontology, most of the concepts have semantic relations between each other. In order to identify those semantic relations, we rely on linguistic roles (linguistic marker), which are defined as follows: IS-A: this generic-specific relation reflects hierarchical inheritance in network of concepts. All entities are categorized as instances of a particular class. Class can become instances of a particu- lar class. Thus, any concepts can be linked to its immediate super ordinate concept. For example, Random Access Memory (concept) is a core memory (concept) in computer. PART-OF: this relation also reflects the hierarchical structure of the domain. This relation direct- ly refers to the parts of each concept in sentence. For example, Random Access Memory (ROM) (concept) is part of memory (concept) MADE-OF: this relation links to concepts, which made of material concepts. For example, Inte- grated Circuit (IC) Chip can be made of a semiconductor material. DELIMITED-BY: this relation marks the boundaries, dividing one concept from another. This is a domain-specific relation, mainly for the concepts, which are belonged to different topic in the field of Information technology. This relation is usually represented by a number of verbs, such as include, delimit, limit, circumscribe, restrict, etc. For example, the processing of a computer is restricted by CPU, RAM. TAKES-PLACE-IN: this relation describes the context of processes, which are related to spatial and temporal dimensions. A number of verbs represent this relation, such as happen, occur, take place in, etc. For example, In order to tackle the conflict of process, time scheduling takes place in the Operating System. ATTRIBUTE-OF: this relation is only useful for concepts designated by specialized adjectives, such as strong, powerful, etc., or nouns that define the properties of other concepts. For example, these router devices are powerful and useful in a network. RESULT-OF: this relation is relevant to either processes or entities that are derived from other processes. For example, as a result of the inconsistency, this file is considered corrupted. AFFECTS: this relation, along with RESULT-OF, is a crucial semantic relation in the knowledge base since both can relate all kinds of concepts in ontology.
  • 7. International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014 7 CAUSES: this relation is directly relevant to the processes or concepts that are derived from other processes. On the contrary of RESULT-OF, this relation usually represents negative meaning. These linguistic roles help us identify semantic relations between keywords in a sentence. There- fore, we can recognize sentence meaning and exactly categorize them. The semantic relations are represented, as shown in Figure. 3. Figure 3. Semantic relations in Information Technology Ontology 4. Experimental Results Over 2000 text documents from the ACM Digital Library were used for testing purposes. These documents with different topics belonged to the field of Information technology. In order to easily process the documents, we grouped them by category before extracting instances and concepts. In our paper, we chose randomsample corpora among 245 categories from ACM. The system’s performance was calculatedby using three measures: Precision, Recall and F- measure. They are calculated by each category in domain-specific ontology as below: ( ) = ( ) ( ) + ( ) = ( ) ( ) + ( ) F − Mesure = 2 Precision ∗ Recall Precision + Recall Where Ci represents a category in ITO and correct, bad, missing represented the number of cor- rect, wrong, missing, respectively. Experiment results are shown in table 4, 5, 6, 7, and 8.
  • 8. International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014 8 Table 4. Experiments results on instances of ingredient layer Category Qualityof instances Precision (%) Recall (%) F-Measure (%) Application (AP_inst) 3672 79.26 76.51 77.86 Artificial Intelligent (AI_inst) 5714 82.94 78.92 80.88 Logic Design (LD_inst) 4644 82.18 80.06 81.11 Operating System (OS_inst) 6785 84.47 81.37 82.89 Process Management (PM_inst) 3056 76.53 72.51 74.47 Software (Soft_inst) 4249 81.64 79.62 80.62 Table 5. Experiments results on set of synonyms Category Quality of synonym Precision (%) Recall (%) F-Measure (%) Application (AP_syno) 524 79.26 76.51 77.86 Artificial Intelligent (AI_syno) 689 94.41 88.15 91.17 Logic Design (LD_syno) 472 92.24 84.27 88.08 Operating System (OS_syno) 861 96.18 91.58 93.82 Process Management (PM_syno) 517 93.25 86.16 89.56 Software (SW_syno) 583 94.26 89.04 91.57 Table 6.Experiments results on set of Hyponyms Category Qualityof Hyponym Precision (%) Recall (%) F-Measure (%) Application (AP_hypo) 714 89.38 76.51 82.45 ArtificialIntelligent (AI_hypo) 837 96.14 88.29 92.04 Logic Design (LD_hypo) 718 87.54 84.26 85.86 Operating System (OS_hypo) 972 96.82 91.42 94.04 Process Management (PM_hypo) 728 88.31 85.15 86.70 Software (SW_hypo) 646 85.64 81.04 83.28
  • 9. International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014 9 Table 7. Experiments results on set of Hypernyms Category Qualityof Hypernym Precision (%) Recall (%) F-Measure (%) Application (AP_hype) 916 79.26 76.51 77.86 Artificial Intelligent (AI_hype) 1321 92.41 91.17 91.79 Logic Design (LD_hype) 954 84.62 79.37 81.91 Operating System (OS_hype) 1413 95.04 96.81 95.92 Process Management (PM_hype) 834 82.31 84.55 83.41 Software (SW_hype) 893 85.48 80.19 82.75 Table 8. Experiments results on set of Sentences Category Qualityof Sentences Precision (%) Recall (%) F-Measure (%) Application (AP_sent) 683 83.26 80.75 81.98 Artificial Intelligent (AI_sent) 972 87.93 85.64 86.77 Logic Design (LD_sent) 589 81.52 79.18 80.33 Operating System (OS_sent) 1026 92.41 88.32 90.32 Process Management (PM_sent) 647 82.83 79.17 80.96 Software (SW_sent) 762 86.74 82.14 84.38 Figure 4. Experiment result evaluation of instances, synonyms, hyponyms and hypernymsbased on Preci- sion, Recall and F-Measures.
  • 10. International Journal of Information Sciences and Techniques (IJIST) Vol.4, No.3, May 2014 10 5. Conclusions Our experiment tried to identify semantic relations between nouns or noun phrases in unstruc- tured files based on linguistic roles in order to build domain ontology on Information Technology. After usingan OpenNLP tool to recognize words and sentences, we applied English patterns de- fined by us to extract them. We then refined them using the SDLC tool. Therefore, our experi- mental results have a high precision and high recall. In order to identify semantic relations, we applied linguistic roles. The semantic roles directly link to instances in the layers of domain on- tology. Also, we propose an algorithm to identify semantic relations based on synonyms, hypo- nyms and hypernyms from instances of the ingredient layer. Overall scores are computed based on three measures: Precision, Recall ad F-Measure. Efforts must also be invested in order to re- duce the overall processing time of the system. In future works, we will focus on building an Information Extraction system based on this ontol- ogy to solve a number of problems, such as ontology learning and Question and Answer. REFERENCES [1] V. MalaSe et al , "Detecting semantic relations between terms in definitions," in The 3rd Interna- tional Workshop on Computational Terminology - CompuTerm 2004 , 2004. [2] G. Zhou et al, "Tree Kernel-Based Semantic Relation Extraction with Rich Syntactic and Semantic Information," Information Sciences, vol. 180, no. 8, pp. 1313 - 1325, 2010. [3] Y. Jie et al, "Building Extraction from LIDAR based Semantic Analysis," Geo-Spatial Information Science, vol. 9, no. 4, Sep. 2006. [4] F. Gomez et al, "Semantic interpretation and knowledge extraction," Knowledge-Based Systems, vol. 20, no. 1, pp. 51 - 60, July 2006. [5] G.Kongkachandra et al, "Abductive Reasoning for Keyword Recovering in Semantic-based Key-word Extraction," in The Fifth International Conference on Information Technology: New Generations - IEEE, 2008, pp. 714 - 719. [6] Z. Goudong et al, "Tree kernel-based semantic relation extraction with rich syntactic and semantic information," Information Sciences, vol. 180, no. 8, pp. 1313 - 1325, Dec. 2009. [7] A.B Abacha et al, "Automatic Extraction of Semantic Relations between Medical Entities- a rule based approach," Journal of Biomedical Semantics, 2011. [8] A.D.S Jayatilaka, "Knowledge Extraction for Semantic Web Using Web Mining ," in The Interna- tional Conference on Advances in ICT for Emerging Regions (ICTer 2011) - IEEE, 2011, pp. 89 - 94. [9] H. Li et al, "A Relation Extraction Method of Chinese Named Entities based on Location and Se- mantic Features," Applied Intelligence, vol. 18, no. 1, pp. 1- 14, May 2012. [10] J. Euzenat et al, Ontology Matching.: Springer, 2007. [11] ACM. [Online].http://www.acm.org/about/class/ccs98-html. AUTHORS Tuoi Phan Thi, Professor. : She is a Professor at the Faculty of Computer Science and Engineering, HCMC University of Technology, Vietnam. She obtained her Ph.D. in Com- puter Science from Charles University, Czech Republic, in 1985. Her research interests are compiler, information retrieval, natural language processing. She has been the Chief Investi- gator of national key projects and published many papers in international journals and confe- rence proceedings in those areas. Chien D C Ta, Mr .: He is Ph.D. Student at the Faculty of Computer Science and Engineer- ing, HCMC University of Technology, Vietnam. His research interests include Natural Lan- guage Processing and its applications, Information Extraction.