SlideShare ist ein Scribd-Unternehmen logo
1 von 10
Downloaden Sie, um offline zu lesen
Analysis of texts to logical
            contradictions
       Ontologs LLC, Moscow, 2012
Motivation
50 years ago: 60th
   The exponential growth of information flows in an era of scientific and
    technological revolution
   People no longer cope with the processing
   Solution: machine processes data

Challenges of our time
   Exponential growth of
    knowledge available to man
   It’s physically impossible to just
    read the number of publications
    on the appropriate category
   Solution: machine must process
    knowledge
Systematization of knowledge
Knowledge extraction
 Sources - both structured and unstructured data
 Unstructured data - texts in natural language
 Intelligent processing of texts - an extraction of knowledge from them
 The purpose of knowledge extraction - its automatic systematization

Methods of knowledge systematization
 Text classification
 Text summarization
 Copyright analysis of texts
 Sentiment analysis of texts


Problems
 Symbol and lexical language levels
 Dependence on the language of the text
 Insufficient accuracy
 The lack of processing context
Levels of language representation
Symbolic level                                      Lexical level
   Symbol classes: alphabetic characters,             Language dictionaries.
    spaces, punctuation, etc.                          Knowledge of inflection.
   Use: The copyright analysis, determination of      Use: The copyright analysis, text search, etc.
    authorship, etc.                                “Brown fox jumps…”
“Symbol classes: alphabetic characters,
                                                    brown (adj.) fox (noun) jump (verb)
spaces, punctuation, etc.”
“symbolclasses” “alphabeticcharacters”
“spaces” “punctuation” “etc”


Syntax level                                        Semantic level
   Grammars.                                          Elements of ontology.
   Congruence.                                        Congruence of ontology’s objects.
   Use: a selection of "correct" phrases,             Use: a selection of correct semantic
    terminological analysis                             structures
“Brown fox jumps…”                                  “Brown fox jumps…”
brown (quality) fox (subject) jump                  brown (color) fox (kind of mammal) jump
(predicate)                                         (action, move)
Analysis on semantic level

Semantic structure of texts
   Ontology – the formal knowledge description for human and machines
   The system automatically extracts knowledge from a text document and creates an ontology
   Ontology of text - language-independent machine representation of the semantic content of
    the text

Tools for ontology manipulation
   Languages: RDF (Resource Description
    Framework) and OWL (Web Ontology
    Language)
   Libraries: Jena - open implementation
    of ​RDF and OWL with the possibility of
    inference
   Ontology of a text - logical theory, written
    in the language OWL (RDF)
Logical analysis of texts
Semantics and language constructs
 Domain-based skeleton ontology - a conceptual framework
 Conceptual scheme - a set of classes and relationships between classes
 Elements of the conceptual schema associated with language expressions

Knowledge extratcion
 Input: a text and an ontology with language
  expressions
 The text is analyzed and objects of an ontology
  are allocated - instances of classes and relations
  of the conceptual scheme
Logical analysis
 A set of logical rules that define the conditions
  for the correctness of ontology elements
 Logic formulas to express the issues of
  correctness
 Verification procedure for the contradictions in
  an ontology
Example – ontology of events
Ontology of events
 Events - incidents, sports, meeting government officials, etc.
 List of participants of the event contains information about
  persons
 Person class objects - instance “Vladimir Putin”

Language expressions for object         class Person
“Vladimir Putin”                         First Name: string
 Vladimir Putin                         Second Name: string

 V. Putin                               Birth Date: date
                                         Position: string
 Vladimir Vladimirovich Putin
                                         Location List: list of (place, date)
 President Putin                        Spouse: string
 Pesident of Russian Federation         Children: list of sting
  [time context: 7 of May, 2000 -7 of    …
  May, 2008, 7 of May 2012 - present]
Objects and facts extraction
White Papers
   President of Russia Vladimir Putin and German Chancellor
    held talks in Berlin 1 of June, 2012. Following the talks, Mr
    Putin Ms Merkel made press statements and answered
    journalists’ questions.
                                              Putin : Person

The ontology of the text                       First Name: Vladimir
                                               Second Name: Putin
   Event class object – {“Working visit to    Birth Date: 7 of October, 1952
    Germany”, 01.06.2012, Participants:        Position: President
    (Vladimir Putin, Angela Merkel), …}
                                               Location List: …
   Person class object – {“Putin”,            Spouse: …
    “Vladimir”, 07.10.1952,…}                  Children: …
                                               …
Identification of contradictions
Identification of contradictions on objects extraction phase
   Suppose an article from the example indicated date: 1 of June 2011
   Language expression ”President of Russian Vladimir Putin” is incorrect
    because temporal context (1 of June 2011) contradicts the content of the
    corresponding object class Person - Russian President [temporary context:
    7 of May 2000 - 7 of May 2008, 7 of May 2012 - present]
Identification of contradictions on logical analysis phase
   Suppose you have an article with the text “1 of June 2012 Vladimir Putin
    isited to inspect the bridge to the island in the Russian city of Vladivostok”
   Event class object is extracted. The object contradicts the fact in Event
    ontology: “Working visit to Germany” because time of events is the same,
    but places are different
Questions and contacts
Contacts:
   Ontologs LLC, 119634, Russia, Moscow, Borovskoie shosse,
    44/3
   www.онтологика.рф, www.ontologs.ru, www.ontologs.com
   Email: info.ontologs.ru

Weitere ähnliche Inhalte

Ähnlich wie Analysis of Texts for Logical Contradictions

Lri Owl And Ontologies 04 04
Lri Owl And Ontologies 04 04Lri Owl And Ontologies 04 04
Lri Owl And Ontologies 04 04Rinke Hoekstra
 
Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Toby Burrows
 
Timo Honkela: Introductory lecture of the seminar course on Computational Pra...
Timo Honkela: Introductory lecture of the seminar course on Computational Pra...Timo Honkela: Introductory lecture of the seminar course on Computational Pra...
Timo Honkela: Introductory lecture of the seminar course on Computational Pra...Timo Honkela
 
Semantic Web - Ontologies
Semantic Web - OntologiesSemantic Web - Ontologies
Semantic Web - OntologiesSerge Linckels
 
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECHAN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECHgerogepatton
 
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECHAN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECHijaia
 
An Investigation Of The Political Discourse Of Obama S Selected Speeches ----...
An Investigation Of The Political Discourse Of Obama S Selected Speeches ----...An Investigation Of The Political Discourse Of Obama S Selected Speeches ----...
An Investigation Of The Political Discourse Of Obama S Selected Speeches ----...Cynthia King
 
Construction of Authority Information for Personal Names Focused on the Forme...
Construction of Authority Information for Personal Names Focused on the Forme...Construction of Authority Information for Personal Names Focused on the Forme...
Construction of Authority Information for Personal Names Focused on the Forme...tmra
 
(48) (human cognitive processing) alexander ziem frames of understanding in t...
(48) (human cognitive processing) alexander ziem frames of understanding in t...(48) (human cognitive processing) alexander ziem frames of understanding in t...
(48) (human cognitive processing) alexander ziem frames of understanding in t...Nelli17
 
Language and Knowledge: Against Modularity as a Viable Theory of Language an...
Language and Knowledge: Against Modularity  as a Viable Theory of Language an...Language and Knowledge: Against Modularity  as a Viable Theory of Language an...
Language and Knowledge: Against Modularity as a Viable Theory of Language an...Dominik Lukes
 
Textual analysis and the importance of texting theory
Textual analysis and the importance of texting theoryTextual analysis and the importance of texting theory
Textual analysis and the importance of texting theoryShubham168021
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptOlusolaTop
 
Literary theory & criticism pt. 1: Formalism
Literary theory & criticism pt. 1: FormalismLiterary theory & criticism pt. 1: Formalism
Literary theory & criticism pt. 1: FormalismMansa Daby
 
Looking beyond the script
Looking beyond the scriptLooking beyond the script
Looking beyond the scriptETCenter
 

Ähnlich wie Analysis of Texts for Logical Contradictions (19)

Lri Owl And Ontologies 04 04
Lri Owl And Ontologies 04 04Lri Owl And Ontologies 04 04
Lri Owl And Ontologies 04 04
 
Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...
 
Vivian
VivianVivian
Vivian
 
Timo Honkela: Introductory lecture of the seminar course on Computational Pra...
Timo Honkela: Introductory lecture of the seminar course on Computational Pra...Timo Honkela: Introductory lecture of the seminar course on Computational Pra...
Timo Honkela: Introductory lecture of the seminar course on Computational Pra...
 
Lec 1
Lec 1Lec 1
Lec 1
 
Lidia Pivovarova
Lidia PivovarovaLidia Pivovarova
Lidia Pivovarova
 
Semantic Web - Ontologies
Semantic Web - OntologiesSemantic Web - Ontologies
Semantic Web - Ontologies
 
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECHAN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
 
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECHAN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
 
An Investigation Of The Political Discourse Of Obama S Selected Speeches ----...
An Investigation Of The Political Discourse Of Obama S Selected Speeches ----...An Investigation Of The Political Discourse Of Obama S Selected Speeches ----...
An Investigation Of The Political Discourse Of Obama S Selected Speeches ----...
 
Construction of Authority Information for Personal Names Focused on the Forme...
Construction of Authority Information for Personal Names Focused on the Forme...Construction of Authority Information for Personal Names Focused on the Forme...
Construction of Authority Information for Personal Names Focused on the Forme...
 
(48) (human cognitive processing) alexander ziem frames of understanding in t...
(48) (human cognitive processing) alexander ziem frames of understanding in t...(48) (human cognitive processing) alexander ziem frames of understanding in t...
(48) (human cognitive processing) alexander ziem frames of understanding in t...
 
Language and Knowledge: Against Modularity as a Viable Theory of Language an...
Language and Knowledge: Against Modularity  as a Viable Theory of Language an...Language and Knowledge: Against Modularity  as a Viable Theory of Language an...
Language and Knowledge: Against Modularity as a Viable Theory of Language an...
 
L1
L1L1
L1
 
Textual analysis and the importance of texting theory
Textual analysis and the importance of texting theoryTextual analysis and the importance of texting theory
Textual analysis and the importance of texting theory
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.ppt
 
Nlp
NlpNlp
Nlp
 
Literary theory & criticism pt. 1: Formalism
Literary theory & criticism pt. 1: FormalismLiterary theory & criticism pt. 1: Formalism
Literary theory & criticism pt. 1: Formalism
 
Looking beyond the script
Looking beyond the scriptLooking beyond the script
Looking beyond the script
 

Kürzlich hochgeladen

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 

Kürzlich hochgeladen (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 

Analysis of Texts for Logical Contradictions

  • 1. Analysis of texts to logical contradictions Ontologs LLC, Moscow, 2012
  • 2. Motivation 50 years ago: 60th  The exponential growth of information flows in an era of scientific and technological revolution  People no longer cope with the processing  Solution: machine processes data Challenges of our time  Exponential growth of knowledge available to man  It’s physically impossible to just read the number of publications on the appropriate category  Solution: machine must process knowledge
  • 3. Systematization of knowledge Knowledge extraction  Sources - both structured and unstructured data  Unstructured data - texts in natural language  Intelligent processing of texts - an extraction of knowledge from them  The purpose of knowledge extraction - its automatic systematization Methods of knowledge systematization  Text classification  Text summarization  Copyright analysis of texts  Sentiment analysis of texts Problems  Symbol and lexical language levels  Dependence on the language of the text  Insufficient accuracy  The lack of processing context
  • 4. Levels of language representation Symbolic level Lexical level  Symbol classes: alphabetic characters,  Language dictionaries. spaces, punctuation, etc.  Knowledge of inflection.  Use: The copyright analysis, determination of  Use: The copyright analysis, text search, etc. authorship, etc. “Brown fox jumps…” “Symbol classes: alphabetic characters, brown (adj.) fox (noun) jump (verb) spaces, punctuation, etc.” “symbolclasses” “alphabeticcharacters” “spaces” “punctuation” “etc” Syntax level Semantic level  Grammars.  Elements of ontology.  Congruence.  Congruence of ontology’s objects.  Use: a selection of "correct" phrases,  Use: a selection of correct semantic terminological analysis structures “Brown fox jumps…” “Brown fox jumps…” brown (quality) fox (subject) jump brown (color) fox (kind of mammal) jump (predicate) (action, move)
  • 5. Analysis on semantic level Semantic structure of texts  Ontology – the formal knowledge description for human and machines  The system automatically extracts knowledge from a text document and creates an ontology  Ontology of text - language-independent machine representation of the semantic content of the text Tools for ontology manipulation  Languages: RDF (Resource Description Framework) and OWL (Web Ontology Language)  Libraries: Jena - open implementation of ​RDF and OWL with the possibility of inference  Ontology of a text - logical theory, written in the language OWL (RDF)
  • 6. Logical analysis of texts Semantics and language constructs  Domain-based skeleton ontology - a conceptual framework  Conceptual scheme - a set of classes and relationships between classes  Elements of the conceptual schema associated with language expressions Knowledge extratcion  Input: a text and an ontology with language expressions  The text is analyzed and objects of an ontology are allocated - instances of classes and relations of the conceptual scheme Logical analysis  A set of logical rules that define the conditions for the correctness of ontology elements  Logic formulas to express the issues of correctness  Verification procedure for the contradictions in an ontology
  • 7. Example – ontology of events Ontology of events  Events - incidents, sports, meeting government officials, etc.  List of participants of the event contains information about persons  Person class objects - instance “Vladimir Putin” Language expressions for object class Person “Vladimir Putin” First Name: string  Vladimir Putin Second Name: string  V. Putin Birth Date: date Position: string  Vladimir Vladimirovich Putin Location List: list of (place, date)  President Putin Spouse: string  Pesident of Russian Federation Children: list of sting [time context: 7 of May, 2000 -7 of … May, 2008, 7 of May 2012 - present]
  • 8. Objects and facts extraction White Papers  President of Russia Vladimir Putin and German Chancellor held talks in Berlin 1 of June, 2012. Following the talks, Mr Putin Ms Merkel made press statements and answered journalists’ questions. Putin : Person The ontology of the text First Name: Vladimir Second Name: Putin  Event class object – {“Working visit to Birth Date: 7 of October, 1952 Germany”, 01.06.2012, Participants: Position: President (Vladimir Putin, Angela Merkel), …} Location List: …  Person class object – {“Putin”, Spouse: … “Vladimir”, 07.10.1952,…} Children: … …
  • 9. Identification of contradictions Identification of contradictions on objects extraction phase  Suppose an article from the example indicated date: 1 of June 2011  Language expression ”President of Russian Vladimir Putin” is incorrect because temporal context (1 of June 2011) contradicts the content of the corresponding object class Person - Russian President [temporary context: 7 of May 2000 - 7 of May 2008, 7 of May 2012 - present] Identification of contradictions on logical analysis phase  Suppose you have an article with the text “1 of June 2012 Vladimir Putin isited to inspect the bridge to the island in the Russian city of Vladivostok”  Event class object is extracted. The object contradicts the fact in Event ontology: “Working visit to Germany” because time of events is the same, but places are different
  • 10. Questions and contacts Contacts:  Ontologs LLC, 119634, Russia, Moscow, Borovskoie shosse, 44/3  www.онтологика.рф, www.ontologs.ru, www.ontologs.com  Email: info.ontologs.ru