SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Text-based knowledge acquisition
             tools

 Abdoulaye Guissé, Adeline Nazarenko,
 François Lévy, Nouha Omrane, Sylvie
          Szulman (Paris13)
Why starting with texts?
 Domain knowledge cannot be fully automatically
  extracted from texts
 Texts are nevertheless useful
     Texts are available data (≠ experts)
     Texts partly reflect the domain conceptualisation (TBox)
     Texts may contain pieces of factual knowledge (ABox)
     Policy documents express business rules
     It is often important to trace knowledge to textual sources
 Natural Language Processing in ONTORULE
   Acquiring knowledge from written policies
   Enriching NLP tools with SBVR-based functionalities (metamodel
    and SE)
   Integrating policy documents into the management system
Text-based knowledge acquisition tools

• Terminae
   Interactive acquisition of domain ontological
   knowledge (conceptual vocabulary including
   concepts, concept definitions, roles and some
   instances)
• Semex
   Combination of information extraction techniques and
   manual modelling for the acquisition of rules
   expressed in terms of the conceptual vocabulary
Overall acquisition approach
Text-based ontology acquisition
          Terminae
Using Terminae for ontology acquisition
Building Lexicalized ontology from texts

• Goals
  – Building a domain ontology

  – Documentation
     • Traceability to source documents


  – Semantic annotation of source documents
     • Query the text
Terminae
Extraction step

– Extract from the acquisition corpus the list of candidate terms using Term
  Extractor tools

Normalisation step

– Filter and select relevant meanings of ambigious terms (clustering terms)
  (i.e. member: airline participant/ customer)
–    Create and structure termino-concepts (relevant and disambiguated
    terms of the domain)

Formalisation step

- Create concepts and instances linked to each termino-concept
Rule edition and navigation
           Semex
Integrating policy documents in BRMSs

• Goals
  – Rule Acquisition
  – Documentation
     • Traceability to source documents
     • Verbalisation and presentation of internal knowledge
  – Maintenance
     • Consistency checking
     • Policy evolution




                                                              A project funded
                                                              by
Underlying Index structure




                             A project funded
                             by
Editing rules: an example




                            A project funded
                            by
Semex
Rule acquisition

– Rule fragment selection
– Rule transformation
   • Revision
        – Normalisation of the vocabulary
        – Syntax simplification
        – Verbalisation of implicit statements
   • Decomposition


Rule exploration

- Navigation interface
- SPARQL interface for advanced queries          A project funded
                                                 by
Structure of the candidate rules




                                                                    A project funded
                                                                    by
Interlinked SBVR-SE statement
The temperature of the micro_slip_test must be greater than 15 C.
Demonstration




                A project funded
                by

Weitere ähnliche Inhalte

Ähnlich wie Cascon2011_3_terminae+semex

Cloud Governance & DevOps: Must-have Tools on Your Journey to Azure Cloud
Cloud Governance & DevOps: Must-have Tools on Your Journey to Azure CloudCloud Governance & DevOps: Must-have Tools on Your Journey to Azure Cloud
Cloud Governance & DevOps: Must-have Tools on Your Journey to Azure CloudPredica Group
 
Rapid Strategic SRE Assessments
Rapid Strategic SRE AssessmentsRapid Strategic SRE Assessments
Rapid Strategic SRE AssessmentsMarc Hornbeek
 
RuleML2015 - Tutorial - Powerful Practical Semantic Rules in Rulelog - Funda...
RuleML2015 - Tutorial -  Powerful Practical Semantic Rules in Rulelog - Funda...RuleML2015 - Tutorial -  Powerful Practical Semantic Rules in Rulelog - Funda...
RuleML2015 - Tutorial - Powerful Practical Semantic Rules in Rulelog - Funda...RuleML
 
Semantic web service
Semantic web serviceSemantic web service
Semantic web servicejean Agnimel
 
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...Chris Shillum
 
Dev ops online training pdf
Dev ops online training pdfDev ops online training pdf
Dev ops online training pdfSayyedYusufali
 
Devops training and placement in hyderabad
Devops training and placement in hyderabadDevops training and placement in hyderabad
Devops training and placement in hyderabadVamsiNihal
 
Best devops training in Hyderabad
Best devops training in HyderabadBest devops training in Hyderabad
Best devops training in HyderabadKumarNaik21
 
DevOps Online Training
DevOps Online Training DevOps Online Training
DevOps Online Training VijayVijji8
 
DevOps Online Training
DevOps Online TrainingDevOps Online Training
DevOps Online TrainingVijayVijji8
 
Sandra smalley
Sandra smalleySandra smalley
Sandra smalleyNASAPMC
 
Dev0PS online Training in Hyderabad
Dev0PS  online Training  in HyderabadDev0PS  online Training  in Hyderabad
Dev0PS online Training in HyderabadVijayVijji8
 
Best devops training in Hyderabad
Best devops training in HyderabadBest devops training in Hyderabad
Best devops training in HyderabadKumarNaik21
 
Dev ops online training ppt
Dev ops online training pptDev ops online training ppt
Dev ops online training pptSayyedYusufali
 
How do I start my DevOps career
How do I start my DevOps careerHow do I start my DevOps career
How do I start my DevOps careerSaiprasadVella
 
aws devops training in Hyderabad
aws devops training in Hyderabadaws devops training in Hyderabad
aws devops training in Hyderabadsaitejavella
 
devops training institutes in hyderabad
devops training institutes in hyderabaddevops training institutes in hyderabad
devops training institutes in hyderabadDIGITALSAI1
 

Ähnlich wie Cascon2011_3_terminae+semex (20)

Cloud Governance & DevOps: Must-have Tools on Your Journey to Azure Cloud
Cloud Governance & DevOps: Must-have Tools on Your Journey to Azure CloudCloud Governance & DevOps: Must-have Tools on Your Journey to Azure Cloud
Cloud Governance & DevOps: Must-have Tools on Your Journey to Azure Cloud
 
Rapid Strategic SRE Assessments
Rapid Strategic SRE AssessmentsRapid Strategic SRE Assessments
Rapid Strategic SRE Assessments
 
RuleML2015 - Tutorial - Powerful Practical Semantic Rules in Rulelog - Funda...
RuleML2015 - Tutorial -  Powerful Practical Semantic Rules in Rulelog - Funda...RuleML2015 - Tutorial -  Powerful Practical Semantic Rules in Rulelog - Funda...
RuleML2015 - Tutorial - Powerful Practical Semantic Rules in Rulelog - Funda...
 
Semantic web service
Semantic web serviceSemantic web service
Semantic web service
 
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
 
Dissertation literature search
Dissertation literature searchDissertation literature search
Dissertation literature search
 
NISO – PESC Charleston 2014
NISO – PESC Charleston 2014NISO – PESC Charleston 2014
NISO – PESC Charleston 2014
 
Dev ops online training pdf
Dev ops online training pdfDev ops online training pdf
Dev ops online training pdf
 
Devops training and placement in hyderabad
Devops training and placement in hyderabadDevops training and placement in hyderabad
Devops training and placement in hyderabad
 
Best devops training in Hyderabad
Best devops training in HyderabadBest devops training in Hyderabad
Best devops training in Hyderabad
 
DevOps Online Training
DevOps Online Training DevOps Online Training
DevOps Online Training
 
DevOps Online Training
DevOps Online TrainingDevOps Online Training
DevOps Online Training
 
Sandra smalley
Sandra smalleySandra smalley
Sandra smalley
 
Devops training
Devops trainingDevops training
Devops training
 
Dev0PS online Training in Hyderabad
Dev0PS  online Training  in HyderabadDev0PS  online Training  in Hyderabad
Dev0PS online Training in Hyderabad
 
Best devops training in Hyderabad
Best devops training in HyderabadBest devops training in Hyderabad
Best devops training in Hyderabad
 
Dev ops online training ppt
Dev ops online training pptDev ops online training ppt
Dev ops online training ppt
 
How do I start my DevOps career
How do I start my DevOps careerHow do I start my DevOps career
How do I start my DevOps career
 
aws devops training in Hyderabad
aws devops training in Hyderabadaws devops training in Hyderabad
aws devops training in Hyderabad
 
devops training institutes in hyderabad
devops training institutes in hyderabaddevops training institutes in hyderabad
devops training institutes in hyderabad
 

Kürzlich hochgeladen

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 

Kürzlich hochgeladen (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 

Cascon2011_3_terminae+semex

  • 1. Text-based knowledge acquisition tools Abdoulaye Guissé, Adeline Nazarenko, François Lévy, Nouha Omrane, Sylvie Szulman (Paris13)
  • 2. Why starting with texts?  Domain knowledge cannot be fully automatically extracted from texts  Texts are nevertheless useful  Texts are available data (≠ experts)  Texts partly reflect the domain conceptualisation (TBox)  Texts may contain pieces of factual knowledge (ABox)  Policy documents express business rules  It is often important to trace knowledge to textual sources  Natural Language Processing in ONTORULE  Acquiring knowledge from written policies  Enriching NLP tools with SBVR-based functionalities (metamodel and SE)  Integrating policy documents into the management system
  • 3. Text-based knowledge acquisition tools • Terminae Interactive acquisition of domain ontological knowledge (conceptual vocabulary including concepts, concept definitions, roles and some instances) • Semex Combination of information extraction techniques and manual modelling for the acquisition of rules expressed in terms of the conceptual vocabulary
  • 6. Using Terminae for ontology acquisition
  • 7. Building Lexicalized ontology from texts • Goals – Building a domain ontology – Documentation • Traceability to source documents – Semantic annotation of source documents • Query the text
  • 8. Terminae Extraction step – Extract from the acquisition corpus the list of candidate terms using Term Extractor tools Normalisation step – Filter and select relevant meanings of ambigious terms (clustering terms) (i.e. member: airline participant/ customer) – Create and structure termino-concepts (relevant and disambiguated terms of the domain) Formalisation step - Create concepts and instances linked to each termino-concept
  • 9. Rule edition and navigation Semex
  • 10. Integrating policy documents in BRMSs • Goals – Rule Acquisition – Documentation • Traceability to source documents • Verbalisation and presentation of internal knowledge – Maintenance • Consistency checking • Policy evolution A project funded by
  • 11. Underlying Index structure A project funded by
  • 12. Editing rules: an example A project funded by
  • 13. Semex Rule acquisition – Rule fragment selection – Rule transformation • Revision – Normalisation of the vocabulary – Syntax simplification – Verbalisation of implicit statements • Decomposition Rule exploration - Navigation interface - SPARQL interface for advanced queries A project funded by
  • 14. Structure of the candidate rules A project funded by Interlinked SBVR-SE statement The temperature of the micro_slip_test must be greater than 15 C.
  • 15. Demonstration A project funded by