To enable knowledge access across languages, ontologies that are often represented only in English, need to be translated into different languages. The main challenge in translating ontologies is to find the right term with respect to the domain modeled by ontology itself. Machine translation services may help in this task; however, a crucial requirement is to have translations validated by experts before the ontologies are deployed. Real-world applications must implement a support system addressing this task for relieve experts work in validating all translations. In this paper, we present ESSOT, an Expert Supporting System for Ontology Translation. The peculiarity of this system is to exploit semantic information of the concept's context for improving the quality of label translations. The system has been tested both within the Organic.Lingua project by translating the modeled ontology in three languages and on other multilingual ontologies in order to evaluate the effectiveness of the system in other contexts. The results have been compared with the translations provided by the Microsoft Translator API and the improvements demonstrated the viability of the proposed approach.
1. Translating Ontologies in
Real-World Settings with ESSOT
Mihael Arcan1, Mauro Dragoni2, and Paul Buitelaar1
1Insight Centre for Data Analytics, NUI Galway
2Fondazione Bruno Kessler (FBK), Shape and Evolving Living Knowledge Unit (SHELL)
International Semantic Web Conference 2016
Kobe, Japan, October 19th 2016
1
2. Presentationâs main points
ï± Our Goal:
to demonstrate why a domain-aware machine translation system can
help domain experts in translating ontologies
2
ESSOT: a collaborative knowledge management
architecture for ontology translation
Background &
Motivation
Why we did thisâŠ
Implementation
The machine translation
system and the domain-
expert facilities
Validation
Real-world scenarios in
which ESSOT has been
validated
3. Motivation 1 â Breaking language barriers
3
ï± Ontologies created by people do not know other languages than their
mother tongue
ï± Collaborative work and alignment in a multi-lingual scenario.
ï§ end-user applications
4. Motivation 2 â Knowledge Dissemination
4
?
ï± Artefacts are created in specific languages due to policy constraints
5. Translation Task: Documents vs Ontologies
DOCUMENT
ï§ Context 1 (GefĂ€Ă): The blood
vessels are the part of the
circulatory system that transports
blood throughout the human
body. There are three major
types of blood vessels
ï§ Context 2 (Schiff): Ships are
typically large ocean-going
vessels; whereas boats are
smaller, and typically travel most
often on inland or coastal waters.
5
vessels
parentOf
labelx
parentOf
labely
parentOf
ONTOLOGY
Example: ââvesselsâ ï GefÀà or Schiff
13. Evaluation Procedure
13
ï± Qualitative evaluation:
We collected subjective judgments from the language experts involved in the evaluation of
the tool on general usability of the components and to provide feedback for future
improvements.
ï§ Research Questions 1: Does the proposed system provide an effective support, in terms of the
quality of suggested translations, to the management of multilingual ontologies?
ï§ Research Questions 2: Do the MoKi functionalities provide an effective support to the collaborative
management of a multilingual ontology?
ï± Quantitative Evaluation:
We collected objective measures concerning the effectiveness of the translations suggested
by the embedded machine translation service.
ï§ 3 metrics: BLEU, METEOR, TER
ï§ 3 language pairs: ENïï DE, ENïï IT, ENïï ES
ï§ 6 ontologies:
âą Organic.Lingua, Presto
âą TheSoz, Geoskill, DOAP, STW
14. Qualitative Evaluation Results
ï± Language Experts view
ï§ Pros: Easy to use for managing translations (9)
ï§ Cons: Usable interface for showing concept translations (3)
ï± Approval and Discussion
ï§ Pros: Pending approvals give a clear situation about concept status (4)
ï§ Cons: Discussion masks are not very useful and the approval process might be
improved (8)
ï± Quick Translation feature
ï§ Pros: Best facility for translating concepts (8)
ï§ Cons: Improvable interface design (3)
14
15. Quantitative Evaluation Results
15
English ïï German English ïï Italian
English ïï Spanish
âą General robustness of the
approach
âą ENïï DE: wrong translations
needed many editing operations
âą Computational time to improve
o avg. 4.1 seconds per request
16. Conclusions
16
ï± ESSOT integrates the OTTO domain-adaptable semantic translation
service and the MoKi collaborative knowledge management tool in an
effective pipeline for multilingual ontology management.
ï± ESSOT provides helpful suggestions for performing ontology translation
(RQ1) and the provided interfaces are usable and useful for supporting
the language experts in the translation activity (RQ2)
ï± The translation quantitative evaluation shows significant improvements
over the Microsoft Translation system
ï± Future work:
ï§ to extend this strategy to other problems (e.g. ontology enrichment);
ï§ improve the approval management task;
ï§ improve the service efficiency.
SEE YOU LATER AT THE DEMO SESSION !!!!!
17. 17
Itâs time for questionsâŠ
Mauro Dragoni
Fondazione Bruno Kessler
https://shell.fbk.eu/index.php/Mauro_Dragoni
dragoni@fbk.eu