Slides from presentation at Terminology Summer School 2016:
1) What is a Multilingual Knowledge System (MKS)
2) Three business cases: Systematic Terminology Work, Cross-border Interoperability, Cross-lingual Semantic Search
3) Summary: Key Role of MKS in Business
2. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Agenda
What is an MKS?
Three Business
Cases
Summary
3. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
LISE Project Research from 2011-2012:
Resources Increase, Quality Decreases
3
Oh, our classification had been built in English,
- no, I donât know whether there is one in
French
Searching on our intranet is a pain â Iâve just
researched for documents containing LCD screen â
nothing found! ... Should have known that they are
all tagged with monitor.
âManual revision of the keyword
lists are not possible any moreâ
(AUP, AT)
Duplicates, inconsistencies, gaps and
content coverage problems after
merging two resources (Imaging
company, UK)
âDifficulty to ensure
consistency, in terms of
quality, coverage,
completenessâ (EU
Representative)
4. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Divide et Impera!
4
Today:
Unstructured
haystack of
concepts
Better:
Ordered scheme, a
taxonomy
Introduce control
Turn information
into knowledge
5. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Taxonomy, thesaurus, ontology tools:
Good, but not for language needs
5
Though, very helpful
⢠Classifications,
nomenclatures
⢠Tagging in CMS
⢠Semantic Search
⢠Standards: SKOS, OWL
⢠Conferences: semantics,
KMWorld, SemTechBiz,
Wissensmanagementtage
Not for Language
⢠For trained experts only (!)
⢠Lexically organised
⢠Weak in managing
synonyms
⢠Weak in multilingualism
⢠Not for describing
terminology data
6. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Termbases, Glossaries
⢠Control language
⢠Focus on translation
⢠Lack knowledge modelling
⢠Only searching, no exploring
Taxonomies, Thesauri
⢠For knowledge structuring
⢠Lack multilingualism
⢠Lack language control
Before MKS: Two Parallel Approaches to
Inventorise and Leverage Knowledge
6
Huge, unaddressed potential for cross-lingual
data analysis, enterprise search, e-discovery and
to facilitate interoperability
would boost with data
would add structure
⌠âŚ
mirror base SpiegelfuĂ
wing mirror AuĂenspiegel
⌠âŚ
mirror
wing
mirror
left wing
mirror
âŚ
7. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael7
Before MKS: No Comprehensive Approach
ď˝ Termbases store concepts rather one by one
ď¨ fail to master large data, unsafe use
ď˝ Taxonomies achieve structure but ignore the
pragmatics of terminologies
ď¨ fail to manage multilingual data
ď˝ No structured multilingual resources
ď¨ no crosslingual data analysis nor e-discovery,
missed leverage
8. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Unify through a Multidimensional
Repository for Knowledge and Language
8
08
45
76
35
17
1: Taxonomy: ď
output
devices
visual
output
devices
screen
audio
output
devices
head-
phones
9. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Unify through a Multidimensional
Repository for Knowledge and Language
9
3: Multilingualism: ďŚ
Synonymy
⢠screen ⢠monitor
⢠Êcran
⢠Bildschirm⢠Monitor ⢠Display
08
45
76
rejected
accepted
35
17
2: Synonymy: ď˘1: Taxonomy: ď 4: Control: 9
10. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Unify through a Multidimensional
Repository for Knowledge and Language
10
Synonymy
⢠screen ⢠monitor
⢠Êcran
⢠Bildschirm⢠Monitor ⢠Display
⢠visual output device
⢠Optisches Ausgabegerät
08
45
76
rejected
accepted
35
17
3: Multilingualism: ďŚ2: Synonymy: ď˘1: Taxonomy: ď 5: Meaning: ~4: Control: 9
11. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Solution: Multilingual Knowledge System
11
Inventory of Linked
Multilingual
Concepts
Fusion of Taxonomy
/ Ontology Graph
with Multilingual
Terminology
12. Coreon combines flexible tools
for creating, exploring, and
maintaining taxonomies with
terminology management.
Coreon is the most advanced
SaaS solution for multilingual
knowledge systems.
13. One System, One View, All Languages:
Concepts, Relations, Terms
13
Immediate broader /
narrower neighborhood
Concept meta
data
Terms and
synonyms
Extensive term
descriptors
Location in map
Alphabetic,
multilingual list
14. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Leverage Existing Assets for Your MKS
14
Multilingual Knowledge System
Multilingual Terminology List Knowledge Map
Structure
semantically
Cover Languages,
Variants, Registers
15. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Agenda
What is an MKS?
Three Business
Cases
Summary
16. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
The Business Case for:
Terminology and Data Maintainers
What is an MKS?
⢠Fusion of terminology with taxonomy /
thesaurus models
⢠Captures language and knowledge
Three Business Cases
Summary
Avoid a loss of
your investment
through
systematic
terminology work
17. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Ad-hoc Collecting Terms is Dangerous
17
Do you feel incomfortable by adding and
adding new data without a pause?
Where are the doublettes?
Did we translate some records twice?
Can we unify two larger terminology
resources?
Visualise and make above
navigatable.
Avoid noise and redundancies.
Achieve clarity and consistency.
18. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
(Re-)Gain Control through Structure
... ...
⌠âŚ
⌠âŚ
⌠âŚ
... ...
Yesterday:
List of terms
Goal:
Visualised Multilingual Concept Map
Schule school
Oberschule secondary school
Realschule intermediate school
Gymnasium grammar school
Grundschule elementary school
Schule
school
Oberschule
secondary
school
Gymnasium
grammar
school
Realschule
intermediate
school
Grundschule
elementary
school
19. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
For the Maintainer: Less Effort
through Term Creation in Context
19
Crafting terms
for more
specific
concepts
Crafting terms
in other
languages
Comparing
definitions,
attributes etc.
screen
LCD
screen
CRT
�
screen
LCD
screen
CRT
screen
CRT screen
CRT monitor
screen
monitor
screen
monitor
CRT screen
CRT monitor
screen
Bild-
schirm
screen
CRT
screen
Bild-
schirm
CRT Bild-
schirm
CRT
screen
CRT
Monitor
?
=
20. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Less Redundancies Means Quality
20
ď˝ Lexical approach:
Doublettes remain hidden
ď˝ Concept map:
Doublettes made transparent
21. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Develop Terminology in Context
22
Dangerous:
Tunnel view onto one isolated concept
Control through Context:
All related concepts in sight
22. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael 23
Benefits: Large Terminology Collections
under Control, thus Valuable
Trust
Investment
Protected
Quality
Control
Efficiency
Systematic
Approach
23. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
The Business Case for:
Cross-border Interoperability
What is an MKS?
⢠Fusion of terminology with taxonomy /
thesaurus models
⢠Captures language and knowledge
Three Business Cases
Summary
Harmonizing
Organisations
and their Systems
24. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Semantic Interoperability
ď˝ âWe are connecting systemsâ
ď˝ Vocabularies only for meta
data?
ď˝ This is rather syntactic
interoperability
25. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael 26
Problem: Cross-Border Interoperability
ď˝ Not a translation but a semantic
problem:
ď˝ Is Austriaâs Matura the same as
Germanyâs Abitur? And its
synonyms?
ď˝ What if a qualification like
RealschulabschluĂ doesnât exist in
other countries?
ď˝ âAgree on that we disagreeâ
26. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Creating a Unified Resource
â across Languages
27
UK SE DE
Multiple terminologies
are merged into a unified
resource.
Linguistic similarity search:
maps units that share
the same meaning.
27. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Concept Mapping Example:
Trademark Classifications
Remote controls for diapositive projectors
Remote controls for projectors
Controls for projectors
Remote controls for slide projectors
Uses advanced linguistic search activating strict metadata
rules to achieve meaning matching.
28. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
A Little Protocol: Processing such Large
Language Resources
Start
⢠110,000 English terms
⢠Other languages ranging from 8,000 to 60,000 terms
Clean
⢠Reduced English to 93,000 terms
Language
coverage
⢠93,000 terms in all EU languages too expensive without automation
⢠Therefore: Mining data for translations
⢠DB grew from 400,000 to 2.1m terms
Conceptual
clustering
⢠Non essential entries removed â DB shrunk to 1.5m terms
⢠Semi-automatic creation of 50,000 meaning clusters
Taxonomy
⢠Built for the top layers and concepts linked
29. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Success Story: Cross-border
Interoperability in Trademarks
30
⢠Seamless and
borderless intellectual
property registration,
opposition, and legal
processing, in all bodies
of the EU and the
member states
Organizational
Benefits
⢠My trademark, as it is
described in my native
language, can be
applied for globally
without changes and is
subject to the same
legislation all over the
EU
Commercial
Benefits
30. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
The Business Case for:
Digital Single Market
What is an MKS?
⢠Fusion of terminology with taxonomy /
thesaurus models
⢠Captures language and knowledge
Three Business Cases
Summary
Cross-lingual
semantic search
is a pre-requisite
for the European
DSM
31. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
The Vision of a Digital Single Market
âConsumers need to be able
to buy the best products
at the best prices,
wherever they are in Europe.â
Vice-President Ansip, Dec 2014
Accelerating growth through a connected Europe:
Speech at GSMA Mobile 360 conference in Brussels
http://europa.eu/rapid/press-release_SPEECH-14-2420_en.htm
32
32. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
âŚand the Reality
http://europa.eu/rapid/attachment/IP-15-
4653/en/Digital_Single_Market_Factsheet_20150325.pd
f
33
33. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Broken already by a Simple Search
<search string>
âRasenmäherâ
34
34. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Language Independent Semantic Search:
Start in any Language
35
Start in any
language
Heisswasserkocher
bouilloires
hervidores
35. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Language Independent Semantic Search:
Decompose the Query String
36
Morphological
analysis
Heiss|wasser|kocher
bouilloire|s
hervidor|es
36. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Language Independent Semantic Search:
Iterate via âNormal Languageâ Synonyms
37
Morphological
analysis
Expand paradigm
through standard
synonyms
heiss
heiĂ
Wasser
H20
Boiler
Kocher
Erhitzer
bouilloire
hervidor
37. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Language Independent Semantic Search:
Now Locate a Concept in the MKS
38
Morphological
analysis
Expand paradigm through
standard synonyms
Locate
concept
in MKS
heiss
heiĂ
Wasser
H20
Boiler
Kocher
Erhitzer
bouilloire
hervidor
HeiĂwasserkocher
bouilloire
hervidor
38. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Language Independent Semantic Search:
Realize your Domain Specific Synonyms
39
Morphological
analysis
Expand paradigm through
standard synonyms
Locate concept
in MKS
Realize
weighted
domain
synonyms
HeiĂweisserkocher
HWK
bouilloire
thÊière
hervidor
Synonyms are
weighted, i.e. carry
attributes to help
disambiguating and
improve ranking
39. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Language Independent Semantic Search:
Consider then the Related Concepts
40
Morphological
analysis
Expand paradigm through
standard synonyms
Locate concept
in MKS
Realize weighted
domain synonyms
Semantic
expansion
through
concept map
Samowar
âCaykolik Samowarâ
40. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Language Independent Semantic Search:
Now Match the Item in your Warehouse DB
41
Caykolik
Samowar
Successful
identification of item
in target database
Caykolik 1,5 l Samowar
41. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Language Independent Semantic Search - Summary
From Heisswasserkocher to Caykolik 1,5l Samowar
42
Start in any
language
Heisswasserkocher
bouilloires
hervidores
Morphological
analysis
Expand paradigm through
standard synonyms
Locate concept
in MKS
Realize weighted
domain synonyms
Semantic expansion
through concept map
Caykolik 1,5 l Samowar
Successful
identification of item
in target database
42. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
MKS â Indispensable for Processing
Multilingual Data
Insights & Sense & Sentiment
Language Detection
NLP/Tokenization
ML
TextAnalytics
Search
MT
Provenance
Multilingual
Knowledge
System
43. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael 44
Benefits: Enable the Digital Single Market
ď˝ Find items
ď˝ with any word
ď˝ in any spelling
ď˝ in any language
ď˝ Find even related relevant items
through concept map
(âDid you mean âŚ?â)
ď˝ Have more first-time purchasers
ď˝ Increase international business
ď˝ Increase e-sales
ď˝ Happy audience
44. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Agenda
What is an MKS?
Three Business
Cases
Summary
45. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Enterprise
Search
Social Media
Analysis
Document
Classification
Inter-
operability
Staff Training
Globalisation
Multilingual Knowledge System Plays
Key Role in many Business Processes
46. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Coreon GmbH
MKS is the Companyâs Core Focus
⢠Spin-off from ESTeam
(established 1995)
⢠Initiated through market and
technology research in
context of EU FP7 project LISE
Since 2012
⢠Strong entrepreneurship
community
⢠Europeâs
âLanguage Valleyâ
Headquarter in Berlin
48
Jochen Hummel, CEO
⢠founder of TRADOS
⢠ESTeam CEO, Chair of
LTInnovate
Michael Wetzel, MD
⢠TRADOS/SDL MultiTerm (1997-
2011)
⢠Contributor to TMX, TBX, OLIF
Gudrun Magnusdottir,
CSO
⢠ESTeam CSO
Lambros Kranias, CTO
⢠ESTeam CTO
⢠20+ years NLP
Key Staff
47. 14-07-2016 MKS â Multilingual Knowledge Systems @wetzelmichael
Multilingual Knowledge Systems
A fusion of terminology with taxonomy / thesaurus,
to capture language with knowledge in a holistic way
Most systematic and visual approach to control
large terminology collections
Huge value in various business processes, industries
and segments
49