SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
Georg Rehm
georg.rehm@dfki.de
DFKI GmbH, Language Technology Lab – Berlin, Germany
META-NET, General Secretary
Human Language Technologies
in a Multilingual Europe
Outline
• Multilingual Europe
• Analysis I: Technology Support for Europe’s Languages
• Analysis II: Status and Current Developments
• Example: LT for the Digital Single Market
• Missions and Opportunities
• Towards the Human Language Project
2EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
• Multilingualism is at the very heart of the European idea.
• 24 EU languages – all languages have the same status.
• Dozens of regional and minority languages as well as
languages of immigrants and trade partners.
• Economic challenges:
– If the DSM is not multilingual, there will be 20+ isolated markets!
– Language barriers are market barriers!
• Social and public challenges:
– Empower all citizens to use their mother tongues.
– Provide multilingual digital public services.
– Enable cross-border, cross-lingual, cross-cultural communication.
Towards a European public sphere and e-participation.
– Restore trust in media (fake news debate, filter bubble issue etc.)
Analysis I: Technology Support
for Europe’s Languages
4EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
q
60 research centres in 34 countries (founded in 2010)
Chair of Executive Board: Jan Hajic (CUNI)
Dep.: J. van Genabith (DFKI), A. Vasiljevs (Tilde)
General Secretary: Georg Rehm (DFKI)
q
Multilingual Europe
Technology Alliance.
826 members in
67 countries
(published in 2013) (31 volumes; published in 2012)
T4ME (META-NET) CESAR METANET4UMETA-NORDMultilingual Europe Technology AllianceNET
q Basque
q Bulgarian*
q Catalan
q Croatian*
q Czech*
q Danish*
q Dutch*
q English*
q Estonian*
q Finnish*
q French*
q Galician
q German*
q Greek*
q Hungarian*
q Icelandic
q Irish*
q Italian*
q Latvian*
q Lithuanian*
q Maltese*
q Norwegian
q Polish*
q Portuguese*
q Romanian*
q Serbian
q Slovak*
q Slovene*
q Spanish*
q Swedish*
q Welsh
* Official EU languagehttp://www.meta-net.eu/whitepapers
MT
English
good
French, Spanish
moderate fragmentary
Catalan, Dutch, German,
Hungarian, Italian, Polish,
Romanian
weak or no support through LT
Basque, Bulgarian, Croatian,
Czech, Danish, Estonian, Finnish,
Galician, Greek, Icelandic, Irish,
Latvian, Lithuanian, Maltese,
Norwegian, Portuguese, Serbian,
Slovak, Slovene, Swedish, Welsh
excellent
Czech, Dutch,
Finnish, French,
German, Italian,
Portuguese,
Spanish
moderate fragmentary
Basque, Bulgarian, Catalan,
Danish, Estonian, Galician,
Greek, Hungarian, Irish,
Norwegian, Polish, Serbian,
Slovak, Slovene, Swedish
weak or no support through LT
Croatian, Icelandic, Latvian,
Lithuanian, Maltese, Romanian,
Welsh
excellent
English
good
Speech
English
good
Dutch, French,
German, Italian,
Spanish
moderate fragmentary
Basque, Bulgarian, Catalan,
Czech, Danish, Finnish,
Galician, Greek, Hungarian,
Norwegian, Polish,
Portuguese, Romanian,
Slovak, Slovene, Swedish
weak or no support through LT
Croatian, Estonian, Icelandic, Irish,
Latvian, Lithuanian, Maltese,
Serbian, Welsh
excellent
English
good
Czech, Dutch,
French, German,
Hungarian, Italian,
Polish, Spanish,
Swedish
moderate fragmentary
Basque, Bulgarian, Catalan,
Croatian, Danish, Estonian,
Finnish, Galician, Greek,
Norwegian, Portuguese,
Romanian, Serbian, Slovak,
Slovene
Icelandic, Irish, Latvian,
Lithuanian, Maltese, Welsh
weak or no support through LTexcellent
ResourcesTextAnalytics
Fragmentary
Weak/none
Moderate
Good
Excellent
Welsh
Maltese
Lithuanian
Latvian
Icelandic
Irish
Croatian
Serbian
Estonian
Slovene
Slovak
Romanian
Norwegian
Greek
Galician
Danish
Bulgarian
Basque
Swedish
Portuguese
Finnish
Catalan
Polish
Hungarian
Czech
Italian
German
Dutch
Spanish
French
English
Levelofsupport
Languages with names in red
have little or no MT support
Source: META-NET White Paper Series: Europe's Languages in the Digital Age. Springer, Heidelberg,
New York, Dordrecht, London, September 2012. Georg Rehm and Hans Uszkoreit (series editors)
Important: even current state of the art
technologies are far from being perfect!
Important: 20+ European languages are
severely under-supported and face the
danger of digital extinction.
Excellent
Good
Moderate
Fragmentary
Weak/no
support
LanguageTechnologySupport
MillionsofNativeSpeakers(Worldwide)
Yiddish
Welsh
VlaxRomani
Turkish
Scots
Romany
Occitan
Maltese
Macedonian
Luxembourgish
Lithuanian
Limburgish
Latvian
Icelandic
Friulian
Frisian
Breton
Bosnian
Asturian
Albanian
Irish
Croatian
Serbian
Hebrew
Estonian
Slovene
Slovak
Romanian
Norwegian
Greek
Galician
Danish
Bulgarian
Basque
Swedish
Portuguese
Finnish
Catalan
Polish
Hungarian
Czech
Italian
German
Dutch
Spanish
French
English
0
50
100
150
200
250
300
350
400
Source: Georg Rehm, Hans Uszkoreit, Ido Dagan, Vartkes Goetcherian, Mehmet Ugur Dogan, Coskun Mermer, Tamás Váradi, Sabine Kirchmeier-Andersen,
Gerhard Stickel, Meirion Prys Jones, Stefan Oeter, and Sigve Gramstad. An Update and Extension of the META-NET Study “Europe's Languages in the
Digital Age”. In Proceedings of the Workshop on Collaboration and Computing for Under-Resourced Languages in the Linked Open Data Era (CCURL 2014),
Reykjavik, Iceland, May 2014.
Analysis II: Status and
Current Developments
10EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
• Multilingual Europe: our languages enjoy equal status yet digital extinction
of the majority of European languages is a very severe danger.
• Language Technology Research and Innovation in Europe: World class
research, excellent results (examples: Moses, recent NMT results of QT21),
strong SME base, thousands of LSPs; fragmentation; need for coordination.
• Big need for high-quality, high-coverage, precise, robust, deployable
Language Technologies: translation, conversational interfaces, text and
media analytics, personal assistants, multilingual DSM etc.
• Artificial Intelligence: Important breakthroughs and massive investments in
R&D and applications (mostly in US and Asia) – huge opportunity for Europe!
• The European Language Challenge cannot be abandoned or outsourced.
Ø Europe must not make its digital infrastructure dependent
on non-European solutions. This is why the EU is building
GALILEO as an alternative to GPS, GLONASS, Bei Dou.
• Big need for Language Technologies made in Europe for Europe!
Status and Current Developments
11EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
!
Example:
Language Technology for the
Digital Single Market
12EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
q Top priority in the European Union.
q Expected to add 400b€ to European GDP
and hundreds of thousands of new jobs.
q Unfortunately, the language topic is not
included in the EC’s Digital Single Market
strategy (published in May 2015).
MDSM: Needed Applications
q Crosslingual SME presales communication and aftersales services
q Multilingual websites, product catalogues, product descriptions
q Crosslingual business intelligence (e.g., based on UGC)
q Crosslingual communication for SMEs, public institutions, citizens
q Multilingual (big) data, language and knowledge value chains
q Multilingual knowledge bases and knowledge graphs (and services)
q Multilingual conversational interfaces for connected devices (IoT)
q Crosslingual social media analytics for EU-wide societal issues
q Multilingual text and report generation (knowledge/data to text)
q All services must be domain-adaptable (avoid one size fits all)
q etc.
15EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
Multilingual Value Programme
q Multilingual Value Programe
§ Suggested three-year programme
§ Requires modest investment
q “Enabling the Multilingual Digital Single
Market through technologies for
translating, analysing, processing and
curating natural language content”
q Three components address the main
needs of the Multilingual DSM (MDSM)
and how to put them into practice:
1. Multilingual Application Areas
2. Multilingual Services
3. Research
Strategic Research and Innovation Agenda
Language as a Data Type and
Key Challenge for Big Data
Enabling the Multilingual Digital Single Market
through technologies for translating, analysing, processing
and curating natural language content
SRIA Editorial Team
Version 0.9 – July 2016
16EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
Version 1.0 to be published in 2017
Missions and Opportunities
17EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
Missions and Opportunities
• Languages & European Society: Enable all European citizens
to communicate and operate in their mother tongues (online & offline).
• Languages & Media: Address – technologically – the massively
increasing social, political and commercial relevance of content and
communication (fake news debate, filter bubble challenge).
• Languages & Market: Realise the Multilingual DSM, including
multilingual content, crosslingual text analytics, multilingual generation.
• Languages & Digital Tech: Future-proof our languages.
• Languages & Devices: Robust, precise, high-quality spoken
language interfaces for billions of connected things – and all languages.
• Excellent opportunity for Europe, European research, European
education, European industry, European innovation, European culture!
• Goal: Move Europe into the pole position in this field!
18EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
Towards the
Human Language Project
19EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
Multilingual
Europe
through
Technology
Multilingual
Strategy of the
EU: more tech
support for
multilingualism
Language
Technologies
for Europe's
digital public
services
Technologies
for the
Multilingual
Digital Single
Market
Language
Technologies
for Big Data
text analytics
The Human
Language
Project – long-
term R&D&I,
post-H2020
Language
Technologies
R&D&I
(H2020, WP
2018-20)
Multilingual Europe
in January 2017
Strategic Research and Innovation Agenda
Language as a Data Type and
Key Challenge for Big Data
Enabling the Multilingual Digital Single Market
through technologies for translating, analysing, processing
and curating natural language content
SRIA Editorial Team
Version 0.9 – July 2016
Open calls and
upcoming service
contracts
Dec. 2016: EC brainstorming
meeting on future LT priorities
in Horizon 2020 and FP9.
Need for a new strategy paper?
Jan. 2017: STOA workshop
and study on LT for Europe
Dec. 2017: LT Session
at BDVA Summit in
Valencia
2017: MDSM SRIA V1.0
Policy change and initiative
towards a European digital
public sphere enabled by MT/LT
DG CONNECT
DGT and
DG CONNECT
DG CONNECT
WP 2018-20 (incl. IoT,
I4.0, assistants,
robots etc.)
Shared programme
between EU and MS Suggested MLV Programme
Strategic Research and Innovation Agenda
Language as a Data Type and
Key Challenge for Big Data
Enabling the Multilingual Digital Single Market
through technologies for translating, analysing, processing
and curating natural language content
SRIA Editorial Team
Version 0.9 – July 2016
CEF AT
ELRC
Multilingual
Europe
through
Technology
Multilingual
Strategy of the
EU: more tech
support for
multilingualism
Language
Technologies
for Europe's
digital public
services
Technologies
for the
Multilingual
Digital Single
Market
Language
Technologies
for Big Data
text analytics
The Human
Language
Project – long-
term R&D&I,
post-H2020
Language
Technologies
R&D&I
(H2020, WP
2018-20)
Multilingual Europe
in January 2017
Strategic Research and Innovation Agenda
Language as a Data Type and
Key Challenge for Big Data
Enabling the Multilingual Digital Single Market
through technologies for translating, analysing, processing
and curating natural language content
SRIA Editorial Team
Version 0.9 – July 2016
Open calls and
upcoming service
contracts
Dec. 2016: EC brainstorming
meeting on future LT priorities
in Horizon 2020 and FP9.
Need for a new strategy paper?
Jan. 2017: STOA workshop
and study on LT for Europe
Dec. 2017: LT Session
at BDVA Summit in
Valencia
2017: MDSM SRIA V1.0
Policy change and initiative
towards a European digital
public sphere enabled by MT/LT
DG CONNECT
DGT and
DG CONNECT
DG CONNECT
WP 2018-20 (incl. IoT,
I4.0, assistants,
robots etc.)
Shared programme
between EU and MS Suggested MLV Programme
Strategic Research and Innovation Agenda
Language as a Data Type and
Key Challenge for Big Data
Enabling the Multilingual Digital Single Market
through technologies for translating, analysing, processing
and curating natural language content
SRIA Editorial Team
Version 0.9 – July 2016
CEF AT
ELRC
Observations:
• Current initiatives are too small and unbalanced; they
concentrate on innovation and technology deployment.
• Danger to loose touch with research and novel,
potentially paradigm-shifting developments.
• Difficult to kick-start new, paradigm-shifting research.
• We need a coordinated, concerted and consolidated
push in basic research, applied R&D and innovation!
Human Language Project – Interdisciplinary R&D&I Programme
Basic
Research
•Results in new
methods,
approaches
Applied R&D
•Results in novel
technologies
Innovation
•Results in novel
or improved
products or
services
Research Themes – Needs and Gaps (market-driven)
• Computational Linguistics
• Artificial Intelligence
• Language Technology
• Linguistics
• Computer Science
• Cognitive Science
• other related fields
• New, groundbreaking
methods, paradigms,
approaches
• Foster technologies,
products, innovation,
economy
• Foster education
HLP: Umbrella programme
to turbo-charge and to
coordinate all European
R&D&I activities in a
systematic way including
EP, EC, Member States.
22EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
Human Language Project
• Goal: Deep Natural Language Understanding.
• Breakthroughs in Artificial Intelligence plus a fresh
look at Linguistics for the Next Generation of LT!
• All official European and many additional languages
• Broad coverage, high quality, high precision
• Across modalities: text, text types, speech, image, video etc.
• Across platforms: messaging, telephony, social, mobile, IoT etc.
• Across cultures: knowledge, customs, formalities, humour,
emotion, subjectivity, biases, opinions, filter bubble etc.
23EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
Human Language Project
• Collaboration and coordination between EC, EP,
Member States and all other stakeholders.
• Mix of funding sources:
– EU projects: Horizon 2020 (WP 2018-2020) + FP9 (2021+)
– National/regional funding sources
• Setup: basic research, applied research, innovation,
commercialisation – tightly intertwined
• Timeframe: 10 years
• Policy change towards “LT-enabled multilingualism”
• Public procurement: EU/EC, MS administrations
should demand certain language technologies
24EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
HLP Topics: Key Ingredients for
Future European LT Research
Artificial Intelligence
including cognition, perception, vision,
cross-modal, cross-platform, cross-culture, IoT etc.
Machine Learning
Language Technology
• Extend knowledge bases
• Semantic Web, ontologies,
linked data, interoperability
• More complex models
• Multilingual resources that
are grounded, extensible
• Subjectivity, objectivity,
further novel dimensions
• Web-scale reasoning
• Combine DNNs and
symbolic processing
• ML for knowledge
acquisition and extension
• DNNs embedded into
modular systems including
symbolic knowledge bases
• Make it possible to inspect
and also to optimise DNNs
(beyond end-to-end)
• (Computational) Linguistics
research towards deep
language understanding
• From corpora to DNNs to
annotated data to highly
improved symbolic methods
• Language portability
• Full and Deep Language
Understanding by 2030 –
Human Language Project
Knowledge Technology
25EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
Human
Language
Project
Truly
Multilingual
Europe
European
Economy
(MDSM)
Attractive
jobs for
high
potentials
Education
and young
researchers
Massive
boost for
research
Foster
innovation
and new
companies
26
Thank you!
Georg Rehm
georg.rehm@dfki.de
Human
Language
Project
Truly
Multilingual
Europe
European
Economy
(MDSM)
Attractive
jobs for
high
potentials
Education
and young
researchers
Massive
boost for
research
Foster
innovation
and new
companies

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Multilingualism for Digital Europe
Multilingualism for Digital EuropeMultilingualism for Digital Europe
Multilingualism for Digital Europe
 
META-NET and META-SHARE: Language Technology for Europe
META-NET and META-SHARE: Language Technology for EuropeMETA-NET and META-SHARE: Language Technology for Europe
META-NET and META-SHARE: Language Technology for Europe
 
Computational Morphology and the META-NET Strategic Research Agenda for Multi...
Computational Morphology and the META-NET Strategic Research Agenda for Multi...Computational Morphology and the META-NET Strategic Research Agenda for Multi...
Computational Morphology and the META-NET Strategic Research Agenda for Multi...
 
European Language Technologies – Past, Present and Future
European Language Technologies – Past, Present and FutureEuropean Language Technologies – Past, Present and Future
European Language Technologies – Past, Present and Future
 
Language Technology for Multilingual Europe
Language Technology for Multilingual EuropeLanguage Technology for Multilingual Europe
Language Technology for Multilingual Europe
 
The META-NET Strategic Research Agenda for Multilingual Europe 2020
The META-NET Strategic Research Agenda for Multilingual Europe 2020The META-NET Strategic Research Agenda for Multilingual Europe 2020
The META-NET Strategic Research Agenda for Multilingual Europe 2020
 
META-NET: Language Technology for Europe
META-NET: Language Technology for EuropeMETA-NET: Language Technology for Europe
META-NET: Language Technology for Europe
 
The META-NET Language White Paper Series
The META-NET Language White Paper SeriesThe META-NET Language White Paper Series
The META-NET Language White Paper Series
 
AI for Translation Technologies and Multilingual Europe
AI for Translation Technologies and Multilingual EuropeAI for Translation Technologies and Multilingual Europe
AI for Translation Technologies and Multilingual Europe
 
Why language technology resources matter to Welsh and other less-used languages
Why language technology resources matter to Welsh and other less-used languagesWhy language technology resources matter to Welsh and other less-used languages
Why language technology resources matter to Welsh and other less-used languages
 
Towards a Human Language Project for Multilingual Europe: AI and Interpretation
Towards a Human Language Project for Multilingual Europe: AI and InterpretationTowards a Human Language Project for Multilingual Europe: AI and Interpretation
Towards a Human Language Project for Multilingual Europe: AI and Interpretation
 
META-NET: Towards a Strategic Research Agenda for Multilingual Europe
META-NET: Towards a Strategic Research Agenda for Multilingual EuropeMETA-NET: Towards a Strategic Research Agenda for Multilingual Europe
META-NET: Towards a Strategic Research Agenda for Multilingual Europe
 
Language Technologies for Multilingual Europe - Towards a Human Language Proj...
Language Technologies for Multilingual Europe - Towards a Human Language Proj...Language Technologies for Multilingual Europe - Towards a Human Language Proj...
Language Technologies for Multilingual Europe - Towards a Human Language Proj...
 
The META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open DataThe META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open Data
 
AI and Conference Interpretation – From Smart Assistants for the Human Interp...
AI and Conference Interpretation – From Smart Assistants for the Human Interp...AI and Conference Interpretation – From Smart Assistants for the Human Interp...
AI and Conference Interpretation – From Smart Assistants for the Human Interp...
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
 
TAUS MT Showcace, MT Applications in the EU Public Sector, Adrejs Vasiljevs, ...
TAUS MT Showcace, MT Applications in the EU Public Sector, Adrejs Vasiljevs, ...TAUS MT Showcace, MT Applications in the EU Public Sector, Adrejs Vasiljevs, ...
TAUS MT Showcace, MT Applications in the EU Public Sector, Adrejs Vasiljevs, ...
 
Swot analysis of counselling methods in poland
Swot analysis of counselling methods in polandSwot analysis of counselling methods in poland
Swot analysis of counselling methods in poland
 
Pam mae technoleg iaith Cymraeg yn bwysig? Why Welsh language technology ...
Pam mae technoleg iaith Cymraeg yn bwysig? Why Welsh language technology ...Pam mae technoleg iaith Cymraeg yn bwysig? Why Welsh language technology ...
Pam mae technoleg iaith Cymraeg yn bwysig? Why Welsh language technology ...
 

Andere mochten auch

Supertech 79 gurgaon details
Supertech 79 gurgaon detailsSupertech 79 gurgaon details
Supertech 79 gurgaon details
Vaibhav Axiom
 

Andere mochten auch (9)

Capital Tower - 1 at Downtown Emaar MGF
Capital Tower - 1 at Downtown Emaar MGFCapital Tower - 1 at Downtown Emaar MGF
Capital Tower - 1 at Downtown Emaar MGF
 
Satya platina 9250404176
Satya platina 9250404176Satya platina 9250404176
Satya platina 9250404176
 
Supertech 79 gurgaon details
Supertech 79 gurgaon detailsSupertech 79 gurgaon details
Supertech 79 gurgaon details
 
Satya platina sec103 gurgaon
Satya platina sec103 gurgaonSatya platina sec103 gurgaon
Satya platina sec103 gurgaon
 
Web design training in bangalore
Web design training in bangaloreWeb design training in bangalore
Web design training in bangalore
 
Presentation
PresentationPresentation
Presentation
 
Azure ppt
Azure pptAzure ppt
Azure ppt
 
Supertech sector 68
Supertech sector 68Supertech sector 68
Supertech sector 68
 
Web design-traiing-in-bangalore
Web design-traiing-in-bangaloreWeb design-traiing-in-bangalore
Web design-traiing-in-bangalore
 

Ähnlich wie Human Language Technologies in a Multilingual Europe

Docfoc.com-1 ALL and TOOL Projects Case studies of creativity and innovation ...
Docfoc.com-1 ALL and TOOL Projects Case studies of creativity and innovation ...Docfoc.com-1 ALL and TOOL Projects Case studies of creativity and innovation ...
Docfoc.com-1 ALL and TOOL Projects Case studies of creativity and innovation ...
Aisling Donovan
 
[DSC Europe 23] Slobodan Markovic - NLP for Serbian.pptx
[DSC Europe 23] Slobodan Markovic - NLP for Serbian.pptx[DSC Europe 23] Slobodan Markovic - NLP for Serbian.pptx
[DSC Europe 23] Slobodan Markovic - NLP for Serbian.pptx
DataScienceConferenc1
 
Visita estudis2010nl
Visita estudis2010nlVisita estudis2010nl
Visita estudis2010nl
guest3db223
 

Ähnlich wie Human Language Technologies in a Multilingual Europe (17)

TAUS Roundtable Moscow, Is MT Ready for e-Government, The Latvian Story, Indr...
TAUS Roundtable Moscow, Is MT Ready for e-Government, The Latvian Story, Indr...TAUS Roundtable Moscow, Is MT Ready for e-Government, The Latvian Story, Indr...
TAUS Roundtable Moscow, Is MT Ready for e-Government, The Latvian Story, Indr...
 
Is MT ready for e-Government? The Latvian Story. Indra Samite, Tilde
Is MT ready for e-Government? The Latvian Story. Indra Samite, TildeIs MT ready for e-Government? The Latvian Story. Indra Samite, Tilde
Is MT ready for e-Government? The Latvian Story. Indra Samite, Tilde
 
The Strategic Impact of META-NET on the Regional, National and International ...
The Strategic Impact of META-NET on the Regional, National and International ...The Strategic Impact of META-NET on the Regional, National and International ...
The Strategic Impact of META-NET on the Regional, National and International ...
 
ELSE IF 2019: What’s next for Multilingual Europe?
ELSE IF 2019: What’s next for Multilingual Europe?ELSE IF 2019: What’s next for Multilingual Europe?
ELSE IF 2019: What’s next for Multilingual Europe?
 
Introduction to CEF.AT. (Kim Harris from ELRC)
Introduction to CEF.AT. (Kim Harris from ELRC)Introduction to CEF.AT. (Kim Harris from ELRC)
Introduction to CEF.AT. (Kim Harris from ELRC)
 
Romania-Galati_by Daniela Bobocea_ Study Visit 2013-189-ES (Barcelona)
Romania-Galati_by Daniela Bobocea_ Study Visit 2013-189-ES (Barcelona)Romania-Galati_by Daniela Bobocea_ Study Visit 2013-189-ES (Barcelona)
Romania-Galati_by Daniela Bobocea_ Study Visit 2013-189-ES (Barcelona)
 
Spanish Language Technology Plan. David Pérez Fernández, Cabinet of State Sec...
Spanish Language Technology Plan. David Pérez Fernández, Cabinet of State Sec...Spanish Language Technology Plan. David Pérez Fernández, Cabinet of State Sec...
Spanish Language Technology Plan. David Pérez Fernández, Cabinet of State Sec...
 
Policies for OER in regional and minority languages: are regional and minorit...
Policies for OER in regional and minority languages: are regional and minorit...Policies for OER in regional and minority languages: are regional and minorit...
Policies for OER in regional and minority languages: are regional and minorit...
 
Adnoddau Cymraeg - Welsh Tools
Adnoddau Cymraeg - Welsh ToolsAdnoddau Cymraeg - Welsh Tools
Adnoddau Cymraeg - Welsh Tools
 
REFLESS project top-level conference: Waldemar Martyniuk, Council of Europe's...
REFLESS project top-level conference: Waldemar Martyniuk, Council of Europe's...REFLESS project top-level conference: Waldemar Martyniuk, Council of Europe's...
REFLESS project top-level conference: Waldemar Martyniuk, Council of Europe's...
 
Docfoc.com-1 ALL and TOOL Projects Case studies of creativity and innovation ...
Docfoc.com-1 ALL and TOOL Projects Case studies of creativity and innovation ...Docfoc.com-1 ALL and TOOL Projects Case studies of creativity and innovation ...
Docfoc.com-1 ALL and TOOL Projects Case studies of creativity and innovation ...
 
MLi - Project presentation
MLi - Project presentationMLi - Project presentation
MLi - Project presentation
 
[DSC Europe 23] Slobodan Markovic - NLP for Serbian.pptx
[DSC Europe 23] Slobodan Markovic - NLP for Serbian.pptx[DSC Europe 23] Slobodan Markovic - NLP for Serbian.pptx
[DSC Europe 23] Slobodan Markovic - NLP for Serbian.pptx
 
Teramo 2016
Teramo 2016Teramo 2016
Teramo 2016
 
Visita estudis2010nl
Visita estudis2010nlVisita estudis2010nl
Visita estudis2010nl
 
Visita estudis2010nl
Visita estudis2010nlVisita estudis2010nl
Visita estudis2010nl
 
Heriot watt 2
Heriot watt 2Heriot watt 2
Heriot watt 2
 

Mehr von Georg Rehm

Mehr von Georg Rehm (19)

QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...
QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...
QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...
 
Observations on Annotations – From Computational Linguistics and the World Wi...
Observations on Annotations – From Computational Linguistics and the World Wi...Observations on Annotations – From Computational Linguistics and the World Wi...
Observations on Annotations – From Computational Linguistics and the World Wi...
 
The Preparation, Impact and Future of the META-NET White Paper Series “Europe...
The Preparation, Impact and Future of the META-NET White Paper Series “Europe...The Preparation, Impact and Future of the META-NET White Paper Series “Europe...
The Preparation, Impact and Future of the META-NET White Paper Series “Europe...
 
Künstliche Intelligenz beim Dolmetschen und Übersetzen
Künstliche Intelligenz beim Dolmetschen und ÜbersetzenKünstliche Intelligenz beim Dolmetschen und Übersetzen
Künstliche Intelligenz beim Dolmetschen und Übersetzen
 
Herausforderungen und Lösungen für die europäische Sprachtechnologie- Forschu...
Herausforderungen und Lösungen für die europäische Sprachtechnologie- Forschu...Herausforderungen und Lösungen für die europäische Sprachtechnologie- Forschu...
Herausforderungen und Lösungen für die europäische Sprachtechnologie- Forschu...
 
KI, Sprachtechnologie und Digital Humanities: Ein (unvollständiger) Überblick
KI, Sprachtechnologie und Digital Humanities: Ein (unvollständiger) ÜberblickKI, Sprachtechnologie und Digital Humanities: Ein (unvollständiger) Überblick
KI, Sprachtechnologie und Digital Humanities: Ein (unvollständiger) Überblick
 
Kuratieren im Zeitalter der KI
Kuratieren im Zeitalter der KIKuratieren im Zeitalter der KI
Kuratieren im Zeitalter der KI
 
Artificial Intelligence for the Film Industry
Artificial Intelligence for the Film IndustryArtificial Intelligence for the Film Industry
Artificial Intelligence for the Film Industry
 
KI für die Kundenkommunikation
KI für die KundenkommunikationKI für die Kundenkommunikation
KI für die Kundenkommunikation
 
Transformieren, Manipulieren, Kuratieren: Technologien für die Wissensarbeit ...
Transformieren, Manipulieren, Kuratieren: Technologien für die Wissensarbeit ...Transformieren, Manipulieren, Kuratieren: Technologien für die Wissensarbeit ...
Transformieren, Manipulieren, Kuratieren: Technologien für die Wissensarbeit ...
 
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliotheken
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen BibliothekenDigitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliotheken
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliotheken
 
EPUB, quo vadis? Publishing im W3C
EPUB, quo vadis? Publishing im W3CEPUB, quo vadis? Publishing im W3C
EPUB, quo vadis? Publishing im W3C
 
Digitale Kuratierungstechnologien für verschiedene Branchen und Anwendungssze...
Digitale Kuratierungstechnologien für verschiedene Branchen und Anwendungssze...Digitale Kuratierungstechnologien für verschiedene Branchen und Anwendungssze...
Digitale Kuratierungstechnologien für verschiedene Branchen und Anwendungssze...
 
Generische Kuratierungstechnologien für spezifische Anwendungsfälle: Hintergr...
Generische Kuratierungstechnologien für spezifische Anwendungsfälle: Hintergr...Generische Kuratierungstechnologien für spezifische Anwendungsfälle: Hintergr...
Generische Kuratierungstechnologien für spezifische Anwendungsfälle: Hintergr...
 
Curation Technologies for Multilingual Europe
Curation Technologies for Multilingual EuropeCuration Technologies for Multilingual Europe
Curation Technologies for Multilingual Europe
 
Web Annotations – A Game Changer for Language Technology?
Web Annotations – A Game Changer for Language Technology?Web Annotations – A Game Changer for Language Technology?
Web Annotations – A Game Changer for Language Technology?
 
Globale Standards im Web of Things
Globale Standards im Web of ThingsGlobale Standards im Web of Things
Globale Standards im Web of Things
 
W3C/DFKI Automotive Workshop
W3C/DFKI Automotive WorkshopW3C/DFKI Automotive Workshop
W3C/DFKI Automotive Workshop
 
Digitale Kuratierungstechnologien – Beispiele aus ausgewählten Branchen
Digitale Kuratierungstechnologien – Beispiele aus ausgewählten BranchenDigitale Kuratierungstechnologien – Beispiele aus ausgewählten Branchen
Digitale Kuratierungstechnologien – Beispiele aus ausgewählten Branchen
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Kürzlich hochgeladen (20)

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

Human Language Technologies in a Multilingual Europe

  • 1. Georg Rehm georg.rehm@dfki.de DFKI GmbH, Language Technology Lab – Berlin, Germany META-NET, General Secretary Human Language Technologies in a Multilingual Europe
  • 2. Outline • Multilingual Europe • Analysis I: Technology Support for Europe’s Languages • Analysis II: Status and Current Developments • Example: LT for the Digital Single Market • Missions and Opportunities • Towards the Human Language Project 2EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
  • 3. • Multilingualism is at the very heart of the European idea. • 24 EU languages – all languages have the same status. • Dozens of regional and minority languages as well as languages of immigrants and trade partners. • Economic challenges: – If the DSM is not multilingual, there will be 20+ isolated markets! – Language barriers are market barriers! • Social and public challenges: – Empower all citizens to use their mother tongues. – Provide multilingual digital public services. – Enable cross-border, cross-lingual, cross-cultural communication. Towards a European public sphere and e-participation. – Restore trust in media (fake news debate, filter bubble issue etc.)
  • 4. Analysis I: Technology Support for Europe’s Languages 4EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
  • 5. q 60 research centres in 34 countries (founded in 2010) Chair of Executive Board: Jan Hajic (CUNI) Dep.: J. van Genabith (DFKI), A. Vasiljevs (Tilde) General Secretary: Georg Rehm (DFKI) q Multilingual Europe Technology Alliance. 826 members in 67 countries (published in 2013) (31 volumes; published in 2012) T4ME (META-NET) CESAR METANET4UMETA-NORDMultilingual Europe Technology AllianceNET
  • 6. q Basque q Bulgarian* q Catalan q Croatian* q Czech* q Danish* q Dutch* q English* q Estonian* q Finnish* q French* q Galician q German* q Greek* q Hungarian* q Icelandic q Irish* q Italian* q Latvian* q Lithuanian* q Maltese* q Norwegian q Polish* q Portuguese* q Romanian* q Serbian q Slovak* q Slovene* q Spanish* q Swedish* q Welsh * Official EU languagehttp://www.meta-net.eu/whitepapers
  • 7. MT English good French, Spanish moderate fragmentary Catalan, Dutch, German, Hungarian, Italian, Polish, Romanian weak or no support through LT Basque, Bulgarian, Croatian, Czech, Danish, Estonian, Finnish, Galician, Greek, Icelandic, Irish, Latvian, Lithuanian, Maltese, Norwegian, Portuguese, Serbian, Slovak, Slovene, Swedish, Welsh excellent Czech, Dutch, Finnish, French, German, Italian, Portuguese, Spanish moderate fragmentary Basque, Bulgarian, Catalan, Danish, Estonian, Galician, Greek, Hungarian, Irish, Norwegian, Polish, Serbian, Slovak, Slovene, Swedish weak or no support through LT Croatian, Icelandic, Latvian, Lithuanian, Maltese, Romanian, Welsh excellent English good Speech English good Dutch, French, German, Italian, Spanish moderate fragmentary Basque, Bulgarian, Catalan, Czech, Danish, Finnish, Galician, Greek, Hungarian, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovene, Swedish weak or no support through LT Croatian, Estonian, Icelandic, Irish, Latvian, Lithuanian, Maltese, Serbian, Welsh excellent English good Czech, Dutch, French, German, Hungarian, Italian, Polish, Spanish, Swedish moderate fragmentary Basque, Bulgarian, Catalan, Croatian, Danish, Estonian, Finnish, Galician, Greek, Norwegian, Portuguese, Romanian, Serbian, Slovak, Slovene Icelandic, Irish, Latvian, Lithuanian, Maltese, Welsh weak or no support through LTexcellent ResourcesTextAnalytics
  • 8. Fragmentary Weak/none Moderate Good Excellent Welsh Maltese Lithuanian Latvian Icelandic Irish Croatian Serbian Estonian Slovene Slovak Romanian Norwegian Greek Galician Danish Bulgarian Basque Swedish Portuguese Finnish Catalan Polish Hungarian Czech Italian German Dutch Spanish French English Levelofsupport Languages with names in red have little or no MT support Source: META-NET White Paper Series: Europe's Languages in the Digital Age. Springer, Heidelberg, New York, Dordrecht, London, September 2012. Georg Rehm and Hans Uszkoreit (series editors) Important: even current state of the art technologies are far from being perfect! Important: 20+ European languages are severely under-supported and face the danger of digital extinction.
  • 9. Excellent Good Moderate Fragmentary Weak/no support LanguageTechnologySupport MillionsofNativeSpeakers(Worldwide) Yiddish Welsh VlaxRomani Turkish Scots Romany Occitan Maltese Macedonian Luxembourgish Lithuanian Limburgish Latvian Icelandic Friulian Frisian Breton Bosnian Asturian Albanian Irish Croatian Serbian Hebrew Estonian Slovene Slovak Romanian Norwegian Greek Galician Danish Bulgarian Basque Swedish Portuguese Finnish Catalan Polish Hungarian Czech Italian German Dutch Spanish French English 0 50 100 150 200 250 300 350 400 Source: Georg Rehm, Hans Uszkoreit, Ido Dagan, Vartkes Goetcherian, Mehmet Ugur Dogan, Coskun Mermer, Tamás Váradi, Sabine Kirchmeier-Andersen, Gerhard Stickel, Meirion Prys Jones, Stefan Oeter, and Sigve Gramstad. An Update and Extension of the META-NET Study “Europe's Languages in the Digital Age”. In Proceedings of the Workshop on Collaboration and Computing for Under-Resourced Languages in the Linked Open Data Era (CCURL 2014), Reykjavik, Iceland, May 2014.
  • 10. Analysis II: Status and Current Developments 10EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
  • 11. • Multilingual Europe: our languages enjoy equal status yet digital extinction of the majority of European languages is a very severe danger. • Language Technology Research and Innovation in Europe: World class research, excellent results (examples: Moses, recent NMT results of QT21), strong SME base, thousands of LSPs; fragmentation; need for coordination. • Big need for high-quality, high-coverage, precise, robust, deployable Language Technologies: translation, conversational interfaces, text and media analytics, personal assistants, multilingual DSM etc. • Artificial Intelligence: Important breakthroughs and massive investments in R&D and applications (mostly in US and Asia) – huge opportunity for Europe! • The European Language Challenge cannot be abandoned or outsourced. Ø Europe must not make its digital infrastructure dependent on non-European solutions. This is why the EU is building GALILEO as an alternative to GPS, GLONASS, Bei Dou. • Big need for Language Technologies made in Europe for Europe! Status and Current Developments 11EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017) !
  • 12. Example: Language Technology for the Digital Single Market 12EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
  • 13. q Top priority in the European Union. q Expected to add 400b€ to European GDP and hundreds of thousands of new jobs. q Unfortunately, the language topic is not included in the EC’s Digital Single Market strategy (published in May 2015).
  • 14.
  • 15. MDSM: Needed Applications q Crosslingual SME presales communication and aftersales services q Multilingual websites, product catalogues, product descriptions q Crosslingual business intelligence (e.g., based on UGC) q Crosslingual communication for SMEs, public institutions, citizens q Multilingual (big) data, language and knowledge value chains q Multilingual knowledge bases and knowledge graphs (and services) q Multilingual conversational interfaces for connected devices (IoT) q Crosslingual social media analytics for EU-wide societal issues q Multilingual text and report generation (knowledge/data to text) q All services must be domain-adaptable (avoid one size fits all) q etc. 15EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
  • 16. Multilingual Value Programme q Multilingual Value Programe § Suggested three-year programme § Requires modest investment q “Enabling the Multilingual Digital Single Market through technologies for translating, analysing, processing and curating natural language content” q Three components address the main needs of the Multilingual DSM (MDSM) and how to put them into practice: 1. Multilingual Application Areas 2. Multilingual Services 3. Research Strategic Research and Innovation Agenda Language as a Data Type and Key Challenge for Big Data Enabling the Multilingual Digital Single Market through technologies for translating, analysing, processing and curating natural language content SRIA Editorial Team Version 0.9 – July 2016 16EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017) Version 1.0 to be published in 2017
  • 17. Missions and Opportunities 17EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
  • 18. Missions and Opportunities • Languages & European Society: Enable all European citizens to communicate and operate in their mother tongues (online & offline). • Languages & Media: Address – technologically – the massively increasing social, political and commercial relevance of content and communication (fake news debate, filter bubble challenge). • Languages & Market: Realise the Multilingual DSM, including multilingual content, crosslingual text analytics, multilingual generation. • Languages & Digital Tech: Future-proof our languages. • Languages & Devices: Robust, precise, high-quality spoken language interfaces for billions of connected things – and all languages. • Excellent opportunity for Europe, European research, European education, European industry, European innovation, European culture! • Goal: Move Europe into the pole position in this field! 18EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
  • 19. Towards the Human Language Project 19EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
  • 20. Multilingual Europe through Technology Multilingual Strategy of the EU: more tech support for multilingualism Language Technologies for Europe's digital public services Technologies for the Multilingual Digital Single Market Language Technologies for Big Data text analytics The Human Language Project – long- term R&D&I, post-H2020 Language Technologies R&D&I (H2020, WP 2018-20) Multilingual Europe in January 2017 Strategic Research and Innovation Agenda Language as a Data Type and Key Challenge for Big Data Enabling the Multilingual Digital Single Market through technologies for translating, analysing, processing and curating natural language content SRIA Editorial Team Version 0.9 – July 2016 Open calls and upcoming service contracts Dec. 2016: EC brainstorming meeting on future LT priorities in Horizon 2020 and FP9. Need for a new strategy paper? Jan. 2017: STOA workshop and study on LT for Europe Dec. 2017: LT Session at BDVA Summit in Valencia 2017: MDSM SRIA V1.0 Policy change and initiative towards a European digital public sphere enabled by MT/LT DG CONNECT DGT and DG CONNECT DG CONNECT WP 2018-20 (incl. IoT, I4.0, assistants, robots etc.) Shared programme between EU and MS Suggested MLV Programme Strategic Research and Innovation Agenda Language as a Data Type and Key Challenge for Big Data Enabling the Multilingual Digital Single Market through technologies for translating, analysing, processing and curating natural language content SRIA Editorial Team Version 0.9 – July 2016 CEF AT ELRC
  • 21. Multilingual Europe through Technology Multilingual Strategy of the EU: more tech support for multilingualism Language Technologies for Europe's digital public services Technologies for the Multilingual Digital Single Market Language Technologies for Big Data text analytics The Human Language Project – long- term R&D&I, post-H2020 Language Technologies R&D&I (H2020, WP 2018-20) Multilingual Europe in January 2017 Strategic Research and Innovation Agenda Language as a Data Type and Key Challenge for Big Data Enabling the Multilingual Digital Single Market through technologies for translating, analysing, processing and curating natural language content SRIA Editorial Team Version 0.9 – July 2016 Open calls and upcoming service contracts Dec. 2016: EC brainstorming meeting on future LT priorities in Horizon 2020 and FP9. Need for a new strategy paper? Jan. 2017: STOA workshop and study on LT for Europe Dec. 2017: LT Session at BDVA Summit in Valencia 2017: MDSM SRIA V1.0 Policy change and initiative towards a European digital public sphere enabled by MT/LT DG CONNECT DGT and DG CONNECT DG CONNECT WP 2018-20 (incl. IoT, I4.0, assistants, robots etc.) Shared programme between EU and MS Suggested MLV Programme Strategic Research and Innovation Agenda Language as a Data Type and Key Challenge for Big Data Enabling the Multilingual Digital Single Market through technologies for translating, analysing, processing and curating natural language content SRIA Editorial Team Version 0.9 – July 2016 CEF AT ELRC Observations: • Current initiatives are too small and unbalanced; they concentrate on innovation and technology deployment. • Danger to loose touch with research and novel, potentially paradigm-shifting developments. • Difficult to kick-start new, paradigm-shifting research. • We need a coordinated, concerted and consolidated push in basic research, applied R&D and innovation!
  • 22. Human Language Project – Interdisciplinary R&D&I Programme Basic Research •Results in new methods, approaches Applied R&D •Results in novel technologies Innovation •Results in novel or improved products or services Research Themes – Needs and Gaps (market-driven) • Computational Linguistics • Artificial Intelligence • Language Technology • Linguistics • Computer Science • Cognitive Science • other related fields • New, groundbreaking methods, paradigms, approaches • Foster technologies, products, innovation, economy • Foster education HLP: Umbrella programme to turbo-charge and to coordinate all European R&D&I activities in a systematic way including EP, EC, Member States. 22EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
  • 23. Human Language Project • Goal: Deep Natural Language Understanding. • Breakthroughs in Artificial Intelligence plus a fresh look at Linguistics for the Next Generation of LT! • All official European and many additional languages • Broad coverage, high quality, high precision • Across modalities: text, text types, speech, image, video etc. • Across platforms: messaging, telephony, social, mobile, IoT etc. • Across cultures: knowledge, customs, formalities, humour, emotion, subjectivity, biases, opinions, filter bubble etc. 23EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
  • 24. Human Language Project • Collaboration and coordination between EC, EP, Member States and all other stakeholders. • Mix of funding sources: – EU projects: Horizon 2020 (WP 2018-2020) + FP9 (2021+) – National/regional funding sources • Setup: basic research, applied research, innovation, commercialisation – tightly intertwined • Timeframe: 10 years • Policy change towards “LT-enabled multilingualism” • Public procurement: EU/EC, MS administrations should demand certain language technologies 24EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
  • 25. HLP Topics: Key Ingredients for Future European LT Research Artificial Intelligence including cognition, perception, vision, cross-modal, cross-platform, cross-culture, IoT etc. Machine Learning Language Technology • Extend knowledge bases • Semantic Web, ontologies, linked data, interoperability • More complex models • Multilingual resources that are grounded, extensible • Subjectivity, objectivity, further novel dimensions • Web-scale reasoning • Combine DNNs and symbolic processing • ML for knowledge acquisition and extension • DNNs embedded into modular systems including symbolic knowledge bases • Make it possible to inspect and also to optimise DNNs (beyond end-to-end) • (Computational) Linguistics research towards deep language understanding • From corpora to DNNs to annotated data to highly improved symbolic methods • Language portability • Full and Deep Language Understanding by 2030 – Human Language Project Knowledge Technology 25EP STOA Workshop: Language Equality in the Digital Age (10 Jan. 2017)
  • 27. Thank you! Georg Rehm georg.rehm@dfki.de Human Language Project Truly Multilingual Europe European Economy (MDSM) Attractive jobs for high potentials Education and young researchers Massive boost for research Foster innovation and new companies