SlideShare ist ein Scribd-Unternehmen logo
1 von 15
CIP ICT PSP – 270999

A multilingual framework for transforming
online services to truly multilingual
Organic.Lingua Closing Workshop, 6/2/2014
Rome, Italy
THE PROBLEM
Online discovery services: Agricultural domain
why so few?
can I search using terms in
my language?

why terminology is not transformed correclty?

why so many options?
why terminology is not correctly transformed?
Discovery online services: other domains
can I search using terms in
my language?

why terminology is not correctly transformed?

why this is not synced with the language of user
interface?
Typical questions by online service
owner
• which MT service should I use?
• are there any free options?
• what methodology should I follow
to translate my portal to a truly
multilingual one?
• which open source tools should I
use?
LESSONS LEARNED: THE
ORGANIC.LINGUA EXPERIENCE
Lessons learned
• Evaluate the performance of linguistic services
– users
– in vitro

• MT has limitations and is not working well for a specific
domain
• Inform the users for the MT
• Provide generic components that can wrap different linguistic
services
• Effort is needed to find, organize and prepare domain specific
language resources
• Publish language resources that can be used to improve
linguistic services
• Enable users feedback but control it
WHAT ORGANIC.LINGUA PROPOSES
Main contributions
• Methodology for the adaptation that can be
re-used in different domains
• Open multilingual architecture
• Re-engineering process
• Open source software tools
• Conceptual framework and linked data
approach
A common multilingual framework
(CMF)
• What CMF is
The Common Multilingual Framework (CMF) is intended to provide a set of
Application Programming Interfaces (APIs) and a common architecture for digital
collection portals that is neutral to the implementation technology of the portals
and extensible

• Principals
–
–
–
–

Open architecture based on REST APIs
Wrap language services
Decouple front end from back end
KOSs managed externally and published as linked
data
Who can be interested
• Portals with digital collections and specifically
online content discovery services
• Projects that are re-engineering or developing
such online content discovery services
• Other stakeholders
–
–
–
–
–

Digital collections managers
SMEs that want to develop products based on data
developers
software engineers and project managers
language technology experts
Architectural principals
• Domain agnostic
• Open architecture
– any new component can be easily integrated

• Adapt external services so you can easily
upgrade them
• Keep analytics for the transactions with the
language services
– important for the evaluation
How it look like
Open Source software
• Multilingual repository tool
• Multilingual ontology evolution and publishing
tool
• Analytics Service
• Portal infrastructure
7 ways to become truly multilingual
1.
2.
3.
4.
5.
6.
7.

Evaluate MT services before their
and during their use
Manage KOSs externally and publish
them as linked data
Be domain specific
Sync content translation and
interface translation
Wrap language services
Follow the standards
Get feedback from users

Weitere ähnliche Inhalte

Mehr von Stoitsis Giannis

The Open Data Stakeholders’ Ecosystem
The Open Data Stakeholders’ EcosystemThe Open Data Stakeholders’ Ecosystem
The Open Data Stakeholders’ EcosystemStoitsis Giannis
 
Open Data in the agrifood sector
Open Data in the agrifood sectorOpen Data in the agrifood sector
Open Data in the agrifood sectorStoitsis Giannis
 
Open-data-in-agrifood-sector-challenges-opportunities
Open-data-in-agrifood-sector-challenges-opportunitiesOpen-data-in-agrifood-sector-challenges-opportunities
Open-data-in-agrifood-sector-challenges-opportunitiesStoitsis Giannis
 
How internet and open data transforms the agricultural sector (in greek)
How internet and open data transforms the agricultural sector (in greek)How internet and open data transforms the agricultural sector (in greek)
How internet and open data transforms the agricultural sector (in greek)Stoitsis Giannis
 
Facilitating regional growth through they use of open agricultural data
Facilitating regional growth through they use of open agricultural dataFacilitating regional growth through they use of open agricultural data
Facilitating regional growth through they use of open agricultural dataStoitsis Giannis
 
Open data: Showcases from agricultural domain
Open data: Showcases from agricultural domainOpen data: Showcases from agricultural domain
Open data: Showcases from agricultural domainStoitsis Giannis
 
How e-infrastructure can contribute to Linked Germplasm Data
How e-infrastructure can contribute to Linked Germplasm DataHow e-infrastructure can contribute to Linked Germplasm Data
How e-infrastructure can contribute to Linked Germplasm DataStoitsis Giannis
 
Open Data Working Group - Agricultural Showcase
Open Data Working Group - Agricultural ShowcaseOpen Data Working Group - Agricultural Showcase
Open Data Working Group - Agricultural ShowcaseStoitsis Giannis
 
Intro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensIntro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensStoitsis Giannis
 
Ag infra kream-presentation-7-6-2013
Ag infra kream-presentation-7-6-2013Ag infra kream-presentation-7-6-2013
Ag infra kream-presentation-7-6-2013Stoitsis Giannis
 
Cetaf ISTC Meeting: Natural-Europe Presentation
Cetaf ISTC Meeting: Natural-Europe PresentationCetaf ISTC Meeting: Natural-Europe Presentation
Cetaf ISTC Meeting: Natural-Europe PresentationStoitsis Giannis
 
Requirements for Processing Datasets for Recommender Systems
Requirements for Processing Datasets for Recommender SystemsRequirements for Processing Datasets for Recommender Systems
Requirements for Processing Datasets for Recommender SystemsStoitsis Giannis
 
Organic.lingua presentation cer_organic
Organic.lingua presentation cer_organicOrganic.lingua presentation cer_organic
Organic.lingua presentation cer_organicStoitsis Giannis
 

Mehr von Stoitsis Giannis (14)

The Open Data Stakeholders’ Ecosystem
The Open Data Stakeholders’ EcosystemThe Open Data Stakeholders’ Ecosystem
The Open Data Stakeholders’ Ecosystem
 
Open Data in the agrifood sector
Open Data in the agrifood sectorOpen Data in the agrifood sector
Open Data in the agrifood sector
 
Open-data-in-agrifood-sector-challenges-opportunities
Open-data-in-agrifood-sector-challenges-opportunitiesOpen-data-in-agrifood-sector-challenges-opportunities
Open-data-in-agrifood-sector-challenges-opportunities
 
How internet and open data transforms the agricultural sector (in greek)
How internet and open data transforms the agricultural sector (in greek)How internet and open data transforms the agricultural sector (in greek)
How internet and open data transforms the agricultural sector (in greek)
 
Facilitating regional growth through they use of open agricultural data
Facilitating regional growth through they use of open agricultural dataFacilitating regional growth through they use of open agricultural data
Facilitating regional growth through they use of open agricultural data
 
City to-farm agro-know
City to-farm agro-knowCity to-farm agro-know
City to-farm agro-know
 
Open data: Showcases from agricultural domain
Open data: Showcases from agricultural domainOpen data: Showcases from agricultural domain
Open data: Showcases from agricultural domain
 
How e-infrastructure can contribute to Linked Germplasm Data
How e-infrastructure can contribute to Linked Germplasm DataHow e-infrastructure can contribute to Linked Germplasm Data
How e-infrastructure can contribute to Linked Germplasm Data
 
Open Data Working Group - Agricultural Showcase
Open Data Working Group - Agricultural ShowcaseOpen Data Working Group - Agricultural Showcase
Open Data Working Group - Agricultural Showcase
 
Intro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensIntro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-Athens
 
Ag infra kream-presentation-7-6-2013
Ag infra kream-presentation-7-6-2013Ag infra kream-presentation-7-6-2013
Ag infra kream-presentation-7-6-2013
 
Cetaf ISTC Meeting: Natural-Europe Presentation
Cetaf ISTC Meeting: Natural-Europe PresentationCetaf ISTC Meeting: Natural-Europe Presentation
Cetaf ISTC Meeting: Natural-Europe Presentation
 
Requirements for Processing Datasets for Recommender Systems
Requirements for Processing Datasets for Recommender SystemsRequirements for Processing Datasets for Recommender Systems
Requirements for Processing Datasets for Recommender Systems
 
Organic.lingua presentation cer_organic
Organic.lingua presentation cer_organicOrganic.lingua presentation cer_organic
Organic.lingua presentation cer_organic
 

Kürzlich hochgeladen

Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 

Kürzlich hochgeladen (20)

Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 

Transforming an online service to truly multilingual

  • 1. CIP ICT PSP – 270999 A multilingual framework for transforming online services to truly multilingual Organic.Lingua Closing Workshop, 6/2/2014 Rome, Italy
  • 3. Online discovery services: Agricultural domain why so few? can I search using terms in my language? why terminology is not transformed correclty? why so many options? why terminology is not correctly transformed?
  • 4. Discovery online services: other domains can I search using terms in my language? why terminology is not correctly transformed? why this is not synced with the language of user interface?
  • 5. Typical questions by online service owner • which MT service should I use? • are there any free options? • what methodology should I follow to translate my portal to a truly multilingual one? • which open source tools should I use?
  • 7. Lessons learned • Evaluate the performance of linguistic services – users – in vitro • MT has limitations and is not working well for a specific domain • Inform the users for the MT • Provide generic components that can wrap different linguistic services • Effort is needed to find, organize and prepare domain specific language resources • Publish language resources that can be used to improve linguistic services • Enable users feedback but control it
  • 9. Main contributions • Methodology for the adaptation that can be re-used in different domains • Open multilingual architecture • Re-engineering process • Open source software tools • Conceptual framework and linked data approach
  • 10. A common multilingual framework (CMF) • What CMF is The Common Multilingual Framework (CMF) is intended to provide a set of Application Programming Interfaces (APIs) and a common architecture for digital collection portals that is neutral to the implementation technology of the portals and extensible • Principals – – – – Open architecture based on REST APIs Wrap language services Decouple front end from back end KOSs managed externally and published as linked data
  • 11. Who can be interested • Portals with digital collections and specifically online content discovery services • Projects that are re-engineering or developing such online content discovery services • Other stakeholders – – – – – Digital collections managers SMEs that want to develop products based on data developers software engineers and project managers language technology experts
  • 12. Architectural principals • Domain agnostic • Open architecture – any new component can be easily integrated • Adapt external services so you can easily upgrade them • Keep analytics for the transactions with the language services – important for the evaluation
  • 13. How it look like
  • 14. Open Source software • Multilingual repository tool • Multilingual ontology evolution and publishing tool • Analytics Service • Portal infrastructure
  • 15. 7 ways to become truly multilingual 1. 2. 3. 4. 5. 6. 7. Evaluate MT services before their and during their use Manage KOSs externally and publish them as linked data Be domain specific Sync content translation and interface translation Wrap language services Follow the standards Get feedback from users