SlideShare ist ein Scribd-Unternehmen logo
1 von 13
Antoine Isaac
Information and networking days
H2020 / Connecting Europe Facility, Jan 15-16, 2014
Europe’s platform to access cultural heritage
Currently
30M objects
Built on descriptive metadata
from a broad, heterogeneous network
Audiovisual
collections
National Aggregators
Regional Aggregators
Archives
Thematic collections
Libraries
Musées
Lausannois
Culture.frThe
European
Library
APEX
European Film
Gateway Europeana Fashion
2,300 galleries, museums, archives and libraries
Accessing items from 36 countries
top 16
Portal interface in 31 languages
Metadata in 33 languages
Serving Europe’s citizens
5M visits on Europeana.eu
7M Facebook impressions
API use…
Content (digital objects on the site of the provider)
Metadata (descriptive object information)
Public Domain
Creative Commons
Licenses
Rights reserved
Orphan work
Facilitating re-use on the legal side
CC
Facilitating re-use on the language side?
Our network needs automatic translation tools
to address information needs all over Europe
Gathering/linking existing multilingual data
Related projects applying NLP tools
E.g., The PATHS project has developed techniques to enrich English
and Spanish collections
1)Identification of key entities
2)Detection of (typed) similarities between objects, using metadata
3)“Background links” to external resources such as Wikipedia
4)Classification of object against a hierarchy of topic
Applying these techniques to other languages would require work
1)requires language-specific tools (PoS tagging, lemmatization)
2)is straightforward to apply to new languages
3)requires language-specific tools
4)depends on (3) and on translation of some topics
http://www.paths-project.eu/eng/Resources/Semantic-Enrichment-of-Cultural-Heritage-content-in-PATHS
Language challenges for Digital Libraries
 Typical queries are very short
Average < 2 terms
 Identification of query language is not easy, even manually
39% of queries may belong to several languages
 Plenty of named entities
60% of queries are for persons & places
Not only is it hard for queries: the same issues apply
to the descriptive metadata
Studies by Humboldt University on Europeana and The European Library
http://www.clef-initiative.eu/documents/71612/86374/CLEF2010wn-LogCLEF-StillerEt2010.pdf
Language processing issues at the scale of
Europe
Thank you!
Antoine Isaac
antoine.isaac@europeana.eu
@EuropeanaEU
Europeana’s vision and mission
 We believe in making cultural heritage openly
accessible in a digital way, to promote the
exchange of ideas and information
 We want to be a catalyst for change in the world of
cultural heritage

Weitere ähnliche Inhalte

Was ist angesagt?

Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Antoine Isaac
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for EuropeanaAntoine Isaac
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseAntoine Isaac
 
Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Antoine Isaac
 
Europeana, more than data aggregation?
Europeana, more than data aggregation?Europeana, more than data aggregation?
Europeana, more than data aggregation?Antoine Isaac
 
Europeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingEuropeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingAntoine Isaac
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session Antoine Isaac
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015Antoine Isaac
 
W3C Library Linked Data Incubator Group - 2011
W3C Library Linked Data Incubator Group  - 2011W3C Library Linked Data Incubator Group  - 2011
W3C Library Linked Data Incubator Group - 2011Antoine Isaac
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaAntoine Isaac
 
Linked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachLinked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachValentine Charles
 
Archaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing frameworkArchaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing frameworkCARARE
 
EIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataEIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataAntoine Isaac
 
Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018Antoine Isaac
 
Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Antoine Isaac
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Antoine Isaac
 
Olaf Janssen on benefits of collaboration between Europeana and archives duri...
Olaf Janssen on benefits of collaboration between Europeana and archives duri...Olaf Janssen on benefits of collaboration between Europeana and archives duri...
Olaf Janssen on benefits of collaboration between Europeana and archives duri...Olaf Janssen
 
Europeana bergen may2010_dovwiner
Europeana bergen may2010_dovwinerEuropeana bergen may2010_dovwiner
Europeana bergen may2010_dovwinerDov Winer
 
Exploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceExploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceCARARE
 
EDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingEDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingAntoine Isaac
 

Was ist angesagt? (20)

Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for Europeana
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data case
 
Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013
 
Europeana, more than data aggregation?
Europeana, more than data aggregation?Europeana, more than data aggregation?
Europeana, more than data aggregation?
 
Europeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingEuropeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap Meeting
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015
 
W3C Library Linked Data Incubator Group - 2011
W3C Library Linked Data Incubator Group  - 2011W3C Library Linked Data Incubator Group  - 2011
W3C Library Linked Data Incubator Group - 2011
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpedia
 
Linked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachLinked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approach
 
Archaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing frameworkArchaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing framework
 
EIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataEIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open Data
 
Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018
 
Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
 
Olaf Janssen on benefits of collaboration between Europeana and archives duri...
Olaf Janssen on benefits of collaboration between Europeana and archives duri...Olaf Janssen on benefits of collaboration between Europeana and archives duri...
Olaf Janssen on benefits of collaboration between Europeana and archives duri...
 
Europeana bergen may2010_dovwiner
Europeana bergen may2010_dovwinerEuropeana bergen may2010_dovwiner
Europeana bergen may2010_dovwiner
 
Exploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceExploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practice
 
EDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingEDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD Meeting
 

Andere mochten auch

Implementing the Media Fragments URI Specification
Implementing the Media Fragments URI SpecificationImplementing the Media Fragments URI Specification
Implementing the Media Fragments URI SpecificationRaphael Troncy
 
Networked books and networked reading
Networked books and networked readingNetworked books and networked reading
Networked books and networked readingCamille Hartsell
 
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...Raphael Troncy
 
Finding media illustrating events
Finding media illustrating eventsFinding media illustrating events
Finding media illustrating eventsRaphael Troncy
 
Europeana and RDF data validation
Europeana and RDF data validationEuropeana and RDF data validation
Europeana and RDF data validationAntoine Isaac
 
ALA2009_Andy Weissberg (Bowker)
ALA2009_Andy Weissberg (Bowker)ALA2009_Andy Weissberg (Bowker)
ALA2009_Andy Weissberg (Bowker)bisg
 
Progress Report on Government Linked Data Worldwide
Progress Report on Government Linked Data WorldwideProgress Report on Government Linked Data Worldwide
Progress Report on Government Linked Data Worldwide3 Round Stones
 

Andere mochten auch (20)

Implementing the Media Fragments URI Specification
Implementing the Media Fragments URI SpecificationImplementing the Media Fragments URI Specification
Implementing the Media Fragments URI Specification
 
Networked books and networked reading
Networked books and networked readingNetworked books and networked reading
Networked books and networked reading
 
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
 
NISO Annual Report of 2012 Activities
NISO Annual Report of 2012 ActivitiesNISO Annual Report of 2012 Activities
NISO Annual Report of 2012 Activities
 
Finding media illustrating events
Finding media illustrating eventsFinding media illustrating events
Finding media illustrating events
 
Europeana and RDF data validation
Europeana and RDF data validationEuropeana and RDF data validation
Europeana and RDF data validation
 
NISO's Standards Update & Annual Membership Meeting
NISO's Standards Update & Annual Membership MeetingNISO's Standards Update & Annual Membership Meeting
NISO's Standards Update & Annual Membership Meeting
 
ALA2009_Andy Weissberg (Bowker)
ALA2009_Andy Weissberg (Bowker)ALA2009_Andy Weissberg (Bowker)
ALA2009_Andy Weissberg (Bowker)
 
Progress Report on Government Linked Data Worldwide
Progress Report on Government Linked Data WorldwideProgress Report on Government Linked Data Worldwide
Progress Report on Government Linked Data Worldwide
 
Uncork Your Licenses!
Uncork Your Licenses!Uncork Your Licenses!
Uncork Your Licenses!
 
Expanding the Possible: What’s New and Upcoming in Standards and Technologies...
Expanding the Possible: What’s New and Upcoming in Standards and Technologies...Expanding the Possible: What’s New and Upcoming in Standards and Technologies...
Expanding the Possible: What’s New and Upcoming in Standards and Technologies...
 
Carpenter Library Assessment Conference Presentation
Carpenter Library Assessment Conference PresentationCarpenter Library Assessment Conference Presentation
Carpenter Library Assessment Conference Presentation
 
Uncork Your Licenses!
Uncork Your Licenses! Uncork Your Licenses!
Uncork Your Licenses!
 
Lagace Presentation on the NISO Open Access Metadata and Indicators Project a...
Lagace Presentation on the NISO Open Access Metadata and Indicators Project a...Lagace Presentation on the NISO Open Access Metadata and Indicators Project a...
Lagace Presentation on the NISO Open Access Metadata and Indicators Project a...
 
Uncork Your Licenses
Uncork Your LicensesUncork Your Licenses
Uncork Your Licenses
 
Intro D2D Paper ER&L Feb 2015
Intro D2D Paper ER&L Feb 2015Intro D2D Paper ER&L Feb 2015
Intro D2D Paper ER&L Feb 2015
 
Carpenter Update on NISO Altmetrics Initiative at CNI Fall meeting in Washing...
Carpenter Update on NISO Altmetrics Initiative at CNI Fall meeting in Washing...Carpenter Update on NISO Altmetrics Initiative at CNI Fall meeting in Washing...
Carpenter Update on NISO Altmetrics Initiative at CNI Fall meeting in Washing...
 
ER&L SUSHI ALI Feb 2015
ER&L SUSHI ALI Feb 2015ER&L SUSHI ALI Feb 2015
ER&L SUSHI ALI Feb 2015
 
The Infrastructure for Alternative Metrics
The Infrastructure for Alternative MetricsThe Infrastructure for Alternative Metrics
The Infrastructure for Alternative Metrics
 
NISO's Altmetrics Initiative
NISO's Altmetrics InitiativeNISO's Altmetrics Initiative
NISO's Altmetrics Initiative
 

Ähnlich wie Multilingual challenges in Europeana

Rio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der WerfRio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der WerfRio Info
 
Rob Davies : How we got here
Rob Davies : How we got hereRob Davies : How we got here
Rob Davies : How we got hereAccessITplus
 
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...Valentine Charles
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseAntoine Isaac
 
Natural Europe presentation in Minerva Conference workshop
Natural Europe presentation in Minerva Conference workshopNatural Europe presentation in Minerva Conference workshop
Natural Europe presentation in Minerva Conference workshopNikos Palavitsinis, PhD
 
The European (Digital) Library - Overview and Outlook
The European (Digital) Library - Overview and OutlookThe European (Digital) Library - Overview and Outlook
The European (Digital) Library - Overview and OutlookOlaf Janssen
 
EuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageEuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageMax Kaiser
 
Europeana Cloud: The Essential Facts
Europeana Cloud: The Essential FactsEuropeana Cloud: The Essential Facts
Europeana Cloud: The Essential FactsLIBER Europe
 
Europeana 2019 - Connect Communities
Europeana 2019 - Connect CommunitiesEuropeana 2019 - Connect Communities
Europeana 2019 - Connect CommunitiesEuropeana
 
Digital cultural heritage works and object description within the scope of Eu...
Digital cultural heritage works and object description within the scope of Eu...Digital cultural heritage works and object description within the scope of Eu...
Digital cultural heritage works and object description within the scope of Eu...Tolga Çakmak
 
77. newsletter d andrea2012
77. newsletter d andrea201277. newsletter d andrea2012
77. newsletter d andrea2012Andrea D'Andrea
 
CARARE workshop: Europeana research
CARARE workshop: Europeana researchCARARE workshop: Europeana research
CARARE workshop: Europeana researchEuropeana
 
A deer for Europe
A deer for EuropeA deer for Europe
A deer for Europeashtreex
 
Europeana essentials June 2013
Europeana essentials June 2013Europeana essentials June 2013
Europeana essentials June 2013Europeana
 
Europeana Essentials - Latest
Europeana Essentials - LatestEuropeana Essentials - Latest
Europeana Essentials - LatestEuropeana
 
Why join Europeana?
Why join Europeana?Why join Europeana?
Why join Europeana?Europeana
 

Ähnlich wie Multilingual challenges in Europeana (20)

Museums and Europeana
Museums and EuropeanaMuseums and Europeana
Museums and Europeana
 
03 isaac dm2-e14-full
03 isaac dm2-e14-full03 isaac dm2-e14-full
03 isaac dm2-e14-full
 
Rio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der WerfRio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der Werf
 
Europeana in a Research Context
Europeana in a Research ContextEuropeana in a Research Context
Europeana in a Research Context
 
Rob Davies : How we got here
Rob Davies : How we got hereRob Davies : How we got here
Rob Davies : How we got here
 
Europeana and Researchers
Europeana and ResearchersEuropeana and Researchers
Europeana and Researchers
 
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) case
 
Natural Europe presentation in Minerva Conference workshop
Natural Europe presentation in Minerva Conference workshopNatural Europe presentation in Minerva Conference workshop
Natural Europe presentation in Minerva Conference workshop
 
The European (Digital) Library - Overview and Outlook
The European (Digital) Library - Overview and OutlookThe European (Digital) Library - Overview and Outlook
The European (Digital) Library - Overview and Outlook
 
EuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageEuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital Heritage
 
Europeana Cloud: The Essential Facts
Europeana Cloud: The Essential FactsEuropeana Cloud: The Essential Facts
Europeana Cloud: The Essential Facts
 
Europeana 2019 - Connect Communities
Europeana 2019 - Connect CommunitiesEuropeana 2019 - Connect Communities
Europeana 2019 - Connect Communities
 
Digital cultural heritage works and object description within the scope of Eu...
Digital cultural heritage works and object description within the scope of Eu...Digital cultural heritage works and object description within the scope of Eu...
Digital cultural heritage works and object description within the scope of Eu...
 
77. newsletter d andrea2012
77. newsletter d andrea201277. newsletter d andrea2012
77. newsletter d andrea2012
 
CARARE workshop: Europeana research
CARARE workshop: Europeana researchCARARE workshop: Europeana research
CARARE workshop: Europeana research
 
A deer for Europe
A deer for EuropeA deer for Europe
A deer for Europe
 
Europeana essentials June 2013
Europeana essentials June 2013Europeana essentials June 2013
Europeana essentials June 2013
 
Europeana Essentials - Latest
Europeana Essentials - LatestEuropeana Essentials - Latest
Europeana Essentials - Latest
 
Why join Europeana?
Why join Europeana?Why join Europeana?
Why join Europeana?
 

Mehr von Antoine Isaac

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Antoine Isaac
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Antoine Isaac
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaAntoine Isaac
 
The Europeana Data Model Principles, community and innovation
The Europeana Data Model  Principles, community and innovationThe Europeana Data Model  Principles, community and innovation
The Europeana Data Model Principles, community and innovationAntoine Isaac
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansAntoine Isaac
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana missionAntoine Isaac
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaAntoine Isaac
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Antoine Isaac
 
The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018Antoine Isaac
 
Data scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaData scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaAntoine Isaac
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesAntoine Isaac
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotationsAntoine Isaac
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSAntoine Isaac
 
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Antoine Isaac
 
Enrichment and Europeana
Enrichment and EuropeanaEnrichment and Europeana
Enrichment and EuropeanaAntoine Isaac
 

Mehr von Antoine Isaac (17)

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'Europeana
 
The Europeana Data Model Principles, community and innovation
The Europeana Data Model  Principles, community and innovationThe Europeana Data Model  Principles, community and innovation
The Europeana Data Model Principles, community and innovation
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plans
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana mission
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at Europeana
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...
 
The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018
 
Europeana et IIIF
Europeana et IIIFEuropeana et IIIF
Europeana et IIIF
 
Data scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaData scale and diversity issues at Europeana
Data scale and diversity issues at Europeana
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
 
Europeana APIs
Europeana APIsEuropeana APIs
Europeana APIs
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotations
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WS
 
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...
 
Enrichment and Europeana
Enrichment and EuropeanaEnrichment and Europeana
Enrichment and Europeana
 

Kürzlich hochgeladen

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

Kürzlich hochgeladen (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

Multilingual challenges in Europeana

  • 1. Antoine Isaac Information and networking days H2020 / Connecting Europe Facility, Jan 15-16, 2014
  • 2. Europe’s platform to access cultural heritage Currently 30M objects
  • 3. Built on descriptive metadata from a broad, heterogeneous network Audiovisual collections National Aggregators Regional Aggregators Archives Thematic collections Libraries Musées Lausannois Culture.frThe European Library APEX European Film Gateway Europeana Fashion 2,300 galleries, museums, archives and libraries
  • 4. Accessing items from 36 countries top 16 Portal interface in 31 languages Metadata in 33 languages
  • 5. Serving Europe’s citizens 5M visits on Europeana.eu 7M Facebook impressions API use…
  • 6. Content (digital objects on the site of the provider) Metadata (descriptive object information) Public Domain Creative Commons Licenses Rights reserved Orphan work Facilitating re-use on the legal side CC
  • 7. Facilitating re-use on the language side? Our network needs automatic translation tools to address information needs all over Europe
  • 9. Related projects applying NLP tools E.g., The PATHS project has developed techniques to enrich English and Spanish collections 1)Identification of key entities 2)Detection of (typed) similarities between objects, using metadata 3)“Background links” to external resources such as Wikipedia 4)Classification of object against a hierarchy of topic Applying these techniques to other languages would require work 1)requires language-specific tools (PoS tagging, lemmatization) 2)is straightforward to apply to new languages 3)requires language-specific tools 4)depends on (3) and on translation of some topics http://www.paths-project.eu/eng/Resources/Semantic-Enrichment-of-Cultural-Heritage-content-in-PATHS
  • 10. Language challenges for Digital Libraries  Typical queries are very short Average < 2 terms  Identification of query language is not easy, even manually 39% of queries may belong to several languages  Plenty of named entities 60% of queries are for persons & places Not only is it hard for queries: the same issues apply to the descriptive metadata Studies by Humboldt University on Europeana and The European Library http://www.clef-initiative.eu/documents/71612/86374/CLEF2010wn-LogCLEF-StillerEt2010.pdf
  • 11. Language processing issues at the scale of Europe
  • 13. Europeana’s vision and mission  We believe in making cultural heritage openly accessible in a digital way, to promote the exchange of ideas and information  We want to be a catalyst for change in the world of cultural heritage

Hinweis der Redaktion

  1. Les Miserables: Victor Hugo’s handwritten manuscripts: http://www.europeana.eu/portal/record/9200103/5372912AF66AB529E188218BC1F747E75EB1A18F.html BnF, public domain Matisse ‘53 in the form of a double helix’ http://www.europeana.eu/portal/record/9200104/F8D60AB9136C8A59B59DF1CFEC278A6CABA8B0C6.htmlThe Wellcome Library (CC-BY-NC-ND) ‘söprűtánc’ – Hungarian traditional dance http://www.europeana.eu/portal/record/08901/E1A7B01BE4AED87FD239672F4F3941F52262D6B2.html Hungarian Academy of Sciences Institute for Musicology, public domain ‘Neurologico reggae’ Music album http://www.europeana.eu/portal/record/08901/ADC241BCBF8470988DBA6EEAFCF13F14D88E5534.html DISMARC – EuropeanaConnect Paid Access ‘Castle of Kavala’ 3D exploration of a Greek castle http://www.europeana.eu/portal/record/2020703/05607B24D15BD516EE2B765F74CDA39C7427F7FB.html Cultural and Educational Technology Institute - Research Centre Athen CARARE CC-BY-NC-ND
  2. All partners send us descriptions of their assets, which we aggregate in a single service
  3. Germany 15.44% France 10.97% Netherlands 9.67% Sweden 9.44% Spain 9.98% UK 6.98% Norway 6.60% Italy 5.4% Ireland 4.04% Poland 4.02% Europe 3.95% Finland 2.95% Austria 2.05% Belgium 1.61% Hungary 1.26%
  4. http://www.clef-initiative.eu/documents/71612/86374/CLEF2010wn-LogCLEF-StillerEt2010.pdf
  5. Users from everywhere Data from everywhere Tools from everywhere http://europeana.eu/portal/record/2022347/B7C7D15C23C28EFD3FA25147ED3A580757CFBB04.html http://europeana.eu/portal/record/9200103/ark__12148_btv1b6921004c.html