SlideShare ist ein Scribd-Unternehmen logo
1 von 11
Building Bridges: from Europeana
       Libraries to Europeana Newspapers
Susan Reilly, LIBER
Twitter: @skreilly
IFLA Newspapers/GENLOC, Helsinki, 13th Aug 2012
Overview

About LIBER
Introduction to Europeana Newspapers
The foundation stone: Europeana Libraries




      This project is partially funded under the ICT Policy Support Programme (ICT PSP)
      as part of the Competitiveness and Innovation Framework Programme by the
      European Community http://ec.europa.eu/ict_psp                                      2
LIBER & the European Digital Agenda

Association of European Research Libraries
    Our projects:
  Content
       Europeana Libraries
       Europeana Newspapers
  Policy
       MEDOANET
  Infrastructure
       APARSEN
       AAA Study
       ODE
      This project is partially funded under the ICT Policy Support Programme (ICT PSP)
      as part of the Competitiveness and Innovation Framework Programme by the
      European Community http://ec.europa.eu/ict_psp
Europeana Newspapers

• 17 partner institutions
• 3 years (2012-2015)
• Aggregation of more than 18 million newspapers
• Will use refinement methods for OCR, OLR (article
  segmentation), and named entity (NER) and class
  recognition
• Suvey existing collections in Europe
• Make content accessible


       This project is partially funded under the ICT Policy Support Programme (ICT PSP)
       as part of the Competitiveness and Innovation Framework Programme by the
       European Community http://ec.europa.eu/ict_psp
Why newspapers?

“The museum (and the
 newspaper) today seeks
 whatever represents normal life
 in its own native locality and
 with infinite pains its collections
 are arranged in a manner which
 is natural to them in their own
 habitat”
                      Lucy Maynard Salmon (1976) in The Newspaper and the Historian

      This project is partially funded under the ICT Policy Support Programme (ICT PSP)
      as part of the Competitiveness and Innovation Framework Programme by the
      European Community http://ec.europa.eu/ict_psp
Europeana Newspapers: where the content
               comes from…

We are looking for
 more libraries!                                                                NL E
                                                    LIBER
                                                                                                                 NLF
                                                                         SUB HH
                                                                                                           NLL
                                                                                       CCS
                        USAL
                                                                                                     NLP

                                    BL                     KB               SBB                ONB


                                                                UIBK                                               NLT
                                              BnF

                                                                                               UB
                                                                        LFT
           This project is partially funded under the ICT Policy Support Programme (ICT PSP)
           as part of the Competitiveness and Innovation Framework Programme by the
           European Community http://ec.europa.eu/ict_psp
What we do with the content

• Select 10 million items to be OCR’d
  • Structural information by UKIB e.g. headings, table of contents
• Select 2 million items for OCR and OLR
  • Article segmentation and page class recognition by CCS
• Libraries carry out manual correction of recognition and
  segmentation results
• Named entity recognition applied to English, Dutch and
  German material




       This project is partially funded under the ICT Policy Support Programme (ICT PSP)
       as part of the Competitiveness and Innovation Framework Programme by the
       European Community http://ec.europa.eu/ict_psp
Making the content accessible

• OCR enables full text searching
• OLR enables more targeted searching (titles and sections)
• NER enables searching by people, place,and the discover of
 new relationships between entities




       This project is partially funded under the ICT Policy Support Programme (ICT PSP)
       as part of the Competitiveness and Innovation Framework Programme by the
       European Community http://ec.europa.eu/ict_psp
No access without aggregation

• Europeana Libraries
  •   A single library domain aggregator
  •   Content from European research libraries
  •   Full-text search capabilities
  •   Portal for researchers
                                                     Access = Sustainability
                                                      Access = Visibility




          This project is partially funded under the ICT Policy Support Programme (ICT PSP)
          as part of the Competitiveness and Innovation Framework Programme by the
          European Community http://ec.europa.eu/ict_psp
Go to www.theeuropeanlibrary.org
Thank you for your attention!
http://www.libereurope.eu
http://www.europeana-newspapers.eu/
http://www.europeana-libraries.eu/
Hall 4/5, stand H104

Weitere ähnliche Inhalte

Mehr von Europeana Newspapers

Présentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information DayPrésentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information Day
Europeana Newspapers
 

Mehr von Europeana Newspapers (20)

Presentation of Ioannis Anagnostopoulos at BnF Information Day
Presentation of Ioannis Anagnostopoulos at BnF Information DayPresentation of Ioannis Anagnostopoulos at BnF Information Day
Presentation of Ioannis Anagnostopoulos at BnF Information Day
 
Presentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information DayPresentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information Day
 
Presentation of Hans-Jörg Lieder, BnF Information Day
Presentation of Hans-Jörg Lieder, BnF Information DayPresentation of Hans-Jörg Lieder, BnF Information Day
Presentation of Hans-Jörg Lieder, BnF Information Day
 
Présentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information DayPrésentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information Day
 
Presentation of Claus Gravenhorst, BnF Information Day
Presentation of Claus Gravenhorst, BnF Information DayPresentation of Claus Gravenhorst, BnF Information Day
Presentation of Claus Gravenhorst, BnF Information Day
 
Presentation of Alaa Abi Haidar at the BnF Information Day
Presentation of Alaa Abi Haidar at the BnF Information DayPresentation of Alaa Abi Haidar at the BnF Information Day
Presentation of Alaa Abi Haidar at the BnF Information Day
 
IFLA 2014 Europeana Newspapers Rossitza Atanassova
IFLA 2014 Europeana Newspapers Rossitza AtanassovaIFLA 2014 Europeana Newspapers Rossitza Atanassova
IFLA 2014 Europeana Newspapers Rossitza Atanassova
 
Europeana Newspapers Estonian Infoday Ragne Kouts
Europeana Newspapers Estonian Infoday Ragne KoutsEuropeana Newspapers Estonian Infoday Ragne Kouts
Europeana Newspapers Estonian Infoday Ragne Kouts
 
Europeana Newspapers Estonian Infoday Kristel Veimann
Europeana Newspapers Estonian Infoday Kristel VeimannEuropeana Newspapers Estonian Infoday Kristel Veimann
Europeana Newspapers Estonian Infoday Kristel Veimann
 
Europeana Newspapers Estonian Infoday Krista Kiisa
Europeana Newspapers Estonian Infoday Krista KiisaEuropeana Newspapers Estonian Infoday Krista Kiisa
Europeana Newspapers Estonian Infoday Krista Kiisa
 
Europeana Newspapers Estonian Infoday Krista Aru
Europeana Newspapers Estonian Infoday Krista AruEuropeana Newspapers Estonian Infoday Krista Aru
Europeana Newspapers Estonian Infoday Krista Aru
 
Europeana Newspapers Estonian Infoday Fred Puss
Europeana Newspapers Estonian Infoday Fred PussEuropeana Newspapers Estonian Infoday Fred Puss
Europeana Newspapers Estonian Infoday Fred Puss
 
Europeana Newpapers LFT Infoday Neudecker
Europeana Newpapers LFT Infoday NeudeckerEuropeana Newpapers LFT Infoday Neudecker
Europeana Newpapers LFT Infoday Neudecker
 
Europeana Newspapers LFT Infoday Thompson
Europeana Newspapers LFT Infoday ThompsonEuropeana Newspapers LFT Infoday Thompson
Europeana Newspapers LFT Infoday Thompson
 
Europeana Newspapers LFT Infoday Rossi
Europeana Newspapers LFT Infoday RossiEuropeana Newspapers LFT Infoday Rossi
Europeana Newspapers LFT Infoday Rossi
 
Europeana Newspapers LFT Infoday Muehlberger
Europeana Newspapers LFT Infoday MuehlbergerEuropeana Newspapers LFT Infoday Muehlberger
Europeana Newspapers LFT Infoday Muehlberger
 
Europeana Newspapers LFT Infoday Messina
Europeana Newspapers LFT Infoday MessinaEuropeana Newspapers LFT Infoday Messina
Europeana Newspapers LFT Infoday Messina
 
Europeana Newspapers Infoday Marchetti
Europeana Newspapers Infoday MarchettiEuropeana Newspapers Infoday Marchetti
Europeana Newspapers Infoday Marchetti
 
Europeana Newspapers LFT Infoday Kempf
Europeana Newspapers LFT Infoday KempfEuropeana Newspapers LFT Infoday Kempf
Europeana Newspapers LFT Infoday Kempf
 
Europeana Newspapers LFT Infoday Genereux
Europeana Newspapers LFT Infoday GenereuxEuropeana Newspapers LFT Infoday Genereux
Europeana Newspapers LFT Infoday Genereux
 

Kürzlich hochgeladen

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Kürzlich hochgeladen (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 

Building Bridges: from Europeana Libraries to Europeana Newspapers

  • 1. Building Bridges: from Europeana Libraries to Europeana Newspapers Susan Reilly, LIBER Twitter: @skreilly IFLA Newspapers/GENLOC, Helsinki, 13th Aug 2012
  • 2. Overview About LIBER Introduction to Europeana Newspapers The foundation stone: Europeana Libraries This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 2
  • 3. LIBER & the European Digital Agenda Association of European Research Libraries Our projects: Content Europeana Libraries Europeana Newspapers Policy MEDOANET Infrastructure APARSEN AAA Study ODE This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 4. Europeana Newspapers • 17 partner institutions • 3 years (2012-2015) • Aggregation of more than 18 million newspapers • Will use refinement methods for OCR, OLR (article segmentation), and named entity (NER) and class recognition • Suvey existing collections in Europe • Make content accessible This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 5. Why newspapers? “The museum (and the newspaper) today seeks whatever represents normal life in its own native locality and with infinite pains its collections are arranged in a manner which is natural to them in their own habitat” Lucy Maynard Salmon (1976) in The Newspaper and the Historian This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 6. Europeana Newspapers: where the content comes from… We are looking for more libraries! NL E LIBER NLF SUB HH NLL CCS USAL NLP BL KB SBB ONB UIBK NLT BnF UB LFT This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 7. What we do with the content • Select 10 million items to be OCR’d • Structural information by UKIB e.g. headings, table of contents • Select 2 million items for OCR and OLR • Article segmentation and page class recognition by CCS • Libraries carry out manual correction of recognition and segmentation results • Named entity recognition applied to English, Dutch and German material This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 8. Making the content accessible • OCR enables full text searching • OLR enables more targeted searching (titles and sections) • NER enables searching by people, place,and the discover of new relationships between entities This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 9. No access without aggregation • Europeana Libraries • A single library domain aggregator • Content from European research libraries • Full-text search capabilities • Portal for researchers Access = Sustainability Access = Visibility This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 11. Thank you for your attention! http://www.libereurope.eu http://www.europeana-newspapers.eu/ http://www.europeana-libraries.eu/ Hall 4/5, stand H104

Hinweis der Redaktion

  1. Before we get in to the drivers and barriers for data sharing I would like to ‘share’ 2 things about me with you.. First of all, I am a librarian. I work as project officer for LIBER, which is the Association of European Research Libraries. We have 380 member libraries from all over Europe. Our projects really focus on developing the role of the library as part of the Europeana Research Infrastructure and they fall into 3 main categories.
  2. To this.. How do we get from the image of the research we have built up to a dedicated pan-European research portal with content from practically all the research libraries in Europe, including bibliographic records, full text and special tools for resaercher- all the things that we know that researchers want. Well of course I’m going to say though partnership, through enabling national, university and other research libraries to work together to build this service and provide research content in a sustainable mannor. Which is what the Europeana Libraries project sets out to do…