SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Downloaden Sie, um offline zu lesen
Europeana Newspapers
Alastair Dunning
Programme Manager, The European Library
@alastairdunning, alastair.dunning AT kb.nl
LIBER Conference, June 2013, Munich

Surveying Newspaper Digitisation in European
Libraries, Then Aggregating Them !

This presentation is at http://www.slideshare.net/alastairdunning
On November 3, 1948,
the early edition of the
Chicago Tribune
proclaimed Thomas
Dewey as winner of the
US presidential
campaign

http://www.chicagotribune.com/news/politics/chi-histdewey_defeats_an20080104104816,0,547284.photo
In actual fact, the
campaign was won by
Harry Truman, who
became the 33rd
President of the United
States

http://en.wikipedia.org/wiki/File:Deweytruman12.jpg
Later editions of the
Chicago Tribune
corrected this mistake
with headline
"DEMOCRATS MAKE
SWEEP OF STATE
OFFICES"
However, I cannot find
these online !
http://en.wikipedia.org/wiki/File:Deweytruman12.jpg
As we shall see, presenting
comprehensive digital archives,
where everything is digitised, is
difficult... yet this is what users
often demand !
"This lack of collocation and collection
presents efficiency challenges and deepens
scholars’ concerns about
comprehensiveness. The anxiety over
“missing something” was quite common
across interviews."

Ithaka S+R, Supporting the Changing
Research Practices of Historians,
http://www.sr.ithaka.org/research-publications/supporting-changingresearch-practices-historians
"When lined up against the non-digital
object upon which it is based, the digital
object can only ever appear impoverished."

Jim Mussell, Historian at
University of Birmingham
http://jimmussell.com/2013/05/23/the-proximal-pastdigital-archives-and-the-here-and-now/
Genealogists - those studying family
history
"Genealogists represent the majority of
users in many archives. And yet, the
traditional archival information system
does not meet their needs."

Wendy M. Duff, Catherine A. Johnson, Where Is the
List with All the Names? Information-Seeking Behavior
of Genealogists, American Archivist, Volume 66(1),
2003, http://archivists.metapress.com/content/L375UJ047224737N
Despite this, European
libraries have made great
strides in digitising their
newspapers
(These results taken from first
Europeana Newspapers
survey, 2012. 47 libraries
responded.)
http://www.europeana-newspapers.eu/wp-content/uploads/2012/04/D4.1-Europeananewspapers-survey-report.pdf
129, 041, 663
from

23,987

titles

pages
11 libraries have digitised more than 3m pages
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.

National Library of Czech Republic
Koninklijke Bibliotheek van België
National Library of Spain
National Library of Norway
National and Univeristy Library of Iceland
BCU Lausanne
Hamburg State and University Library
Bibliothèque nationale de France
British Library
Koninklijke Bibliotheek
Austrian National Library
But, only

12 (26%)
10%
of the

libraries had digitised more than
of their collection

(either in terms of titles or page numbers)
National Library of Luxembourg

4.000.000
pages in collection

620.000
pages digitised
National Library of Finland

620.000
pages digitised

2.010.246
pages in collection
Hamburg State and University Library

c. 2.000.000 pages digitised

c. 12.000.000

pages

in collection
What else did the survey discover ?
Access to digitised newspapers is nearly always
free of charge. At least

40 (85%)

offered free access to their digitised
newspapers.

One library had pay per view, whilst another three offered
subscription services for users (ie paid access per day or per
month).
Only four libraries licensed their newspaper contents to
other groups (e.g. school, universities).
Access to twentieth-century content remains
problematic.
27 out of 47 libraries

(57%)

have a cut off date

beyond which they will not publish digitised newspapers on
the web. Most frequently, this is based on a 70 year sliding
scale.

23%

(11 out of 47) had an agreement with a rights

organisation so that in-copyright digitised newspapers could
be published, but often restricted to individual titles
There is still much to be done to exploit the richness
of digitised newspaper content

64%

(37 from 47) of libraries made use of OCR

But only 17 of these libraries (

36%

) exposed the resulting

full text to the viewer

36%
13%

had undertaken zoning and segmentation and only six

libraries (

) had included features such as facetted

browsing or extracting entities such as place or name
--> Motivation for Europeana
Newspapers
Others WPs will explain process of
improving digitised archives but I
want to return to one earlier
quote
"... the lack of comprehensive search
tools for primary sources ..."
Locating primary sources presents a
crucial challenge for reserachers.
--> TEL aggregator as part of
Europeana Newspapers project
Timetable: Early version with
limited content added to The
European Library website in
September 20
More content being added in 2013
and 2014
http://theeuropeanlibrary.org will
deliver a search interface to help
locate

18m pages digitised

at European libraires
Users will also be able to search
over titles of newspapers. Title
metadata will also be forwarded to
Europeana
Some Issues:
Copyright means that some
images cannot be shared at all,
only metadata (e.g. names and
dates of newspapers)
Some Issues:
OCR and zoning quality will affect
search results significantly. Eg
Higher quality OCR will be
returned more often in search
results
Some Issues:
Some pages have no OCR
whatsoever - more difficult to find
Some Issues:
Different libraries are willing to
share different amounts of
content
Some libraries happy for full
content to be shared; for others it
is just snippets of images
Last Thoughts and What Next ?:
The European Library will sustain access
beyond project funding; but adding more
content will require membership of TEL
How can we allow for transcription?
What do non-academic users want?
How do we create full-text APIs ?
Oh, the results here
were all based on the
first edition of the
project survey.
If your library want to
contribute to later
editions, see links by
July 2013
http://www.europeana-newspapers.eu/tell-us-about-your-newspaperdigitisation-project/
http://www.surveymonkey.com/s/BQ28579

Weitere ähnliche Inhalte

Was ist angesagt?

Copy of What Public Libraries Can Do For_Special Libraries
Copy of What Public Libraries Can Do For_Special LibrariesCopy of What Public Libraries Can Do For_Special Libraries
Copy of What Public Libraries Can Do For_Special Libraries
Newton Free Library
 
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
The European Library
 
Peter Webster - Digital History - 11 June 2013
Peter Webster - Digital History - 11 June 2013Peter Webster - Digital History - 11 June 2013
Peter Webster - Digital History - 11 June 2013
Digital History
 
Open Access to Science: a practical Institutional Repository perspective
Open Access to Science: a practical Institutional Repository perspectiveOpen Access to Science: a practical Institutional Repository perspective
Open Access to Science: a practical Institutional Repository perspective
calsi
 

Was ist angesagt? (20)

Digitisation on demand
Digitisation on demandDigitisation on demand
Digitisation on demand
 
Challenges and solutions in creating a european historic newspapers browser
Challenges and solutions in creating a european historic newspapers browser Challenges and solutions in creating a european historic newspapers browser
Challenges and solutions in creating a european historic newspapers browser
 
More than just books - British Library Labs Presentation given at MSc Compute...
More than just books - British Library Labs Presentation given at MSc Compute...More than just books - British Library Labs Presentation given at MSc Compute...
More than just books - British Library Labs Presentation given at MSc Compute...
 
Welcoming Day 2017-Library of Campus Gandia-CRAI
Welcoming Day 2017-Library of Campus Gandia-CRAIWelcoming Day 2017-Library of Campus Gandia-CRAI
Welcoming Day 2017-Library of Campus Gandia-CRAI
 
Dariah de and e research alliance, germany
Dariah de and e research alliance, germanyDariah de and e research alliance, germany
Dariah de and e research alliance, germany
 
BL Labs presentation given to the Digital Scholarship Team
BL Labs presentation given to the Digital Scholarship TeamBL Labs presentation given to the Digital Scholarship Team
BL Labs presentation given to the Digital Scholarship Team
 
Copy of What Public Libraries Can Do For_Special Libraries
Copy of What Public Libraries Can Do For_Special LibrariesCopy of What Public Libraries Can Do For_Special Libraries
Copy of What Public Libraries Can Do For_Special Libraries
 
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
 
Sciences Po Grenoble library and Research, France
Sciences Po Grenoble library and Research, FranceSciences Po Grenoble library and Research, France
Sciences Po Grenoble library and Research, France
 
UKSG 2018 Breakout - The latest in open access book publishing - Bruinsma
UKSG 2018 Breakout - The latest in open access book publishing - BruinsmaUKSG 2018 Breakout - The latest in open access book publishing - Bruinsma
UKSG 2018 Breakout - The latest in open access book publishing - Bruinsma
 
Peter Webster - Digital History - 11 June 2013
Peter Webster - Digital History - 11 June 2013Peter Webster - Digital History - 11 June 2013
Peter Webster - Digital History - 11 June 2013
 
The future of cataloguing: a CIGS World Cafe Workshop
The future of cataloguing: a CIGS World Cafe WorkshopThe future of cataloguing: a CIGS World Cafe Workshop
The future of cataloguing: a CIGS World Cafe Workshop
 
English for Academic Purposes - Library Welcome 2015
English for Academic Purposes - Library Welcome 2015English for Academic Purposes - Library Welcome 2015
English for Academic Purposes - Library Welcome 2015
 
Tentative steps in mining UK theses
Tentative steps in mining UK thesesTentative steps in mining UK theses
Tentative steps in mining UK theses
 
2011 jisc rdtf teresa the womens library
2011 jisc rdtf teresa the womens library2011 jisc rdtf teresa the womens library
2011 jisc rdtf teresa the womens library
 
Dissertation question time
Dissertation question timeDissertation question time
Dissertation question time
 
Open Access to Science: a practical Institutional Repository perspective
Open Access to Science: a practical Institutional Repository perspectiveOpen Access to Science: a practical Institutional Repository perspective
Open Access to Science: a practical Institutional Repository perspective
 
Using Library Resources for your Final Year Project
Using Library Resources for your Final Year ProjectUsing Library Resources for your Final Year Project
Using Library Resources for your Final Year Project
 
Quantifying the impacts of investment in humanities archives
Quantifying the impacts of investment in humanities archivesQuantifying the impacts of investment in humanities archives
Quantifying the impacts of investment in humanities archives
 
Librarianship in the Czech Republic 2007
Librarianship in the Czech Republic 2007Librarianship in the Czech Republic 2007
Librarianship in the Czech Republic 2007
 

Andere mochten auch

Rluk dunning-2012-130218124338-phpapp02
Rluk dunning-2012-130218124338-phpapp02Rluk dunning-2012-130218124338-phpapp02
Rluk dunning-2012-130218124338-phpapp02
The European Library
 
Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...
The European Library
 

Andere mochten auch (7)

Freire model api
Freire model apiFreire model api
Freire model api
 
Aubéry Escande - Europeana Newspapers - A new tool for researchers
Aubéry Escande - Europeana Newspapers - A new tool for researchersAubéry Escande - Europeana Newspapers - A new tool for researchers
Aubéry Escande - Europeana Newspapers - A new tool for researchers
 
The european library ukb nienke 13 feb 2014
The european library   ukb nienke 13 feb 2014The european library   ukb nienke 13 feb 2014
The european library ukb nienke 13 feb 2014
 
Rluk dunning-2012-130218124338-phpapp02
Rluk dunning-2012-130218124338-phpapp02Rluk dunning-2012-130218124338-phpapp02
Rluk dunning-2012-130218124338-phpapp02
 
Linking Collections Through Linked Open Data
Linking Collections Through Linked Open DataLinking Collections Through Linked Open Data
Linking Collections Through Linked Open Data
 
Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...
 
Getting your Analysis Noticed
Getting your Analysis NoticedGetting your Analysis Noticed
Getting your Analysis Noticed
 

Ähnlich wie Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries, then Aggregating Them

Electronic resources in academic libraries
Electronic resources in academic librariesElectronic resources in academic libraries
Electronic resources in academic libraries
estambulcervantes
 
LIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers Project
Europeana Newspapers
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02
The European Library
 

Ähnlich wie Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries, then Aggregating Them (20)

Digitised historic newspapers in Europe
Digitised historic newspapers in EuropeDigitised historic newspapers in Europe
Digitised historic newspapers in Europe
 
What's up, Europeana Newspapers?
What's up, Europeana Newspapers?What's up, Europeana Newspapers?
What's up, Europeana Newspapers?
 
Europeana Libraries Review
Europeana Libraries ReviewEuropeana Libraries Review
Europeana Libraries Review
 
You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?
 
You've Digitised. What Next ?
You've Digitised. What Next ?You've Digitised. What Next ?
You've Digitised. What Next ?
 
Electronic resources in academic libraries
Electronic resources in academic librariesElectronic resources in academic libraries
Electronic resources in academic libraries
 
They have left the building: The Web Route to Library Users
They have left the building: The Web Route to Library UsersThey have left the building: The Web Route to Library Users
They have left the building: The Web Route to Library Users
 
Finding Primary Sources and Digital Collections on the Web
Finding Primary Sources and Digital Collections on the WebFinding Primary Sources and Digital Collections on the Web
Finding Primary Sources and Digital Collections on the Web
 
Tonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final Revised
Tonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final RevisedTonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final Revised
Tonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final Revised
 
Opening up the archives: from basement to browser
Opening up the archives: from basement to browserOpening up the archives: from basement to browser
Opening up the archives: from basement to browser
 
LIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers Project
 
Update on our Wikipedia activities in 2015 - National library & Archives of t...
Update on our Wikipedia activities in 2015 - National library & Archives of t...Update on our Wikipedia activities in 2015 - National library & Archives of t...
Update on our Wikipedia activities in 2015 - National library & Archives of t...
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02
 
Terry Weech: Public Computing: Libraries and Volunteers
Terry Weech: Public Computing: Libraries and Volunteers Terry Weech: Public Computing: Libraries and Volunteers
Terry Weech: Public Computing: Libraries and Volunteers
 
Europeana Cloud Aggregator Forum 2014
Europeana Cloud Aggregator Forum 2014Europeana Cloud Aggregator Forum 2014
Europeana Cloud Aggregator Forum 2014
 
Linked Data for Libraries: Great progress, but what is the benefit?
Linked Data for Libraries:  Great progress, but what is the benefit?Linked Data for Libraries:  Great progress, but what is the benefit?
Linked Data for Libraries: Great progress, but what is the benefit?
 
“Virtual Communities in Europe: the cultural mix and how the European Library...
“Virtual Communities in Europe: the cultural mix and how the European Library...“Virtual Communities in Europe: the cultural mix and how the European Library...
“Virtual Communities in Europe: the cultural mix and how the European Library...
 
Situation Dänemark
Situation DänemarkSituation Dänemark
Situation Dänemark
 
The Future of Libraries (for beginners)
The Future of Libraries (for beginners)The Future of Libraries (for beginners)
The Future of Libraries (for beginners)
 
The European(a) Newspapers Project
The European(a) Newspapers ProjectThe European(a) Newspapers Project
The European(a) Newspapers Project
 

Mehr von The European Library

Europeana Newspapers (Project Details and Aggregation Workflow)
Europeana Newspapers (Project Details and Aggregation Workflow)Europeana Newspapers (Project Details and Aggregation Workflow)
Europeana Newspapers (Project Details and Aggregation Workflow)
The European Library
 
Alastair Dunning, Open data at The European library, TEL
Alastair Dunning, Open data at The European library, TELAlastair Dunning, Open data at The European library, TEL
Alastair Dunning, Open data at The European library, TEL
The European Library
 
Alastair Dunning, Europeana Newspapers, The European Library
Alastair Dunning, Europeana Newspapers, The European LibraryAlastair Dunning, Europeana Newspapers, The European Library
Alastair Dunning, Europeana Newspapers, The European Library
The European Library
 
Alastair Dunning, Introduction to Europeana Cloud, The European Library
Alastair Dunning, Introduction to Europeana Cloud, The European LibraryAlastair Dunning, Introduction to Europeana Cloud, The European Library
Alastair Dunning, Introduction to Europeana Cloud, The European Library
The European Library
 
Dunning welsh-newspapers-130314110640-phpapp01
Dunning welsh-newspapers-130314110640-phpapp01Dunning welsh-newspapers-130314110640-phpapp01
Dunning welsh-newspapers-130314110640-phpapp01
The European Library
 
Alastair Dunning, Breaking the waves, The European Library
Alastair Dunning, Breaking the waves, The European LibraryAlastair Dunning, Breaking the waves, The European Library
Alastair Dunning, Breaking the waves, The European Library
The European Library
 
Alastair Dunning, Challenges and Solutions in Creating a European Historic Ne...
Alastair Dunning, Challenges and Solutions in Creating a European Historic Ne...Alastair Dunning, Challenges and Solutions in Creating a European Historic Ne...
Alastair Dunning, Challenges and Solutions in Creating a European Historic Ne...
The European Library
 
Alastair Dunning, Future Directions for The European Library
Alastair Dunning, Future Directions for The European Library Alastair Dunning, Future Directions for The European Library
Alastair Dunning, Future Directions for The European Library
The European Library
 
Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library
Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European LibraryChiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library
Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library
The European Library
 
Chiara latronico, Europeana Collections 1914-1918 - Ingestion and Aggregation...
Chiara latronico, Europeana Collections 1914-1918 - Ingestion and Aggregation...Chiara latronico, Europeana Collections 1914-1918 - Ingestion and Aggregation...
Chiara latronico, Europeana Collections 1914-1918 - Ingestion and Aggregation...
The European Library
 
Chiara Latronico, Europeana Cloud - Ingestion and Aggregation Workshop, The E...
Chiara Latronico, Europeana Cloud - Ingestion and Aggregation Workshop, The E...Chiara Latronico, Europeana Cloud - Ingestion and Aggregation Workshop, The E...
Chiara Latronico, Europeana Cloud - Ingestion and Aggregation Workshop, The E...
The European Library
 
Mike Mertens, Deputy Executive Director and Data Services Manager, RLUK
Mike Mertens, Deputy Executive Director and Data Services Manager, RLUKMike Mertens, Deputy Executive Director and Data Services Manager, RLUK
Mike Mertens, Deputy Executive Director and Data Services Manager, RLUK
The European Library
 
Developing linked Open Data - Nuno Freire, Senior Researcher, The European Li...
Developing linked Open Data - Nuno Freire, Senior Researcher, The European Li...Developing linked Open Data - Nuno Freire, Senior Researcher, The European Li...
Developing linked Open Data - Nuno Freire, Senior Researcher, The European Li...
The European Library
 
Outcomes of strategic review, Bas Savenije, Director General of the National ...
Outcomes of strategic review, Bas Savenije, Director General of the National ...Outcomes of strategic review, Bas Savenije, Director General of the National ...
Outcomes of strategic review, Bas Savenije, Director General of the National ...
The European Library
 
ISNI and Libraries - Anila Angjeli, Bibliotheque Nationale de France
ISNI and Libraries - Anila Angjeli, Bibliotheque Nationale de FranceISNI and Libraries - Anila Angjeli, Bibliotheque Nationale de France
ISNI and Libraries - Anila Angjeli, Bibliotheque Nationale de France
The European Library
 
CORE - Petr Knoth, Research Associate
CORE - Petr Knoth, Research AssociateCORE - Petr Knoth, Research Associate
CORE - Petr Knoth, Research Associate
The European Library
 

Mehr von The European Library (20)

Europeana Newspapers (Project Details and Aggregation Workflow)
Europeana Newspapers (Project Details and Aggregation Workflow)Europeana Newspapers (Project Details and Aggregation Workflow)
Europeana Newspapers (Project Details and Aggregation Workflow)
 
Europeana Newspapers Aggregation and Indexing Plan
Europeana Newspapers Aggregation and Indexing PlanEuropeana Newspapers Aggregation and Indexing Plan
Europeana Newspapers Aggregation and Indexing Plan
 
Alastair Dunning, Open data at The European library, TEL
Alastair Dunning, Open data at The European library, TELAlastair Dunning, Open data at The European library, TEL
Alastair Dunning, Open data at The European library, TEL
 
Alastair Dunning, Europeana Newspapers, The European Library
Alastair Dunning, Europeana Newspapers, The European LibraryAlastair Dunning, Europeana Newspapers, The European Library
Alastair Dunning, Europeana Newspapers, The European Library
 
Alastair Dunning, Introduction to Europeana Cloud, The European Library
Alastair Dunning, Introduction to Europeana Cloud, The European LibraryAlastair Dunning, Introduction to Europeana Cloud, The European Library
Alastair Dunning, Introduction to Europeana Cloud, The European Library
 
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
 
Dunning welsh-newspapers-130314110640-phpapp01
Dunning welsh-newspapers-130314110640-phpapp01Dunning welsh-newspapers-130314110640-phpapp01
Dunning welsh-newspapers-130314110640-phpapp01
 
Alastair Dunning, Breaking the waves, The European Library
Alastair Dunning, Breaking the waves, The European LibraryAlastair Dunning, Breaking the waves, The European Library
Alastair Dunning, Breaking the waves, The European Library
 
Alastair Dunning, Challenges and Solutions in Creating a European Historic Ne...
Alastair Dunning, Challenges and Solutions in Creating a European Historic Ne...Alastair Dunning, Challenges and Solutions in Creating a European Historic Ne...
Alastair Dunning, Challenges and Solutions in Creating a European Historic Ne...
 
Alastair Dunning, Future Directions for The European Library
Alastair Dunning, Future Directions for The European Library Alastair Dunning, Future Directions for The European Library
Alastair Dunning, Future Directions for The European Library
 
Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library
Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European LibraryChiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library
Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library
 
Chiara latronico, Europeana Collections 1914-1918 - Ingestion and Aggregation...
Chiara latronico, Europeana Collections 1914-1918 - Ingestion and Aggregation...Chiara latronico, Europeana Collections 1914-1918 - Ingestion and Aggregation...
Chiara latronico, Europeana Collections 1914-1918 - Ingestion and Aggregation...
 
Chiara Latronico, Europeana Cloud - Ingestion and Aggregation Workshop, The E...
Chiara Latronico, Europeana Cloud - Ingestion and Aggregation Workshop, The E...Chiara Latronico, Europeana Cloud - Ingestion and Aggregation Workshop, The E...
Chiara Latronico, Europeana Cloud - Ingestion and Aggregation Workshop, The E...
 
Word Occurrence Based Extraction of Work Contributors from Statements of Resp...
Word Occurrence Based Extraction of Work Contributors from Statements of Resp...Word Occurrence Based Extraction of Work Contributors from Statements of Resp...
Word Occurrence Based Extraction of Work Contributors from Statements of Resp...
 
Joining The European Library, Adam Sofronijevic, University of Belgrade
Joining The European Library, Adam Sofronijevic, University of BelgradeJoining The European Library, Adam Sofronijevic, University of Belgrade
Joining The European Library, Adam Sofronijevic, University of Belgrade
 
Mike Mertens, Deputy Executive Director and Data Services Manager, RLUK
Mike Mertens, Deputy Executive Director and Data Services Manager, RLUKMike Mertens, Deputy Executive Director and Data Services Manager, RLUK
Mike Mertens, Deputy Executive Director and Data Services Manager, RLUK
 
Developing linked Open Data - Nuno Freire, Senior Researcher, The European Li...
Developing linked Open Data - Nuno Freire, Senior Researcher, The European Li...Developing linked Open Data - Nuno Freire, Senior Researcher, The European Li...
Developing linked Open Data - Nuno Freire, Senior Researcher, The European Li...
 
Outcomes of strategic review, Bas Savenije, Director General of the National ...
Outcomes of strategic review, Bas Savenije, Director General of the National ...Outcomes of strategic review, Bas Savenije, Director General of the National ...
Outcomes of strategic review, Bas Savenije, Director General of the National ...
 
ISNI and Libraries - Anila Angjeli, Bibliotheque Nationale de France
ISNI and Libraries - Anila Angjeli, Bibliotheque Nationale de FranceISNI and Libraries - Anila Angjeli, Bibliotheque Nationale de France
ISNI and Libraries - Anila Angjeli, Bibliotheque Nationale de France
 
CORE - Petr Knoth, Research Associate
CORE - Petr Knoth, Research AssociateCORE - Petr Knoth, Research Associate
CORE - Petr Knoth, Research Associate
 

Kürzlich hochgeladen

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 

Kürzlich hochgeladen (20)

How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 

Europeana Newspapers: Surveying Newspaper Digitisation in European Libraries, then Aggregating Them

  • 1. Europeana Newspapers Alastair Dunning Programme Manager, The European Library @alastairdunning, alastair.dunning AT kb.nl LIBER Conference, June 2013, Munich Surveying Newspaper Digitisation in European Libraries, Then Aggregating Them ! This presentation is at http://www.slideshare.net/alastairdunning
  • 2. On November 3, 1948, the early edition of the Chicago Tribune proclaimed Thomas Dewey as winner of the US presidential campaign http://www.chicagotribune.com/news/politics/chi-histdewey_defeats_an20080104104816,0,547284.photo
  • 3. In actual fact, the campaign was won by Harry Truman, who became the 33rd President of the United States http://en.wikipedia.org/wiki/File:Deweytruman12.jpg
  • 4. Later editions of the Chicago Tribune corrected this mistake with headline "DEMOCRATS MAKE SWEEP OF STATE OFFICES" However, I cannot find these online ! http://en.wikipedia.org/wiki/File:Deweytruman12.jpg
  • 5. As we shall see, presenting comprehensive digital archives, where everything is digitised, is difficult... yet this is what users often demand !
  • 6. "This lack of collocation and collection presents efficiency challenges and deepens scholars’ concerns about comprehensiveness. The anxiety over “missing something” was quite common across interviews." Ithaka S+R, Supporting the Changing Research Practices of Historians, http://www.sr.ithaka.org/research-publications/supporting-changingresearch-practices-historians
  • 7. "When lined up against the non-digital object upon which it is based, the digital object can only ever appear impoverished." Jim Mussell, Historian at University of Birmingham http://jimmussell.com/2013/05/23/the-proximal-pastdigital-archives-and-the-here-and-now/
  • 8. Genealogists - those studying family history "Genealogists represent the majority of users in many archives. And yet, the traditional archival information system does not meet their needs." Wendy M. Duff, Catherine A. Johnson, Where Is the List with All the Names? Information-Seeking Behavior of Genealogists, American Archivist, Volume 66(1), 2003, http://archivists.metapress.com/content/L375UJ047224737N
  • 9. Despite this, European libraries have made great strides in digitising their newspapers (These results taken from first Europeana Newspapers survey, 2012. 47 libraries responded.) http://www.europeana-newspapers.eu/wp-content/uploads/2012/04/D4.1-Europeananewspapers-survey-report.pdf
  • 11.
  • 12. 11 libraries have digitised more than 3m pages 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. National Library of Czech Republic Koninklijke Bibliotheek van België National Library of Spain National Library of Norway National and Univeristy Library of Iceland BCU Lausanne Hamburg State and University Library Bibliothèque nationale de France British Library Koninklijke Bibliotheek Austrian National Library
  • 13. But, only 12 (26%) 10% of the libraries had digitised more than of their collection (either in terms of titles or page numbers)
  • 14. National Library of Luxembourg 4.000.000 pages in collection 620.000 pages digitised
  • 15. National Library of Finland 620.000 pages digitised 2.010.246 pages in collection
  • 16. Hamburg State and University Library c. 2.000.000 pages digitised c. 12.000.000 pages in collection
  • 17. What else did the survey discover ?
  • 18. Access to digitised newspapers is nearly always free of charge. At least 40 (85%) offered free access to their digitised newspapers. One library had pay per view, whilst another three offered subscription services for users (ie paid access per day or per month). Only four libraries licensed their newspaper contents to other groups (e.g. school, universities).
  • 19. Access to twentieth-century content remains problematic. 27 out of 47 libraries (57%) have a cut off date beyond which they will not publish digitised newspapers on the web. Most frequently, this is based on a 70 year sliding scale. 23% (11 out of 47) had an agreement with a rights organisation so that in-copyright digitised newspapers could be published, but often restricted to individual titles
  • 20. There is still much to be done to exploit the richness of digitised newspaper content 64% (37 from 47) of libraries made use of OCR But only 17 of these libraries ( 36% ) exposed the resulting full text to the viewer 36% 13% had undertaken zoning and segmentation and only six libraries ( ) had included features such as facetted browsing or extracting entities such as place or name
  • 21. --> Motivation for Europeana Newspapers Others WPs will explain process of improving digitised archives but I want to return to one earlier quote
  • 22. "... the lack of comprehensive search tools for primary sources ..." Locating primary sources presents a crucial challenge for reserachers. --> TEL aggregator as part of Europeana Newspapers project
  • 23. Timetable: Early version with limited content added to The European Library website in September 20 More content being added in 2013 and 2014
  • 24. http://theeuropeanlibrary.org will deliver a search interface to help locate 18m pages digitised at European libraires Users will also be able to search over titles of newspapers. Title metadata will also be forwarded to Europeana
  • 25. Some Issues: Copyright means that some images cannot be shared at all, only metadata (e.g. names and dates of newspapers)
  • 26. Some Issues: OCR and zoning quality will affect search results significantly. Eg Higher quality OCR will be returned more often in search results
  • 27. Some Issues: Some pages have no OCR whatsoever - more difficult to find
  • 28. Some Issues: Different libraries are willing to share different amounts of content Some libraries happy for full content to be shared; for others it is just snippets of images
  • 29. Last Thoughts and What Next ?: The European Library will sustain access beyond project funding; but adding more content will require membership of TEL How can we allow for transcription? What do non-academic users want? How do we create full-text APIs ?
  • 30. Oh, the results here were all based on the first edition of the project survey. If your library want to contribute to later editions, see links by July 2013 http://www.europeana-newspapers.eu/tell-us-about-your-newspaperdigitisation-project/ http://www.surveymonkey.com/s/BQ28579