SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Downloaden Sie, um offline zu lesen
What libraries can learn from Google –
and what they can do better
Günter Mühlberger
University Innsbruck Library
Agenda
• Introduction
• A story about digitisation
• The continuation of the story
• Some conclusions
Introduction
• Department for Digitisation and Digital Preservation
– Founded in 2002, 14 FTE, R&D and Digitisation Services
– Since 1998 coordinated several R&D EU projects in the digital library
domain
– Currently involved in several projects, e.g.: IMPACT (mass-digitisation
of textual material, text recognition and language technologies),
Prestoprime (long term preservation of audio-visual material), both
projects will set up a CoC
– Coordinator of the library network eBooks on Demand (EOD) with 30
member libraries in 13 countries: Digitisation on Demand service
• Several medium and large scale digitisation projects + respective
applications for searching, browsing, archiving
– Catalogue cards
– Newspapers and newspaper clippings
– Books and journals
• Our mission
– To make a valuable contribution to an up to date digital library
A short story
• January 2007
– Collection of 30.000 books from a monastery
“Servitenbibliothek” as present to the library
– No spare shelves at the library for such a collection since
a collection of German dissertations occupies the best
magazines
– Suggestion to get rid of the dissertations
– Decision to digitize first and than to throw them away
• During 2007
– Several experiments with document scanners, cutting of
the documents, workflows, etc.
Digitisation of dissertations
• 2008 – mid 2010
– Real production process with two parallel document scanners and
up to 70.000 pages per day, 50.000 pages as average
– Average of 2’ per dissertation (110 pages) including ALL steps in
the workflow
– Convincing scan quality: Tests show that OCR will be nearly perfect
– All extra pages (supplements, tables, etc.) are treated extra
– Single cutting of documents too time consuming
– Change of paper quality
• Summer 2010
– We have processed 216.000 dissertations with 24 mill. pages,
1800 shelf meters
– 400 GB image data (TIFF IV bitonal)
– Overall time invested: 8000 hours or 5 person years
– High quality industrial equipment for less than 50.000 EUR
– Tests for OCR processing the 24 mill. pages are encouraging
Continuation of the story
• How can we give access to this large collection?
– Copyright comes in
• Investigations on Austrian copyright
– We are allowed to scan for preservation purposes. O.k!
– We are allowed to store for preservation. O.k!
– We are allowed to print out a copy and use it instead of what we
had before we digitised everything. Hm!
– We are allowed to use this copy for interlibrary loan – but need to
get it back. Uups!
– We are not allowed to make them available to the public. O.k!
– We are not allowed to make them available to our researchers and
students at the university. Uups!
– We are not allowed to make them available to other libraries
owning the same dissertations. Pff!
– We are allowed to provide access on a handful of dedicated
computers at the library. Mmh!
Some more considerations
• “Making available” is a new kind of use
– Copying, distribution, translation, exhibiting, etc. are traditional use
forms and publisher contracts cover this kind of use
– In 2003 (following the EU Directive on Copyright from 2001) a new
kind of use was introduced: “making available”
– Since this is a new right “old” contracts (usually) do not cover this
right.
– The author is therefore the right holder, not the publisher.
– In some countries it is more complicated (e.g. Germany) but as a rule
of thumb most authors in Europe still have the right to decide by
whom, when and how their digitised work will be made available to
the public
• Dissertations
– Even simpler since no publishers or RROs are involved
– Dissertations were printed on behalf of the authors, never distributed
via the book market
Our approach to copyright
• Let’s the social Internet work for us
– Dissertations will be made available online, but only title page,
table of contents and abstract/introduction will be shown to
everyone
– Under discussion: Maybe also some more pages and search
snippets
– Readers will get the chance to write a short “Request”: I would
need this book for my scientific work, etc.
– Readers will be encouraged to contact potential right holders (“Do
the diligent search for us”)
• Registration mechanism
– A big displayer will appear: If you are the author or if you know the
author/right holder – please help us!
– Authors will need to register (personal coordinates), set some
options and confirm their statement
Authorisation
• Copyright options
– They may want to make a general statement: Open Access,
Creative Commons, All rights reserved
– A cooperation with authors organisation (RRO) will make sense
– Or they may want to make a specific statement: This library is
allowed to do that and that. Than it is a simple bilateral, non-
exclusive contract.
• How to identify the right holder?
– Digital signatures or eCards would make life much easier.
• Current plan:
– Author provides address.
– He receives a letter with a list of TAN codes which will be needed
for any action within the system.
– If he chooses to “reserve all rights” the data are transferred to the
RRO(s)
– Minimal risk remains but can be neglected
Our dream
• We hope
– That it becomes a “self-runner” where those who need the
information will convince those who have the rights to
provide free access – or at least provide some access rights
for libraries
– That authors will understand why it is so important that
libraries digitise current material and provide access to
everyone
– That users will understand that authors have rights
(copyright and personal rights) which need to be respected
– That RROs and publishers will understand that not everyone
is interested in “making money with books written 30 years
ago” but that many are also willing to support the idea of
open access
– That thousands and ten-thousands of authors and readers
will take part
What we can learn from Google
• Mission of Google is to organise the information universe based on
technological innovation
– Therefore books are highly important (they contain much better information
than websites)
– Digitisation of books was just one step towards the overall objective
• If you have a mission, do the first step first and afterwards sort out the
problems
– Organise the cheapest way to scan, build your own machines, workflow, etc.
– Make a reasonable compromise between quantity and quality
– Be innovative (take what is here but put it together in a new way)
• Convert problems into chances
– Rather sure that Google underestimated the impact of copyright
– Settlement was probably not foreseen from the very beginning, but now it is a
great business opportunity for them
– If it comes, it will allow them to make a lot of money
• Battle on books is won /lost in the 20th century not in the public
domain
– Who reads books from 19th, 18th or 17th century?
What libraries can do better
• Libraries also need to follow their mission: to preserve the
intellectual heritage of mankind and to provide free access
to everyone
– Google is not a library
– It does many things as if it were a library (and better), but it never
will become a library
– Preservation comprises analogue AND digital preservation (go hand
in hand)
• to digitise (collect) everything
– Libraries are collection holders, not Google or anyone else
– Digitisation (and everything what is connected) has to be part of
the daily business and not only of projects
– Digitisation should be twofold: on-demand AND via mass
digitisation (including cutting of documents and 20th century
material)
– A natural consequence is to also collect modern material in digital
format (right from the beginning, pre-press files)
What libraries can do better
• to cooperate among each other (nationally and internationally)
– Most libraries have the same books, even duplicates within an
institution
– Swedish books in Austria, German books in Sweden, etc.
– Open access material will no longer belong to one library, but to
everyone!
– Therefore it makes definitely sense to cut one book and store the
pages digitally and analogue (acid free box)
• to involve readers (and right holders)
– Libraries have a “natural authority” which needs to be exploited as a
market advantage
– Libraries are much nearer to authors and readers than anyone else, but
they need to give them the chance to express themselves
– They may be slow, old-fashioned and technologically not on the fore-
front but they are trustful organisations and are able to mobilise
thousands or even hundred thousands of users
Let’s go to work!

Weitere ähnliche Inhalte

Was ist angesagt?

Can you save the web? Web Archiving!
Can you save the web? Web Archiving!Can you save the web? Web Archiving!
Can you save the web? Web Archiving!Vangelis Banos
 
Danish library association and the danish digital library
Danish library association and the danish digital libraryDanish library association and the danish digital library
Danish library association and the danish digital libraryMichel Steen-Hansen
 
What a difference 10 years makes | But where to from here?
What a difference 10 years makes | But where to from here?What a difference 10 years makes | But where to from here?
What a difference 10 years makes | But where to from here?Adrian Kingston
 
Hohmann liber2006text
Hohmann liber2006textHohmann liber2006text
Hohmann liber2006textTina Hohmann
 
BlogForever Project presentation at MTSR2013
BlogForever Project presentation at MTSR2013BlogForever Project presentation at MTSR2013
BlogForever Project presentation at MTSR2013eimgreece
 

Was ist angesagt? (6)

Danish Library Association - voorstelling door Hellen Niegaard
Danish Library Association - voorstelling door Hellen NiegaardDanish Library Association - voorstelling door Hellen Niegaard
Danish Library Association - voorstelling door Hellen Niegaard
 
Can you save the web? Web Archiving!
Can you save the web? Web Archiving!Can you save the web? Web Archiving!
Can you save the web? Web Archiving!
 
Danish library association and the danish digital library
Danish library association and the danish digital libraryDanish library association and the danish digital library
Danish library association and the danish digital library
 
What a difference 10 years makes | But where to from here?
What a difference 10 years makes | But where to from here?What a difference 10 years makes | But where to from here?
What a difference 10 years makes | But where to from here?
 
Hohmann liber2006text
Hohmann liber2006textHohmann liber2006text
Hohmann liber2006text
 
BlogForever Project presentation at MTSR2013
BlogForever Project presentation at MTSR2013BlogForever Project presentation at MTSR2013
BlogForever Project presentation at MTSR2013
 

Andere mochten auch

Slidecast final draft
Slidecast final draftSlidecast final draft
Slidecast final drafthmzucker
 
Irregular verbs
Irregular verbsIrregular verbs
Irregular verbsIsra1976
 
Aperiodic crystal workshop 2013: TEM
Aperiodic crystal workshop 2013: TEMAperiodic crystal workshop 2013: TEM
Aperiodic crystal workshop 2013: TEMJoke Hadermann
 
Tem for incommensurately modulated materials
Tem for incommensurately modulated materialsTem for incommensurately modulated materials
Tem for incommensurately modulated materialsJoke Hadermann
 
Direct space structure solution from precession electron diffraction data: Re...
Direct space structure solution from precession electron diffraction data: Re...Direct space structure solution from precession electron diffraction data: Re...
Direct space structure solution from precession electron diffraction data: Re...Joke Hadermann
 
Complementarity of advanced TEM to bulk diffraction techniques
Complementarity of advanced TEM to bulk diffraction techniquesComplementarity of advanced TEM to bulk diffraction techniques
Complementarity of advanced TEM to bulk diffraction techniquesJoke Hadermann
 
Determining a structure with electron crystallography - Overview of the paper...
Determining a structure with electron crystallography - Overview of the paper...Determining a structure with electron crystallography - Overview of the paper...
Determining a structure with electron crystallography - Overview of the paper...Joke Hadermann
 
Mapping of chemical order in inorganic compounds
Mapping of chemical order in inorganic compoundsMapping of chemical order in inorganic compounds
Mapping of chemical order in inorganic compoundsJoke Hadermann
 
Solving the Structure of Li Ion Battery Materials with Precession Electron Di...
Solving the Structure of Li Ion Battery Materials with Precession Electron Di...Solving the Structure of Li Ion Battery Materials with Precession Electron Di...
Solving the Structure of Li Ion Battery Materials with Precession Electron Di...Joke Hadermann
 
Irregular verbs
Irregular verbsIrregular verbs
Irregular verbsIsra1976
 
New oxide structures using lone pairs cations as "chemical scissors"
New oxide structures using lone pairs cations as "chemical scissors"New oxide structures using lone pairs cations as "chemical scissors"
New oxide structures using lone pairs cations as "chemical scissors"Joke Hadermann
 
TEM Winterworkshop 2011: electron diffraction
TEM Winterworkshop 2011: electron diffractionTEM Winterworkshop 2011: electron diffraction
TEM Winterworkshop 2011: electron diffractionJoke Hadermann
 
Scheelite CGEW/MO for luminescence - Summary of the paper
Scheelite CGEW/MO for luminescence - Summary of the paperScheelite CGEW/MO for luminescence - Summary of the paper
Scheelite CGEW/MO for luminescence - Summary of the paperJoke Hadermann
 
TEM workshop 2013: Electron diffraction
TEM workshop 2013: Electron diffractionTEM workshop 2013: Electron diffraction
TEM workshop 2013: Electron diffractionJoke Hadermann
 

Andere mochten auch (16)

Slidecast final draft
Slidecast final draftSlidecast final draft
Slidecast final draft
 
Irregular verbs
Irregular verbsIrregular verbs
Irregular verbs
 
Slide
SlideSlide
Slide
 
Slidecast
SlidecastSlidecast
Slidecast
 
Aperiodic crystal workshop 2013: TEM
Aperiodic crystal workshop 2013: TEMAperiodic crystal workshop 2013: TEM
Aperiodic crystal workshop 2013: TEM
 
Tem for incommensurately modulated materials
Tem for incommensurately modulated materialsTem for incommensurately modulated materials
Tem for incommensurately modulated materials
 
Direct space structure solution from precession electron diffraction data: Re...
Direct space structure solution from precession electron diffraction data: Re...Direct space structure solution from precession electron diffraction data: Re...
Direct space structure solution from precession electron diffraction data: Re...
 
Complementarity of advanced TEM to bulk diffraction techniques
Complementarity of advanced TEM to bulk diffraction techniquesComplementarity of advanced TEM to bulk diffraction techniques
Complementarity of advanced TEM to bulk diffraction techniques
 
Determining a structure with electron crystallography - Overview of the paper...
Determining a structure with electron crystallography - Overview of the paper...Determining a structure with electron crystallography - Overview of the paper...
Determining a structure with electron crystallography - Overview of the paper...
 
Mapping of chemical order in inorganic compounds
Mapping of chemical order in inorganic compoundsMapping of chemical order in inorganic compounds
Mapping of chemical order in inorganic compounds
 
Solving the Structure of Li Ion Battery Materials with Precession Electron Di...
Solving the Structure of Li Ion Battery Materials with Precession Electron Di...Solving the Structure of Li Ion Battery Materials with Precession Electron Di...
Solving the Structure of Li Ion Battery Materials with Precession Electron Di...
 
Irregular verbs
Irregular verbsIrregular verbs
Irregular verbs
 
New oxide structures using lone pairs cations as "chemical scissors"
New oxide structures using lone pairs cations as "chemical scissors"New oxide structures using lone pairs cations as "chemical scissors"
New oxide structures using lone pairs cations as "chemical scissors"
 
TEM Winterworkshop 2011: electron diffraction
TEM Winterworkshop 2011: electron diffractionTEM Winterworkshop 2011: electron diffraction
TEM Winterworkshop 2011: electron diffraction
 
Scheelite CGEW/MO for luminescence - Summary of the paper
Scheelite CGEW/MO for luminescence - Summary of the paperScheelite CGEW/MO for luminescence - Summary of the paper
Scheelite CGEW/MO for luminescence - Summary of the paper
 
TEM workshop 2013: Electron diffraction
TEM workshop 2013: Electron diffractionTEM workshop 2013: Electron diffraction
TEM workshop 2013: Electron diffraction
 

Ähnlich wie Muehlberger umea google

You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?The European Library
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02The European Library
 
53 million objects! Now what?
53 million objects! Now what?53 million objects! Now what?
53 million objects! Now what?David Haskiya
 
The Public Domain Charter
The Public Domain CharterThe Public Domain Charter
The Public Domain CharterPaul Keller
 
Save this book: posterity’s challenge
Save this book: posterity’s challengeSave this book: posterity’s challenge
Save this book: posterity’s challengeBookrepublic
 
Naple presentation danish digital library
Naple presentation danish digital libraryNaple presentation danish digital library
Naple presentation danish digital libraryJakobheide
 
eReading talk SJSU 2012 01
eReading talk SJSU 2012 01eReading talk SJSU 2012 01
eReading talk SJSU 2012 01TAPintoIT
 
Copyright challenges and policy choices in European heritage projects Tools, ...
Copyright challenges and policy choices in European heritage projects Tools, ...Copyright challenges and policy choices in European heritage projects Tools, ...
Copyright challenges and policy choices in European heritage projects Tools, ...Phonothèque MMSH
 
Bryssel 22. 23.10.2009 Mb
Bryssel 22. 23.10.2009 MbBryssel 22. 23.10.2009 Mb
Bryssel 22. 23.10.2009 Mbguestd67478
 
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...The European Library
 

Ähnlich wie Muehlberger umea google (20)

You've Digitised. What Next ?
You've Digitised. What Next ?You've Digitised. What Next ?
You've Digitised. What Next ?
 
You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02
 
Proyecto Arrow. Ana Manchado Mangas
Proyecto Arrow. Ana Manchado MangasProyecto Arrow. Ana Manchado Mangas
Proyecto Arrow. Ana Manchado Mangas
 
Save This Book
Save This BookSave This Book
Save This Book
 
53 million objects! Now what?
53 million objects! Now what?53 million objects! Now what?
53 million objects! Now what?
 
Digital Library
Digital LibraryDigital Library
Digital Library
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
 
Europeana en CARARE
Europeana en CARAREEuropeana en CARARE
Europeana en CARARE
 
31 36
31 3631 36
31 36
 
Sistema Compartit a l'ICOLC
Sistema Compartit a l'ICOLCSistema Compartit a l'ICOLC
Sistema Compartit a l'ICOLC
 
Luca Martinelli Europeana
Luca Martinelli EuropeanaLuca Martinelli Europeana
Luca Martinelli Europeana
 
Digitallibrary
DigitallibraryDigitallibrary
Digitallibrary
 
The Public Domain Charter
The Public Domain CharterThe Public Domain Charter
The Public Domain Charter
 
Save this book: posterity’s challenge
Save this book: posterity’s challengeSave this book: posterity’s challenge
Save this book: posterity’s challenge
 
Naple presentation danish digital library
Naple presentation danish digital libraryNaple presentation danish digital library
Naple presentation danish digital library
 
eReading talk SJSU 2012 01
eReading talk SJSU 2012 01eReading talk SJSU 2012 01
eReading talk SJSU 2012 01
 
Copyright challenges and policy choices in European heritage projects Tools, ...
Copyright challenges and policy choices in European heritage projects Tools, ...Copyright challenges and policy choices in European heritage projects Tools, ...
Copyright challenges and policy choices in European heritage projects Tools, ...
 
Bryssel 22. 23.10.2009 Mb
Bryssel 22. 23.10.2009 MbBryssel 22. 23.10.2009 Mb
Bryssel 22. 23.10.2009 Mb
 
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
 

Kürzlich hochgeladen

18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 

Kürzlich hochgeladen (20)

18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 

Muehlberger umea google

  • 1. What libraries can learn from Google – and what they can do better Günter Mühlberger University Innsbruck Library
  • 2. Agenda • Introduction • A story about digitisation • The continuation of the story • Some conclusions
  • 3. Introduction • Department for Digitisation and Digital Preservation – Founded in 2002, 14 FTE, R&D and Digitisation Services – Since 1998 coordinated several R&D EU projects in the digital library domain – Currently involved in several projects, e.g.: IMPACT (mass-digitisation of textual material, text recognition and language technologies), Prestoprime (long term preservation of audio-visual material), both projects will set up a CoC – Coordinator of the library network eBooks on Demand (EOD) with 30 member libraries in 13 countries: Digitisation on Demand service • Several medium and large scale digitisation projects + respective applications for searching, browsing, archiving – Catalogue cards – Newspapers and newspaper clippings – Books and journals • Our mission – To make a valuable contribution to an up to date digital library
  • 4. A short story • January 2007 – Collection of 30.000 books from a monastery “Servitenbibliothek” as present to the library – No spare shelves at the library for such a collection since a collection of German dissertations occupies the best magazines – Suggestion to get rid of the dissertations – Decision to digitize first and than to throw them away • During 2007 – Several experiments with document scanners, cutting of the documents, workflows, etc.
  • 5. Digitisation of dissertations • 2008 – mid 2010 – Real production process with two parallel document scanners and up to 70.000 pages per day, 50.000 pages as average – Average of 2’ per dissertation (110 pages) including ALL steps in the workflow – Convincing scan quality: Tests show that OCR will be nearly perfect – All extra pages (supplements, tables, etc.) are treated extra – Single cutting of documents too time consuming – Change of paper quality • Summer 2010 – We have processed 216.000 dissertations with 24 mill. pages, 1800 shelf meters – 400 GB image data (TIFF IV bitonal) – Overall time invested: 8000 hours or 5 person years – High quality industrial equipment for less than 50.000 EUR – Tests for OCR processing the 24 mill. pages are encouraging
  • 6. Continuation of the story • How can we give access to this large collection? – Copyright comes in • Investigations on Austrian copyright – We are allowed to scan for preservation purposes. O.k! – We are allowed to store for preservation. O.k! – We are allowed to print out a copy and use it instead of what we had before we digitised everything. Hm! – We are allowed to use this copy for interlibrary loan – but need to get it back. Uups! – We are not allowed to make them available to the public. O.k! – We are not allowed to make them available to our researchers and students at the university. Uups! – We are not allowed to make them available to other libraries owning the same dissertations. Pff! – We are allowed to provide access on a handful of dedicated computers at the library. Mmh!
  • 7. Some more considerations • “Making available” is a new kind of use – Copying, distribution, translation, exhibiting, etc. are traditional use forms and publisher contracts cover this kind of use – In 2003 (following the EU Directive on Copyright from 2001) a new kind of use was introduced: “making available” – Since this is a new right “old” contracts (usually) do not cover this right. – The author is therefore the right holder, not the publisher. – In some countries it is more complicated (e.g. Germany) but as a rule of thumb most authors in Europe still have the right to decide by whom, when and how their digitised work will be made available to the public • Dissertations – Even simpler since no publishers or RROs are involved – Dissertations were printed on behalf of the authors, never distributed via the book market
  • 8. Our approach to copyright • Let’s the social Internet work for us – Dissertations will be made available online, but only title page, table of contents and abstract/introduction will be shown to everyone – Under discussion: Maybe also some more pages and search snippets – Readers will get the chance to write a short “Request”: I would need this book for my scientific work, etc. – Readers will be encouraged to contact potential right holders (“Do the diligent search for us”) • Registration mechanism – A big displayer will appear: If you are the author or if you know the author/right holder – please help us! – Authors will need to register (personal coordinates), set some options and confirm their statement
  • 9. Authorisation • Copyright options – They may want to make a general statement: Open Access, Creative Commons, All rights reserved – A cooperation with authors organisation (RRO) will make sense – Or they may want to make a specific statement: This library is allowed to do that and that. Than it is a simple bilateral, non- exclusive contract. • How to identify the right holder? – Digital signatures or eCards would make life much easier. • Current plan: – Author provides address. – He receives a letter with a list of TAN codes which will be needed for any action within the system. – If he chooses to “reserve all rights” the data are transferred to the RRO(s) – Minimal risk remains but can be neglected
  • 10. Our dream • We hope – That it becomes a “self-runner” where those who need the information will convince those who have the rights to provide free access – or at least provide some access rights for libraries – That authors will understand why it is so important that libraries digitise current material and provide access to everyone – That users will understand that authors have rights (copyright and personal rights) which need to be respected – That RROs and publishers will understand that not everyone is interested in “making money with books written 30 years ago” but that many are also willing to support the idea of open access – That thousands and ten-thousands of authors and readers will take part
  • 11. What we can learn from Google • Mission of Google is to organise the information universe based on technological innovation – Therefore books are highly important (they contain much better information than websites) – Digitisation of books was just one step towards the overall objective • If you have a mission, do the first step first and afterwards sort out the problems – Organise the cheapest way to scan, build your own machines, workflow, etc. – Make a reasonable compromise between quantity and quality – Be innovative (take what is here but put it together in a new way) • Convert problems into chances – Rather sure that Google underestimated the impact of copyright – Settlement was probably not foreseen from the very beginning, but now it is a great business opportunity for them – If it comes, it will allow them to make a lot of money • Battle on books is won /lost in the 20th century not in the public domain – Who reads books from 19th, 18th or 17th century?
  • 12. What libraries can do better • Libraries also need to follow their mission: to preserve the intellectual heritage of mankind and to provide free access to everyone – Google is not a library – It does many things as if it were a library (and better), but it never will become a library – Preservation comprises analogue AND digital preservation (go hand in hand) • to digitise (collect) everything – Libraries are collection holders, not Google or anyone else – Digitisation (and everything what is connected) has to be part of the daily business and not only of projects – Digitisation should be twofold: on-demand AND via mass digitisation (including cutting of documents and 20th century material) – A natural consequence is to also collect modern material in digital format (right from the beginning, pre-press files)
  • 13. What libraries can do better • to cooperate among each other (nationally and internationally) – Most libraries have the same books, even duplicates within an institution – Swedish books in Austria, German books in Sweden, etc. – Open access material will no longer belong to one library, but to everyone! – Therefore it makes definitely sense to cut one book and store the pages digitally and analogue (acid free box) • to involve readers (and right holders) – Libraries have a “natural authority” which needs to be exploited as a market advantage – Libraries are much nearer to authors and readers than anyone else, but they need to give them the chance to express themselves – They may be slow, old-fashioned and technologically not on the fore- front but they are trustful organisations and are able to mobilise thousands or even hundred thousands of users
  • 14. Let’s go to work!