SlideShare ist ein Scribd-Unternehmen logo
1 von 47
Downloaden Sie, um offline zu lesen
digital projects best
     practices

        Frederick Zarndt

   frederick@frederickzarndt.com




                                   1
how’s and what’s of a
          digital archive / library

•    what is a (good) digital library ?
•    how should a digital library be designed ?
•    how should a digital library be created ?
•    how is a digital library measured ?
•    how should a digital project be executed ?
•    how should a digital library or a digital project be
     managed ?




                                                            2
why a digital project?


•  to enhance accessibility of the content in libraries
   and archives
•  to increase collaboration and cooperation between
   libraries and archives around the world
•  to promote research
•  to provide opportunities for entrepreneurs




                                                          3
digital projects overview


•  collections: organized groups of digital
   objects




                                              4
digital collections
Library and Archives Canada




                              5
digital projects overview


•  collections: organized groups of digital
   objects
•  objects: digital materials




                                              6
digital object
issue from the California Digital Newspaper Collection




                                                   7
digital projects overview


•  collections: organized groups of digital
   objects
•  objects: digital materials
•  metadata: information about objects and
   collections




                                              8
digital object metadata
metadata from the Singapore National Library




                                               9
project phases


•    assess
•    design
•    implement
•    measure
•    preserve
•    manage



                      10
assess

•    select the collection or content
•    define the goals
•    identify the users
•    identify ownership and legal risks
•    identify applicable standards
•    evaluate capabilities




                                          11
design: standards

•  METS XML for descriptive, structural, technical,
   and administrative metadata
•  descriptive metadata
    •  Metadata Object Description Standard
       (MODS) selected metadata from MARC
    •  Dublin Core fundamental group of text
       elements for describing and cataloging
•  technical metadata
    •  ALTO for OCR text
    •  PREMIS for digital preservation
    •  MIX for images
                                                      12
design: standards

•  image standards
    •  TIFF
    •  JPEG2000
    •  JPEG
    •  ANSI/NISO Z39.87
•  file standards
    •  PDF, PDF/A, PDF/A-1b, PDF/A-1a
    •  TEI
•  record standards
    •  ISAD(G)
    •  ERA
                                        13
design: access

•  user community
•  user interface (UI)
•  search
•  authentication and user
   management
•  digital object presentation
•  portability
•  administration




                                 14
implement: pilot


  create requirements and acceptance criteria
  repeat
  {
     digitize (small) pilot batch
     test data against acceptance criteria
     adjust requirements and acceptance criteria
  }
  until (no more adjustments are necessary)
  digitize more data


NB: pilot batches are VERY VERY important!!
                                                   15
implement: in-house


reasons for in-house production

   •    collection cannot be moved
   •    collection is badly organized
   •    digitization must be done slowly over a long
        period
   •    digitization is very simple




                                                       16
implement: outsource


reasons for outsourced production

   •  originals can’t be scanned in-house because…
      •  equipment is too expensive
      •  output data is beyond staff experience
      •  labor is too expensive
   •  large volume of work in a short time
   •  insufficient space, infrastructure, or staff




                                                     17
implement: software


•    commercial off-the-shelf (COTS)
•    open source
•    customized COTS
•    customized open source
•    custom in-house




                                       18
implement: crowd sourcing


 •    FamilySearch.org
 •    National Library of Australia
      Newspapers Digitisation Program
 •    Library and Archives Canada
 •    Wikipedia




                                        19
measure: acceptance criteria

•  automatic quality checks
    •  is the digital object complete?
    •  is the digital object verifiable?
    •  is the digital object uncorrupted?
•  manual quality checks
    •  does the metadata meet accuracy
       specifications?
    •  does the text meet accuracy
       specifications?
    •  is the image quality satisfactory?

                                            20
measure: image quality


    “…images which are ultimately to be viewed by human
    beings, the only “correct” method of quantifying visual image
    quality is through subjective evaluation. in practice,
    however, subjective evaluation is usually too inconvenient,
    time-consuming and expensive…”

    “…best way to assess the quality of an image is to look at it
    because human eyes are the ultimate viewers of most
    images…”


Zhou Wang and Hamid R. Sheikh. Image Quality Assessment: From Error Visibility to Structural Similarity.
IEEE Transactions on Image Processing. April 2004

Zhou Wang, Alan Bovick, and Ligang Lu. Why is image quality assessment so difficult? IEEE Transactions
on Image Processing. April 2004
                                                                                                           21
measure: use


•  who is using the collection?
•  what is the collection being used for?
•  how many page views per day / week /
   month?
•  how long do visitors to the collection stay?
•  how many repeat visitors to the collection?




                                                  22
preserve


•    bit rot
•    format obsolescence
•    media obsolescence / decay
•    migration to new media or hardware
•    standards obsolescence




                                          23
preserve: bit rot

gradual decay of …
   •  storage media because of media quality
   •  storage media because of improper storage
   •  data due to random events (bit-flip,
   •  software due to interface changes
   •  software due to non-obvious or inadvertent
   configuration changes




                                                   24
preserve: media decay

a report by NIST and the Library of Congress says
   that
   •  virtually all CD-Rs tested indicated an
   estimated life expectancy beyond 15 years
   •  only 47 percent of recordable DVDs indicated
   an estimated life expectancy beyond 15 years,
   some had a life expectancy as short as 1.9 years
   •  in practice actual lifetimes may be considerably
   shorter



                                                         25
preserve: media obsolescence


  •    5 ¼” floppy disks
  •    8 track tapes
  •    3 ½” floppy disks
  •    ZIP drives
  •    CD-R, CD-RW, Blu-Ray
  •    microfilm




                               26
preserve: migration


•  file format changes
•  file name differences: case sensitive /
   insensitive
•  extended file attributes
•  file permissions
•  soft links / hard links




                                             27
preserve: standards obsolescence


   remember …
     •  WordPerfect ?
     •  MARC records ?
     •  Adobe Flash ?




                                   28
preservation
Open Archival Information System (OAIS)
            reference model




                                      29
the problem



              30
the problem


 the 2009 CHAOS Report (The Standish Group)
reports that of all software projects surveyed, 44%
   are “challenged”, 24% failed, and only 32%
                      succeeded




                                                      31
the problem



  Roger Sessions estimates that the worldwide cost
    of IT failure is USD $500 billion per month




Roger Sessions: CTO of ObjectWatch. He has written seven books including
Simple Architectures for Complex Enterprises and many articles. He is a
founding member of the Board of Directors of the International Association of
Software Architects.                                                            32
the problem


     in a recent survey of 1230 IT professionals
conducted by Embarcadero Technologies, 2 of the
  3 biggest project challenges cited by the IT pros
are “poor planning” and “poor or no requirements”




                                                      33
the problem


     in a March 2007 web poll conducted by the
Computing Technology Industry Association "nearly
   28 percent of the more than 1,000 respondents
singled out poor communications as the number one
              cause of project failure"




                                                    34
the problem

in a white paper written for Project Perfect by Taimour al
Neimat, he lists

   • poor planning
   • unclear goals and objectives
   • objectives changing during the project
   • unrealistic time or resource estimates
   • lack of executive support and user involvement
   • failure to communicate and act as a team
   • inappropriate skills

as primary causes for the failure of complex IT projects

                                                             35
the problem

a recent tender from an (anonymous) government agency

   •  project to convert ~ 170,000 text images to xml
   •  value of project ~ USD $180,000
   •  19 pages of definitions, governing law, proposal
   evaluation criteria, contractual conditions, instructions
   about tender response format, etc
   •  technical requirements description? < 1 page
   •  data acceptance criteria? “a high level of accuracy”

                                                               36
the problem

a recent program established by a prominent national
library
   •  digitize more than 20 million text pages
   •  high level image and xml requirements
   •  value of work awarded? > USD $5,000,000
   •  after award of work, technical requirements
   expand to 43+ pages from ~3 pages
   •  acceptance criteria? added as an afterthought
   and not well defined

                                                       37
the problem

typical tender evaluation criteria in priority order

   1. understanding of requirements
   2. reputation of service bureau
   3. price




                                                       38
39
the problem


requirements




               40
requirements
Library of Congress JPEG2000 profile




                                       41
the problem


requirements
 acceptance




               42
acceptance
National Library of Australia NDP




                                    43
the problem


 requirements
  acceptance
communication




                44
communication



                           “projects are about
                      communication, communication,
                          and communication”




Elenbass,	
  B.	
  (2000).	
  “Staging	
  a	
  Project:	
  Are	
  You	
  Se>ng	
  Your	
  Project	
  Up	
  for	
  Success?”.	
  	
  
Proceedings	
  of	
  the	
  Project	
  Management	
  InsItute	
  Annual	
  Seminars	
  &	
  Symposiums.	
  
                                                                                                                                  45
references


•  METS, MODS, ALTO, PRISM, etc :
   http://www.loc.gov/standards
•  OAIS : http://public.ccsds.org/publications/RefModel.aspx
•  NISO standards and guidelines :
   http://www.niso.org/publications/rp
•  good practice guides : http://www.ukoln.ac.uk
•  And many, many more




                                                               46
preguntas?
            Frederick Zarndt

      frederick@frederickzarndt.com



This work is licensed under the Creative Commons
        Attribution-ShareAlike (CC by SA)
    License. To view a copy of this license visit
 http://creativecommons.org/licenses/by-sa/3.0/

                                                    47

Weitere ähnliche Inhalte

Andere mochten auch

Digar - Digital archive of the National Library / Estonian Design Awards 2014
Digar - Digital archive of the National Library / Estonian Design Awards 2014Digar - Digital archive of the National Library / Estonian Design Awards 2014
Digar - Digital archive of the National Library / Estonian Design Awards 2014Designawards
 
RDFa-deployed Multimedia Metadata - Tutorial, Part 2
RDFa-deployed Multimedia Metadata - Tutorial, Part 2RDFa-deployed Multimedia Metadata - Tutorial, Part 2
RDFa-deployed Multimedia Metadata - Tutorial, Part 2Michael Hausenblas
 
Noblessner / Estonian Design Awards 2014
Noblessner / Estonian Design Awards 2014Noblessner / Estonian Design Awards 2014
Noblessner / Estonian Design Awards 2014Designawards
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projectsac2182
 

Andere mochten auch (7)

Digar - Digital archive of the National Library / Estonian Design Awards 2014
Digar - Digital archive of the National Library / Estonian Design Awards 2014Digar - Digital archive of the National Library / Estonian Design Awards 2014
Digar - Digital archive of the National Library / Estonian Design Awards 2014
 
RDFa-deployed Multimedia Metadata - Tutorial, Part 2
RDFa-deployed Multimedia Metadata - Tutorial, Part 2RDFa-deployed Multimedia Metadata - Tutorial, Part 2
RDFa-deployed Multimedia Metadata - Tutorial, Part 2
 
Noblessner / Estonian Design Awards 2014
Noblessner / Estonian Design Awards 2014Noblessner / Estonian Design Awards 2014
Noblessner / Estonian Design Awards 2014
 
Management of library and archive in digital era
Management of library and archive in digital eraManagement of library and archive in digital era
Management of library and archive in digital era
 
Making a Case for Photo Metadata
Making a Case for Photo MetadataMaking a Case for Photo Metadata
Making a Case for Photo Metadata
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projects
 
Metadata Workshop
Metadata WorkshopMetadata Workshop
Metadata Workshop
 

Ähnlich wie Digital projects best practices [xxxiii reunión nacional de archivos 201111]

"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with ArchivematicaJenny Mitcham
 
A Digitization Primer for Botanical and Horticultural Librarians
A Digitization Primer for Botanical and Horticultural LibrariansA Digitization Primer for Botanical and Horticultural Librarians
A Digitization Primer for Botanical and Horticultural LibrariansChris Freeland
 
Importance of Developers to HE in the UK
Importance of Developers to HE in the UKImportance of Developers to HE in the UK
Importance of Developers to HE in the UKPaul Walk
 
ICIC 2013 Conference Proceedings Uwe Rosemann TIB
ICIC 2013 Conference Proceedings Uwe Rosemann TIBICIC 2013 Conference Proceedings Uwe Rosemann TIB
ICIC 2013 Conference Proceedings Uwe Rosemann TIBDr. Haxel Consult
 
Pitts Library Digitization Initiatives
Pitts Library Digitization InitiativesPitts Library Digitization Initiatives
Pitts Library Digitization Initiativesjbweave
 
Moving an Archive from Tape to Disk: A Case-Study at ICPSR
Moving an Archive from Tape to Disk: A Case-Study at ICPSRMoving an Archive from Tape to Disk: A Case-Study at ICPSR
Moving an Archive from Tape to Disk: A Case-Study at ICPSRBryan Beecher
 
Building a Digital Library
Building a Digital LibraryBuilding a Digital Library
Building a Digital LibraryEd Fay
 
GAMA - Europeana en de digitale ontsluiting van cultureel erfgoed
GAMA - Europeana en de digitale ontsluiting van cultureel erfgoedGAMA - Europeana en de digitale ontsluiting van cultureel erfgoed
GAMA - Europeana en de digitale ontsluiting van cultureel erfgoedEuropeanaLocal Project
 
Born Again: The Digitisation of the Anthropology Photographic Archive. 2004
Born Again: The Digitisation of the Anthropology Photographic Archive. 2004Born Again: The Digitisation of the Anthropology Photographic Archive. 2004
Born Again: The Digitisation of the Anthropology Photographic Archive. 2004Rose Holley
 
2010 EGITF Amsterdam - Gap between GRID and Humanities
2010 EGITF Amsterdam - Gap between GRID and Humanities2010 EGITF Amsterdam - Gap between GRID and Humanities
2010 EGITF Amsterdam - Gap between GRID and HumanitiesDirk Roorda
 
Tooling for the JavaScript Era
Tooling for the JavaScript EraTooling for the JavaScript Era
Tooling for the JavaScript Eramartinlippert
 
Presentation on the Warsaw Conference on National Bibliographies August 2012
Presentation on the Warsaw Conference on National Bibliographies August 2012Presentation on the Warsaw Conference on National Bibliographies August 2012
Presentation on the Warsaw Conference on National Bibliographies August 2012nw13
 
Digital practice guidelines : the new generation presented by Scott Wajon
Digital practice guidelines : the new generation presented by Scott WajonDigital practice guidelines : the new generation presented by Scott Wajon
Digital practice guidelines : the new generation presented by Scott WajonPublicLibraryServices
 
Chemical Databases and Open Chemistry on the Desktop
Chemical Databases and Open Chemistry on the DesktopChemical Databases and Open Chemistry on the Desktop
Chemical Databases and Open Chemistry on the DesktopMarcus Hanwell
 
CONTENTdm Presentation 060711
CONTENTdm Presentation 060711CONTENTdm Presentation 060711
CONTENTdm Presentation 060711Buttes
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
 
Using Omeka as a Gateway to Digital Projects
Using Omeka as a Gateway to Digital ProjectsUsing Omeka as a Gateway to Digital Projects
Using Omeka as a Gateway to Digital Projectslibrarianrafia
 
Introduction_to_knowledge_graph.pdf
Introduction_to_knowledge_graph.pdfIntroduction_to_knowledge_graph.pdf
Introduction_to_knowledge_graph.pdfJaberRad1
 
A community of developers stimulating innovation in uk higher education
A community of developers stimulating innovation in uk higher educationA community of developers stimulating innovation in uk higher education
A community of developers stimulating innovation in uk higher educationDevCSI
 

Ähnlich wie Digital projects best practices [xxxiii reunión nacional de archivos 201111] (20)

"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica
 
A Digitization Primer for Botanical and Horticultural Librarians
A Digitization Primer for Botanical and Horticultural LibrariansA Digitization Primer for Botanical and Horticultural Librarians
A Digitization Primer for Botanical and Horticultural Librarians
 
Importance of Developers to HE in the UK
Importance of Developers to HE in the UKImportance of Developers to HE in the UK
Importance of Developers to HE in the UK
 
ICIC 2013 Conference Proceedings Uwe Rosemann TIB
ICIC 2013 Conference Proceedings Uwe Rosemann TIBICIC 2013 Conference Proceedings Uwe Rosemann TIB
ICIC 2013 Conference Proceedings Uwe Rosemann TIB
 
Pitts Library Digitization Initiatives
Pitts Library Digitization InitiativesPitts Library Digitization Initiatives
Pitts Library Digitization Initiatives
 
Moving an Archive from Tape to Disk: A Case-Study at ICPSR
Moving an Archive from Tape to Disk: A Case-Study at ICPSRMoving an Archive from Tape to Disk: A Case-Study at ICPSR
Moving an Archive from Tape to Disk: A Case-Study at ICPSR
 
Building a Digital Library
Building a Digital LibraryBuilding a Digital Library
Building a Digital Library
 
GAMA - Europeana en de digitale ontsluiting van cultureel erfgoed
GAMA - Europeana en de digitale ontsluiting van cultureel erfgoedGAMA - Europeana en de digitale ontsluiting van cultureel erfgoed
GAMA - Europeana en de digitale ontsluiting van cultureel erfgoed
 
Born Again: The Digitisation of the Anthropology Photographic Archive. 2004
Born Again: The Digitisation of the Anthropology Photographic Archive. 2004Born Again: The Digitisation of the Anthropology Photographic Archive. 2004
Born Again: The Digitisation of the Anthropology Photographic Archive. 2004
 
2010 EGITF Amsterdam - Gap between GRID and Humanities
2010 EGITF Amsterdam - Gap between GRID and Humanities2010 EGITF Amsterdam - Gap between GRID and Humanities
2010 EGITF Amsterdam - Gap between GRID and Humanities
 
Tooling for the JavaScript Era
Tooling for the JavaScript EraTooling for the JavaScript Era
Tooling for the JavaScript Era
 
Presentation on the Warsaw Conference on National Bibliographies August 2012
Presentation on the Warsaw Conference on National Bibliographies August 2012Presentation on the Warsaw Conference on National Bibliographies August 2012
Presentation on the Warsaw Conference on National Bibliographies August 2012
 
Digital practice guidelines : the new generation presented by Scott Wajon
Digital practice guidelines : the new generation presented by Scott WajonDigital practice guidelines : the new generation presented by Scott Wajon
Digital practice guidelines : the new generation presented by Scott Wajon
 
Chemical Databases and Open Chemistry on the Desktop
Chemical Databases and Open Chemistry on the DesktopChemical Databases and Open Chemistry on the Desktop
Chemical Databases and Open Chemistry on the Desktop
 
CONTENTdm Presentation 060711
CONTENTdm Presentation 060711CONTENTdm Presentation 060711
CONTENTdm Presentation 060711
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
Using Omeka as a Gateway to Digital Projects
Using Omeka as a Gateway to Digital ProjectsUsing Omeka as a Gateway to Digital Projects
Using Omeka as a Gateway to Digital Projects
 
Long Term Preservation Dale Peters
Long Term Preservation Dale PetersLong Term Preservation Dale Peters
Long Term Preservation Dale Peters
 
Introduction_to_knowledge_graph.pdf
Introduction_to_knowledge_graph.pdfIntroduction_to_knowledge_graph.pdf
Introduction_to_knowledge_graph.pdf
 
A community of developers stimulating innovation in uk higher education
A community of developers stimulating innovation in uk higher educationA community of developers stimulating innovation in uk higher education
A community of developers stimulating innovation in uk higher education
 

Mehr von Frederick Zarndt

Digitization of the Tuol Sleng Genocide Museum Archives
Digitization of the Tuol Sleng Genocide Museum ArchivesDigitization of the Tuol Sleng Genocide Museum Archives
Digitization of the Tuol Sleng Genocide Museum ArchivesFrederick Zarndt
 
2017 Born Digital Legal Deposit Policies and Practices
2017 Born Digital Legal Deposit Policies and Practices2017 Born Digital Legal Deposit Policies and Practices
2017 Born Digital Legal Deposit Policies and PracticesFrederick Zarndt
 
e-Legal Deposit Survey 2017
e-Legal Deposit Survey 2017e-Legal Deposit Survey 2017
e-Legal Deposit Survey 2017Frederick Zarndt
 
Project Management according to Great Pumpkin Principles
Project Management according to Great Pumpkin PrinciplesProject Management according to Great Pumpkin Principles
Project Management according to Great Pumpkin PrinciplesFrederick Zarndt
 
What did you say? interculture communication [20160308 phnom penh]
What did you say? interculture communication [20160308 phnom penh]What did you say? interculture communication [20160308 phnom penh]
What did you say? interculture communication [20160308 phnom penh]Frederick Zarndt
 
Coronado public library digital newspapers workshop local partnerships [oct 2...
Coronado public library digital newspapers workshop local partnerships [oct 2...Coronado public library digital newspapers workshop local partnerships [oct 2...
Coronado public library digital newspapers workshop local partnerships [oct 2...Frederick Zarndt
 
Coronado public library digital newspapers workshop [Oct 2016]
Coronado public library digital newspapers workshop [Oct 2016]Coronado public library digital newspapers workshop [Oct 2016]
Coronado public library digital newspapers workshop [Oct 2016]Frederick Zarndt
 
What did you say? mindful interculture communication [201608 icgse]
What did you say? mindful interculture communication [201608 icgse]What did you say? mindful interculture communication [201608 icgse]
What did you say? mindful interculture communication [201608 icgse]Frederick Zarndt
 
Here Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsHere Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsFrederick Zarndt
 
Here Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsHere Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsFrederick Zarndt
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...Frederick Zarndt
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...Frederick Zarndt
 
Rootstech 2015 finding and using digitized historical newspapers workshop [20...
Rootstech 2015 finding and using digitized historical newspapers workshop [20...Rootstech 2015 finding and using digitized historical newspapers workshop [20...
Rootstech 2015 finding and using digitized historical newspapers workshop [20...Frederick Zarndt
 
20140410 ifla digitization workshop [idlc kuala lumpur]
20140410 ifla digitization workshop [idlc kuala lumpur]20140410 ifla digitization workshop [idlc kuala lumpur]
20140410 ifla digitization workshop [idlc kuala lumpur]Frederick Zarndt
 
What did you say? Intercultural expectations, misunderstandings, and communic...
What did you say? Intercultural expectations, misunderstandings, and communic...What did you say? Intercultural expectations, misunderstandings, and communic...
What did you say? Intercultural expectations, misunderstandings, and communic...Frederick Zarndt
 
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...Frederick Zarndt
 
20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]Frederick Zarndt
 
20131019 digital collections - if you build them will anyone visit [library 2...
20131019 digital collections - if you build them will anyone visit [library 2...20131019 digital collections - if you build them will anyone visit [library 2...
20131019 digital collections - if you build them will anyone visit [library 2...Frederick Zarndt
 
20130903 what did you say? interculture communication [hamburg]
20130903 what did you say? interculture communication [hamburg]20130903 what did you say? interculture communication [hamburg]
20130903 what did you say? interculture communication [hamburg]Frederick Zarndt
 
201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...Frederick Zarndt
 

Mehr von Frederick Zarndt (20)

Digitization of the Tuol Sleng Genocide Museum Archives
Digitization of the Tuol Sleng Genocide Museum ArchivesDigitization of the Tuol Sleng Genocide Museum Archives
Digitization of the Tuol Sleng Genocide Museum Archives
 
2017 Born Digital Legal Deposit Policies and Practices
2017 Born Digital Legal Deposit Policies and Practices2017 Born Digital Legal Deposit Policies and Practices
2017 Born Digital Legal Deposit Policies and Practices
 
e-Legal Deposit Survey 2017
e-Legal Deposit Survey 2017e-Legal Deposit Survey 2017
e-Legal Deposit Survey 2017
 
Project Management according to Great Pumpkin Principles
Project Management according to Great Pumpkin PrinciplesProject Management according to Great Pumpkin Principles
Project Management according to Great Pumpkin Principles
 
What did you say? interculture communication [20160308 phnom penh]
What did you say? interculture communication [20160308 phnom penh]What did you say? interculture communication [20160308 phnom penh]
What did you say? interculture communication [20160308 phnom penh]
 
Coronado public library digital newspapers workshop local partnerships [oct 2...
Coronado public library digital newspapers workshop local partnerships [oct 2...Coronado public library digital newspapers workshop local partnerships [oct 2...
Coronado public library digital newspapers workshop local partnerships [oct 2...
 
Coronado public library digital newspapers workshop [Oct 2016]
Coronado public library digital newspapers workshop [Oct 2016]Coronado public library digital newspapers workshop [Oct 2016]
Coronado public library digital newspapers workshop [Oct 2016]
 
What did you say? mindful interculture communication [201608 icgse]
What did you say? mindful interculture communication [201608 icgse]What did you say? mindful interculture communication [201608 icgse]
What did you say? mindful interculture communication [201608 icgse]
 
Here Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsHere Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital News
 
Here Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsHere Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital News
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...
 
Rootstech 2015 finding and using digitized historical newspapers workshop [20...
Rootstech 2015 finding and using digitized historical newspapers workshop [20...Rootstech 2015 finding and using digitized historical newspapers workshop [20...
Rootstech 2015 finding and using digitized historical newspapers workshop [20...
 
20140410 ifla digitization workshop [idlc kuala lumpur]
20140410 ifla digitization workshop [idlc kuala lumpur]20140410 ifla digitization workshop [idlc kuala lumpur]
20140410 ifla digitization workshop [idlc kuala lumpur]
 
What did you say? Intercultural expectations, misunderstandings, and communic...
What did you say? Intercultural expectations, misunderstandings, and communic...What did you say? Intercultural expectations, misunderstandings, and communic...
What did you say? Intercultural expectations, misunderstandings, and communic...
 
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
 
20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]
 
20131019 digital collections - if you build them will anyone visit [library 2...
20131019 digital collections - if you build them will anyone visit [library 2...20131019 digital collections - if you build them will anyone visit [library 2...
20131019 digital collections - if you build them will anyone visit [library 2...
 
20130903 what did you say? interculture communication [hamburg]
20130903 what did you say? interculture communication [hamburg]20130903 what did you say? interculture communication [hamburg]
20130903 what did you say? interculture communication [hamburg]
 
201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...
 

Kürzlich hochgeladen

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 

Kürzlich hochgeladen (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 

Digital projects best practices [xxxiii reunión nacional de archivos 201111]

  • 1. digital projects best practices Frederick Zarndt frederick@frederickzarndt.com 1
  • 2. how’s and what’s of a digital archive / library •  what is a (good) digital library ? •  how should a digital library be designed ? •  how should a digital library be created ? •  how is a digital library measured ? •  how should a digital project be executed ? •  how should a digital library or a digital project be managed ? 2
  • 3. why a digital project? •  to enhance accessibility of the content in libraries and archives •  to increase collaboration and cooperation between libraries and archives around the world •  to promote research •  to provide opportunities for entrepreneurs 3
  • 4. digital projects overview •  collections: organized groups of digital objects 4
  • 6. digital projects overview •  collections: organized groups of digital objects •  objects: digital materials 6
  • 7. digital object issue from the California Digital Newspaper Collection 7
  • 8. digital projects overview •  collections: organized groups of digital objects •  objects: digital materials •  metadata: information about objects and collections 8
  • 9. digital object metadata metadata from the Singapore National Library 9
  • 10. project phases •  assess •  design •  implement •  measure •  preserve •  manage 10
  • 11. assess •  select the collection or content •  define the goals •  identify the users •  identify ownership and legal risks •  identify applicable standards •  evaluate capabilities 11
  • 12. design: standards •  METS XML for descriptive, structural, technical, and administrative metadata •  descriptive metadata •  Metadata Object Description Standard (MODS) selected metadata from MARC •  Dublin Core fundamental group of text elements for describing and cataloging •  technical metadata •  ALTO for OCR text •  PREMIS for digital preservation •  MIX for images 12
  • 13. design: standards •  image standards •  TIFF •  JPEG2000 •  JPEG •  ANSI/NISO Z39.87 •  file standards •  PDF, PDF/A, PDF/A-1b, PDF/A-1a •  TEI •  record standards •  ISAD(G) •  ERA 13
  • 14. design: access •  user community •  user interface (UI) •  search •  authentication and user management •  digital object presentation •  portability •  administration 14
  • 15. implement: pilot create requirements and acceptance criteria repeat { digitize (small) pilot batch test data against acceptance criteria adjust requirements and acceptance criteria } until (no more adjustments are necessary) digitize more data NB: pilot batches are VERY VERY important!! 15
  • 16. implement: in-house reasons for in-house production •  collection cannot be moved •  collection is badly organized •  digitization must be done slowly over a long period •  digitization is very simple 16
  • 17. implement: outsource reasons for outsourced production •  originals can’t be scanned in-house because… •  equipment is too expensive •  output data is beyond staff experience •  labor is too expensive •  large volume of work in a short time •  insufficient space, infrastructure, or staff 17
  • 18. implement: software •  commercial off-the-shelf (COTS) •  open source •  customized COTS •  customized open source •  custom in-house 18
  • 19. implement: crowd sourcing •  FamilySearch.org •  National Library of Australia Newspapers Digitisation Program •  Library and Archives Canada •  Wikipedia 19
  • 20. measure: acceptance criteria •  automatic quality checks •  is the digital object complete? •  is the digital object verifiable? •  is the digital object uncorrupted? •  manual quality checks •  does the metadata meet accuracy specifications? •  does the text meet accuracy specifications? •  is the image quality satisfactory? 20
  • 21. measure: image quality “…images which are ultimately to be viewed by human beings, the only “correct” method of quantifying visual image quality is through subjective evaluation. in practice, however, subjective evaluation is usually too inconvenient, time-consuming and expensive…” “…best way to assess the quality of an image is to look at it because human eyes are the ultimate viewers of most images…” Zhou Wang and Hamid R. Sheikh. Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Transactions on Image Processing. April 2004 Zhou Wang, Alan Bovick, and Ligang Lu. Why is image quality assessment so difficult? IEEE Transactions on Image Processing. April 2004 21
  • 22. measure: use •  who is using the collection? •  what is the collection being used for? •  how many page views per day / week / month? •  how long do visitors to the collection stay? •  how many repeat visitors to the collection? 22
  • 23. preserve •  bit rot •  format obsolescence •  media obsolescence / decay •  migration to new media or hardware •  standards obsolescence 23
  • 24. preserve: bit rot gradual decay of … •  storage media because of media quality •  storage media because of improper storage •  data due to random events (bit-flip, •  software due to interface changes •  software due to non-obvious or inadvertent configuration changes 24
  • 25. preserve: media decay a report by NIST and the Library of Congress says that •  virtually all CD-Rs tested indicated an estimated life expectancy beyond 15 years •  only 47 percent of recordable DVDs indicated an estimated life expectancy beyond 15 years, some had a life expectancy as short as 1.9 years •  in practice actual lifetimes may be considerably shorter 25
  • 26. preserve: media obsolescence •  5 ¼” floppy disks •  8 track tapes •  3 ½” floppy disks •  ZIP drives •  CD-R, CD-RW, Blu-Ray •  microfilm 26
  • 27. preserve: migration •  file format changes •  file name differences: case sensitive / insensitive •  extended file attributes •  file permissions •  soft links / hard links 27
  • 28. preserve: standards obsolescence remember … •  WordPerfect ? •  MARC records ? •  Adobe Flash ? 28
  • 29. preservation Open Archival Information System (OAIS) reference model 29
  • 31. the problem the 2009 CHAOS Report (The Standish Group) reports that of all software projects surveyed, 44% are “challenged”, 24% failed, and only 32% succeeded 31
  • 32. the problem Roger Sessions estimates that the worldwide cost of IT failure is USD $500 billion per month Roger Sessions: CTO of ObjectWatch. He has written seven books including Simple Architectures for Complex Enterprises and many articles. He is a founding member of the Board of Directors of the International Association of Software Architects. 32
  • 33. the problem in a recent survey of 1230 IT professionals conducted by Embarcadero Technologies, 2 of the 3 biggest project challenges cited by the IT pros are “poor planning” and “poor or no requirements” 33
  • 34. the problem in a March 2007 web poll conducted by the Computing Technology Industry Association "nearly 28 percent of the more than 1,000 respondents singled out poor communications as the number one cause of project failure" 34
  • 35. the problem in a white paper written for Project Perfect by Taimour al Neimat, he lists • poor planning • unclear goals and objectives • objectives changing during the project • unrealistic time or resource estimates • lack of executive support and user involvement • failure to communicate and act as a team • inappropriate skills as primary causes for the failure of complex IT projects 35
  • 36. the problem a recent tender from an (anonymous) government agency •  project to convert ~ 170,000 text images to xml •  value of project ~ USD $180,000 •  19 pages of definitions, governing law, proposal evaluation criteria, contractual conditions, instructions about tender response format, etc •  technical requirements description? < 1 page •  data acceptance criteria? “a high level of accuracy” 36
  • 37. the problem a recent program established by a prominent national library •  digitize more than 20 million text pages •  high level image and xml requirements •  value of work awarded? > USD $5,000,000 •  after award of work, technical requirements expand to 43+ pages from ~3 pages •  acceptance criteria? added as an afterthought and not well defined 37
  • 38. the problem typical tender evaluation criteria in priority order 1. understanding of requirements 2. reputation of service bureau 3. price 38
  • 39. 39
  • 41. requirements Library of Congress JPEG2000 profile 41
  • 43. acceptance National Library of Australia NDP 43
  • 44. the problem requirements acceptance communication 44
  • 45. communication “projects are about communication, communication, and communication” Elenbass,  B.  (2000).  “Staging  a  Project:  Are  You  Se>ng  Your  Project  Up  for  Success?”.     Proceedings  of  the  Project  Management  InsItute  Annual  Seminars  &  Symposiums.   45
  • 46. references •  METS, MODS, ALTO, PRISM, etc : http://www.loc.gov/standards •  OAIS : http://public.ccsds.org/publications/RefModel.aspx •  NISO standards and guidelines : http://www.niso.org/publications/rp •  good practice guides : http://www.ukoln.ac.uk •  And many, many more 46
  • 47. preguntas? Frederick Zarndt frederick@frederickzarndt.com This work is licensed under the Creative Commons Attribution-ShareAlike (CC by SA) License. To view a copy of this license visit http://creativecommons.org/licenses/by-sa/3.0/ 47

Hinweis der Redaktion

  1. digital libraries are (relatively) new. best practices are still (rapidly) evolving. computing technologies, storage media, communication protocols, and standards are changing.iArchives story.
  2. this talk will probably not give you answers but rather a bunch of questions that you should ask as you undertake a digitization project. it will also give you a list of things to do before, during, after a digitization project, but not tell you how to do them.mention communications, requirements, acceptance criteria
  3. primarily to enhance access. access to a digital collection is not restricted to 1 user in 1 place. now it is possible for many users in many places to concurrently access the collection.may also be to preserve a deteriorating collection
  4. digital collections are similar to analog collections – books, newspapers, magazines, photographs, records – only in digital form. digital collections differ from analog collections in that they are more flexible.A digital collection consists of digital objects that are selected and organized to facilitate their discovery, access, and use.Digital objects, metadata, and the user interface together create the user experience of a collection.
  5. A digital object represents a discrete unit and is comprised of a digital file or files as well as descriptive metadata. Digital objects begin life in one of two ways: As a digitized file produced as a surrogate for materials that exist in analog format.As a &quot;born digital&quot; entity, with no analog counterpart.digital objects are either digital surrogates for analog objects or born digital objects scanned text, scanned photos born digital text, digital photos archived websites census records, land records
  6. metadata is similar to a card catalog but more flexible. richer descriptive and administrative metadata. may contain data about the digital objects themselves.metadata is structured information associated with an object for purposes of discovery, description, use, management, and preservation.
  7. phases implies separation / sequential. not necessarily sequential! more about this later…
  8. digital collection users may be different from analog collection users (genealogists)digital collection users may be different from analog collection users (genealogists)copyright holders are generally not happy about digital surrogates! know Turkish / EU copyright law! collaborate with copyright holder if possible.examples: Singapore, Australia, USA
  9. METS XML since version 1.1 ~2001. administered by LOC but developed by libraries around the world. METS editorial board. METS now at version 1.9METSsections:header, descriptive, administrative, files, structural map (heart of METSstructural links (between elements of structural map), behaviorMARC not often used with digital collections. replaced by MODS (administered by LOC) and / or Dublin Core (administered by OCLC)
  10. TIFF since 1986. last update (version 6.0) 1992. now under control of AdobeJPEG2000 since 2000. intended to supersede JPEG.PDF, PDF/A under control of Adobe. PDF/A subset of PDF version 1.4 and an ISO standard. latest PDF version is 1.7. In 2008 Adobe granted a royalty-free rights for all patents owned by Adobe that are necessary to make, use, sell and distribute PDF compliant implementationslook for open, community developed, tried and tested standard formats
  11. Crowdsourcing is a distributed problem-solving and production model. Problems are broadcast to an unknown group of solvers in the form of an open call for solutions. Users—also known as the crowd—typically form into online communities, and the crowd submits solutions. The crowd also sorts through the solutions, finding the best ones.FamilySearch documents are drawn primarily from a collection of 2.4 million microfilms made of historical documents from 110 countries.130,000+ volunteers from around the world. Records based data.Australia NDP 5,800,000 newspaper pages online. 50,000,000+ lines of newspaper text corrected, 2,000,000+ per month in 2011.Wikipedia founded 2001. 90,000 active contributors. Website ranks 6th in the world usage according to Alexa. Editions in 282 languages.
  12. Recognizing that MARC is no longer fit for the purpose, work with the library and other interested communities to specify and implement a carrier for bibliographic information that is capable of representing the full range of data of interest to libraries, and of facilitating the exchange of such data both within the library community and with related communities.
  13. NB: pilot batches are VERY VERY important!!