The Biodiversity Heritage Library: Corn-fed, Missouri Raised, Going Global. Martin R. Kalfatovic. Missouri Botanical Garden Staff Meeting. August 19, 2009. Saint Louis, MO.
The Biodiversity Heritage Library: Corn-fed, Missouri Raised, Going Global
1. The Biodiversity Heritage Library Corn-fed, Missouri Raised, Going Global Martin R. Kalfatovic Smithsonian Institution Libraries Missouri Botanical Garden August 19, 2009
2.
3.
4. American Museum of Natural History (New York) Academy of Natural Sciences Philadelphia California Academy of Sciences (San Francisco) Field Museum (Chicago) Natural History Museum (London) Smithsonian Institution Libraries (Washington) Missouri Botanical Garden (St. Louis) New York Botanical Garden (New York) Royal Botanic Garden, Kew Botany Libraries, Harvard University Ernst Mayr Library of the Museum of Comparative Zoology, Harvard University Marine Biological Laboratory / Woods Hole Oceanographic Institution
5. Scanning Partner Internet Archive Contributor University of Illinois, Urbana-Champaign Contributor California Digital Library Contributor Library of Congress
6. Initial grant from the MacArthur and Sloan Foundations (as part of the Encyclopedia of Life grant) Additional support from parent institutions Supplemental grants in place for specific development (e.g. Moore Foundation for Fedora) Additional grants being actively pursued by BHL and individual members
8. H Informatics Marine Biological Laboratory Missouri Botanical Garden Species Pages & Secretariat Smithsonian Education and Outreach Smithsonian & Harvard Synthesis Center Field Museum
9.
10.
11. How much is there: Core literature pre-1923: 100 million pages (?) All pre-1923: 120-150 million pages All literature: 280-320 million pages
12. More than: 36,000 volumes 15 million pages Only 290 million to go! Avg. monthly growth rate 1,500 volumes 600,000 pages See you in 2048! Now Online
13.
14.
15.
16.
17. Collections Coordinator on board in February 2009. Bianca Lipscomb, based at the Smithsonian, will coordinate material selection across the BHL and contributing partners Rough Selection
19. Selection Tools: Combined Serial list for selection of title to scan to avoid duplication of effort Monographic “de-duping” algorithm OCLC Collection Analysis
20.
21. Select Book ~Pull from Shelf Review Physically and Metadata Establish viability and create Wonderfetch Send to IA scanning center Book is scanned & QA Page images loaded Derivatives created Book returned to library Files harvested from IA portal to BHL Taxonomic Intelligence Added Available through BHL
30. Ingest of other collections 12,000,000 pages+ from other Internet Archive scanning partners
31. Biodiversity Heritage Library Permission Process Working with non-profit publishers for sharing with the BHL To digitize and mount works under copyright BHL must obtain permission from the copyright holders. Many biodiversity journals and monographs are published by non-profit institutions or learned societies whose mission is to promote research and learning. Some of these institutions have not sold their rights to commercial publishers and are open to sharing with the BHL.
32. Current Permission Agreement: The agreement is non-exclusive. The copyright holder can use the content for other purposes. It does not involve any transfer of copyright to the BHL or its member institutions. It “grants to the current and future member Participating Institutions of the BHL a world-wide, nonexclusive, perpetual, irrevocable, royalty-free, sublicenseable license to digitize and use the Titles (as identified above) in connection with the BHL, including the right to make reproductions in digital form, publicly display, and disseminate the Titles via the BHL and related websites, and create derivative works in digital form based on the Titles. The scope of this license is equivalent to an open source license, which permits others to use, reproduce, supplement, modify, create derivatives, and otherwise use the Titles, for any and all non-commercial purposes, with proper attribution to the Licensor as the source.”
33.
34. Process: There is room for some modification of the wording of the draft permission document. When it is finalized, BHL Director sign for the BHL and the Editor-in-chief or Chairperson of the society signs. The process is usually very smooth. >60 titles to date, many published in the US, some of which are published in Europe and Asia.
45. Built from a variety of new and existing sources Views available for varying levels of expertise from novice to expert Legacy literature a key component of the EOL species pages Encyclopedia of Life Species Pages
56. The Biodiversity Heritage Library Corn-fed, Missouri Raised, Going Global Martin R. Kalfatovic Smithsonian Institution Libraries Missouri Botanical Garden August 19, 2009
Editor's Notes
A Botanical Introduction to The Biodiversity Heritage Library Botany & Mycology 2009 Snowbird, Utah July 26, 2009 Martin R. Kalfatovic Smithsonian Institution Libraries
The Biodiversity Heritage Library
2003. Telluride. Encyclopedia of Life meeting February 2005. London. Library and Laboratory: the Marriage of Research, Data and Taxonomic Literature May 2005. Washington. Ground work for the Biodiversity Heritage Library June 2006. Washington. Organizational and Technical meeting August 2006. New York Botanical Garden. BHL Director’s Meeting. October 2006. St. Louis/San Francisco. Technical meetings February 2007. Museum of Comparative Zoology. Organizational meeting May 2007. Encyclopedia of Life and BHL Portal Launch. Washington DC.
American Museum of Natural History (New York) Academy of Natural Sciences Philadelphia California Academy of Sciences (San Francisco) Field Museum (Chicago) Natural History Museum (London) Smithsonian Institution Libraries (Washington) Missouri Botanical Garden (St. Louis) New York Botanical Garden (New York) Royal Botanic Garden, Kew Botany Libraries, Harvard University Ernst Mayr Library of the Museum of Comparative Zoology, Harvard University Marine Biological Laboratory / Woods Hole Oceanographic Institution
Scanning Partner Internet Archive Contributor University of Illinois, Urbana-Champaign Contributor California Digital Library Contributor Library of Congress
Initial grant from the MacArthur and Sloan Foundations (as part of the Encyclopedia of Life grant) Additional support from parent institutions Supplemental grants in place for specific development (e.g. Moore Foundation for Fedora) Additional grants being actively pursued by BHL and individual members
Structure of EOL
The Museum
Focus on how to get the books more … useful
BHL Focus: Literature Core literature pre-1923: 100 million pages (?) All pre-1923: 120-150 million pages All literature: 280-320 million pages
Marine Biological Laboratory/WHOI Marine monographs General Science Museum of Comparative Zoology MCZ publications Herpetology monographs and serials Ichthyology monographs and serials
University of Illinois Fieldiana Natural history of Illinois American Museum of Natural History AMNH publications Ornithology Natural History Museum NHM publications Major natural history general serials
Botany Collections Missouri Botanical Garden, New York Botanical Garden, Harvard Botany Libraries, and Royal Botanic Garden, Kew will cooperatively develop a methodology for botanical publications and botanical collections from other BHL members will fill in gaps
Collections Coordinator on board in February 2009. Bianca Lipscomb, based at the Smithsonian, will coordinate material selection across the BHL and contributing partners
How to make THIS into 0’s and 1’s
Selection Tools: Combined Serial list for selection of title to scan to avoid duplication of effort Monographic “de-duping” algorithm OCLC Collection Analysis
Mass Scanning Workflow Bid Lists Serials Management Pick Lists Packing Lists Monographic Management Local data flow WonderFetch (tm) Return of data Return of material Billing
BHL Scanning
Internet Archive 501(c)(3) organization Dedicated to “Universal Access to Human Knowledge” Founder of the Open Content Alliance Provides: Mass scanning Archival storage of files Image processing Technology development
Scribe Machine Single Scribe Machine Custom built by the Internet Archive Human operated 3,500 page per shift per day
Internet Archive/BHL scanning centers Northeast Regional Scanning Center 10 Scribe machines MBL/WHOI Harvard New York Public Library 10 Scribe machines AMNH NYBG
Internet Archive/BHL scanning centers University of Illinois 2 Scribe machines Natural History Museum, London 1 Scribe machine Missouri Botanical Garden Non-Scribe operation
Internet Archive/BHL scanning centers Washington, DC 1 Scribe machine at Smithsonian Libraries 10 Scribe facility at Library of Congress with Fedlink
Acquiring Other Content
Missouri Botanical Garden Library continuing in-house scanning process Other BHL members also have non-Internet Archive scanning operations Ingest of other interested libraries
Missouri Botanical Garden Library continuing in-house scanning process Other BHL members also have non-Internet Archive scanning operations Ingest of other interested libraries
Biodiversity Heritage Library Permission Process Working with non-profit publishers for sharing with the BHL To digitize and mount works under copyright BHL must obtain permission from the copyright holders. Many biodiversity journals and monographs are published by non-profit institutions or learned societies whose mission is to promote research and learning. Some of these institutions have not sold their rights to commercial publishers and are open to sharing with the BHL.
Current Permission Agreement: The agreement is non-exclusive. The copyright holder can use the content for other purposes. It does not involve any transfer of copyright to the BHL or its member institutions. It “grants to the current and future member Participating Institutions of the BHL a world-wide, nonexclusive, perpetual, irrevocable, royalty-free, sublicenseable license to digitize and use the Titles (as identified above) in connection with the BHL, including the right to make reproductions in digital form, publicly display, and disseminate the Titles via the BHL and related websites, and create derivative works in digital form based on the Titles. The scope of this license is equivalent to an open source license, which permits others to use, reproduce, supplement, modify, create derivatives, and otherwise use the Titles, for any and all non-commercial purposes, with proper attribution to the Licensor as the source.”
BHL License Agreement
Process: There is room for some modification of the wording of the draft permission document. When it is finalized, BHL Director sign for the BHL and the Editor-in-chief or Chairperson of the society signs. The process is usually very smooth. >60 titles to date, many published in the US, some of which are published in Europe and Asia.
Permissions List
Example permissions title
Title in the BHL Portal
But what’s this all mean to me!?
But what’s this all mean to me!?
But since our model prototype was MOBOT’s Botanicus …
BHL Portal
CiteBank
Dataflow
Bibliographic Metadata sharing between BHL and EOL/LifeDesks/Scratchpads
Structure of EOL Built from a variety of new and existing sources Views available for varying levels of expertise from novice to expert Legacy literature a key component of the EOL species pages
BHL 2.0
The world
The world
Species going extinct as we talk
Species going extinct as we talk
Species going extinct as we talk
Bluescreen
BHL Portal http://www.biodiversitylibrary.org Cite http://cite.biodiversitylibrary.org Internet Archive http://www.archive.org Ubio http://www.ubio.org
Chris Freeland Suzanne Pilsk Tom Garnett Cathy Norton David Remsen Henning Scholz
A Botanical Introduction to The Biodiversity Heritage Library Botany & Mycology 2009 Snowbird, Utah July 26, 2009 Martin R. Kalfatovic Smithsonian Institution Libraries