SlideShare ist ein Scribd-Unternehmen logo
1 von 35
Collection Assessment in a Collaborative
Environment: BHL
Connie Rinaldo, Bianca Crowley, Trish
Rose Sandler & William Ulate
The BHL is…
• A consortium of 15 natural history, botanical libraries and
research institutions
• An open access, full-text digital library for legacy
biodiversity literature.
• An open data repository of taxonomic names and
bibliographic information
• An expanding global effort
• Mission: The Biodiversity Heritage Library improves &
makes more efficient the methodology of research in
biodiversity studies by collaboratively making biodiversity
literature openly available to the world as part of a
global biodiversity community.
BHL Goals
• Goal 1: Relevant Content: Build & maintain the BHL as the largest reliable,
reputable, & responsive repository of biodiversity literature & archival
materials.
• Goal 2: Tools & Services: Develop services & tools which facilitate
discovery & improve research efficiency of BHL content.
• Goal 3: User Engagement: Increase global awareness about the BHL
through outreach, learning & education, & branding through engagement
& collaboration with existing & new user communities.
• Goal 4: Membership & Partnerships: Grow BHL consortia membership &
partnerships while fostering cross-institutional collaboration that
continues to serve as a model for digital library development
• Goal 5: Financial Sustainability: Ensure sustainability & relevance by being
flexible, adaptable, & financially sound while the content & services
remain openly & freely available.
Core BHL Member Institutions
Global Partners
http://biodiversitylibrary.org

Now online
64,188 titles

120,461 volumes
42 million+ pages
BHL Overview
• New user interface launched in March
• Search by title, author, article, subjects and scientific
names
• Various download options, including high resolution
• Taxonomic name finding algorithm
• Machine-to-machine services
• Full-text search being tested
Core Principles
• Open access
• Open data
• Deconstruct the silo and deliver content where users are
already working
– Via other biodiversity websites and taxonomic
resources
– Via social media platforms: blog, flickr, Facebook,
Twitter, Pinterest, &etc.
• Involve users in collection and technical development
activities
Scanning Locally, Coordinating Globally

Vols. 6,
8, 10

Issue Tracking
Software

Vols.
1-5

Vols. 7,
9, 11-21
Beyond the Silo: Open Data
Stable
URLs

Open Data
Policy

APIs
Application
Programming
Interfaces

Data
Exports

OAI-PMH
Open Archive
Initiative –
Protocol for
Metadata
Harvesting
User Feedback is Critical
General feedback form

http://biodiversitylibrary.org/contact

Scan request form
Impact
•

“BHL came to the rescue when a planned trip to work in the Mertz Library at The New
York Botanical Garden had to be cancelled due to Hurricane Sandy. Thanks to the online
resources available through BHL I was able to source most of the key works I needed,
with their supporting bibliographic information. Further use of BHL occurred when
building work at the Linnean Society of London limited access to some of the book I had
been able to use from that collection."

•

“I would like thank you all very much for invaluable work and support you do. I just got a
pdf-file from more than century old (1893) journal paper (regional naturalist society
paper, published in Finland), to get copy I should take 500 mile drive to our university
library. Now I am got it fastly in high-quality pdf-copy. Cordial thanks and all success in
continuing your highly valuable mission.” [conservation biologist from Estonia]

•

“You are a wonderful resource. I maintain a Website that describes the plant genus
Opuntia (prickly pear cacti). There is no way I could maintain such a site without access to
literature from 100-200 years ago. Most of the cactus species were discovered long ago; I
find it invaluable to put up PDF files to document each species in the literature as I
document them photographically. I am a botanist, but I work in the pharmaceutical field
(not so many botanical jobs out there). Your library makes it possible for me to continue
working with plants in a meaningful and scientific manner.”
Biodiversity
Literature

BHL

EOL

Scientific Names

Researchers

Publications

Datasets
Collecting
Events Specimens
Localities

Field
Notes

Phylogenies
Nomenclators
Name Species
Checklists

Indexes

Content Aggregators
Questions about BHL Content
• How many books in BHL are there about....?
• How can we identify areas of weakness in BHL
in order to prioritize what materials to scan
next?
• Rod Page has one suggestion:
http://iphylo.blogspot.com/2013/10/whichtaxonomic-journals-should-be.html
Questions about BHL Content
• What are scalable solutions to content
analysis?
• Can we provide creative & meaningful
visualizations?
Why do we care about taxonomic
names?
• Scientists use taxonomic names to organize
their research
• Biodiversity literature breaks down by
discipline & by specific taxon
Extracted Scientific Names
What is “Taxonomic Intelligence”?
• Global Names Recognition & Discovery tool
– Locate, verify, record scientific names from each
page
– Text is uncorrected OCR
Overview of available BHL (meta) data
http://biodivlib.wikispaces.com/Data+Exports
• Title metadata: contributed from MARC records of
hundreds of library catalogs (BHL consortium libraries &
non-BHL IA contributors)
• Volume/item metadata: provides information about the
actual objects & pieces digitized
• Subject
• Creator/author data
• Segment/part/”article” metadata (separate table for
segment/part creators?)
• Page metadata which includes our algorithmically
identified scientific name data
• OCR text available at the item/volume level but not overall
for corpus of BHL
Data Exports
Visualization of BHL Data for Pinusbanksiana
Source Data Sample
Sample BHL & Nomenclatural Data
• Google Refine reconciled list of BHL subject keywords
• List of vetted BHL subject targets from collection
development policy
• Taxonomic name data set for trees of North America
(link out)
• http://www.fs.fed.us/database/feis/plants/tree/ind
ex.html
• http://www.treesofnorthamerica.net/
• Subject terms associated with BHL titles where Pinus
banksiana occurs
OtherTools& Process
• Bibliographies (discipline & more)
• Index Animalium: identifies first appearance of 400,000 animals
from 1758-1850
• Researcher supplied specific taxon bibliographies
• Zoological Record: Taxonomic references back to 1864.
• Taxonomic Literature II: a selective guide to botanical publications
with dates, commentaries and scientific types
• Compare universe of biodiversity literature to BHL
• Unknown dataset for full universe
• Compared BHL member collections to BHL content for gap-filling
before content expanded (lists automated but gap identification
manual)
• REST especies: a way to collate species metadata? http://dopaservices.jrc.ec.europa.eu/services/especies/
• DOPA Explorer http://ehabitat-wps.jrc.ec.europa.eu/dopasimple/
SAMPLE VISUALIZATIONS
Core & Supporting Keywords for BHL Collections
Wordle for BHL Content
http://public.tableausoftware.com/views/BHLViz/DigitizedSubjects
Visualization Opportunities
• JournalMap (geo tagging scientific
literature) http://www.journalmap.org/
• Visualizing article performance http://bit.ly/1c4TJfn
• Better Life Index
http://www.oecd.org/statistics/datalab/bli.htm
• Altmetric: http://www.altmetric.com/
• Tableau http://www.tableausoftware.com/public/
• Worth it:
http://www.wired.com/wiredscience/2013/11/wireddata-life-martin-krzywinski/?viewall=true
Taxon Data Manipulation
Opportunities
• Euler Project: Reasoning with Taxonomies:
http://euler.cs.ucdavis.edu/
• REST & Taxonomy:
https://drupal.org/project/taxonomy_api
SUMMARY
•
•
•
•

Metadata reconciliation
Gap analysis
Visualizations
All automated!
Thank you for your
Help!
http://biodiversitylibrary.org
Connie Rinaldo
crinaldo@oeb.harvard.edu

Weitere ähnliche Inhalte

Was ist angesagt?

An Introduction to the Biodiversity Heritage Library
An Introduction to the Biodiversity Heritage LibraryAn Introduction to the Biodiversity Heritage Library
An Introduction to the Biodiversity Heritage LibraryMartin Kalfatovic
 
The Biodiversity Heritage Library and bibliographic citations: towards new u...
The Biodiversity Heritage Library and bibliographic citations: towards new u...The Biodiversity Heritage Library and bibliographic citations: towards new u...
The Biodiversity Heritage Library and bibliographic citations: towards new u...Trish Rose-Sandler
 
A Current Overview of the Biodiversity Heritage Library
A Current Overview of the Biodiversity Heritage LibraryA Current Overview of the Biodiversity Heritage Library
A Current Overview of the Biodiversity Heritage LibraryMartin Kalfatovic
 
2016 BHL Program Director's Report
2016 BHL Program Director's Report2016 BHL Program Director's Report
2016 BHL Program Director's ReportMartin Kalfatovic
 
Information Retrieval Methods in Libraries and Information Centers
Information Retrieval Methods in Libraries and Information CentersInformation Retrieval Methods in Libraries and Information Centers
Information Retrieval Methods in Libraries and Information CentersEdeama Onwuchekwa
 
Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...
Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...
Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...Trish Rose-Sandler
 
An International Cooperative Digital Library for Taxonomic Literature: The Bi...
An International Cooperative Digital Library for Taxonomic Literature: The Bi...An International Cooperative Digital Library for Taxonomic Literature: The Bi...
An International Cooperative Digital Library for Taxonomic Literature: The Bi...Martin Kalfatovic
 
The Biodiversity Heritage Library. 10+1 and Beyond: Looking Forward
The Biodiversity Heritage Library. 10+1 and Beyond: Looking ForwardThe Biodiversity Heritage Library. 10+1 and Beyond: Looking Forward
The Biodiversity Heritage Library. 10+1 and Beyond: Looking ForwardMartin Kalfatovic
 
The Biodiversity Heritage Library 10 Years and More!
The Biodiversity Heritage Library 10 Years and More!The Biodiversity Heritage Library 10 Years and More!
The Biodiversity Heritage Library 10 Years and More!Martin Kalfatovic
 
Increasing Access, Promoting Progress: Empowering Global Research through the...
Increasing Access, Promoting Progress: Empowering Global Research through the...Increasing Access, Promoting Progress: Empowering Global Research through the...
Increasing Access, Promoting Progress: Empowering Global Research through the...Martin Kalfatovic
 
Electronic library and information resources
Electronic library and information resourcesElectronic library and information resources
Electronic library and information resourcesavid
 
Breathing new life into old data - How opening your collection can spark imag...
Breathing new life into old data - How opening your collection can spark imag...Breathing new life into old data - How opening your collection can spark imag...
Breathing new life into old data - How opening your collection can spark imag...Trish Rose-Sandler
 
M sc advanced food marketing finding info
M sc advanced food marketing   finding infoM sc advanced food marketing   finding info
M sc advanced food marketing finding infonmjb
 
Building the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeBuilding the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeTrish Rose-Sandler
 
Stage 2 animal science finding info
Stage 2 animal science   finding infoStage 2 animal science   finding info
Stage 2 animal science finding infonmjb
 
2009 05 20 Cimc Pilsk
2009 05 20 Cimc Pilsk2009 05 20 Cimc Pilsk
2009 05 20 Cimc PilskSCPilsk
 

Was ist angesagt? (20)

Limitreal
LimitrealLimitreal
Limitreal
 
Muswebho
MuswebhoMuswebho
Muswebho
 
An Introduction to the Biodiversity Heritage Library
An Introduction to the Biodiversity Heritage LibraryAn Introduction to the Biodiversity Heritage Library
An Introduction to the Biodiversity Heritage Library
 
The Biodiversity Heritage Library and bibliographic citations: towards new u...
The Biodiversity Heritage Library and bibliographic citations: towards new u...The Biodiversity Heritage Library and bibliographic citations: towards new u...
The Biodiversity Heritage Library and bibliographic citations: towards new u...
 
A Current Overview of the Biodiversity Heritage Library
A Current Overview of the Biodiversity Heritage LibraryA Current Overview of the Biodiversity Heritage Library
A Current Overview of the Biodiversity Heritage Library
 
2016 BHL Program Director's Report
2016 BHL Program Director's Report2016 BHL Program Director's Report
2016 BHL Program Director's Report
 
Information Retrieval Methods in Libraries and Information Centers
Information Retrieval Methods in Libraries and Information CentersInformation Retrieval Methods in Libraries and Information Centers
Information Retrieval Methods in Libraries and Information Centers
 
Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...
Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...
Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...
 
An International Cooperative Digital Library for Taxonomic Literature: The Bi...
An International Cooperative Digital Library for Taxonomic Literature: The Bi...An International Cooperative Digital Library for Taxonomic Literature: The Bi...
An International Cooperative Digital Library for Taxonomic Literature: The Bi...
 
The Biodiversity Heritage Library. 10+1 and Beyond: Looking Forward
The Biodiversity Heritage Library. 10+1 and Beyond: Looking ForwardThe Biodiversity Heritage Library. 10+1 and Beyond: Looking Forward
The Biodiversity Heritage Library. 10+1 and Beyond: Looking Forward
 
The Biodiversity Heritage Library 10 Years and More!
The Biodiversity Heritage Library 10 Years and More!The Biodiversity Heritage Library 10 Years and More!
The Biodiversity Heritage Library 10 Years and More!
 
Increasing Access, Promoting Progress: Empowering Global Research through the...
Increasing Access, Promoting Progress: Empowering Global Research through the...Increasing Access, Promoting Progress: Empowering Global Research through the...
Increasing Access, Promoting Progress: Empowering Global Research through the...
 
Electronic library and information resources
Electronic library and information resourcesElectronic library and information resources
Electronic library and information resources
 
Breathing new life into old data - How opening your collection can spark imag...
Breathing new life into old data - How opening your collection can spark imag...Breathing new life into old data - How opening your collection can spark imag...
Breathing new life into old data - How opening your collection can spark imag...
 
M sc advanced food marketing finding info
M sc advanced food marketing   finding infoM sc advanced food marketing   finding info
M sc advanced food marketing finding info
 
Building the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeBuilding the new open linked library: Theory and Practice
Building the new open linked library: Theory and Practice
 
Repositories as key players in non-commercial open access - a developing reg...
Repositories as key players in non-commercial open access  - a developing reg...Repositories as key players in non-commercial open access  - a developing reg...
Repositories as key players in non-commercial open access - a developing reg...
 
Repositories as key players in non-commercial open access - a developing reg...
Repositories as key players in non-commercial open access  - a developing reg...Repositories as key players in non-commercial open access  - a developing reg...
Repositories as key players in non-commercial open access - a developing reg...
 
Stage 2 animal science finding info
Stage 2 animal science   finding infoStage 2 animal science   finding info
Stage 2 animal science finding info
 
2009 05 20 Cimc Pilsk
2009 05 20 Cimc Pilsk2009 05 20 Cimc Pilsk
2009 05 20 Cimc Pilsk
 

Andere mochten auch

The Biodiversity Heritage Library: Origin | Growth | Partnerships
The Biodiversity Heritage Library: Origin | Growth | PartnershipsThe Biodiversity Heritage Library: Origin | Growth | Partnerships
The Biodiversity Heritage Library: Origin | Growth | PartnershipsMartin Kalfatovic
 
2014 esa brewsternotes8042014
2014 esa brewsternotes80420142014 esa brewsternotes8042014
2014 esa brewsternotes8042014Connie Rinaldo
 
2015 LIBER rinaldo&smith 25-06-15 (3)
2015 LIBER rinaldo&smith 25-06-15 (3)2015 LIBER rinaldo&smith 25-06-15 (3)
2015 LIBER rinaldo&smith 25-06-15 (3)Connie Rinaldo
 
South Africa 2015
South Africa 2015South Africa 2015
South Africa 2015CindyByers
 
2015bhlchicagomembertalk
2015bhlchicagomembertalk2015bhlchicagomembertalk
2015bhlchicagomembertalkConnie Rinaldo
 
Biodiversity Heritage Library
Biodiversity Heritage LibraryBiodiversity Heritage Library
Biodiversity Heritage LibraryConnie Rinaldo
 
Ifla art of life presentation final
Ifla art of life presentation finalIfla art of life presentation final
Ifla art of life presentation finalConnie Rinaldo
 
The physical landscape of Africa
The physical landscape of AfricaThe physical landscape of Africa
The physical landscape of Africarachelkcole
 

Andere mochten auch (9)

The Biodiversity Heritage Library: Origin | Growth | Partnerships
The Biodiversity Heritage Library: Origin | Growth | PartnershipsThe Biodiversity Heritage Library: Origin | Growth | Partnerships
The Biodiversity Heritage Library: Origin | Growth | Partnerships
 
2014 esa brewsternotes8042014
2014 esa brewsternotes80420142014 esa brewsternotes8042014
2014 esa brewsternotes8042014
 
2015 LIBER rinaldo&smith 25-06-15 (3)
2015 LIBER rinaldo&smith 25-06-15 (3)2015 LIBER rinaldo&smith 25-06-15 (3)
2015 LIBER rinaldo&smith 25-06-15 (3)
 
South Africa 2015
South Africa 2015South Africa 2015
South Africa 2015
 
2015bhlchicagomembertalk
2015bhlchicagomembertalk2015bhlchicagomembertalk
2015bhlchicagomembertalk
 
Biodiversity Heritage Library
Biodiversity Heritage LibraryBiodiversity Heritage Library
Biodiversity Heritage Library
 
Ifla art of life presentation final
Ifla art of life presentation finalIfla art of life presentation final
Ifla art of life presentation final
 
BHL & Copyright
BHL & CopyrightBHL & Copyright
BHL & Copyright
 
The physical landscape of Africa
The physical landscape of AfricaThe physical landscape of Africa
The physical landscape of Africa
 

Ähnlich wie Collection assessment in a collaborative environment: Biodiversity Heritage Library

The Biodiversity Heritage Library: Corn-fed, Missouri Raised, Going Global
The Biodiversity Heritage Library: Corn-fed, Missouri Raised, Going GlobalThe Biodiversity Heritage Library: Corn-fed, Missouri Raised, Going Global
The Biodiversity Heritage Library: Corn-fed, Missouri Raised, Going GlobalMartin Kalfatovic
 
An Inordinate Fondness for Data: The Biodiversity Heritage Library
An Inordinate Fondness for Data: The Biodiversity Heritage LibraryAn Inordinate Fondness for Data: The Biodiversity Heritage Library
An Inordinate Fondness for Data: The Biodiversity Heritage LibraryMartin Kalfatovic
 
Digital Libraries for Science: Botanicus and the Biodiversity Heritage Library
Digital Libraries for Science: Botanicus and the Biodiversity Heritage LibraryDigital Libraries for Science: Botanicus and the Biodiversity Heritage Library
Digital Libraries for Science: Botanicus and the Biodiversity Heritage LibraryChris Freeland
 
Building a Global Library of Taxonomic Literature
Building a Global Library of Taxonomic LiteratureBuilding a Global Library of Taxonomic Literature
Building a Global Library of Taxonomic LiteratureMartin Kalfatovic
 
BHL overview for GPO Interagency Seminar
BHL overview for GPO Interagency SeminarBHL overview for GPO Interagency Seminar
BHL overview for GPO Interagency SeminarBianca Crowley
 
A Lined Data Approach to Interoperability between Biomedical Resource Invento...
A Lined Data Approach to Interoperability between Biomedical Resource Invento...A Lined Data Approach to Interoperability between Biomedical Resource Invento...
A Lined Data Approach to Interoperability between Biomedical Resource Invento...Trish Whetzel
 
It is something wonderful
It is something wonderfulIt is something wonderful
It is something wonderfulBecky Morin
 
Digitizing Entomology: The Biodiversity Heritage Library @ the Smithsonian
Digitizing Entomology: The Biodiversity Heritage Library @ the SmithsonianDigitizing Entomology: The Biodiversity Heritage Library @ the Smithsonian
Digitizing Entomology: The Biodiversity Heritage Library @ the SmithsonianMartin Kalfatovic
 
Purposeful Gaming, OCR Correction and Seed & Nursery Catalog Digitization
Purposeful Gaming, OCR Correction and Seed & Nursery Catalog DigitizationPurposeful Gaming, OCR Correction and Seed & Nursery Catalog Digitization
Purposeful Gaming, OCR Correction and Seed & Nursery Catalog DigitizationMartySchlabach
 
Global Library of Life: The Biodiversity Heritage Library
Global Library of Life: The Biodiversity Heritage LibraryGlobal Library of Life: The Biodiversity Heritage Library
Global Library of Life: The Biodiversity Heritage LibraryMartin Kalfatovic
 
BHL and Specimen Collection Data: The needle in the Festuca stack
BHL and Specimen Collection Data: The needle in the Festuca stackBHL and Specimen Collection Data: The needle in the Festuca stack
BHL and Specimen Collection Data: The needle in the Festuca stackMartin Kalfatovic
 
Cybertaxonomy may 31 2011
Cybertaxonomy may 31 2011Cybertaxonomy may 31 2011
Cybertaxonomy may 31 2011tgarnett
 
Eol fellow-march2010
Eol fellow-march2010Eol fellow-march2010
Eol fellow-march2010tgarnett
 
A Botanical Introduction to The Biodiversity Heritage Library
A Botanical Introduction to The Biodiversity Heritage LibraryA Botanical Introduction to The Biodiversity Heritage Library
A Botanical Introduction to The Biodiversity Heritage LibraryMartin Kalfatovic
 
An Overview of the Biodiversity Heritage Library
An Overview of the Biodiversity Heritage LibraryAn Overview of the Biodiversity Heritage Library
An Overview of the Biodiversity Heritage LibraryMartin Kalfatovic
 
Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...
Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...
Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...Martin Kalfatovic
 
Finding the annotation needs of the botanical community in a digital library
Finding the annotation needs of the botanical community in a digital libraryFinding the annotation needs of the botanical community in a digital library
Finding the annotation needs of the botanical community in a digital libraryWilliam Ulate
 

Ähnlich wie Collection assessment in a collaborative environment: Biodiversity Heritage Library (20)

The Biodiversity Heritage Library: Corn-fed, Missouri Raised, Going Global
The Biodiversity Heritage Library: Corn-fed, Missouri Raised, Going GlobalThe Biodiversity Heritage Library: Corn-fed, Missouri Raised, Going Global
The Biodiversity Heritage Library: Corn-fed, Missouri Raised, Going Global
 
An Inordinate Fondness for Data: The Biodiversity Heritage Library
An Inordinate Fondness for Data: The Biodiversity Heritage LibraryAn Inordinate Fondness for Data: The Biodiversity Heritage Library
An Inordinate Fondness for Data: The Biodiversity Heritage Library
 
Cbhl apr2014
Cbhl apr2014Cbhl apr2014
Cbhl apr2014
 
Siri sgrpmtg05092013
Siri sgrpmtg05092013Siri sgrpmtg05092013
Siri sgrpmtg05092013
 
Digital Libraries for Science: Botanicus and the Biodiversity Heritage Library
Digital Libraries for Science: Botanicus and the Biodiversity Heritage LibraryDigital Libraries for Science: Botanicus and the Biodiversity Heritage Library
Digital Libraries for Science: Botanicus and the Biodiversity Heritage Library
 
Building a Global Library of Taxonomic Literature
Building a Global Library of Taxonomic LiteratureBuilding a Global Library of Taxonomic Literature
Building a Global Library of Taxonomic Literature
 
BHL overview for GPO Interagency Seminar
BHL overview for GPO Interagency SeminarBHL overview for GPO Interagency Seminar
BHL overview for GPO Interagency Seminar
 
A Lined Data Approach to Interoperability between Biomedical Resource Invento...
A Lined Data Approach to Interoperability between Biomedical Resource Invento...A Lined Data Approach to Interoperability between Biomedical Resource Invento...
A Lined Data Approach to Interoperability between Biomedical Resource Invento...
 
It is something wonderful
It is something wonderfulIt is something wonderful
It is something wonderful
 
Digitizing Entomology: The Biodiversity Heritage Library @ the Smithsonian
Digitizing Entomology: The Biodiversity Heritage Library @ the SmithsonianDigitizing Entomology: The Biodiversity Heritage Library @ the Smithsonian
Digitizing Entomology: The Biodiversity Heritage Library @ the Smithsonian
 
Purposeful Gaming, OCR Correction and Seed & Nursery Catalog Digitization
Purposeful Gaming, OCR Correction and Seed & Nursery Catalog DigitizationPurposeful Gaming, OCR Correction and Seed & Nursery Catalog Digitization
Purposeful Gaming, OCR Correction and Seed & Nursery Catalog Digitization
 
Global Library of Life: The Biodiversity Heritage Library
Global Library of Life: The Biodiversity Heritage LibraryGlobal Library of Life: The Biodiversity Heritage Library
Global Library of Life: The Biodiversity Heritage Library
 
BHL and Specimen Collection Data: The needle in the Festuca stack
BHL and Specimen Collection Data: The needle in the Festuca stackBHL and Specimen Collection Data: The needle in the Festuca stack
BHL and Specimen Collection Data: The needle in the Festuca stack
 
Recommendation and the Library
Recommendation and the LibraryRecommendation and the Library
Recommendation and the Library
 
Cybertaxonomy may 31 2011
Cybertaxonomy may 31 2011Cybertaxonomy may 31 2011
Cybertaxonomy may 31 2011
 
Eol fellow-march2010
Eol fellow-march2010Eol fellow-march2010
Eol fellow-march2010
 
A Botanical Introduction to The Biodiversity Heritage Library
A Botanical Introduction to The Biodiversity Heritage LibraryA Botanical Introduction to The Biodiversity Heritage Library
A Botanical Introduction to The Biodiversity Heritage Library
 
An Overview of the Biodiversity Heritage Library
An Overview of the Biodiversity Heritage LibraryAn Overview of the Biodiversity Heritage Library
An Overview of the Biodiversity Heritage Library
 
Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...
Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...
Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...
 
Finding the annotation needs of the botanical community in a digital library
Finding the annotation needs of the botanical community in a digital libraryFinding the annotation needs of the botanical community in a digital library
Finding the annotation needs of the botanical community in a digital library
 

Kürzlich hochgeladen

Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 

Kürzlich hochgeladen (20)

LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 

Collection assessment in a collaborative environment: Biodiversity Heritage Library

  • 1. Collection Assessment in a Collaborative Environment: BHL Connie Rinaldo, Bianca Crowley, Trish Rose Sandler & William Ulate
  • 2. The BHL is… • A consortium of 15 natural history, botanical libraries and research institutions • An open access, full-text digital library for legacy biodiversity literature. • An open data repository of taxonomic names and bibliographic information • An expanding global effort • Mission: The Biodiversity Heritage Library improves & makes more efficient the methodology of research in biodiversity studies by collaboratively making biodiversity literature openly available to the world as part of a global biodiversity community.
  • 3. BHL Goals • Goal 1: Relevant Content: Build & maintain the BHL as the largest reliable, reputable, & responsive repository of biodiversity literature & archival materials. • Goal 2: Tools & Services: Develop services & tools which facilitate discovery & improve research efficiency of BHL content. • Goal 3: User Engagement: Increase global awareness about the BHL through outreach, learning & education, & branding through engagement & collaboration with existing & new user communities. • Goal 4: Membership & Partnerships: Grow BHL consortia membership & partnerships while fostering cross-institutional collaboration that continues to serve as a model for digital library development • Goal 5: Financial Sustainability: Ensure sustainability & relevance by being flexible, adaptable, & financially sound while the content & services remain openly & freely available.
  • 4. Core BHL Member Institutions
  • 7. BHL Overview • New user interface launched in March • Search by title, author, article, subjects and scientific names • Various download options, including high resolution • Taxonomic name finding algorithm • Machine-to-machine services • Full-text search being tested
  • 8. Core Principles • Open access • Open data • Deconstruct the silo and deliver content where users are already working – Via other biodiversity websites and taxonomic resources – Via social media platforms: blog, flickr, Facebook, Twitter, Pinterest, &etc. • Involve users in collection and technical development activities
  • 9. Scanning Locally, Coordinating Globally Vols. 6, 8, 10 Issue Tracking Software Vols. 1-5 Vols. 7, 9, 11-21
  • 10. Beyond the Silo: Open Data Stable URLs Open Data Policy APIs Application Programming Interfaces Data Exports OAI-PMH Open Archive Initiative – Protocol for Metadata Harvesting
  • 11. User Feedback is Critical General feedback form http://biodiversitylibrary.org/contact Scan request form
  • 12. Impact • “BHL came to the rescue when a planned trip to work in the Mertz Library at The New York Botanical Garden had to be cancelled due to Hurricane Sandy. Thanks to the online resources available through BHL I was able to source most of the key works I needed, with their supporting bibliographic information. Further use of BHL occurred when building work at the Linnean Society of London limited access to some of the book I had been able to use from that collection." • “I would like thank you all very much for invaluable work and support you do. I just got a pdf-file from more than century old (1893) journal paper (regional naturalist society paper, published in Finland), to get copy I should take 500 mile drive to our university library. Now I am got it fastly in high-quality pdf-copy. Cordial thanks and all success in continuing your highly valuable mission.” [conservation biologist from Estonia] • “You are a wonderful resource. I maintain a Website that describes the plant genus Opuntia (prickly pear cacti). There is no way I could maintain such a site without access to literature from 100-200 years ago. Most of the cactus species were discovered long ago; I find it invaluable to put up PDF files to document each species in the literature as I document them photographically. I am a botanist, but I work in the pharmaceutical field (not so many botanical jobs out there). Your library makes it possible for me to continue working with plants in a meaningful and scientific manner.”
  • 14. Questions about BHL Content • How many books in BHL are there about....? • How can we identify areas of weakness in BHL in order to prioritize what materials to scan next? • Rod Page has one suggestion: http://iphylo.blogspot.com/2013/10/whichtaxonomic-journals-should-be.html
  • 15.
  • 16. Questions about BHL Content • What are scalable solutions to content analysis? • Can we provide creative & meaningful visualizations?
  • 17. Why do we care about taxonomic names? • Scientists use taxonomic names to organize their research • Biodiversity literature breaks down by discipline & by specific taxon
  • 19. What is “Taxonomic Intelligence”? • Global Names Recognition & Discovery tool – Locate, verify, record scientific names from each page – Text is uncorrected OCR
  • 20. Overview of available BHL (meta) data http://biodivlib.wikispaces.com/Data+Exports • Title metadata: contributed from MARC records of hundreds of library catalogs (BHL consortium libraries & non-BHL IA contributors) • Volume/item metadata: provides information about the actual objects & pieces digitized • Subject • Creator/author data • Segment/part/”article” metadata (separate table for segment/part creators?) • Page metadata which includes our algorithmically identified scientific name data • OCR text available at the item/volume level but not overall for corpus of BHL
  • 21.
  • 23. Visualization of BHL Data for Pinusbanksiana
  • 25. Sample BHL & Nomenclatural Data • Google Refine reconciled list of BHL subject keywords • List of vetted BHL subject targets from collection development policy • Taxonomic name data set for trees of North America (link out) • http://www.fs.fed.us/database/feis/plants/tree/ind ex.html • http://www.treesofnorthamerica.net/ • Subject terms associated with BHL titles where Pinus banksiana occurs
  • 26. OtherTools& Process • Bibliographies (discipline & more) • Index Animalium: identifies first appearance of 400,000 animals from 1758-1850 • Researcher supplied specific taxon bibliographies • Zoological Record: Taxonomic references back to 1864. • Taxonomic Literature II: a selective guide to botanical publications with dates, commentaries and scientific types • Compare universe of biodiversity literature to BHL • Unknown dataset for full universe • Compared BHL member collections to BHL content for gap-filling before content expanded (lists automated but gap identification manual) • REST especies: a way to collate species metadata? http://dopaservices.jrc.ec.europa.eu/services/especies/ • DOPA Explorer http://ehabitat-wps.jrc.ec.europa.eu/dopasimple/
  • 28. Core & Supporting Keywords for BHL Collections
  • 29. Wordle for BHL Content
  • 31. Visualization Opportunities • JournalMap (geo tagging scientific literature) http://www.journalmap.org/ • Visualizing article performance http://bit.ly/1c4TJfn • Better Life Index http://www.oecd.org/statistics/datalab/bli.htm • Altmetric: http://www.altmetric.com/ • Tableau http://www.tableausoftware.com/public/ • Worth it: http://www.wired.com/wiredscience/2013/11/wireddata-life-martin-krzywinski/?viewall=true
  • 32.
  • 33. Taxon Data Manipulation Opportunities • Euler Project: Reasoning with Taxonomies: http://euler.cs.ucdavis.edu/ • REST & Taxonomy: https://drupal.org/project/taxonomy_api
  • 35. Thank you for your Help! http://biodiversitylibrary.org Connie Rinaldo crinaldo@oeb.harvard.edu

Hinweis der Redaktion

  1. GOALS:
  2. A free & open access digital library for biodiversity literature and primary source materials (field books)A consortium of 15 libraries working together to run a virtual library branchA collection of content from the 15 member BHL consortium and other Internet Archive contributorsAnyone is free to access & download BHL materials
  3. SEARCH: Subject searching in BHL via the advanced search http://biodiversitylibrary.org/advsearch"subjects" tab is searching through the table of subject keywords we have in BHL, derived from the LCSH. It does NOT search titles or scientific names. If you do a basic keyword search via the homepage for a subject term, say "Birds", you will pull hits across all titles, articles, authors, subjects and scientific names broken out by tabs. Notice that the subjects tab shows all search results where "birds" is a part of the subject keyword string such as "Birds of prey" or "Cage birds".
  4. COLLABORATION!
  5. Add images…Also add DOIs?
  6. User feedback is key; we rely on the many eyes of the crowd to help us direct our curation activities to the content people are actually usingUsers can let us know if they find a problem with something in our collection through our general feedback form and place a request for something to be scanned through our scanning request form
  7. The trees of north america, entomology, or bears: metadata, right? BUT LCSH doesn’t adequately describe the biodiversity literature. Scientists organize around scientific names, articles, and parts of articles (species descriptiond)Rod Page did this: constructed a table listing all the journals in BioNames that have an ISSN, ordered by the number of articles in BioNames (i.e., mostly articles that publish new names). The full table is here, I've reproduced part of it below (limited to those journals with at least 500 articles in BioNames)
  8. From Rod PageRod Page did this: constructed a table listing all the journals in BioNames that have an ISSN, ordered by the number of articles in BioNames (i.e., mostly articles that publish new names). The full table is here, I've reproduced part of it below (limited to those journals with at least 500 articles in BioNames)
  9. The trees of north america, entomology, or bears: metadata, right? BUT LCSH doesn’t adequately describe the biodiversity literature. Scientists organize around scientific names, articles, and parts of articles (species descriptiond)
  10. The Biodiversity Heritage Library uses taxonomic intelligence tools, including Global Names Recognition and Discovery (GNRD) developed by Global Names Architecture, to locate, verify, and record scientific names located within the text of each digitized page. The Note: The text used for this identification is uncorrected OCR, so may not include all results expected or visible in the pageThis names-based index is an incredibly valuable tool for organismal research, and is easily incorporated into external web sites through two different methods of access.
  11. Bold= focus for this session—what we have provided on library boxNames aEach dataset has its own complexity: - taxonomic names have a. hierarchy (the previous to last is an infraspecific taxonomic level: forma) b. change over time (the 4th one in the list Pinusdivaricata is a synonym)c. and have all sorts of exceptions to the rules (the last one Pinus X murraybanksiana is a hybrid) - common names are a. subjective, biased towards organisms of well known groups onlyb. are dependant on language, region and time. - subjects are a. language dependantb. hierarchicalc. at title levelre extracted from OCR text
  12. These have all been provided on Library Box, in addition to some more specific setsAlso have MODS, Endnote and BibTex files for titles, items/volumes and parts
  13. A visualizaton of BHL data (for Pinusbanksiana)How do we reconcile all of this to find out what content covers our question? How can we map the more specific terms to LCSH/call numbers when we have limited resources--we need to automate as much as possible.  We want consistent language.  The BHL uses LC for the volumes but also pulls out scientific names.  How do we get them incorporated into the consistent language of LC in an automated way that can scale?  We want to know what we have so we can compare to an (as yet) unidentified universe.  (bibliographies, index animalium, TL2)A picture of BHL data (for Pinusbanksiana as it appears in page 140 of v.78 of The Canadian field naturalist)How do we reconcile all of this to find out what content covers our question? How can we map the more specific terms to LCSH/call numbers when we have limited resources--we need to automate as much as possible.  We want consistent language.  The BHL uses LC for the volumes but also pulls out scientific names.  How do we get them incorporated into the consistent language of LC in an automated way that can scale?  We want to know what we have so we can compare to an (as yet) unidentified universe.  (bibliographies, index animalium, TL2)Each dataset has its own complexity: - taxonomic names have a. hierarchy (the previous to last is an infraspecific taxonomic level: forma) b. change over time (the 4th one in the list Pinusdivaricata is a synonym) c. and have all sorts of exceptions to the rules (the last one Pinus X murraybanksiana is a hybrid) - common names are a. subjective, biased towards organisms of well known groups only b. are dependant on language, region and time. - subjects are a. language dependant b. hierarchical c. at title level
  14. To show that name data come from multiple sources
  15. BOLD means in library boxGoogle refine:  what they are and implications for collection analysisThese are links to
  16.  index animalium, TL2; Literature breaks down by discipline and even by specific taxon; scientific names and bibliographic structure are different and we are trying to merge the two: looking at scientific data next to library data but have to make sense of the merger in the library world (see coll dev chart).  Scientists work at an article/name/article part level; we work on the level of the volume.Taxonomic Literature: A selective guide to botanical publications and collections with dates, commentaries and types (Stafleu et al.).TL-2 is the premier publication of the International Association for Plant Taxonomy (IAPT), TL-2 is a 15 volume guide to the literature of systematic botany published between 1753 and 1940. It is organized by author and includes numbered entries for the author's publications. How can we map back to LCSH/call numbers when we have limited resources--we need to automate as much as possible.  We want consistent language.  The BHL uses LC for the volumes but also pulls out scientific names.  How do we get them incorporated into the consistent language of LC in an automated way that can scale?  We want to know what we have so we can compare to an (as yet) unidentified universe.  (bibliographies, index animalium, TL2)IndexAnimalium is Sherborn’s life’s work—9000 page bibliography identifying the first book in which over 400,000 organisms appeared; covers 1758-1850LENGTHY process is all of this! Needs more automationZoological Record: is the world's oldest continuing database of animal biology. It is considered the world's leading taxonomic reference, and with coverage back to 1864, has long acted as the world's unofficial register of animal names.Early on we compared the universe of what is in the big libraries to what was in BHL and that allowed us to fill gaps:  https://bhl.wikispaces.com/BHL+Priority+Titles
  17. These are keywords that we use to describe how we collect for BHL. These are adapted from LC but not necessarily actual subject heading. We modified some terms to make the language clear and bring in some of the scientific naming conventions (Ornithology instead of birds). This was meant to merge appropriate parts of the library and scientific world. This is the consistent language against which we want to compare BHL content.
  18. Many irrelevant features; breaks up phrases (united states) At least is shows that we have lots of BOTANY (but we would want to merge that with plants) .
  19. This shows the distribution of keywords for items scanned by the Ernst Mayr Library of the Museum of Comparative Zoology (good thing zoology shows up as a big piece). This was made using tableau software—all of the tiny items can be identified but like wordle, lots of irrelevant stuff. How can we automate the improvement and appropriate merging of metadata? http://public.tableausoftware.com/views/BHLViz/DigitizedSubjects