SlideShare a Scribd company logo
1 of 50
Society for Scholarly
Publishing Conference
Arlington, VA
Zen and the Art of
Metadata Maintenance
John W. Warren
Head, Mason Publishing Group &
George Mason University Press
May 29, 2015
The magnitudes of metadata
Library of Congress:
Eames
Metadata and the Powers of Ten
• In the 1968 film Powers of Ten by Charles and
Ray Eames, we start with a picnic by the
lakeside in Chicago, and are transported every
ten seconds by a magnitude of ten to the outer
edges of the universe. Returning to earth, we
move into the hand of the sleeping picnicker—
and with ten times more magnification every ten
seconds we end inside a proton of a carbon
atom within a DNA molecule in a white blood cell
• Metadata is like that: it is both a universe and
DNA
Metadata—“data about data”—is the
lifeblood of publishing in the digital
age and the key to discovery
Metadata is a universe of standards
across the information spectrum
The goal of good book metadata is to
make your book discoverable
…and ultimately to be purchased
Metadata begins with basic information
Product basics: Title, author, and jacket cover
Product description: Catalog copy, backcover
copy
Product details: including identifiers (ISBN),
page counts, price(s), carton quantity
Additional information: author bios, blurbs,
reviews, awards received, table of contents,
keywords, audience code, rights information,
other currencies
And even more information: book trailers,
sample chapters, excerpts, tags, other books,
Metadata is integral to the entire publishing
ecosystem
• Book metadata commonly thought of as descriptive
data or product information (title, author, etc.)
• Can be defined broadly as information that
describes content, or an item that describes another
item
• In its broader sense, a book review becomes part of
a book’s metadata, while the book review itself is
content with its own metadata
• A book becomes part of the metadata of each book
or article it references, the author becomes part of
the book’s metadata and the book part of the
author’s metadata
© 2012, the Book Industry Study Group, Inc.
Metadata is information flow
Metadata management services (not always used)
Large Publishers Medium-size
Publishers
Smaller Publishers
Distributors and aggregators (e.g.
wholesalers, data companies, etc.)
Retailers (e.g., B&N, Amazon, indie stores, etc.)
© 2012, the Book Industry Study Group, Inc.
Metadata is messy
Metadata management services (not always used)
Distributors and aggregators (e.g.
wholesalers, data companies, etc.)
Retailers (e.g., B&N, Amazon, indie stores, etc.)
Other Consumer Facing Companies (e.g., social sites, etc.)
Content
converters
Large Publishers Medium-size
Publishers
Smaller Publishers
Metadata, though
not called by that
name, has been
important to
booksellers,
libraries, and
collectors nearly
as long as the
book itself
Metadata is clearly on the rise
metadata hell
o
ONIX is a standard for representing
book, serial and video product
information in electronic form
<Product>
<RecordReference>1234567890</RecordReference>
<Notification Type>03</NotificationType>
<ISBN>9781626160132</ISBN>
<Product Form>BB</ProductForm>
<DistinctiveTitle>Hezbollah</DistinctiveTitle>
<Contributor>
<ContributorRole>A01</ContributorRole>
<PersonNameInverted>Levitt, Matthew</PersonNameInverted>
<BiographicalNote>Matthew Levitt is a senior fellow and director
of the Washington Institute for Near East Policy’s Stein Program
on Counterterrorism and Intelligence</BiographicalNote>
</Contributor>
<LanguageOfText>eng</LanguageOfText>
<NumberOfPages>368<.NumberOfPages>
<BISACMainSubject>POL012000</BISACMainSubject>
• ONIX was developed and is maintained by
EDItEUR, jointly with Book Industry Communication
(UK) and Book Industry Study Group (US), in
cooperation of industry stakeholders
ONIX 1.0 released January 2000
ONIX 2.1 released 2004
ONIX 3.0 released 2009
ONIX 3.01 released January 2012
• Many industry partners are still not accepting ONIX
3, although EDItEUR/BISG plan to sunset ONIX 2.1
• User groups in 13 different countries
• Best practices guides and code specifications
available on BISG and EDItEUR websites
ONIX—ONline Information Exchange—
has evolved and is evolving
• 32 “core” metadata elements; 27 of these
required
• Enriched ONIX data can include hundreds of
fields of data such as description, rights
information, reviews
• Trading partners include:
• Wholesalers (Ingram, Baker & Taylor)
• Online retailers (Amazon, Powells)
• Retail chains (Barnes and Noble)
• Databases (Library of Congress, Bowker Books in
Print, Serials Solutions, OCLC/WorldCat)
ONIX feeds are sent to book industry
trading partners
Using the ONIX code to indicate your
audience is one example
ONIX Code List 28 Audience Code List (Audience
Code, Description)
• 01- General/Trade
• 02- Children/Juvenile
• 03- Young Adult
• 04- Primary & Secondary/Elementary & High School
• 05- College/Higher Education
• 06- Professional and Scholarly
• 07- English as a Second Language
• 08- Adult Education
ONIX is preferred, but most trading
partners will take other methods
• Other common formats include excel or comma
delimited file formats
• Some trading partners will NOT accept ONIX
• Some vendors have “special needs” —be prepared
to accommodate important trading partners
• Cover files are also sent in a variety of formats/sizes
• According to BISG, subject code is the most
common error in publisher metadata
• Publishers, for their part, also claim that BISAC
subject headings are the most frequent element
modified in the supply chain by recipients
Metadata matters
Metadata is a process of
continuous improvement
• Improvement efforts should include increasing the
quality of metadata as well as the quantity of
accounts to which you feed metadata directly in
ONIX
• Effort never ends—there’s always something to add,
improve, clean up
• Data constantly changes; errors are introduced
regularly
• Limit the number of different databases you store
data
• Not everybody should be able to edit it, but
everyone should be able to see it/access it
• Determine the “Goldilocks Zone” between
control/dictatorship and anarchy/everybody does
anything
Metadata flows
through
numerous
systems to
trading partners,
and ultimately to
thousands of
destinations
Metadata is sent to Library of Congress,
Bowker, and direct trading partners…
… In order for your titles be discovered on
retail sites and in various databases
Metadata holds a central role in the
publishing organization
• Administration: rights and royalty information
• Interoperability: transmission of data between or within
internal and external systems and standards
• Production: flow of products through creation to
dissemination
• Marketing and distribution: data exchange with distribution,
retail, and other vendors and partners
• Repurposing: chunking content into chapters or micro
segments
• Learning: integration of content into learning management
systems and databases, from Blackboard to SCORM
• Preservation: digital assets management, version control
• Use: sales tracking, search logs, and ebook usage
BISG certification validates your
metadata’s accuracy and timeliness
• Qualitative and quantitative look at publisher
data
• Automatic validation of certain fields
• Panel of industry experts manually reviews other
fields
• Score of 95 quantitative / 90 on qualitative gives
Silver rating
– Ingram gives preferential treatment to rated presses
• BISG working on retooling program to be
scalable
• Feedback gives you clear idea of what to
Six publishers
are currently
certified
gold—four big
ones and two
University
Presses
To meta-infinity…
and beyond!
OCLC’s WorldCat shows what libraries near
the user hold your title in stock
• MARC Records (library standard
bibliographic data) can be sent directly to
libraries, though most libraries get MARC
Records from OCLC or other sources
• MARC Records help libraries catalog
titles quickly and accurately
• OCLC has an initiative to merge ONIX
and WorldCat/MARC Records
• Metadata “crosswalks” map and
exchange data between two metadata
schemas
MARC Records are the library
world’s key metadata format
Metadata is becoming
increasingly interconnected
• Google Scholar, Web of Science, and Scopus:
multidisciplinary knowledge databases
• Focus on journals — also indexes books in database
• Serials Solutions Summon Service and EBSCO
Discovery Service: popular discovery engines
• PubMed Central, LexisNexis, ERIC: specialized
sites ubiquitous in certain subject areas
• These tools enable Linked Data Repositories,
improving architecture of the semantic web
Metadata can be supplied to social sites
such as Goodreads and LibraryThing
New standards emerge constantly,
improving discovery and accuracy
What is the ISNI?
• ISO Standard, published in 2012
• International Standard Name Identifier
• Numerical representation of a name
– Assigned to public figures, content contributors (i.e.
researchers, authors, musicians, actors, publishers,
research institutions) and subjects of that content (if
they are people or institutions, i.e. Harry Potter)
– 16 digits
– Example: 0000 0004 1029 5439
ISNI is a tool for linking data sets and
improving disambiguation
Brian May,
guitarist in
Queen, is an
astrophysicist,
AND is the
editor of an
obscure
photography
book
(yes, an interesting
fellow!)
Researchers (and organizations) can
sign up for ORCID IDs
• ORCID provides a persistent digital identifier that
distinguishes researchers from others (i.e. of the
same or similar name)
• Integrated into key research workflows such as
manuscript and grant submission
• Supports automated linkages between
researchers and their professional activities,
ensuring that their work is recognized
DOI can improve discovery, sales, and
citation of scholarly content
• DOI—Digital Object Identifier—developed by the AAP in
1996 and approved as an ISO standard in 2010
• Few publishers have used DOI beyond realm of journals
• Can identify a work, product, component of a work such
as a chapter, table, or chart
• DOI is actionable, links to whatever resource the
publisher wants it to link to
• DOI itself is permanent but its metadata, as well as the
link that it resolves to, can be changed by its owner at
any time
• Can link to multiple resources: publisher website, content
itself, book reviews or social media, and product on retail
sites such as Amazon or Powells
Metadata is a work in progress
• Despite your best attempts at “clean” and robust
metadata, vendors get metadata from many
sources—some which have inaccurate data
• Ebook metadata is still the wild frontier
• Still difficult to link print and ebook metadata;
YBP is good at this for library market
• Increasing complexities of multiple formats,
multiple currencies, chapter-level sales, etc.
“95% of publishers have had the
experience of creating their e-books
with one set of metadata and seeing
an altered set of metadata at the
point of sale, online booksellers like
Amazon, Barnes & Noble and
Apple.”
― Jeremy Greenfield, Editorial
Director, Digital Book World
Keywords are a practical and still
evolving application in ONIX
• Keywords are words and phrases that describe the
content or theme of the work that are:
– Relevant to the work
– Used to supplement the title, subtitle, author name,
description
• Keywords are not required but can be useful for
discovery and Search Engine Optimization (SEO)
• Use of keywords is fluid; not all data recipients
support use but many utilize at least some keywords
• Publishers should use much discretion before
repeating information within multiple data points
Use of keywords can improve
discoverability
• Useful when consumer’s search term is jargon,
new, distinctive, specific
• When consumer does not know exact title or
author of book, or title differs from theme
• When many different and distinctive terms
describe the topic
• When book includes topics that are outside the
BISAC subjects used or consumer use terms
outside metadata
Choosing set of effective keywords is
an art and a science
• Be specific, but not so specific people won’t use the
term
• Choose unique and relevant keywords
• Avoid terms already in title or author fields
• Choose consumer-oriented keywords
• Single words and phrases of 2–5 words are
acceptable
• Use as many synonyms as appropriate
• Prioritize—500 character max recommended, but
not all vendors use the entire field
• Semicolons instead of commas are preferred, and
best with no spaces
• Title: A Dance with Dragons
Author: George R. R. Martin
ISBN: 9780553801477
Series: A Song of Ice and Fire (Book 5) BISAC Subject
Headings: FIC009020; FIC002000; FIC028010
Keywords: Game of Thrones; kingdoms; kings; magic;
dragons; HBO series; medieval; saga; Targaryen
Description: In the aftermath of a colossal battle, the
future of the Seven Kingdoms hangs in the balance—
beset by newly emerging threats from every direction. In
the east, Daenerys Targaryen, the last scion of House
Targaryen, rules with her three dragons as queen of a
city built on dust and death. But Daenerys has
thousands of enemies, and many have set out to find
her. As they gather, one young man embarks upon his
own quest for the queen, with an entirely different goal in
mind.
Overheard at
Book Expo
America
“Metadata is the
content”
(a few years ago)
Social metadata is on the rise
Metadata
can even
be a
game—
fun!
Social reading is metadata’s
next frontier
• Shared and user-created bookmarks, tags, highlights,
annotations, comments, notes
• Social Book, developed by Bob Stein at Institute for the
Future of the Book, “turns each document into a
conversation”
• Hypothes.is is an open, collaborative platform for
critique and community peer-review of books, scientific
articles, web pages, data, and other knowledge materials
• Publishers interested in receiving data on ebook reading
habits and student interaction with etextbooks
– Potential to increase quality of textbooks and perhaps
improve learning
– Privacy concerns are not insignificant
When
metadata
met
content
The novel becomes an API
• For Landing Gear, Kate Pullinger’s new novel,
Random House created an API (application
programming interface) that allows programmers
to get content in multiple ways
• Excerpt-length section of Landing Gear stored in
a content management system is tagged to
define characters, locations, events and times
• Programmers can access this data and build
new products with it, such as plot locations on a
map or events on a timeline, or even character
locations during key events
In this example, the API shows you what
happens to Yacub at the Karachi airport
Discussion/Questions
• Please contact me with any questions:
John W. Warren
Head, Mason Publishing Group/George Mason
University Press, George Mason University
Email: jwarre13@gmu.edu
Twitter: @john_w_warren
• Based on the article: “Zen and the Art of
Metadata Maintenance,” John W. Warren, June
17, 2015, Journal of Electronic Publishing
http://dx.doi.org/10.3998/3336451.0018.305
Sources and Further Reading (1)
BISG: https://www.bisg.org/publications
Best Practices for Metadata: https://www.bisg.org/product-metadata-best-
practices
ONIX: https://www.bisg.org/onix-books
Product Data Certification Program: https://www.bisg.org/product-data-
certification-program-1
Best Practices for Keywords for Metadatahttps://www.bisg.org/best-
practices-keywords-metadata
CrossRef: DOI for books:
http://www.crossref.org/02publishers/dois_for_books.html
EDItEUR, Codelists Issue 28 for Release 2.1 and 3.0:
http://www.editeur.org/14/Code-Lists
Hypothes.is: https://hypothes.is/
ISNI database: http://www.isni.org/search
Linked Data: Evolving the Web into a Global Data Space, Heath, Tom and
Christian Bizer, Synthesis Lectures on the Semantic Web: Theory and
Sources and Further Reading (2)
Metadatagames: http://www.metadatagames.org/
ORCID: http://orcid.org/
Open Utopia on Social Book: http://theopenutopia.org/social-book/
Seeing Standards: A Visualization of the Metadata Universe, Riley, Jenn and Devin
Becker. 2009–2010: http://www.dlib.indiana.edu/~jenlrile/metadatamap/
“Taxonomy of Social Reading: A Proposal,” Bob Stein, Institute for the Future of the
Book, 2010: http://futureofthebook.org/social-reading/.
“Teacher Knows if You’ve Done the E-Reading,” David Streitfeld, New York Times,
April 8 2013: http://www.nytimes.com/2013/04/09/technology/coursesmart-e-
textbooks-track-students-progress-for-teachers.html
Uncharted: Big Data as a Lens on Human Culture, Aiden, Erez and Jean-Baptiste
Michel, 2013, New York: Riverhead
“White Paper: The Link Between Metadata and Sales,” Nielsen Book, Prepared by
Andre Breedt and David Walter, 2012:
http://www.nielsenbookdata.co.uk/controller.php?page=1129

More Related Content

What's hot

Finding books in the online catalog
Finding books in the online catalogFinding books in the online catalog
Finding books in the online catalogJohn B. Cade Library
 
Rscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsRscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsSusanMRob
 
Ecp 11 created by hedley. hendricks and presented by gerald Louw
Ecp 11 created by hedley. hendricks and presented by gerald LouwEcp 11 created by hedley. hendricks and presented by gerald Louw
Ecp 11 created by hedley. hendricks and presented by gerald LouwGerald Louw
 
Enterprise Content Management (ECM) in the Cloud
Enterprise Content Management (ECM) in the CloudEnterprise Content Management (ECM) in the Cloud
Enterprise Content Management (ECM) in the CloudKashif Imran
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of librariesRegan Harper
 
The mad world of ebooks apla 2008
The mad world of ebooks apla 2008The mad world of ebooks apla 2008
The mad world of ebooks apla 2008Tony Horava
 
Beginning Research Graduate Education
Beginning Research Graduate EducationBeginning Research Graduate Education
Beginning Research Graduate Educationguest3c5326
 
What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? Debra Shapiro
 
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARYINFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARYChris Okiki
 
Role of Cataloger in the 21st Century Academic Library
Role of Cataloger in the 21st Century Academic LibraryRole of Cataloger in the 21st Century Academic Library
Role of Cataloger in the 21st Century Academic LibraryNew York University
 
What flavor of metadata is best for your collection?
What flavor of metadata is best for your collection?What flavor of metadata is best for your collection?
What flavor of metadata is best for your collection?Debra Shapiro
 
Many flavors of linked data
Many flavors of linked dataMany flavors of linked data
Many flavors of linked dataDebra Shapiro
 
Using the library for research
Using the library for researchUsing the library for research
Using the library for researchRoddy MacLeod
 

What's hot (15)

Finding books in the online catalog
Finding books in the online catalogFinding books in the online catalog
Finding books in the online catalog
 
Rscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsRscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libs
 
Ecp 11 created by hedley. hendricks and presented by gerald Louw
Ecp 11 created by hedley. hendricks and presented by gerald LouwEcp 11 created by hedley. hendricks and presented by gerald Louw
Ecp 11 created by hedley. hendricks and presented by gerald Louw
 
NISO Altmetrics Recommended Practice 3:AM Conference presentation
NISO Altmetrics Recommended Practice 3:AM Conference presentationNISO Altmetrics Recommended Practice 3:AM Conference presentation
NISO Altmetrics Recommended Practice 3:AM Conference presentation
 
Enterprise Content Management (ECM) in the Cloud
Enterprise Content Management (ECM) in the CloudEnterprise Content Management (ECM) in the Cloud
Enterprise Content Management (ECM) in the Cloud
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of libraries
 
The mad world of ebooks apla 2008
The mad world of ebooks apla 2008The mad world of ebooks apla 2008
The mad world of ebooks apla 2008
 
Beginning Research Graduate Education
Beginning Research Graduate EducationBeginning Research Graduate Education
Beginning Research Graduate Education
 
What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection?
 
Why we need oa infrastructure - STM Association Beyond Open Access Seminar
Why we need oa infrastructure - STM Association Beyond Open Access SeminarWhy we need oa infrastructure - STM Association Beyond Open Access Seminar
Why we need oa infrastructure - STM Association Beyond Open Access Seminar
 
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARYINFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
 
Role of Cataloger in the 21st Century Academic Library
Role of Cataloger in the 21st Century Academic LibraryRole of Cataloger in the 21st Century Academic Library
Role of Cataloger in the 21st Century Academic Library
 
What flavor of metadata is best for your collection?
What flavor of metadata is best for your collection?What flavor of metadata is best for your collection?
What flavor of metadata is best for your collection?
 
Many flavors of linked data
Many flavors of linked dataMany flavors of linked data
Many flavors of linked data
 
Using the library for research
Using the library for researchUsing the library for research
Using the library for research
 

Viewers also liked

Paraire wk 3 term 2
Paraire wk 3 term 2Paraire wk 3 term 2
Paraire wk 3 term 2takp
 
An analysis of the role played by the ‘China Import and Export Fair’ in Canto...
An analysis of the role played by the ‘China Import and Export Fair’ in Canto...An analysis of the role played by the ‘China Import and Export Fair’ in Canto...
An analysis of the role played by the ‘China Import and Export Fair’ in Canto...wangqiongapp
 
Taite week 7 term 3 pdf
Taite week 7 term 3 pdfTaite week 7 term 3 pdf
Taite week 7 term 3 pdftakp
 
Mane wk9 term 1 13pdf
Mane wk9 term 1 13pdfMane wk9 term 1 13pdf
Mane wk9 term 1 13pdftakp
 
Everyone has a plan until... Automacon16
Everyone has a plan until...  Automacon16Everyone has a plan until...  Automacon16
Everyone has a plan until... Automacon16Pete Cheslock
 
Wenerei wk 10 term 2 2013 pdf1
Wenerei wk 10 term 2 2013 pdf1Wenerei wk 10 term 2 2013 pdf1
Wenerei wk 10 term 2 2013 pdf1takp
 
Mane week 4 term 3
Mane week 4 term 3Mane week 4 term 3
Mane week 4 term 3takp
 
Wenerei wk 10 term 2 2013 pdf
Wenerei wk 10 term 2 2013 pdfWenerei wk 10 term 2 2013 pdf
Wenerei wk 10 term 2 2013 pdftakp
 
Userguide eeeims en_v1.0
Userguide eeeims en_v1.0Userguide eeeims en_v1.0
Userguide eeeims en_v1.0Abetu Bope
 
Green (r)evolution – action against climate change
Green (r)evolution – action against climate changeGreen (r)evolution – action against climate change
Green (r)evolution – action against climate changeAnuj Kumar Dwivedi
 
Mane wk 7 term 2
Mane wk 7 term 2Mane wk 7 term 2
Mane wk 7 term 2takp
 
ANZ Law Seminar 2015
ANZ Law Seminar 2015ANZ Law Seminar 2015
ANZ Law Seminar 2015tinaarg
 
Mane week 5 term 3
Mane week 5 term 3Mane week 5 term 3
Mane week 5 term 3takp
 
Wenerei wk8 term 1
Wenerei wk8 term 1 Wenerei wk8 term 1
Wenerei wk8 term 1 takp
 
Mane week 9 term 3
Mane week 9 term 3Mane week 9 term 3
Mane week 9 term 3takp
 

Viewers also liked (20)

MODAL VERBS
MODAL VERBSMODAL VERBS
MODAL VERBS
 
T301
T301T301
T301
 
Rujak buah
Rujak buahRujak buah
Rujak buah
 
Paraire wk 3 term 2
Paraire wk 3 term 2Paraire wk 3 term 2
Paraire wk 3 term 2
 
An analysis of the role played by the ‘China Import and Export Fair’ in Canto...
An analysis of the role played by the ‘China Import and Export Fair’ in Canto...An analysis of the role played by the ‘China Import and Export Fair’ in Canto...
An analysis of the role played by the ‘China Import and Export Fair’ in Canto...
 
Taite week 7 term 3 pdf
Taite week 7 term 3 pdfTaite week 7 term 3 pdf
Taite week 7 term 3 pdf
 
Mane wk9 term 1 13pdf
Mane wk9 term 1 13pdfMane wk9 term 1 13pdf
Mane wk9 term 1 13pdf
 
Everyone has a plan until... Automacon16
Everyone has a plan until...  Automacon16Everyone has a plan until...  Automacon16
Everyone has a plan until... Automacon16
 
Wenerei wk 10 term 2 2013 pdf1
Wenerei wk 10 term 2 2013 pdf1Wenerei wk 10 term 2 2013 pdf1
Wenerei wk 10 term 2 2013 pdf1
 
Mane week 4 term 3
Mane week 4 term 3Mane week 4 term 3
Mane week 4 term 3
 
Wenerei wk 10 term 2 2013 pdf
Wenerei wk 10 term 2 2013 pdfWenerei wk 10 term 2 2013 pdf
Wenerei wk 10 term 2 2013 pdf
 
Userguide eeeims en_v1.0
Userguide eeeims en_v1.0Userguide eeeims en_v1.0
Userguide eeeims en_v1.0
 
Green (r)evolution – action against climate change
Green (r)evolution – action against climate changeGreen (r)evolution – action against climate change
Green (r)evolution – action against climate change
 
Mane wk 7 term 2
Mane wk 7 term 2Mane wk 7 term 2
Mane wk 7 term 2
 
Pre.con.ex.
Pre.con.ex.Pre.con.ex.
Pre.con.ex.
 
ANZ Law Seminar 2015
ANZ Law Seminar 2015ANZ Law Seminar 2015
ANZ Law Seminar 2015
 
Mane week 5 term 3
Mane week 5 term 3Mane week 5 term 3
Mane week 5 term 3
 
Wenerei wk8 term 1
Wenerei wk8 term 1 Wenerei wk8 term 1
Wenerei wk8 term 1
 
Lilianagbiu
LilianagbiuLilianagbiu
Lilianagbiu
 
Mane week 9 term 3
Mane week 9 term 3Mane week 9 term 3
Mane week 9 term 3
 

Similar to Zen and the Art of Metadata Maintenance

Summary of Trends in Cataloging
Summary of Trends in CatalogingSummary of Trends in Cataloging
Summary of Trends in CatalogingWilliam Worford
 
Creating powerful metadata. Workshop. Tools of Change for Publishers (TOC) 20...
Creating powerful metadata. Workshop. Tools of Change for Publishers (TOC) 20...Creating powerful metadata. Workshop. Tools of Change for Publishers (TOC) 20...
Creating powerful metadata. Workshop. Tools of Change for Publishers (TOC) 20...Renee Register
 
Library Simplified at ALA 2015
Library Simplified at ALA 2015Library Simplified at ALA 2015
Library Simplified at ALA 2015jamesenglish
 
Data mining concept and methods for basic
Data mining concept and methods for basicData mining concept and methods for basic
Data mining concept and methods for basicNivaTripathy2
 
Thema webinar from BookNet Canada, June 2014
Thema webinar from BookNet Canada, June 2014Thema webinar from BookNet Canada, June 2014
Thema webinar from BookNet Canada, June 2014BookNet Canada
 
Library Language: Vocabulary for the Modern Librarian
Library Language: Vocabulary for the Modern LibrarianLibrary Language: Vocabulary for the Modern Librarian
Library Language: Vocabulary for the Modern LibrarianLibraries Thriving
 
Finding and Managing Information
Finding and Managing InformationFinding and Managing Information
Finding and Managing InformationNeny Isharyanti
 
James English, The New York Public Library @European Digital Distributors Me...
James English,  The New York Public Library @European Digital Distributors Me...James English,  The New York Public Library @European Digital Distributors Me...
James English, The New York Public Library @European Digital Distributors Me...TISP Project
 
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...Ringgold Inc
 
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...Ringgold Inc
 
Social networks and collaborative tools: connecting informations in the Googl...
Social networks and collaborative tools: connecting informations in the Googl...Social networks and collaborative tools: connecting informations in the Googl...
Social networks and collaborative tools: connecting informations in the Googl...Bonaria Biancu
 
Social networks and collaborative tool: connecting information in the Googlez...
Social networks and collaborative tool: connecting information in the Googlez...Social networks and collaborative tool: connecting information in the Googlez...
Social networks and collaborative tool: connecting information in the Googlez...guest9fb557
 
How BiblioShare Supports Bookselling
How BiblioShare Supports BooksellingHow BiblioShare Supports Bookselling
How BiblioShare Supports BooksellingBookNet Canada
 
Practical Information Architecture
Practical Information ArchitecturePractical Information Architecture
Practical Information ArchitectureRob Bogue
 

Similar to Zen and the Art of Metadata Maintenance (20)

Summary of Trends in Cataloging
Summary of Trends in CatalogingSummary of Trends in Cataloging
Summary of Trends in Cataloging
 
Creating powerful metadata. Workshop. Tools of Change for Publishers (TOC) 20...
Creating powerful metadata. Workshop. Tools of Change for Publishers (TOC) 20...Creating powerful metadata. Workshop. Tools of Change for Publishers (TOC) 20...
Creating powerful metadata. Workshop. Tools of Change for Publishers (TOC) 20...
 
Library Simplified at ALA 2015
Library Simplified at ALA 2015Library Simplified at ALA 2015
Library Simplified at ALA 2015
 
Data mining concept and methods for basic
Data mining concept and methods for basicData mining concept and methods for basic
Data mining concept and methods for basic
 
Thema webinar from BookNet Canada, June 2014
Thema webinar from BookNet Canada, June 2014Thema webinar from BookNet Canada, June 2014
Thema webinar from BookNet Canada, June 2014
 
Library Language: Vocabulary for the Modern Librarian
Library Language: Vocabulary for the Modern LibrarianLibrary Language: Vocabulary for the Modern Librarian
Library Language: Vocabulary for the Modern Librarian
 
Finding and Managing Information
Finding and Managing InformationFinding and Managing Information
Finding and Managing Information
 
Why metadata (English version)
Why metadata (English version)Why metadata (English version)
Why metadata (English version)
 
James English, The New York Public Library @European Digital Distributors Me...
James English,  The New York Public Library @European Digital Distributors Me...James English,  The New York Public Library @European Digital Distributors Me...
James English, The New York Public Library @European Digital Distributors Me...
 
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
 
Payton Eliminating Conflicts in Ebook Metadata
Payton Eliminating Conflicts in Ebook MetadataPayton Eliminating Conflicts in Ebook Metadata
Payton Eliminating Conflicts in Ebook Metadata
 
1 d.1
1 d.11 d.1
1 d.1
 
Register "New Directions in Cataloging and Metadata Creation"
Register "New Directions in Cataloging and Metadata Creation"Register "New Directions in Cataloging and Metadata Creation"
Register "New Directions in Cataloging and Metadata Creation"
 
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
 
Metadata
MetadataMetadata
Metadata
 
Social networks and collaborative tools: connecting informations in the Googl...
Social networks and collaborative tools: connecting informations in the Googl...Social networks and collaborative tools: connecting informations in the Googl...
Social networks and collaborative tools: connecting informations in the Googl...
 
Social networks and collaborative tool: connecting information in the Googlez...
Social networks and collaborative tool: connecting information in the Googlez...Social networks and collaborative tool: connecting information in the Googlez...
Social networks and collaborative tool: connecting information in the Googlez...
 
How BiblioShare Supports Bookselling
How BiblioShare Supports BooksellingHow BiblioShare Supports Bookselling
How BiblioShare Supports Bookselling
 
Practical Information Architecture
Practical Information ArchitecturePractical Information Architecture
Practical Information Architecture
 
Vlahakis From the Perspective of the Platform Provider
Vlahakis From the Perspective of the Platform ProviderVlahakis From the Perspective of the Platform Provider
Vlahakis From the Perspective of the Platform Provider
 

Recently uploaded

Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 

Recently uploaded (20)

Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 

Zen and the Art of Metadata Maintenance

  • 1. Society for Scholarly Publishing Conference Arlington, VA Zen and the Art of Metadata Maintenance John W. Warren Head, Mason Publishing Group & George Mason University Press May 29, 2015
  • 2. The magnitudes of metadata Library of Congress: Eames
  • 3. Metadata and the Powers of Ten • In the 1968 film Powers of Ten by Charles and Ray Eames, we start with a picnic by the lakeside in Chicago, and are transported every ten seconds by a magnitude of ten to the outer edges of the universe. Returning to earth, we move into the hand of the sleeping picnicker— and with ten times more magnification every ten seconds we end inside a proton of a carbon atom within a DNA molecule in a white blood cell • Metadata is like that: it is both a universe and DNA
  • 4. Metadata—“data about data”—is the lifeblood of publishing in the digital age and the key to discovery
  • 5. Metadata is a universe of standards across the information spectrum
  • 6. The goal of good book metadata is to make your book discoverable …and ultimately to be purchased
  • 7. Metadata begins with basic information Product basics: Title, author, and jacket cover Product description: Catalog copy, backcover copy Product details: including identifiers (ISBN), page counts, price(s), carton quantity Additional information: author bios, blurbs, reviews, awards received, table of contents, keywords, audience code, rights information, other currencies And even more information: book trailers, sample chapters, excerpts, tags, other books,
  • 8. Metadata is integral to the entire publishing ecosystem • Book metadata commonly thought of as descriptive data or product information (title, author, etc.) • Can be defined broadly as information that describes content, or an item that describes another item • In its broader sense, a book review becomes part of a book’s metadata, while the book review itself is content with its own metadata • A book becomes part of the metadata of each book or article it references, the author becomes part of the book’s metadata and the book part of the author’s metadata
  • 9. © 2012, the Book Industry Study Group, Inc. Metadata is information flow Metadata management services (not always used) Large Publishers Medium-size Publishers Smaller Publishers Distributors and aggregators (e.g. wholesalers, data companies, etc.) Retailers (e.g., B&N, Amazon, indie stores, etc.)
  • 10. © 2012, the Book Industry Study Group, Inc. Metadata is messy Metadata management services (not always used) Distributors and aggregators (e.g. wholesalers, data companies, etc.) Retailers (e.g., B&N, Amazon, indie stores, etc.) Other Consumer Facing Companies (e.g., social sites, etc.) Content converters Large Publishers Medium-size Publishers Smaller Publishers
  • 11. Metadata, though not called by that name, has been important to booksellers, libraries, and collectors nearly as long as the book itself
  • 12. Metadata is clearly on the rise metadata hell o
  • 13. ONIX is a standard for representing book, serial and video product information in electronic form <Product> <RecordReference>1234567890</RecordReference> <Notification Type>03</NotificationType> <ISBN>9781626160132</ISBN> <Product Form>BB</ProductForm> <DistinctiveTitle>Hezbollah</DistinctiveTitle> <Contributor> <ContributorRole>A01</ContributorRole> <PersonNameInverted>Levitt, Matthew</PersonNameInverted> <BiographicalNote>Matthew Levitt is a senior fellow and director of the Washington Institute for Near East Policy’s Stein Program on Counterterrorism and Intelligence</BiographicalNote> </Contributor> <LanguageOfText>eng</LanguageOfText> <NumberOfPages>368<.NumberOfPages> <BISACMainSubject>POL012000</BISACMainSubject>
  • 14. • ONIX was developed and is maintained by EDItEUR, jointly with Book Industry Communication (UK) and Book Industry Study Group (US), in cooperation of industry stakeholders ONIX 1.0 released January 2000 ONIX 2.1 released 2004 ONIX 3.0 released 2009 ONIX 3.01 released January 2012 • Many industry partners are still not accepting ONIX 3, although EDItEUR/BISG plan to sunset ONIX 2.1 • User groups in 13 different countries • Best practices guides and code specifications available on BISG and EDItEUR websites ONIX—ONline Information Exchange— has evolved and is evolving
  • 15. • 32 “core” metadata elements; 27 of these required • Enriched ONIX data can include hundreds of fields of data such as description, rights information, reviews • Trading partners include: • Wholesalers (Ingram, Baker & Taylor) • Online retailers (Amazon, Powells) • Retail chains (Barnes and Noble) • Databases (Library of Congress, Bowker Books in Print, Serials Solutions, OCLC/WorldCat) ONIX feeds are sent to book industry trading partners
  • 16. Using the ONIX code to indicate your audience is one example ONIX Code List 28 Audience Code List (Audience Code, Description) • 01- General/Trade • 02- Children/Juvenile • 03- Young Adult • 04- Primary & Secondary/Elementary & High School • 05- College/Higher Education • 06- Professional and Scholarly • 07- English as a Second Language • 08- Adult Education
  • 17. ONIX is preferred, but most trading partners will take other methods • Other common formats include excel or comma delimited file formats • Some trading partners will NOT accept ONIX • Some vendors have “special needs” —be prepared to accommodate important trading partners • Cover files are also sent in a variety of formats/sizes • According to BISG, subject code is the most common error in publisher metadata • Publishers, for their part, also claim that BISAC subject headings are the most frequent element modified in the supply chain by recipients
  • 19. Metadata is a process of continuous improvement • Improvement efforts should include increasing the quality of metadata as well as the quantity of accounts to which you feed metadata directly in ONIX • Effort never ends—there’s always something to add, improve, clean up • Data constantly changes; errors are introduced regularly • Limit the number of different databases you store data • Not everybody should be able to edit it, but everyone should be able to see it/access it • Determine the “Goldilocks Zone” between control/dictatorship and anarchy/everybody does anything
  • 20. Metadata flows through numerous systems to trading partners, and ultimately to thousands of destinations
  • 21. Metadata is sent to Library of Congress, Bowker, and direct trading partners…
  • 22. … In order for your titles be discovered on retail sites and in various databases
  • 23. Metadata holds a central role in the publishing organization • Administration: rights and royalty information • Interoperability: transmission of data between or within internal and external systems and standards • Production: flow of products through creation to dissemination • Marketing and distribution: data exchange with distribution, retail, and other vendors and partners • Repurposing: chunking content into chapters or micro segments • Learning: integration of content into learning management systems and databases, from Blackboard to SCORM • Preservation: digital assets management, version control • Use: sales tracking, search logs, and ebook usage
  • 24. BISG certification validates your metadata’s accuracy and timeliness • Qualitative and quantitative look at publisher data • Automatic validation of certain fields • Panel of industry experts manually reviews other fields • Score of 95 quantitative / 90 on qualitative gives Silver rating – Ingram gives preferential treatment to rated presses • BISG working on retooling program to be scalable • Feedback gives you clear idea of what to
  • 25. Six publishers are currently certified gold—four big ones and two University Presses
  • 27. OCLC’s WorldCat shows what libraries near the user hold your title in stock
  • 28. • MARC Records (library standard bibliographic data) can be sent directly to libraries, though most libraries get MARC Records from OCLC or other sources • MARC Records help libraries catalog titles quickly and accurately • OCLC has an initiative to merge ONIX and WorldCat/MARC Records • Metadata “crosswalks” map and exchange data between two metadata schemas MARC Records are the library world’s key metadata format
  • 29. Metadata is becoming increasingly interconnected • Google Scholar, Web of Science, and Scopus: multidisciplinary knowledge databases • Focus on journals — also indexes books in database • Serials Solutions Summon Service and EBSCO Discovery Service: popular discovery engines • PubMed Central, LexisNexis, ERIC: specialized sites ubiquitous in certain subject areas • These tools enable Linked Data Repositories, improving architecture of the semantic web
  • 30. Metadata can be supplied to social sites such as Goodreads and LibraryThing
  • 31. New standards emerge constantly, improving discovery and accuracy What is the ISNI? • ISO Standard, published in 2012 • International Standard Name Identifier • Numerical representation of a name – Assigned to public figures, content contributors (i.e. researchers, authors, musicians, actors, publishers, research institutions) and subjects of that content (if they are people or institutions, i.e. Harry Potter) – 16 digits – Example: 0000 0004 1029 5439
  • 32. ISNI is a tool for linking data sets and improving disambiguation Brian May, guitarist in Queen, is an astrophysicist, AND is the editor of an obscure photography book (yes, an interesting fellow!)
  • 33. Researchers (and organizations) can sign up for ORCID IDs • ORCID provides a persistent digital identifier that distinguishes researchers from others (i.e. of the same or similar name) • Integrated into key research workflows such as manuscript and grant submission • Supports automated linkages between researchers and their professional activities, ensuring that their work is recognized
  • 34. DOI can improve discovery, sales, and citation of scholarly content • DOI—Digital Object Identifier—developed by the AAP in 1996 and approved as an ISO standard in 2010 • Few publishers have used DOI beyond realm of journals • Can identify a work, product, component of a work such as a chapter, table, or chart • DOI is actionable, links to whatever resource the publisher wants it to link to • DOI itself is permanent but its metadata, as well as the link that it resolves to, can be changed by its owner at any time • Can link to multiple resources: publisher website, content itself, book reviews or social media, and product on retail sites such as Amazon or Powells
  • 35. Metadata is a work in progress • Despite your best attempts at “clean” and robust metadata, vendors get metadata from many sources—some which have inaccurate data • Ebook metadata is still the wild frontier • Still difficult to link print and ebook metadata; YBP is good at this for library market • Increasing complexities of multiple formats, multiple currencies, chapter-level sales, etc.
  • 36. “95% of publishers have had the experience of creating their e-books with one set of metadata and seeing an altered set of metadata at the point of sale, online booksellers like Amazon, Barnes & Noble and Apple.” ― Jeremy Greenfield, Editorial Director, Digital Book World
  • 37. Keywords are a practical and still evolving application in ONIX • Keywords are words and phrases that describe the content or theme of the work that are: – Relevant to the work – Used to supplement the title, subtitle, author name, description • Keywords are not required but can be useful for discovery and Search Engine Optimization (SEO) • Use of keywords is fluid; not all data recipients support use but many utilize at least some keywords • Publishers should use much discretion before repeating information within multiple data points
  • 38. Use of keywords can improve discoverability • Useful when consumer’s search term is jargon, new, distinctive, specific • When consumer does not know exact title or author of book, or title differs from theme • When many different and distinctive terms describe the topic • When book includes topics that are outside the BISAC subjects used or consumer use terms outside metadata
  • 39. Choosing set of effective keywords is an art and a science • Be specific, but not so specific people won’t use the term • Choose unique and relevant keywords • Avoid terms already in title or author fields • Choose consumer-oriented keywords • Single words and phrases of 2–5 words are acceptable • Use as many synonyms as appropriate • Prioritize—500 character max recommended, but not all vendors use the entire field • Semicolons instead of commas are preferred, and best with no spaces
  • 40. • Title: A Dance with Dragons Author: George R. R. Martin ISBN: 9780553801477 Series: A Song of Ice and Fire (Book 5) BISAC Subject Headings: FIC009020; FIC002000; FIC028010 Keywords: Game of Thrones; kingdoms; kings; magic; dragons; HBO series; medieval; saga; Targaryen Description: In the aftermath of a colossal battle, the future of the Seven Kingdoms hangs in the balance— beset by newly emerging threats from every direction. In the east, Daenerys Targaryen, the last scion of House Targaryen, rules with her three dragons as queen of a city built on dust and death. But Daenerys has thousands of enemies, and many have set out to find her. As they gather, one young man embarks upon his own quest for the queen, with an entirely different goal in mind.
  • 41. Overheard at Book Expo America “Metadata is the content” (a few years ago)
  • 42. Social metadata is on the rise
  • 44. Social reading is metadata’s next frontier • Shared and user-created bookmarks, tags, highlights, annotations, comments, notes • Social Book, developed by Bob Stein at Institute for the Future of the Book, “turns each document into a conversation” • Hypothes.is is an open, collaborative platform for critique and community peer-review of books, scientific articles, web pages, data, and other knowledge materials • Publishers interested in receiving data on ebook reading habits and student interaction with etextbooks – Potential to increase quality of textbooks and perhaps improve learning – Privacy concerns are not insignificant
  • 46. The novel becomes an API • For Landing Gear, Kate Pullinger’s new novel, Random House created an API (application programming interface) that allows programmers to get content in multiple ways • Excerpt-length section of Landing Gear stored in a content management system is tagged to define characters, locations, events and times • Programmers can access this data and build new products with it, such as plot locations on a map or events on a timeline, or even character locations during key events
  • 47. In this example, the API shows you what happens to Yacub at the Karachi airport
  • 48. Discussion/Questions • Please contact me with any questions: John W. Warren Head, Mason Publishing Group/George Mason University Press, George Mason University Email: jwarre13@gmu.edu Twitter: @john_w_warren • Based on the article: “Zen and the Art of Metadata Maintenance,” John W. Warren, June 17, 2015, Journal of Electronic Publishing http://dx.doi.org/10.3998/3336451.0018.305
  • 49. Sources and Further Reading (1) BISG: https://www.bisg.org/publications Best Practices for Metadata: https://www.bisg.org/product-metadata-best- practices ONIX: https://www.bisg.org/onix-books Product Data Certification Program: https://www.bisg.org/product-data- certification-program-1 Best Practices for Keywords for Metadatahttps://www.bisg.org/best- practices-keywords-metadata CrossRef: DOI for books: http://www.crossref.org/02publishers/dois_for_books.html EDItEUR, Codelists Issue 28 for Release 2.1 and 3.0: http://www.editeur.org/14/Code-Lists Hypothes.is: https://hypothes.is/ ISNI database: http://www.isni.org/search Linked Data: Evolving the Web into a Global Data Space, Heath, Tom and Christian Bizer, Synthesis Lectures on the Semantic Web: Theory and
  • 50. Sources and Further Reading (2) Metadatagames: http://www.metadatagames.org/ ORCID: http://orcid.org/ Open Utopia on Social Book: http://theopenutopia.org/social-book/ Seeing Standards: A Visualization of the Metadata Universe, Riley, Jenn and Devin Becker. 2009–2010: http://www.dlib.indiana.edu/~jenlrile/metadatamap/ “Taxonomy of Social Reading: A Proposal,” Bob Stein, Institute for the Future of the Book, 2010: http://futureofthebook.org/social-reading/. “Teacher Knows if You’ve Done the E-Reading,” David Streitfeld, New York Times, April 8 2013: http://www.nytimes.com/2013/04/09/technology/coursesmart-e- textbooks-track-students-progress-for-teachers.html Uncharted: Big Data as a Lens on Human Culture, Aiden, Erez and Jean-Baptiste Michel, 2013, New York: Riverhead “White Paper: The Link Between Metadata and Sales,” Nielsen Book, Prepared by Andre Breedt and David Walter, 2012: http://www.nielsenbookdata.co.uk/controller.php?page=1129

Editor's Notes

  1. Zen and the Art of Metadata Maintenance Metadata is the lifeblood of publishing in the digital age and the key to discovery. Metadata is a universe, a continuum of standards, and a process of information flow; the process of creating and disseminating metadata involves both art and science. This workshop we will examine the theory and practice of metadata, from the basics of quality metadata construction and management, steps for analyzing your metadata process and instilling continuous improvement steps, programs such as BISG’s Product Certification Program, practical applications for publishers and authors such as keywords, metadata issues for ebooks, and the “frontiers” of the ever-evolving metadata universe. Metadata permeates and enables all aspects of publishing, from information creation and production to marketing and dissemination and an understanding of metadata is essential for publishers, authors, and all those involved in the publishing industry.
  2. http://www.loc.gov/exhibits/eames/science.html
  3. Metadata and the Eames “Powers of Ten” http://www.powersof10.com/film
  4. “Seeing Standards: A Visualization of the Metadata Universe” : Zooming within the domain of scholarly texts, we note a plethora of standards, such as ONIX, used within the book trade. This is a sampling, as the authors point out, “the standards represented here are among those most heavily used or publicized in the cultural heritage community, though certainly not all standards that might be relevant are included” http://www.dlib.indiana.edu/~jenlrile/metadatamap/ Jenn Riley and Devin Becker of the Indiana University Libraries
  5. SOURCE: BISG Generally speaking, metadata is created by publishers, and either directly or through intermediaries, sent to retail partners. The retailers then parse the metadata and display product to their customers, sometimes online, sometimes in store. This chart shows how it’s supposed to work. ____________________ This is the ideal system: metadata is never changed between leaving the publisher and reaching the retailers
  6. And this chart shows how it actually works. The key thing to note here is what were two-way arrows are now one-way arrows. And the red arrows show where metadata is being altered. But more on this later. ______________________ Actual flow; touched by a lot of hands; messy; hard to tell where problems originate
  7. In the early age of print, libraries, collectors, and book dealers managed their inventories in correspondence and account books; the Frankfurt Book Fair emerged by 1475 as an important trading place for books; and in the early 16th century the development of the book trade led to the rise of printer’s guilds and to the creation of standards (such as the book’s title page)—all these factors led to the creation of metadata although it wasn’t yet referred to by that name
  8. Google N Gram viewer Digitized millions of books, can do search on keywords or phrases to see their use over time https://books.google.com/ngrams/graph?content=metadata&year_start=1960&year_end=2000&corpus=15&smoothing=3&share=&direct_url=t1%3B%2Cmetadata%3B%2Cc0
  9. This is what a typical ONIX record looks like. At first blush, it’s pretty technical. But really, it’s eminently logical. One important thing to note: ONIX is undergoing a transformation. The current version of the standard, 2.1, is being phased out. The new version, 3.0, is more robust, more nimble, and particularly well-suited for dealing with digital content. BISG is encouraging American publishers to have a migration plan before the sunset date. ______________ According to BISG: Subject code is what is most frequently wrong in publishers’ metadata https://www.bisg.org/onix-books http://www.booksonix.info/what-is-onix
  10. Best Practices for Metadata: https://www.bisg.org/publications/best-practices-product-metadata Registration required Best Practices for Keywords for Metadata https://www.bisg.org/best-practices-keywords-metadata Latest set of code lists Codelists Issue 24 for Release 2.1 and 3.0 http://www.editeur.org/14/Code-Lists/#Code%20lists%20issue%20latest
  11. BISG’s guidelines identify 32 core metadata elements—of which 27 are listed as mandatory, albeit some mandatory only if physical products, some if digital, some in special circumstances. This does not mean, however, that trading partners will reject publisher metadata with missing “mandatory” elements. The majority of these core elements have various sub-elements and iterations. The full ONIX specification (release 26; July 2014) comprises 156 code-lists that include a whopping 4,105 different values, and that does not take into account the nearly innumerable iterations within those values, such as the 3,000-odd BISAC subject codes to be used for code list 26, value 10.
  12. https://www.bisg.org/audience-codes
  13. According to BISG, the element most frequently in error within publishers’ metadata is the subject code. This might seem odd—after all, one would expect publishers to know the subjects of the books they publish—except when considering that the subject codes used, BISAC Subject Headings (in the US) include approximately 3,000 “minor” subject headings grouped under 51 “major” subjects, while BIC codes (used in the UK), consist of 2,600 different subject categories arranged in 18 sections defining broad subject areas, plus approximately 1000 qualifiers used to refine the meaning of the subject categories. That, and it may be an intern or a lower-level marketing coordinator that is assigning subject codes for a publisher’s titles. Publishers, for their part, also claim that BISAC subject headings are the most frequent element modified by metadata recipients.
  14. Nielsen BookScan conducted a study on the impact of metadata on sales. The study, conducted by Andre Breedt and David Walter of Nielsen, tracked ISBNs in the British market, but the results are clearly analogous here. Reports that those titles in Nielsen’s top-selling 85,000 with complete data records sold 70% more copies on average than those with incomplete metadata.   In this chart, you can see that complete metadata directly impacts sales. Titles with images compared with those that did not have images sold six times as many units. ___________________ Nielsen website >> “metadata whitepaper” >> contains complete, extensive results of the study (including more graphs) See also http://publishingperspectives.com/metadata-conference-2011/
  15. Start with “low hanging fruit” Move toward bigger projects (ie turning 4 different databases with assoc metadata into 1) Access– important to control inadvertent changes/mistakes “Goldilocks zone” (Bob Oeste) zone between control/dictatorship and anarchy/everybody does anything
  16. Metadata permeates and enables all aspects of publishing, from information creation and production to marketing and dissemination and an understanding of metadata is essential for publishers, authors, and all those involved in the publishing industry.
  17. BISG also has a program to certify product data before it goes into the supply chain. Some end users provide incentives for metadata that meets minimum standards. Until now, our Product Data Certification Program has been a largely manual effort. We’re in the process of retooling the PDCP program to make it more automated and more scalable. All of these efforts taken together are designed to improve the quality of metadata. Because, if we want this: _____________________________ Quantitative and qualitative look at publisher’s data Score 95 quantitative / 90 on qualitative >> Silver (Ingram gives preferential treatment to rated presses) Currently 2 to 3 month rating process, want to set up continuous certification Goal is to increase efficiency Industry would benefit from single data repository BISG is a 4-person office, and 1 employee is currently moving to Germany >> ratings schedule is backed up, so they are not accepting new data at the moment (hope to be back up to speed with a few months)
  18. https://www.bisg.org/product-data-certification-program-1
  19. Publishers can send metadata directly, though these sites get metadata from a variety of sources. Sending directly makes it quicker and more accurate. This is also an important source of CROWD-SOURCED metadata—Goodreads for example sends crowd-generated metadata to other sites such as Google
  20. Helps a publisher distinguish between two authors with the same, or similar names, for royalty purposes. You might also be familiar with ORCID –another way to represent a researchers name with numerical identifier. 16 digits – final one is a check digit so it is sometimes an X In machine-to-machine communication, the ISNI is rendered without the spaces. We break it up into four sections just so it’s more human-readable on the web.
  21. Examples: Brian May, noted guitarist for the band Queen, is also an astrophysicist who has published his dissertation “A survey of radial velocities in the zodiacal dust cloud”. He has ALSO annotated a seminal collection of stereoscopic photographs. A true Renaissance man – and the ISNI allows for these works to be linked together under his unique, persistent identifier. Another example: Publisher has two author with the same name but must distinguish between them for royalty purposes Links databases such as Books in Print and Musicbrainz together via ISNI A music professor can be identified as a session musician for Wynton Marsalis and as the author of monographs on the evolution of jazz keyboard styles.
  22. http://orcid.org/
  23. In my experience—let’s call it Warren’s Law—the author is invariably the one to find errors in the publisher’s metadata, indubitably due to authors’ predilections to checking their Amazon rankings. May 3, 2012 http://www.digitalbookworld.com/2012/nearly-100-of-publishers-have-seen-e-booksellers-get-their-metadata-wrong/
  24. Example– jargon iPad Mini for Dummies (BISAC Subject: COMPUTERS / Hardware / Tablets) Possible keywords: Apple Let Me Off at the Top!: My Classy Life and Other Musings (BISAC Subject: HUMOR / Form / Parodies) Possible keywords: Will Ferrell; Anchorman; movies Example- exact title unknown, Please Kill Me: The Uncensored Oral History of Punk (BISAC Subject: MUSIC / Genres & Styles / Punk) Possible Keywords: CBGBs; Velvet Underground Example- different terms The Daniel Plan: 40 Days to a Healthier Life (BISAC Subject: HEALTH & FITNESS / Healthy Living) Possible keywords: weight loss; fitness; Saddleback Church; Southern Baptist Convention Example, outside BISAC Making Toast: A Family Story (BISAC Subject: BIOGRAPHY & AUTOBIOGRAPHY / Personal Memoirs) Possible keywords: widowers; grandparents; grief; caring for children; domestic life; alternative
  25. For example, a book with World War II as the setting, fiction or nonfiction, you may not have these words in the title or description but you can use as keywords: World War 2 Second World War WWII European Theater Pacific Theater Military history D-Day
  26. A well-known series in its own right, this title is part of the basis for HBO’s TV show Game of Thrones. The official series, A Song of Ice and Fire, may not be immediately recognizable to consumers searching for books related to the show. Therefore the keywords section is a great way to make this connection while maintaining the integrity of the book’s descriptive content, etc.
  27. See “Open Utopia” on SocialBook, using CommentPress http://www.livemargin.com/socialbook/client/user_home.html
  28. See “Open Utopia” on SocialBook, using CommentPress http://www.livemargin.com/socialbook/client/user_home.html
  29. See “Utopia” on SocialBook, using CommentPress SocialBook- (BOB STEIN) Civic Participation among undocumented immigrants in an age of Deportation, Alvaro Corral, DRAFT http://www.livemargin.com/socialbook/client/reader.html#bookId=5241fa66e4b02b4c7eafc633&mode=community&chunk=1&offset=27
  30. To get access to the Landing Gear API or see some of the resulting projects, please visit http://www.randomhouse.ca/LandingGearAPI See also http://www.katepullinger.com/digital/ and http://www.randomhouse.ca/landing-gear http://bookcontentapi.devcloud.acquia-sites.com/
  31. Metadata is full of paradox. Publishers must exert control and perfect metadata for their titles, while unleashing it into the world, inviting loss of control and imperfection. Metadata is highly technical yet eminently creative. Even as metadata has become a key component in propelling discovery and sales of books and other content, it is evolving from the enterprise of illuminating meaning and significance in an individual title to enabling linkages between a broad network of objects and resources.