Linked Data is exploding in the library world, but the biggest problems libraries have are coming up with the time or money involved in converting their records, looking into Linked Data programs, finding community support, and all the various other issues that arise as part of developing new methods. Likewise, one of the biggest hurdles for libraries and linked data is that they do not know what to do to get involved. As we have fewer people available and smaller budgets each year, we would like to explore ways in which libraries can get involved in the process without expending an undue amount of their already dwindling resources. To see how linked data can be applied, we will look at the example of the Smithsonian Libraries (SIL). Over the past 18 months, SIL has been preparing for the transition from MARC to linked open data. This session will talk about various SIL projects and initiatives (such as the FAST headings project and the introduction of Wikidata and WikiBase); how to incorporate linked data elements into MARC records; and how to develop staff and give them proficiency with new tools and workflows.
Heidy Berthoud, Head, Resource Description, Smithsonian Libraries
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Linked Data at Smithsonian Libraries
1. Click to edit Master title
style
Edit Master text styles
Linked data at
Smithsonian Libraries
Heidy Berthoud, Head of Resource Description
NASIG 2020 Practical approaches to linked data
2. Click to edit Master title style
The Smithsonian Libraries and Archives
stand with the Secretary of the
Smithsonian, Dr. Lonnie Bunch, in
expressing our deepest sympathy to the
families and communities impacted by
discrimination and violence.
3. Current linked data projects
• Participation in Program for Cooperative Cataloging’s (PCC) URIs in
MARC pilot
• Experimentation with Wikidata and creating a local Wikibase
• Adjacent projects that we have built into this work:
• FAST headings, both adding to bibliographic records and populating authority
file
• Redoing import profiles to allow vocabularies beyond LCSH
• Implementing guidelines for minimally punctuated records
4. Linked data (LD) import macro
• Created by Jackie Shieh, Descriptive Data Management
Librarian
• (Partially) developed for PCC URI pilot project
• Developed in phases:
• June-December 2019: LD import macro revisions; used only by Head
of Resource Description + Head of Serials and Electronic Resources
• January 2020-: Final LD import macro; used by all catalogers
5. LD import macro final
• Runs three tasks in MarcEdit
• RDA helper
• Build linked records
• Strips punctuation / adds 758
• Works on bib records only
• Adds tracking field
6. Roll out and training
• Team already accustomed to using an import macro
• MarcEdit needed on all machines used in cataloging workflows
• Task file for punctuation / 758 needed to be saved on all
machines
• Did change the way we do authority work
• No longer imported at point of cataloging
• Authorities are now updated and processed in a batch procedure
8. Click to edit Master title style
Lost motels
Experimental phase
9. Click to edit Master title style
Lost motels
Experimental phase
10. Click to edit Master title style
Lost motels
Experimental phase
11. Click to edit Master title style
Eggs
Macro final version
12. Click to edit Master title style
Eggs
Macro final version
13. Click to edit Master title style
Eggs
Macro final version
14. Click to edit Master title style
Eggs
Macro final version
15. Wikidata—a brief introduction
• Stores structured data in the form of simple relationship statements
• Great way to educate team members on triples
(subject/predicate/object)
• Relationships are built between two items (represented in Wikidata
by Q numbers) linked by a property (represented in Wikidata by P
numbers)
• Anyone can sign up for an account and start editing data
17. Click to edit Master title style
Lonnie Bunch (Q12053846)
subject
employer (P108)
predicate
• Smithsonian Institution
(Q131626)
• California African
American Museum
(Q5020212)
• Chicago History Museum
(Q2963315)
objects
18. Wikidata—early days
• Already have experienced Wikimedian on staff—Diane Shaw,
Special Collections Cataloger
• Particular goals and areas of interest:
• Looking for new ways to manage identities
• Looking for new ways to represent collections that aren’t handled well
in ILS
• Local Wikibase is under development
19. Wikidata—early victories
• Found support for and interest in this project across the
Smithsonian
• Office of the Chief Information Officer (OCIO)
• Multiple colleagues from Smithsonian archival units
• Creation of the Smithsonian P number in Wikidata : P7851
• Onsite training with Andrew Lih and development of online tools
20. Wikidata and teleworking
• New energy around Wikidata Team
• Highly portable work, so folks from outside metadata and cataloging have
wanted to get involved
• Easy to organize via shared documentation
• Working (so far!) on data modeling discussions, selecting properties
for creation in our Wikibase, creating properties and recording P
numbers
• Workshop with Wikimedia Deutschland happening tomorrow (June
10)
21. Resources
Statement from Secretary of the Smithsonian, Dr. Lonnie Bunch
https://www.si.edu/newsdesk/releases/statement-secretary-lonnie-g-
bunch?fbclid=IwAR19ZCVS4iY2AtA-By5XURDAOuJdDb-
qDP5QMSbyAPayamgfSE0W4TE_zc
“Talking about Race” from the Smithsonian’s National Museum of African American
History & Culture
https://nmaahc.si.edu/learn/talking-about-race
NASIG statement against racism
https://nasig.wordpress.com/2020/06/08/nasig-statement-against-racism/
22. Resources
PCC task group on linked data best practices final report (September
2019)
https://www.loc.gov/aba/pcc/taskgroup/linked-data-best-practices-final-
report.pdf
URI FAQs (September 2018)
https://www.loc.gov/aba/pcc/bibframe/TaskGroups/URI%20FAQs.pdf
23. Resources
Formulating and obtaining URIs: a guide to commonly used vocabularies and
reference sources (updated January 2020)
https://www.loc.gov/aba/pcc/bibframe/TaskGroups/formulate_obtain_URI_guide.pdf
PCC guidelines for minimally punctuated MARC bibliographic records (Approved
September 2019; effective January 2020)
https://www.loc.gov/aba/pcc/documents/PCC-Guidelines-Minimally-Punctuated-
MARC-Data.docx
24. Resources
Wikidata
https://www.wikidata.org
Wikidata : List of properties
https://www.wikidata.org/wiki/Wikidata:List_of_properties
Wikidata property proposal/Generic
https://www.wikidata.org/wiki/Wikidata:Property_proposal/Generic
Wikimedia Toolforge
https://tools.wmflabs.org/admin/
25. Click to edit Master title style
Thank you!
berthoudh@si.edu