A presentation of the strategies used by the New York Art Resources Consortium (NYARC) to provide access to their growing web archive collection by Lily Pregill, NYARC Coordinator & Systems Manager. The presentation demonstrates the integration of the Archive-It Open Search API in the Primo discovery platform and details NYARC's metadata approach . It was delivered to an online meeting organized by the Metropolitan New York Library Council (METRO) on February 29, 2016.
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Web Archiving: Description and Access
1. Web Archiving:
Description and Access
Lily Pregill
NYARC Coordinator & Systems Manager
Metropolitan New York Library Council
Web Archiving Series, Part 3
February 29, 2016
2. Chocolate + peanut butter approach
Descriptive metadata + full-text indexing are both essential
to drive discovery and retrieval of web archives
3. What is NYARC?
2009
2010
2006
2012
2015
2013
Brooklyn Museum + The Frick Collection + MoMA
New York Art Resources Consortium (NYARC) formed
Launched Arcade, shared Millennium ILS
Archive-It and Auction Catalogs Pilot Project
Mellon Grant: Reframing Collection for a Digital Age
Mellon Grant: Making the Black Hole Gray
Launched NYARC Discovery
4. Archive-It
Thematic Collections
Art Resources
Artists’ Websites
Auction Houses
Catalogues Raisonnés
NYC Galleries
Restitution of Lost or Looted Art
Institution-based Collections
Brooklyn Museum
The Frick Collection
MoMA
NYARC
10 collections > 250 websites + growing…
http://nyarc.org/webarchive
5. Accessing Web Archives
URL driven search
Multiple levels of search
Combined full-text and DC metadata search on collection page
12. Metadata Workflow
• Connexion: Begin cataloging in Connexion
• Use Extract Metadata tool
• Apply Local Constant Data built off the metadata profile
• Upload to WorldCat
• Export to local Millennium system (Arcade)
• Millennium records ingested by Primo/NYARC Discovery weekly
13. Metadata Workflow: Constant Data Example
m o d
007 c ǂb r ǂd c ǂe n
040 FXM ǂb eng ǂe rda ǂc FXM
049 FXMA
300 1 online resource : ǂb color illustrations
336 text ǂb txt ǂ2 rdacontent
336 still image ǂb sti ǂ2 rdacontent
337 computer ǂb c ǂ2 rdamedia
338 online resource ǂb cr ǂ2 rdacarrier
520 Summary
583 capture ǂc year ǂh New York Art Resources Consortium ǂ2 pet ǂ5 NyNyARC
588 Description of the resource based on live site viewed on Month, Day, Year,
and archived site; title from home page.
655 7Web sites. ǂ2 aat
85640ǂz Live site
85640ǂu ǂz Archived site
17. Where can I learn more?
Archive-It
• OpenSearch API
https://webarchive.jira.com/wiki/display/search/OpenSearch+API
• Metadata in Archive-It
https://webarchive.jira.com/wiki/display/ARIH/Metadata+in+Archive-It
NYARC Web Archiving Reports
• Archive-It and Online Auction Catalogs (2010)
http://www.nyarc.org/sites/default/files/ait_leahy_report.pdf
• Reframing Collections for a Digital Age: Final Report (2013)
http://www.nyarc.org/sites/default/files/reports/reframing_final_report2013.pdf
• Making the Black Hole Gray: Final Report (2016)
http://www.nyarc.org/sites/default/files/making_the_black_hole_gray_final_report.pdf
NYARC Documentation
• Metadata Application Profile
http://www.nyarc.org/sites/default/files/web-archiving-profile.pdf
• Metadata for Web Archived Resources: Recommendations for Further Exploration
http://www.nyarc.org/sites/default/files/Recommendations%20for%20further%20exploration-final.pdf
• NYARC Wiki
http://wiki.nyarc.org
Website coming soon ….. OCLC Research Partners Web Archiving Metadata Working Group