SlideShare ist ein Scribd-Unternehmen logo
1 von 46
Downloaden Sie, um offline zu lesen
CHASING 

THE FIFTH STAR

Open data at the National Library
Michael Lascarides
User Experience Lead, DigitalNZ
@mlascarides
Kia ora, I am Michael Lascarides. I’m the User Experience Lead at the National Library of New Zealand, where I work as part of the DigitalNZ team.
We make web, mobile, and data interfaces for all kinds of folks to use, from professional and academic researchers to the generally curious, as well
as our own staff. I’d like to share a little bit about a few of the ways our institution creates, uses, and shares our collections data so that folks like you
can turn it into something wonderful.
@mlascarides on twitter
FYI: There are a lot of links in this talk, but I’m going to move pretty quickly past most of them. If you want to download a copy
of this talk, I’ve posted it to Twitter, where you can also ask me anything
You can also request pictures of my new puppy. (Sorry.)
OUR LIBRARY
In case you’re unsure or have forgotten just what a National Library does, here’s a quick overview.
By Pear285 (Own work) [CC BY-SA 4.0 (http://creativecommons.org/licenses/by-sa/4.0)], via Wikimedia Commons
The National Library is New Zealand's legal deposit library. The Act of Parliament that created us charges us with a mission to "enrich the cultural and economic life of
New Zealand and its interchanges with other nations”. We have, roughly speaking, three main collections: the General Collections (encompassing the Legal deposit
services), the Schools Collection (including the largest collection of children’s books in the southern hemisphere), and the collections of the Alexander Turnbull Library,
predominantly comprised of unpublished materials such as manuscripts and photographs.
836K
unpublished
1.4M
published
30M
items
We currently have over 800 thousand items in the catalogue of unpublished materials, 1.4 million in the published, and over 30 million items searchable on our web site,
which are mostly individual digitised newspaper articles from our Papers Past service.
We’ve got books, maps, photographs, recorded music, music scores, newspapers, periodicals, manuscripts, letters, paintings, artefacts, manuals, and more.
HOW WE ORGANISE OUR DIGITAL STUFF
DIGITAL PRESERVATION (NDHA)
PUBLISHED
CATALOGUE
UNPUBLISHED
CATALOGUE
(TIAKI)
DATABASES
&
INDEXES
FULL-TEXT
DIGITAL
OBJECTS
METADATA SERVICE (DIGITALNZ API)
OTHER
INSTITUTIONS
DIGITALNZ.ORG
* GREATLY SIMPLIFIED
PHYSICAL COLLECTIONS
SUBSCRIPTION
SERVICES
NATLIB.GOVT.NZ
PAPERS PAST
This is a very rough diagram of what our digital world looks like. On the bottom, there’s the actual collections, in physical and digital forms, with layers of catalogues and
databases just above in blue. We deliver materials to the world through three main web sites, in red at the top: The National Library site, Papers Past and DigitalNZ. In
between is our metadata service, the DigitalNZ API, which is the secret sauce we use to create ties within our collections and to those in other institutions. We’ll look
more closely at these in a moment.
GOOGLE “NATLIB STRATEGY 2030”
We recently created and published our new guiding strategy, which looks ahead to the year 2030. The basic strategy fits on a single slide…

https://natlib.govt.nz/about-us/strategy-and-policy/strategic-directions
New Zealanders will…
…trust that their documentary heritage and
TAONGA are collected, preserved and accessible
…easily access, share and use New Zealand’s
KNOWLEDGE resources
…have the LITERACY skills to achieve social,
educational and employment success and be
inspired
…to innovate and create new knowledge.
…and I think it’s a pretty good framework for this talk in front of this particular audience. We are going to preserve the nation’s documentary heritage, build a knowledge
network around it, and ensure that the country has the literacy (including digital literacy!) to make full use of it.

https://natlib.govt.nz/about-us/strategy-and-policy/strategic-directions
❤🔬
(WE LOVE RESEARCHERS)
All of our collections are utterly meaningless if people don’t use them. So we’re always keen to have as many people as possible exploring, interrogating, and reusing our
collections. There is a lot of collaboration and co-creation implied in that strategy, so let’s be on this journey together.
OPEN DATA AT NLNZ
One of the ways we try to encourage co-creation and collaboration is to be as open as possible.
As a government agency, we strive to release our data in under open licenses 

https://www.ict.govt.nz/guidance-and-resources/open-government/new-zealand-government-open-access-and-licensing-nzgoal-framework/
and employ the best practices we can when sharing that data. 

https://www.data.govt.nz/toolkit/open-data-in-new-zealand/open-data-nz/
But beyond the basic governmental requirements to be openly available, we aspire to be as interoperable and interconnected as possible. The five-star scale promoted
by Tim Berners-Lee is still the standard in this regard.

http://5stardata.info/en/
5stardata.info
⭐ AVAILABLE
⭐⭐ STRUCTURED
⭐⭐⭐ OPEN
⭐⭐⭐⭐ PERMANENT
⭐⭐⭐⭐⭐ CONNECTED
This would be my summary of what that means.
#lodlam
That elusive 5th star is the “Linked” in “Linked Open Data”. If you’re new to these concepts and you find them interesting, a great hashtaggable term to follow on your
fave social media is LODLAM, which is Linked Open Data for Libraries and Museums. Doing so will connect you to a lovely, smart and interesting community of people,
some of whom are in this room.
natlib.govt.nz/about-us/open-data
We’ve gathered the open data sets that we have available at the National Library on a single page so that you can easily get your hands on them.
OPEN DATA SETS AVAILABLE
Data sets Format ⭐?
PublicationsNZ, IndexNZ, Te Puna Web
Directory, Māori subject headings
CSV, MARC ⭐⭐⭐
Turnbull Library unpublished collections
metadata, Iwi/Hapu Names list XML ⭐⭐⭐
DigitalNZ Metadata, Papers Past
Metadata, Turnbull Library Metadata API (JSON) ⭐⭐⭐⭐
A lot of our open data sets are collections data, and we’re doing all right on the 5-star scale, with mostly threes and fours. But we have a couple of collections that run
quite a bit deeper.
PAPERS PAST
The first of these is Papers Past.
Papers Past is the site where we deliver our full-text digitised materials.
We started with newspapers, and there are over 4 million New Zealand newspaper pages from 1839 to 1949. They’re scanned and automatically transcribed via optical
character recognition, so they are full-text searchable.
It has been expanded to include more than a million pages of magazines,
journals
letters, diaries
and parliamentary papers.
We’ve had researchers mine Papers Past for everything from linguistic analysis training data, to tracking the history of political propaganda, to using old weather reports
to chart historical climate change. (This is a 1912 article about man-made climate change, by the way). And if you’re more maths-y, computer science-y, there’s
opportunities to help us improve machine transcriptions, extract entities like names and places from texts, and a whole lot more.
WANT BULK DATA?
Just ask!
Four million articles 

from 73 titles 

available up to 1878
So, the Papers Past web site is an amazing resource for researchers in its own right. But we often get requests from people who want copies of our raw data. Doing so
previously had been very tricky due to the complexity of copyright—you’d be amazed how many newspaper companies from the 19th century are still around. But we’ve
cleared all the hurdles to release the raw data for newspapers up to 1878. It’s a small part of the collection, but even this small part of Papers Past includes four million
articles.
DIGITALNZ
The other deep, rich digital resource we maintain is DigitalNZ.
digitalnz.org
DigitalNZ is our service that collects the metadata from cultural heritage organisations in New Zealand, and those worldwide that have New Zealand-related content.
digitalnz.org
This year marks our 10th birthday!
We harvest the metadata for over 30 million items from more than 200 institutions, map it to a standard format, and make them all discoverable from a single search.
While we use this data to power the DigitalNZ web site, which is our web front end to the aggregated collection, the real star of the show is the DigitalNZ API, our
machine-readable metadata service.
Anyone who is interested can get a developer key from our website and start hacking with our data. You can build your own products, or automate your research. And of
course, most of the National Library’s own collections are available through the service.
We’ve recently introduced a feature called Stories, which lets you assemble items from across the DigitalNZ content partners’ collections and weave them together with
your own narratives. Or, if you’re feeling less-inspired, you can just use a story as a way to organise your research.
The leading edge of our work with DigitalNZ is getting us really close to that fifth star.
Concepts API
Moving towards 5 Star
Linked Open Data
We’ve recently introduced the Concepts API, which allows you to interrogate our collections for items related to specific places or specific people, rather than just
keyword searches.

Overview: https://digitalnz.org/blog/posts/introducing-the-digitalnz-concepts-api

Documentation: http://digitalnz.github.io/supplejack/api_usage/concepts-api.html
https://digitalnz.org/concepts/4062
For the first time, you can see concepts in action on our recently redesigned website as the Explore Places feature, where we offer a permanent link to each Place
concept along with all of the items in our collections that we determine to be related to that place.
http://digitalnz.github.io/supplejack/
It’s also worth noting that we have freely released the software that powers DigitalNZ as an open source project, so if you’ve got a big metadata harvesting job of your
own, you can benefit from our 10 years of blood, sweat, and tears.
WHAT ARE OUR
INTERESTING PROBLEMS?
I’d like to close with a few of the problems we are working on, which should give you a sense of what we’re thinking about and where we’re headed, but just possibly
also spark some ideas for collaboration with some of you in the future.
How do we connect our
stuff to other peoples’
stuff?
(aka ⭐⭐⭐⭐⭐)
Understanding the tools. Liaison with other institutions. Doing the work. Going from concept to production.
How do we scale up?
Fighting technical debt and scaling issues. Brewster Kahle’s incitement to digitize everything in NZ.
How do we get people
involved?
More content partners. Promoting re-use. Promoting our open source tools. General marketing. Making tools easy. Educating people in digital literacy. Breaking down
barriers.
How do know what cool
things people are m
making with our stuff?
Measuring the impact we have on New Zealand and the world is a HARD. PROBLEM. If you build something with our stuff, it is immensely useful to us (and immensely
persuasive to the folks who allot our funding) if you let us know about it.
TAKE OUR STUFF.
MAKE SOMETHING
WONDERFUL WITH IT.
So please go out and do it.
Thank you!
Michael Lascarides
User Experience Lead, DigitalNZ
@mlascarides

Weitere ähnliche Inhalte

Was ist angesagt?

Designing Linked Data Software & Services for Libraries
Designing Linked Data Software & Services for LibrariesDesigning Linked Data Software & Services for Libraries
Designing Linked Data Software & Services for LibrariesRichard Wallis
 
Linked Data in Libraries
Linked Data in LibrariesLinked Data in Libraries
Linked Data in LibrariesRichard Wallis
 
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)Vladimir Alexiev, PhD, PMP
 
Schema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your LibrarySchema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your LibraryRichard Wallis
 
Linked data for Ebook discovery
Linked data for Ebook discoveryLinked data for Ebook discovery
Linked data for Ebook discoveryRichard Wallis
 
WorldCat, Works, and Schema.org
WorldCat, Works, and Schema.orgWorldCat, Works, and Schema.org
WorldCat, Works, and Schema.orgRichard Wallis
 
Telling the World and Our Users What We Have
Telling the World and Our Users What We HaveTelling the World and Our Users What We Have
Telling the World and Our Users What We HaveRichard Wallis
 
Schema.org - An Extending Influence
Schema.org - An Extending InfluenceSchema.org - An Extending Influence
Schema.org - An Extending InfluenceRichard Wallis
 
Schema.org - Extending Benefits
Schema.org - Extending BenefitsSchema.org - Extending Benefits
Schema.org - Extending BenefitsRichard Wallis
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationRichard Wallis
 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library DataRichard Wallis
 
Dagstuhl FOAF history talk
Dagstuhl FOAF history talkDagstuhl FOAF history talk
Dagstuhl FOAF history talkDan Brickley
 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our OpportunityRichard Wallis
 
LD4L OCLC Data Strategy
LD4L OCLC Data StrategyLD4L OCLC Data Strategy
LD4L OCLC Data StrategyRichard Wallis
 
semantic markup using schema.org
semantic markup using schema.orgsemantic markup using schema.org
semantic markup using schema.orgJoshua Shinavier
 
2011 jisc rdtf teresa the womens library
2011 jisc rdtf teresa the womens library2011 jisc rdtf teresa the womens library
2011 jisc rdtf teresa the womens libraryTeresa Doherty
 

Was ist angesagt? (20)

Designing Linked Data Software & Services for Libraries
Designing Linked Data Software & Services for LibrariesDesigning Linked Data Software & Services for Libraries
Designing Linked Data Software & Services for Libraries
 
Linked Data in Libraries
Linked Data in LibrariesLinked Data in Libraries
Linked Data in Libraries
 
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
 
Schema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your LibrarySchema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your Library
 
GLAMs working with Wikidata
GLAMs working with WikidataGLAMs working with Wikidata
GLAMs working with Wikidata
 
Linked data for Ebook discovery
Linked data for Ebook discoveryLinked data for Ebook discovery
Linked data for Ebook discovery
 
Extending Schema.org
Extending Schema.orgExtending Schema.org
Extending Schema.org
 
WorldCat, Works, and Schema.org
WorldCat, Works, and Schema.orgWorldCat, Works, and Schema.org
WorldCat, Works, and Schema.org
 
Telling the World and Our Users What We Have
Telling the World and Our Users What We HaveTelling the World and Our Users What We Have
Telling the World and Our Users What We Have
 
Schema.org - An Extending Influence
Schema.org - An Extending InfluenceSchema.org - An Extending Influence
Schema.org - An Extending Influence
 
Schema.org - Extending Benefits
Schema.org - Extending BenefitsSchema.org - Extending Benefits
Schema.org - Extending Benefits
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data Foundation
 
Linked Data and OCLC
Linked Data and OCLCLinked Data and OCLC
Linked Data and OCLC
 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library Data
 
Dagstuhl FOAF history talk
Dagstuhl FOAF history talkDagstuhl FOAF history talk
Dagstuhl FOAF history talk
 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our Opportunity
 
Situation Dänemark
Situation DänemarkSituation Dänemark
Situation Dänemark
 
LD4L OCLC Data Strategy
LD4L OCLC Data StrategyLD4L OCLC Data Strategy
LD4L OCLC Data Strategy
 
semantic markup using schema.org
semantic markup using schema.orgsemantic markup using schema.org
semantic markup using schema.org
 
2011 jisc rdtf teresa the womens library
2011 jisc rdtf teresa the womens library2011 jisc rdtf teresa the womens library
2011 jisc rdtf teresa the womens library
 

Ähnlich wie Chasing the Fifth Star - Open Data at the National Library of NZ

Libraries & Tech for Good, 11 July 2016 (with notes)
Libraries & Tech for Good, 11 July 2016 (with notes)Libraries & Tech for Good, 11 July 2016 (with notes)
Libraries & Tech for Good, 11 July 2016 (with notes)George Oates
 
Hello islandora building a digital repository nov 30, 2016 v6
Hello islandora  building a digital repository nov 30, 2016 v6Hello islandora  building a digital repository nov 30, 2016 v6
Hello islandora building a digital repository nov 30, 2016 v6eohallor
 
Digital Public Library of America
Digital Public Library of AmericaDigital Public Library of America
Digital Public Library of AmericaLarry Naukam
 
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...lljohnston
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collectionslljohnston
 
DPLA - an introduction for historians
DPLA  - an introduction for historiansDPLA  - an introduction for historians
DPLA - an introduction for historiansLarry Naukam
 
For the People: Digitizing Hearings from the 60s, 70s, and 80s
For the People: Digitizing Hearings from the 60s, 70s, and 80sFor the People: Digitizing Hearings from the 60s, 70s, and 80s
For the People: Digitizing Hearings from the 60s, 70s, and 80sSonnet Ireland
 
Rethink research, illuminate history with the British Library
Rethink research, illuminate history with the British LibraryRethink research, illuminate history with the British Library
Rethink research, illuminate history with the British LibraryMia
 
Libraries & their digital future
Libraries & their digital futureLibraries & their digital future
Libraries & their digital futureMal Booth
 
For the benefit of all
For the benefit of allFor the benefit of all
For the benefit of allSonnet Ireland
 
DISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur DataDISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur DataLotte Belice Baltussen
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Olaf Janssen
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...PACKED vzw
 
APLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataAPLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataHamilton Public Library
 
Digital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital Tools, Trends and Methodologies in the Humanities and Social SciencesDigital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital Tools, Trends and Methodologies in the Humanities and Social SciencesShawn Day
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital librariesSören Auer
 
Leslie Johnston Keynote, Best Practices Exchange 2011
Leslie Johnston Keynote, Best Practices Exchange 2011Leslie Johnston Keynote, Best Practices Exchange 2011
Leslie Johnston Keynote, Best Practices Exchange 2011lljohnston
 

Ähnlich wie Chasing the Fifth Star - Open Data at the National Library of NZ (20)

Libraries & Tech for Good, 11 July 2016 (with notes)
Libraries & Tech for Good, 11 July 2016 (with notes)Libraries & Tech for Good, 11 July 2016 (with notes)
Libraries & Tech for Good, 11 July 2016 (with notes)
 
Hello islandora building a digital repository nov 30, 2016 v6
Hello islandora  building a digital repository nov 30, 2016 v6Hello islandora  building a digital repository nov 30, 2016 v6
Hello islandora building a digital repository nov 30, 2016 v6
 
Digital Public Library of America
Digital Public Library of AmericaDigital Public Library of America
Digital Public Library of America
 
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collections
 
DPLA - an introduction for historians
DPLA  - an introduction for historiansDPLA  - an introduction for historians
DPLA - an introduction for historians
 
WORLD CAT AS BIG DATA
WORLD CAT AS  BIG DATAWORLD CAT AS  BIG DATA
WORLD CAT AS BIG DATA
 
For the People: Digitizing Hearings from the 60s, 70s, and 80s
For the People: Digitizing Hearings from the 60s, 70s, and 80sFor the People: Digitizing Hearings from the 60s, 70s, and 80s
For the People: Digitizing Hearings from the 60s, 70s, and 80s
 
Rethink research, illuminate history with the British Library
Rethink research, illuminate history with the British LibraryRethink research, illuminate history with the British Library
Rethink research, illuminate history with the British Library
 
Libraries & their digital future
Libraries & their digital futureLibraries & their digital future
Libraries & their digital future
 
Libraries a living hub
Libraries a living hubLibraries a living hub
Libraries a living hub
 
For the benefit of all
For the benefit of allFor the benefit of all
For the benefit of all
 
DISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur DataDISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur Data
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
 
APLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataAPLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with Data
 
Digital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital Tools, Trends and Methodologies in the Humanities and Social SciencesDigital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital Tools, Trends and Methodologies in the Humanities and Social Sciences
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital libraries
 
Digitization and public libraries
Digitization and public librariesDigitization and public libraries
Digitization and public libraries
 
Leslie Johnston Keynote, Best Practices Exchange 2011
Leslie Johnston Keynote, Best Practices Exchange 2011Leslie Johnston Keynote, Best Practices Exchange 2011
Leslie Johnston Keynote, Best Practices Exchange 2011
 

Mehr von mlascarides

Metrics Analysis for GLAMs
Metrics Analysis for GLAMsMetrics Analysis for GLAMs
Metrics Analysis for GLAMsmlascarides
 
Papers Past - A Redesign Case Study
Papers Past - A Redesign Case StudyPapers Past - A Redesign Case Study
Papers Past - A Redesign Case Studymlascarides
 
Digital Strategy for Cultural Heritage Institutions
Digital Strategy for Cultural Heritage InstitutionsDigital Strategy for Cultural Heritage Institutions
Digital Strategy for Cultural Heritage Institutionsmlascarides
 
Design for Crowdsourcing
Design for CrowdsourcingDesign for Crowdsourcing
Design for Crowdsourcingmlascarides
 
2011 National Digital Forum of New Zealand - Keynote
2011 National Digital Forum of New Zealand - Keynote2011 National Digital Forum of New Zealand - Keynote
2011 National Digital Forum of New Zealand - Keynotemlascarides
 
The Emotional Design of Libraries
The Emotional Design of LibrariesThe Emotional Design of Libraries
The Emotional Design of Librariesmlascarides
 
A Digital LIbrarian's Toolkit
A Digital LIbrarian's ToolkitA Digital LIbrarian's Toolkit
A Digital LIbrarian's Toolkitmlascarides
 

Mehr von mlascarides (8)

Metrics Analysis for GLAMs
Metrics Analysis for GLAMsMetrics Analysis for GLAMs
Metrics Analysis for GLAMs
 
Papers Past - A Redesign Case Study
Papers Past - A Redesign Case StudyPapers Past - A Redesign Case Study
Papers Past - A Redesign Case Study
 
Digital Strategy for Cultural Heritage Institutions
Digital Strategy for Cultural Heritage InstitutionsDigital Strategy for Cultural Heritage Institutions
Digital Strategy for Cultural Heritage Institutions
 
Design for Crowdsourcing
Design for CrowdsourcingDesign for Crowdsourcing
Design for Crowdsourcing
 
2011 National Digital Forum of New Zealand - Keynote
2011 National Digital Forum of New Zealand - Keynote2011 National Digital Forum of New Zealand - Keynote
2011 National Digital Forum of New Zealand - Keynote
 
The Emotional Design of Libraries
The Emotional Design of LibrariesThe Emotional Design of Libraries
The Emotional Design of Libraries
 
A Digital LIbrarian's Toolkit
A Digital LIbrarian's ToolkitA Digital LIbrarian's Toolkit
A Digital LIbrarian's Toolkit
 
HathiTrust
HathiTrustHathiTrust
HathiTrust
 

Kürzlich hochgeladen

2024: The FAR, Federal Acquisition Regulations, Part 32
2024: The FAR, Federal Acquisition Regulations, Part 322024: The FAR, Federal Acquisition Regulations, Part 32
2024: The FAR, Federal Acquisition Regulations, Part 32JSchaus & Associates
 
Dating Call Girls inBaloda Bazar Bhatapara 9332606886Call Girls Advance Cash...
Dating Call Girls inBaloda Bazar Bhatapara  9332606886Call Girls Advance Cash...Dating Call Girls inBaloda Bazar Bhatapara  9332606886Call Girls Advance Cash...
Dating Call Girls inBaloda Bazar Bhatapara 9332606886Call Girls Advance Cash...kumargunjan9515
 
Financing strategies for adaptation. Presentation for CANCC
Financing strategies for adaptation. Presentation for CANCCFinancing strategies for adaptation. Presentation for CANCC
Financing strategies for adaptation. Presentation for CANCCNAP Global Network
 
Cheap Call Girls In Hyderabad Phone No 📞 9352988975 📞 Elite Escort Service Av...
Cheap Call Girls In Hyderabad Phone No 📞 9352988975 📞 Elite Escort Service Av...Cheap Call Girls In Hyderabad Phone No 📞 9352988975 📞 Elite Escort Service Av...
Cheap Call Girls In Hyderabad Phone No 📞 9352988975 📞 Elite Escort Service Av...kajalverma014
 
Time, Stress & Work Life Balance for Clerks with Beckie Whitehouse
Time, Stress & Work Life Balance for Clerks with Beckie WhitehouseTime, Stress & Work Life Balance for Clerks with Beckie Whitehouse
Time, Stress & Work Life Balance for Clerks with Beckie Whitehousesubs7
 
NAP Expo - Delivering effective and adequate adaptation.pptx
NAP Expo - Delivering effective and adequate adaptation.pptxNAP Expo - Delivering effective and adequate adaptation.pptx
NAP Expo - Delivering effective and adequate adaptation.pptxNAP Global Network
 
Finance strategies for adaptation. Presentation for CANCC
Finance strategies for adaptation. Presentation for CANCCFinance strategies for adaptation. Presentation for CANCC
Finance strategies for adaptation. Presentation for CANCCNAP Global Network
 
NGO working for orphan children’s education
NGO working for orphan children’s educationNGO working for orphan children’s education
NGO working for orphan children’s educationSERUDS INDIA
 
Pakistani Call girls in Sharjah 0505086370 Sharjah Call girls
Pakistani Call girls in Sharjah 0505086370 Sharjah Call girlsPakistani Call girls in Sharjah 0505086370 Sharjah Call girls
Pakistani Call girls in Sharjah 0505086370 Sharjah Call girlsMonica Sydney
 
31st World Press Freedom Day - A Press for the Planet: Journalism in the face...
31st World Press Freedom Day - A Press for the Planet: Journalism in the face...31st World Press Freedom Day - A Press for the Planet: Journalism in the face...
31st World Press Freedom Day - A Press for the Planet: Journalism in the face...Christina Parmionova
 
Antisemitism Awareness Act: pénaliser la critique de l'Etat d'Israël
Antisemitism Awareness Act: pénaliser la critique de l'Etat d'IsraëlAntisemitism Awareness Act: pénaliser la critique de l'Etat d'Israël
Antisemitism Awareness Act: pénaliser la critique de l'Etat d'IsraëlEdouardHusson
 
Honasa Consumer Limited Impact Report 2024.pdf
Honasa Consumer Limited Impact Report 2024.pdfHonasa Consumer Limited Impact Report 2024.pdf
Honasa Consumer Limited Impact Report 2024.pdfSocial Samosa
 
Competitive Advantage slide deck___.pptx
Competitive Advantage slide deck___.pptxCompetitive Advantage slide deck___.pptx
Competitive Advantage slide deck___.pptxScottMeyers35
 
Unique Value Prop slide deck________.pdf
Unique Value Prop slide deck________.pdfUnique Value Prop slide deck________.pdf
Unique Value Prop slide deck________.pdfScottMeyers35
 
World Press Freedom Day 2024; May 3rd - Poster
World Press Freedom Day 2024; May 3rd - PosterWorld Press Freedom Day 2024; May 3rd - Poster
World Press Freedom Day 2024; May 3rd - PosterChristina Parmionova
 
The NAP process & South-South peer learning
The NAP process & South-South peer learningThe NAP process & South-South peer learning
The NAP process & South-South peer learningNAP Global Network
 
2024: The FAR, Federal Acquisition Regulations, Part 31
2024: The FAR, Federal Acquisition Regulations, Part 312024: The FAR, Federal Acquisition Regulations, Part 31
2024: The FAR, Federal Acquisition Regulations, Part 31JSchaus & Associates
 
2024 asthma jkdjkfjsdklfjsdlkfjskldfgdsgerg
2024 asthma jkdjkfjsdklfjsdlkfjskldfgdsgerg2024 asthma jkdjkfjsdklfjsdlkfjskldfgdsgerg
2024 asthma jkdjkfjsdklfjsdlkfjskldfgdsgergMadhuKothuru
 

Kürzlich hochgeladen (20)

tOld settlement register shouldnotaffect BTR
tOld settlement register shouldnotaffect BTRtOld settlement register shouldnotaffect BTR
tOld settlement register shouldnotaffect BTR
 
2024: The FAR, Federal Acquisition Regulations, Part 32
2024: The FAR, Federal Acquisition Regulations, Part 322024: The FAR, Federal Acquisition Regulations, Part 32
2024: The FAR, Federal Acquisition Regulations, Part 32
 
Dating Call Girls inBaloda Bazar Bhatapara 9332606886Call Girls Advance Cash...
Dating Call Girls inBaloda Bazar Bhatapara  9332606886Call Girls Advance Cash...Dating Call Girls inBaloda Bazar Bhatapara  9332606886Call Girls Advance Cash...
Dating Call Girls inBaloda Bazar Bhatapara 9332606886Call Girls Advance Cash...
 
Financing strategies for adaptation. Presentation for CANCC
Financing strategies for adaptation. Presentation for CANCCFinancing strategies for adaptation. Presentation for CANCC
Financing strategies for adaptation. Presentation for CANCC
 
Cheap Call Girls In Hyderabad Phone No 📞 9352988975 📞 Elite Escort Service Av...
Cheap Call Girls In Hyderabad Phone No 📞 9352988975 📞 Elite Escort Service Av...Cheap Call Girls In Hyderabad Phone No 📞 9352988975 📞 Elite Escort Service Av...
Cheap Call Girls In Hyderabad Phone No 📞 9352988975 📞 Elite Escort Service Av...
 
Time, Stress & Work Life Balance for Clerks with Beckie Whitehouse
Time, Stress & Work Life Balance for Clerks with Beckie WhitehouseTime, Stress & Work Life Balance for Clerks with Beckie Whitehouse
Time, Stress & Work Life Balance for Clerks with Beckie Whitehouse
 
NAP Expo - Delivering effective and adequate adaptation.pptx
NAP Expo - Delivering effective and adequate adaptation.pptxNAP Expo - Delivering effective and adequate adaptation.pptx
NAP Expo - Delivering effective and adequate adaptation.pptx
 
Finance strategies for adaptation. Presentation for CANCC
Finance strategies for adaptation. Presentation for CANCCFinance strategies for adaptation. Presentation for CANCC
Finance strategies for adaptation. Presentation for CANCC
 
NGO working for orphan children’s education
NGO working for orphan children’s educationNGO working for orphan children’s education
NGO working for orphan children’s education
 
Pakistani Call girls in Sharjah 0505086370 Sharjah Call girls
Pakistani Call girls in Sharjah 0505086370 Sharjah Call girlsPakistani Call girls in Sharjah 0505086370 Sharjah Call girls
Pakistani Call girls in Sharjah 0505086370 Sharjah Call girls
 
31st World Press Freedom Day - A Press for the Planet: Journalism in the face...
31st World Press Freedom Day - A Press for the Planet: Journalism in the face...31st World Press Freedom Day - A Press for the Planet: Journalism in the face...
31st World Press Freedom Day - A Press for the Planet: Journalism in the face...
 
Antisemitism Awareness Act: pénaliser la critique de l'Etat d'Israël
Antisemitism Awareness Act: pénaliser la critique de l'Etat d'IsraëlAntisemitism Awareness Act: pénaliser la critique de l'Etat d'Israël
Antisemitism Awareness Act: pénaliser la critique de l'Etat d'Israël
 
Honasa Consumer Limited Impact Report 2024.pdf
Honasa Consumer Limited Impact Report 2024.pdfHonasa Consumer Limited Impact Report 2024.pdf
Honasa Consumer Limited Impact Report 2024.pdf
 
Panchayath circular KLC -Panchayath raj act s 169, 218
Panchayath circular KLC -Panchayath raj act s 169, 218Panchayath circular KLC -Panchayath raj act s 169, 218
Panchayath circular KLC -Panchayath raj act s 169, 218
 
Competitive Advantage slide deck___.pptx
Competitive Advantage slide deck___.pptxCompetitive Advantage slide deck___.pptx
Competitive Advantage slide deck___.pptx
 
Unique Value Prop slide deck________.pdf
Unique Value Prop slide deck________.pdfUnique Value Prop slide deck________.pdf
Unique Value Prop slide deck________.pdf
 
World Press Freedom Day 2024; May 3rd - Poster
World Press Freedom Day 2024; May 3rd - PosterWorld Press Freedom Day 2024; May 3rd - Poster
World Press Freedom Day 2024; May 3rd - Poster
 
The NAP process & South-South peer learning
The NAP process & South-South peer learningThe NAP process & South-South peer learning
The NAP process & South-South peer learning
 
2024: The FAR, Federal Acquisition Regulations, Part 31
2024: The FAR, Federal Acquisition Regulations, Part 312024: The FAR, Federal Acquisition Regulations, Part 31
2024: The FAR, Federal Acquisition Regulations, Part 31
 
2024 asthma jkdjkfjsdklfjsdlkfjskldfgdsgerg
2024 asthma jkdjkfjsdklfjsdlkfjskldfgdsgerg2024 asthma jkdjkfjsdklfjsdlkfjskldfgdsgerg
2024 asthma jkdjkfjsdklfjsdlkfjskldfgdsgerg
 

Chasing the Fifth Star - Open Data at the National Library of NZ

  • 1. CHASING 
 THE FIFTH STAR
 Open data at the National Library Michael Lascarides User Experience Lead, DigitalNZ @mlascarides Kia ora, I am Michael Lascarides. I’m the User Experience Lead at the National Library of New Zealand, where I work as part of the DigitalNZ team. We make web, mobile, and data interfaces for all kinds of folks to use, from professional and academic researchers to the generally curious, as well as our own staff. I’d like to share a little bit about a few of the ways our institution creates, uses, and shares our collections data so that folks like you can turn it into something wonderful.
  • 2. @mlascarides on twitter FYI: There are a lot of links in this talk, but I’m going to move pretty quickly past most of them. If you want to download a copy of this talk, I’ve posted it to Twitter, where you can also ask me anything
  • 3. You can also request pictures of my new puppy. (Sorry.)
  • 4. OUR LIBRARY In case you’re unsure or have forgotten just what a National Library does, here’s a quick overview.
  • 5. By Pear285 (Own work) [CC BY-SA 4.0 (http://creativecommons.org/licenses/by-sa/4.0)], via Wikimedia Commons The National Library is New Zealand's legal deposit library. The Act of Parliament that created us charges us with a mission to "enrich the cultural and economic life of New Zealand and its interchanges with other nations”. We have, roughly speaking, three main collections: the General Collections (encompassing the Legal deposit services), the Schools Collection (including the largest collection of children’s books in the southern hemisphere), and the collections of the Alexander Turnbull Library, predominantly comprised of unpublished materials such as manuscripts and photographs.
  • 6. 836K unpublished 1.4M published 30M items We currently have over 800 thousand items in the catalogue of unpublished materials, 1.4 million in the published, and over 30 million items searchable on our web site, which are mostly individual digitised newspaper articles from our Papers Past service.
  • 7. We’ve got books, maps, photographs, recorded music, music scores, newspapers, periodicals, manuscripts, letters, paintings, artefacts, manuals, and more.
  • 8. HOW WE ORGANISE OUR DIGITAL STUFF DIGITAL PRESERVATION (NDHA) PUBLISHED CATALOGUE UNPUBLISHED CATALOGUE (TIAKI) DATABASES & INDEXES FULL-TEXT DIGITAL OBJECTS METADATA SERVICE (DIGITALNZ API) OTHER INSTITUTIONS DIGITALNZ.ORG * GREATLY SIMPLIFIED PHYSICAL COLLECTIONS SUBSCRIPTION SERVICES NATLIB.GOVT.NZ PAPERS PAST This is a very rough diagram of what our digital world looks like. On the bottom, there’s the actual collections, in physical and digital forms, with layers of catalogues and databases just above in blue. We deliver materials to the world through three main web sites, in red at the top: The National Library site, Papers Past and DigitalNZ. In between is our metadata service, the DigitalNZ API, which is the secret sauce we use to create ties within our collections and to those in other institutions. We’ll look more closely at these in a moment.
  • 9. GOOGLE “NATLIB STRATEGY 2030” We recently created and published our new guiding strategy, which looks ahead to the year 2030. The basic strategy fits on a single slide… https://natlib.govt.nz/about-us/strategy-and-policy/strategic-directions
  • 10. New Zealanders will… …trust that their documentary heritage and TAONGA are collected, preserved and accessible …easily access, share and use New Zealand’s KNOWLEDGE resources …have the LITERACY skills to achieve social, educational and employment success and be inspired …to innovate and create new knowledge. …and I think it’s a pretty good framework for this talk in front of this particular audience. We are going to preserve the nation’s documentary heritage, build a knowledge network around it, and ensure that the country has the literacy (including digital literacy!) to make full use of it. https://natlib.govt.nz/about-us/strategy-and-policy/strategic-directions
  • 11. ❤🔬 (WE LOVE RESEARCHERS) All of our collections are utterly meaningless if people don’t use them. So we’re always keen to have as many people as possible exploring, interrogating, and reusing our collections. There is a lot of collaboration and co-creation implied in that strategy, so let’s be on this journey together.
  • 12. OPEN DATA AT NLNZ One of the ways we try to encourage co-creation and collaboration is to be as open as possible.
  • 13. As a government agency, we strive to release our data in under open licenses https://www.ict.govt.nz/guidance-and-resources/open-government/new-zealand-government-open-access-and-licensing-nzgoal-framework/
  • 14. and employ the best practices we can when sharing that data. https://www.data.govt.nz/toolkit/open-data-in-new-zealand/open-data-nz/
  • 15. But beyond the basic governmental requirements to be openly available, we aspire to be as interoperable and interconnected as possible. The five-star scale promoted by Tim Berners-Lee is still the standard in this regard. http://5stardata.info/en/
  • 17. ⭐ AVAILABLE ⭐⭐ STRUCTURED ⭐⭐⭐ OPEN ⭐⭐⭐⭐ PERMANENT ⭐⭐⭐⭐⭐ CONNECTED This would be my summary of what that means.
  • 18. #lodlam That elusive 5th star is the “Linked” in “Linked Open Data”. If you’re new to these concepts and you find them interesting, a great hashtaggable term to follow on your fave social media is LODLAM, which is Linked Open Data for Libraries and Museums. Doing so will connect you to a lovely, smart and interesting community of people, some of whom are in this room.
  • 19. natlib.govt.nz/about-us/open-data We’ve gathered the open data sets that we have available at the National Library on a single page so that you can easily get your hands on them.
  • 20. OPEN DATA SETS AVAILABLE Data sets Format ⭐? PublicationsNZ, IndexNZ, Te Puna Web Directory, Māori subject headings CSV, MARC ⭐⭐⭐ Turnbull Library unpublished collections metadata, Iwi/Hapu Names list XML ⭐⭐⭐ DigitalNZ Metadata, Papers Past Metadata, Turnbull Library Metadata API (JSON) ⭐⭐⭐⭐ A lot of our open data sets are collections data, and we’re doing all right on the 5-star scale, with mostly threes and fours. But we have a couple of collections that run quite a bit deeper.
  • 21. PAPERS PAST The first of these is Papers Past.
  • 22. Papers Past is the site where we deliver our full-text digitised materials.
  • 23. We started with newspapers, and there are over 4 million New Zealand newspaper pages from 1839 to 1949. They’re scanned and automatically transcribed via optical character recognition, so they are full-text searchable.
  • 24. It has been expanded to include more than a million pages of magazines,
  • 28. We’ve had researchers mine Papers Past for everything from linguistic analysis training data, to tracking the history of political propaganda, to using old weather reports to chart historical climate change. (This is a 1912 article about man-made climate change, by the way). And if you’re more maths-y, computer science-y, there’s opportunities to help us improve machine transcriptions, extract entities like names and places from texts, and a whole lot more.
  • 29. WANT BULK DATA? Just ask! Four million articles 
 from 73 titles 
 available up to 1878 So, the Papers Past web site is an amazing resource for researchers in its own right. But we often get requests from people who want copies of our raw data. Doing so previously had been very tricky due to the complexity of copyright—you’d be amazed how many newspaper companies from the 19th century are still around. But we’ve cleared all the hurdles to release the raw data for newspapers up to 1878. It’s a small part of the collection, but even this small part of Papers Past includes four million articles.
  • 30. DIGITALNZ The other deep, rich digital resource we maintain is DigitalNZ.
  • 31. digitalnz.org DigitalNZ is our service that collects the metadata from cultural heritage organisations in New Zealand, and those worldwide that have New Zealand-related content.
  • 32. digitalnz.org This year marks our 10th birthday!
  • 33. We harvest the metadata for over 30 million items from more than 200 institutions, map it to a standard format, and make them all discoverable from a single search. While we use this data to power the DigitalNZ web site, which is our web front end to the aggregated collection, the real star of the show is the DigitalNZ API, our machine-readable metadata service.
  • 34. Anyone who is interested can get a developer key from our website and start hacking with our data. You can build your own products, or automate your research. And of course, most of the National Library’s own collections are available through the service.
  • 35. We’ve recently introduced a feature called Stories, which lets you assemble items from across the DigitalNZ content partners’ collections and weave them together with your own narratives. Or, if you’re feeling less-inspired, you can just use a story as a way to organise your research.
  • 36. The leading edge of our work with DigitalNZ is getting us really close to that fifth star.
  • 37. Concepts API Moving towards 5 Star Linked Open Data We’ve recently introduced the Concepts API, which allows you to interrogate our collections for items related to specific places or specific people, rather than just keyword searches. Overview: https://digitalnz.org/blog/posts/introducing-the-digitalnz-concepts-api Documentation: http://digitalnz.github.io/supplejack/api_usage/concepts-api.html
  • 38. https://digitalnz.org/concepts/4062 For the first time, you can see concepts in action on our recently redesigned website as the Explore Places feature, where we offer a permanent link to each Place concept along with all of the items in our collections that we determine to be related to that place.
  • 39. http://digitalnz.github.io/supplejack/ It’s also worth noting that we have freely released the software that powers DigitalNZ as an open source project, so if you’ve got a big metadata harvesting job of your own, you can benefit from our 10 years of blood, sweat, and tears.
  • 40. WHAT ARE OUR INTERESTING PROBLEMS? I’d like to close with a few of the problems we are working on, which should give you a sense of what we’re thinking about and where we’re headed, but just possibly also spark some ideas for collaboration with some of you in the future.
  • 41. How do we connect our stuff to other peoples’ stuff? (aka ⭐⭐⭐⭐⭐) Understanding the tools. Liaison with other institutions. Doing the work. Going from concept to production.
  • 42. How do we scale up? Fighting technical debt and scaling issues. Brewster Kahle’s incitement to digitize everything in NZ.
  • 43. How do we get people involved? More content partners. Promoting re-use. Promoting our open source tools. General marketing. Making tools easy. Educating people in digital literacy. Breaking down barriers.
  • 44. How do know what cool things people are m making with our stuff? Measuring the impact we have on New Zealand and the world is a HARD. PROBLEM. If you build something with our stuff, it is immensely useful to us (and immensely persuasive to the folks who allot our funding) if you let us know about it.
  • 45. TAKE OUR STUFF. MAKE SOMETHING WONDERFUL WITH IT. So please go out and do it.
  • 46. Thank you! Michael Lascarides User Experience Lead, DigitalNZ @mlascarides