SlideShare ist ein Scribd-Unternehmen logo
1 von 18
“Choir attempted that beautiful
anthem “Oh, Radiant Morn” –
made a hash of it”
Making a hash of the Adkin Diary transcriptions
Adrian Kingston
Collections Information Manager, Digital Assets and Development
Museum of New Zealand Te Papa Tongarewa
@adriankingston
Crowdsourcing for the Digital Humanities and Cultural Heritage Sector
Victoria University of Wellington, 23 April 2013
Wed. Apr. 23.
Worked at Swamp–Cow p[addock] fence. Bulliman took
48 heavy fat ewes at 15/-. In evening Father + I drove
down to Levin No L[icense] Democratic Vote Campaign
committee meeting. Father voted to chair + self
appointed secretary. Discussed campaign.
Background
 George Leslie Adkin; Farmer, photographer, geologist, explorer,
archaeologist, ethnologist.
 1 man, 41 diaries, 59 years, Over 21000 days
 Thousands of negatives and prints, some albums
 Initial deadline, launch of @life100yearsago ,a project of
WW100
 Did everything ourselves. We resourced most of this project
with a curator (Kirstie Ross) and a monkey with a keyboard
 Figure out process (imaging, cropping, loading, transcription
guidelines),
 Figure out content (data structure, quirks of Adkin, glossaries
etc.)
 Project? What project?
 Very early days.
Process
 Assess album condition
 Photograph album pages
 Crop pages to days
 Create narrative for day
 Load “day” images to EMu “day” narrative
 Transcribe
 Add associated subjects, people, places (from authority files
and controlled vocabularies)
 Add context to narrative entries for month
 Some parts semi-automated, some completely manual; some
need no special skills, others do
Received a letter + referee’s report from Dr
Chilton, Editor “Trans[actions of the] NZ
Inst[itute], on my paper on Tararuas = “my
theories based on too slender evidence and
debatable evidence + also in part erroneous (?
GLA). I decided to withdraw the paper as it is
evidently unsuitable for publication in
“Transactions”
http://collections.tepapa.govt.nz/theme.aspx?irn=4294
Framework
 Using existing framework; EMu, Collections Online
 CIDOC CRM for building and expressing relationships
 Days are conceptual entities, not physical. Framework allows
for this
 Links to physical entities, diaries, photographs, albums
 Links to people, places, topics
 However, scale of content of really starting to highlight issues of
display in Collections Online.
What we’ve learnt
 So much content, so much data
 More than just one man’s story, a huge data source on NZ life
 So much potential for a number of fields of research
 Our existing data structure works really well
 Transcription only one part
 To get most out of the content, need the links, need the rich
conceptual model
 Context needed, or at least useful, for the reader
 Existing display not so hot
 Enlivens the collection, a step beyond just digitisation and
transcription
Issues
 Size of the project is daunting, but the transcription seems
manageable to do through crowdsourcing
 There are a number of existing platforms that look great, but
how to deal with matching to our structure, vocabularies,
authorities?
 Could use automated in text authority mining, but would need
to then match back to authorities and structure
 Beyond scope of crowdsourcing? But does that diminish the
value of the “data”?
 Could come later though, are we getting too hung up on
quality?
Our potential crowd
 By starting it ourselves, we have some content available to
promote the crowdsourcing.
 Already had unsolicited volunteers
 The content is interesting: NZ history, early 20th Century
courtship, farming, geology, religion, war, politics, weather…
 Horowhenua locals interested in local history, and one of their
famous sons
 History students and educators
 Bring students closer to primary material, work with cursive
handwriting, highlight the importance of accuracy in relation to
data, personal biography
 Learning history through a first hand account
 Plan B is do war years with interns
We decided to go into town to lunch so I piloted
the party to Kirkcaldie + Stains where we had a
good dinner… Will wanted to know if one could
have all the courses for 2/-. I told him it was not
customary to indulge in more than six but that if
he wanted to tackle the lot we would have to
leave him at it. Olive ordered dishes she did not
want + Alice also got a bit mixed up.
http://collections.tepapa.govt.nz/theme.aspx?irn=4095
Where to
 Can’t do with existing (human) resource
 Transcription only one part of the project
 Need to figure what parts need to be crowdsourced, what can’t
 Transcription will enable the adding the contextual and semantic
relationships and links to other sources
 Options for automating the above
 Or, with a focussed crowd and a finite project, maybe we don’t need
a new platform, could provide training and use existing tools
 Can’t crowdsource the display platform. Or can we? Crowdfund it?
 Make data available for analysis, visualisation, research, fun
 Need to formalise the project
 Lots to figure out
In evening rode down to see Maud – showed
her some books but there seemed to be a lack
of sympathy between us + the evening was a
failure.
http://collections.tepapa.govt.nz/theme.aspx?irn=4080
See
 Adkin diaries of Collections Online
 @adkin_diary on Twitter
 @life100yearsago on Twitter
Questions?
 Kirstie Ross, Curator Modern New Zealand
 Adrian Kingston, Collections Information Manager
 Philip Edgar, Manager Digital Collections and Access

Weitere ähnliche Inhalte

Ähnlich wie Making a hash of the Adkin Diary transcriptions

How to Craft and Deliver Winning Presentations
How to Craft and Deliver Winning PresentationsHow to Craft and Deliver Winning Presentations
How to Craft and Deliver Winning Presentations
Joaquim Jorge
 
Knowledge = Information + Context
Knowledge = Information + ContextKnowledge = Information + Context
Knowledge = Information + Context
Stefan Gradmann
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Olaf Janssen
 

Ähnlich wie Making a hash of the Adkin Diary transcriptions (20)

Data+Design
Data+DesignData+Design
Data+Design
 
Repackaging research AASL 2013
Repackaging research AASL 2013Repackaging research AASL 2013
Repackaging research AASL 2013
 
Get An Overall Idea of Digital Humanities
Get An Overall Idea of Digital HumanitiesGet An Overall Idea of Digital Humanities
Get An Overall Idea of Digital Humanities
 
Essay Simple Life. Online assignment writing service.
Essay Simple Life. Online assignment writing service.Essay Simple Life. Online assignment writing service.
Essay Simple Life. Online assignment writing service.
 
Dh presentation 2018
Dh presentation 2018Dh presentation 2018
Dh presentation 2018
 
SAWS Rouche Presentation HERA Event Feb 2015
SAWS Rouche Presentation HERA Event Feb 2015SAWS Rouche Presentation HERA Event Feb 2015
SAWS Rouche Presentation HERA Event Feb 2015
 
NCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
NCompass Live: Library Lockdown - How to Build an Escape Room in Your LibraryNCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
NCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
 
Dh presentation 2019
Dh presentation 2019Dh presentation 2019
Dh presentation 2019
 
Session5 01.rutger vankoert
Session5 01.rutger vankoertSession5 01.rutger vankoert
Session5 01.rutger vankoert
 
How to Craft and Deliver Winning Presentations
How to Craft and Deliver Winning PresentationsHow to Craft and Deliver Winning Presentations
How to Craft and Deliver Winning Presentations
 
Unit 6: Collective Learning (Part 1)
Unit 6: Collective Learning (Part 1)Unit 6: Collective Learning (Part 1)
Unit 6: Collective Learning (Part 1)
 
Future of semantic apps
Future of semantic appsFuture of semantic apps
Future of semantic apps
 
Knowledge = Information + Context
Knowledge = Information + ContextKnowledge = Information + Context
Knowledge = Information + Context
 
Putnam "This is Today's Metadata Quality"
Putnam "This is Today's Metadata Quality"Putnam "This is Today's Metadata Quality"
Putnam "This is Today's Metadata Quality"
 
Steve Knight by Design
Steve Knight by DesignSteve Knight by Design
Steve Knight by Design
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
 
Tribal libraries and archives panel session - NWILL, September 2021
Tribal libraries and archives  panel session - NWILL, September 2021Tribal libraries and archives  panel session - NWILL, September 2021
Tribal libraries and archives panel session - NWILL, September 2021
 
Digital Fluencies: A Story of Trials & Triumph
Digital Fluencies: A Story of Trials & TriumphDigital Fluencies: A Story of Trials & Triumph
Digital Fluencies: A Story of Trials & Triumph
 
Computationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and SpaceComputationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and Space
 

Mehr von donellemckinley

McKinley NDF2013 crowdsourcing
McKinley NDF2013 crowdsourcingMcKinley NDF2013 crowdsourcing
McKinley NDF2013 crowdsourcing
donellemckinley
 
PhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website designPhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website design
donellemckinley
 
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
donellemckinley
 

Mehr von donellemckinley (12)

McKinley NDF2013 crowdsourcing
McKinley NDF2013 crowdsourcingMcKinley NDF2013 crowdsourcing
McKinley NDF2013 crowdsourcing
 
McLean-letters
McLean-lettersMcLean-letters
McLean-letters
 
PhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website designPhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website design
 
Evaluating crowdsourcing websites
Evaluating crowdsourcing websitesEvaluating crowdsourcing websites
Evaluating crowdsourcing websites
 
Crowdsourcing or bust: The Indexer, Archives NZ
Crowdsourcing or bust: The Indexer, Archives NZ Crowdsourcing or bust: The Indexer, Archives NZ
Crowdsourcing or bust: The Indexer, Archives NZ
 
This is not a penis: User-generated tags
This is not a penis: User-generated tagsThis is not a penis: User-generated tags
This is not a penis: User-generated tags
 
Crowd in the Cloud: Collaborative Frameworks for Virtual DH Projects
Crowd in the Cloud:  Collaborative Frameworks for Virtual DH ProjectsCrowd in the Cloud:  Collaborative Frameworks for Virtual DH Projects
Crowd in the Cloud: Collaborative Frameworks for Virtual DH Projects
 
UC CEISMIC: some thoughts on crowd-sourcing earthquake content
UC CEISMIC: some thoughts on crowd-sourcing earthquake contentUC CEISMIC: some thoughts on crowd-sourcing earthquake content
UC CEISMIC: some thoughts on crowd-sourcing earthquake content
 
Factors that influence an organization’s decision to adopt crowdsourcing: A r...
Factors that influence an organization’s decision to adopt crowdsourcing: A r...Factors that influence an organization’s decision to adopt crowdsourcing: A r...
Factors that influence an organization’s decision to adopt crowdsourcing: A r...
 
Lessons from Transcribe Bentham
Lessons from Transcribe BenthamLessons from Transcribe Bentham
Lessons from Transcribe Bentham
 
Crowdsourcing workshop quiz (answers)
Crowdsourcing workshop quiz (answers)Crowdsourcing workshop quiz (answers)
Crowdsourcing workshop quiz (answers)
 
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Kürzlich hochgeladen (20)

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Making a hash of the Adkin Diary transcriptions

  • 1. “Choir attempted that beautiful anthem “Oh, Radiant Morn” – made a hash of it” Making a hash of the Adkin Diary transcriptions Adrian Kingston Collections Information Manager, Digital Assets and Development Museum of New Zealand Te Papa Tongarewa @adriankingston Crowdsourcing for the Digital Humanities and Cultural Heritage Sector Victoria University of Wellington, 23 April 2013
  • 2.
  • 3. Wed. Apr. 23. Worked at Swamp–Cow p[addock] fence. Bulliman took 48 heavy fat ewes at 15/-. In evening Father + I drove down to Levin No L[icense] Democratic Vote Campaign committee meeting. Father voted to chair + self appointed secretary. Discussed campaign.
  • 4. Background  George Leslie Adkin; Farmer, photographer, geologist, explorer, archaeologist, ethnologist.  1 man, 41 diaries, 59 years, Over 21000 days  Thousands of negatives and prints, some albums  Initial deadline, launch of @life100yearsago ,a project of WW100  Did everything ourselves. We resourced most of this project with a curator (Kirstie Ross) and a monkey with a keyboard  Figure out process (imaging, cropping, loading, transcription guidelines),  Figure out content (data structure, quirks of Adkin, glossaries etc.)  Project? What project?  Very early days.
  • 5. Process  Assess album condition  Photograph album pages  Crop pages to days  Create narrative for day  Load “day” images to EMu “day” narrative  Transcribe  Add associated subjects, people, places (from authority files and controlled vocabularies)  Add context to narrative entries for month  Some parts semi-automated, some completely manual; some need no special skills, others do
  • 6. Received a letter + referee’s report from Dr Chilton, Editor “Trans[actions of the] NZ Inst[itute], on my paper on Tararuas = “my theories based on too slender evidence and debatable evidence + also in part erroneous (? GLA). I decided to withdraw the paper as it is evidently unsuitable for publication in “Transactions” http://collections.tepapa.govt.nz/theme.aspx?irn=4294
  • 7. Framework  Using existing framework; EMu, Collections Online  CIDOC CRM for building and expressing relationships  Days are conceptual entities, not physical. Framework allows for this  Links to physical entities, diaries, photographs, albums  Links to people, places, topics  However, scale of content of really starting to highlight issues of display in Collections Online.
  • 8.
  • 9.
  • 10. What we’ve learnt  So much content, so much data  More than just one man’s story, a huge data source on NZ life  So much potential for a number of fields of research  Our existing data structure works really well  Transcription only one part  To get most out of the content, need the links, need the rich conceptual model  Context needed, or at least useful, for the reader  Existing display not so hot  Enlivens the collection, a step beyond just digitisation and transcription
  • 11.
  • 12. Issues  Size of the project is daunting, but the transcription seems manageable to do through crowdsourcing  There are a number of existing platforms that look great, but how to deal with matching to our structure, vocabularies, authorities?  Could use automated in text authority mining, but would need to then match back to authorities and structure  Beyond scope of crowdsourcing? But does that diminish the value of the “data”?  Could come later though, are we getting too hung up on quality?
  • 13.
  • 14. Our potential crowd  By starting it ourselves, we have some content available to promote the crowdsourcing.  Already had unsolicited volunteers  The content is interesting: NZ history, early 20th Century courtship, farming, geology, religion, war, politics, weather…  Horowhenua locals interested in local history, and one of their famous sons  History students and educators  Bring students closer to primary material, work with cursive handwriting, highlight the importance of accuracy in relation to data, personal biography  Learning history through a first hand account  Plan B is do war years with interns
  • 15. We decided to go into town to lunch so I piloted the party to Kirkcaldie + Stains where we had a good dinner… Will wanted to know if one could have all the courses for 2/-. I told him it was not customary to indulge in more than six but that if he wanted to tackle the lot we would have to leave him at it. Olive ordered dishes she did not want + Alice also got a bit mixed up. http://collections.tepapa.govt.nz/theme.aspx?irn=4095
  • 16. Where to  Can’t do with existing (human) resource  Transcription only one part of the project  Need to figure what parts need to be crowdsourced, what can’t  Transcription will enable the adding the contextual and semantic relationships and links to other sources  Options for automating the above  Or, with a focussed crowd and a finite project, maybe we don’t need a new platform, could provide training and use existing tools  Can’t crowdsource the display platform. Or can we? Crowdfund it?  Make data available for analysis, visualisation, research, fun  Need to formalise the project  Lots to figure out
  • 17. In evening rode down to see Maud – showed her some books but there seemed to be a lack of sympathy between us + the evening was a failure. http://collections.tepapa.govt.nz/theme.aspx?irn=4080
  • 18. See  Adkin diaries of Collections Online  @adkin_diary on Twitter  @life100yearsago on Twitter Questions?  Kirstie Ross, Curator Modern New Zealand  Adrian Kingston, Collections Information Manager  Philip Edgar, Manager Digital Collections and Access