SlideShare ist ein Scribd-Unternehmen logo
1 von 18
“Choir attempted that beautiful
anthem “Oh, Radiant Morn” –
made a hash of it”
Making a hash of the Adkin Diary transcriptions
Adrian Kingston
Collections Information Manager, Digital Assets and Development
Museum of New Zealand Te Papa Tongarewa
@adriankingston
Crowdsourcing for the Digital Humanities and Cultural Heritage Sector
Victoria University of Wellington, 23 April 2013
Wed. Apr. 23.
Worked at Swamp–Cow p[addock] fence. Bulliman took
48 heavy fat ewes at 15/-. In evening Father + I drove
down to Levin No L[icense] Democratic Vote Campaign
committee meeting. Father voted to chair + self
appointed secretary. Discussed campaign.
Background
 George Leslie Adkin; Farmer, photographer, geologist, explorer,
archaeologist, ethnologist.
 1 man, 41 diaries, 59 years, Over 21000 days
 Thousands of negatives and prints, some albums
 Initial deadline, launch of @life100yearsago ,a project of
WW100
 Did everything ourselves. We resourced most of this project
with a curator (Kirstie Ross) and a monkey with a keyboard
 Figure out process (imaging, cropping, loading, transcription
guidelines),
 Figure out content (data structure, quirks of Adkin, glossaries
etc.)
 Project? What project?
 Very early days.
Process
 Assess album condition
 Photograph album pages
 Crop pages to days
 Create narrative for day
 Load “day” images to EMu “day” narrative
 Transcribe
 Add associated subjects, people, places (from authority files
and controlled vocabularies)
 Add context to narrative entries for month
 Some parts semi-automated, some completely manual; some
need no special skills, others do
Received a letter + referee’s report from Dr
Chilton, Editor “Trans[actions of the] NZ
Inst[itute], on my paper on Tararuas = “my
theories based on too slender evidence and
debatable evidence + also in part erroneous (?
GLA). I decided to withdraw the paper as it is
evidently unsuitable for publication in
“Transactions”
http://collections.tepapa.govt.nz/theme.aspx?irn=4294
Framework
 Using existing framework; EMu, Collections Online
 CIDOC CRM for building and expressing relationships
 Days are conceptual entities, not physical. Framework allows
for this
 Links to physical entities, diaries, photographs, albums
 Links to people, places, topics
 However, scale of content of really starting to highlight issues of
display in Collections Online.
What we’ve learnt
 So much content, so much data
 More than just one man’s story, a huge data source on NZ life
 So much potential for a number of fields of research
 Our existing data structure works really well
 Transcription only one part
 To get most out of the content, need the links, need the rich
conceptual model
 Context needed, or at least useful, for the reader
 Existing display not so hot
 Enlivens the collection, a step beyond just digitisation and
transcription
Issues
 Size of the project is daunting, but the transcription seems
manageable to do through crowdsourcing
 There are a number of existing platforms that look great, but
how to deal with matching to our structure, vocabularies,
authorities?
 Could use automated in text authority mining, but would need
to then match back to authorities and structure
 Beyond scope of crowdsourcing? But does that diminish the
value of the “data”?
 Could come later though, are we getting too hung up on
quality?
Our potential crowd
 By starting it ourselves, we have some content available to
promote the crowdsourcing.
 Already had unsolicited volunteers
 The content is interesting: NZ history, early 20th Century
courtship, farming, geology, religion, war, politics, weather…
 Horowhenua locals interested in local history, and one of their
famous sons
 History students and educators
 Bring students closer to primary material, work with cursive
handwriting, highlight the importance of accuracy in relation to
data, personal biography
 Learning history through a first hand account
 Plan B is do war years with interns
We decided to go into town to lunch so I piloted
the party to Kirkcaldie + Stains where we had a
good dinner… Will wanted to know if one could
have all the courses for 2/-. I told him it was not
customary to indulge in more than six but that if
he wanted to tackle the lot we would have to
leave him at it. Olive ordered dishes she did not
want + Alice also got a bit mixed up.
http://collections.tepapa.govt.nz/theme.aspx?irn=4095
Where to
 Can’t do with existing (human) resource
 Transcription only one part of the project
 Need to figure what parts need to be crowdsourced, what can’t
 Transcription will enable the adding the contextual and semantic
relationships and links to other sources
 Options for automating the above
 Or, with a focussed crowd and a finite project, maybe we don’t need
a new platform, could provide training and use existing tools
 Can’t crowdsource the display platform. Or can we? Crowdfund it?
 Make data available for analysis, visualisation, research, fun
 Need to formalise the project
 Lots to figure out
In evening rode down to see Maud – showed
her some books but there seemed to be a lack
of sympathy between us + the evening was a
failure.
http://collections.tepapa.govt.nz/theme.aspx?irn=4080
See
 Adkin diaries of Collections Online
 @adkin_diary on Twitter
 @life100yearsago on Twitter
Questions?
 Kirstie Ross, Curator Modern New Zealand
 Adrian Kingston, Collections Information Manager
 Philip Edgar, Manager Digital Collections and Access

Weitere ähnliche Inhalte

Ähnlich wie Choir attempted beautiful anthem “Oh, Radiant Morn” – made hash

Repackaging research AASL 2013
Repackaging research AASL 2013Repackaging research AASL 2013
Repackaging research AASL 2013Paige Jaeger
 
Get An Overall Idea of Digital Humanities
Get An Overall Idea of Digital HumanitiesGet An Overall Idea of Digital Humanities
Get An Overall Idea of Digital HumanitiesIndia Assignment India
 
Essay Simple Life. Online assignment writing service.
Essay Simple Life. Online assignment writing service.Essay Simple Life. Online assignment writing service.
Essay Simple Life. Online assignment writing service.Becky Smith
 
NCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
NCompass Live: Library Lockdown - How to Build an Escape Room in Your LibraryNCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
NCompass Live: Library Lockdown - How to Build an Escape Room in Your LibraryNebraska Library Commission
 
How to Craft and Deliver Winning Presentations
How to Craft and Deliver Winning PresentationsHow to Craft and Deliver Winning Presentations
How to Craft and Deliver Winning PresentationsJoaquim Jorge
 
Unit 6: Collective Learning (Part 1)
Unit 6: Collective Learning (Part 1)Unit 6: Collective Learning (Part 1)
Unit 6: Collective Learning (Part 1)Big History Project
 
Knowledge = Information + Context
Knowledge = Information + ContextKnowledge = Information + Context
Knowledge = Information + ContextStefan Gradmann
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Olaf Janssen
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...PACKED vzw
 
Tribal libraries and archives panel session - NWILL, September 2021
Tribal libraries and archives  panel session - NWILL, September 2021Tribal libraries and archives  panel session - NWILL, September 2021
Tribal libraries and archives panel session - NWILL, September 2021Manisha Khetarpal
 
Digital Fluencies: A Story of Trials & Triumph
Digital Fluencies: A Story of Trials & TriumphDigital Fluencies: A Story of Trials & Triumph
Digital Fluencies: A Story of Trials & TriumphKimberly Eke
 
Computationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and SpaceComputationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and SpaceMarieke van Erp
 

Ähnlich wie Choir attempted beautiful anthem “Oh, Radiant Morn” – made hash (20)

Data+Design
Data+DesignData+Design
Data+Design
 
Repackaging research AASL 2013
Repackaging research AASL 2013Repackaging research AASL 2013
Repackaging research AASL 2013
 
Get An Overall Idea of Digital Humanities
Get An Overall Idea of Digital HumanitiesGet An Overall Idea of Digital Humanities
Get An Overall Idea of Digital Humanities
 
Essay Simple Life. Online assignment writing service.
Essay Simple Life. Online assignment writing service.Essay Simple Life. Online assignment writing service.
Essay Simple Life. Online assignment writing service.
 
Dh presentation 2018
Dh presentation 2018Dh presentation 2018
Dh presentation 2018
 
SAWS Rouche Presentation HERA Event Feb 2015
SAWS Rouche Presentation HERA Event Feb 2015SAWS Rouche Presentation HERA Event Feb 2015
SAWS Rouche Presentation HERA Event Feb 2015
 
NCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
NCompass Live: Library Lockdown - How to Build an Escape Room in Your LibraryNCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
NCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
 
Dh presentation 2019
Dh presentation 2019Dh presentation 2019
Dh presentation 2019
 
Session5 01.rutger vankoert
Session5 01.rutger vankoertSession5 01.rutger vankoert
Session5 01.rutger vankoert
 
How to Craft and Deliver Winning Presentations
How to Craft and Deliver Winning PresentationsHow to Craft and Deliver Winning Presentations
How to Craft and Deliver Winning Presentations
 
Unit 6: Collective Learning (Part 1)
Unit 6: Collective Learning (Part 1)Unit 6: Collective Learning (Part 1)
Unit 6: Collective Learning (Part 1)
 
Future of semantic apps
Future of semantic appsFuture of semantic apps
Future of semantic apps
 
Knowledge = Information + Context
Knowledge = Information + ContextKnowledge = Information + Context
Knowledge = Information + Context
 
Putnam "This is Today's Metadata Quality"
Putnam "This is Today's Metadata Quality"Putnam "This is Today's Metadata Quality"
Putnam "This is Today's Metadata Quality"
 
Steve Knight by Design
Steve Knight by DesignSteve Knight by Design
Steve Knight by Design
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
 
Tribal libraries and archives panel session - NWILL, September 2021
Tribal libraries and archives  panel session - NWILL, September 2021Tribal libraries and archives  panel session - NWILL, September 2021
Tribal libraries and archives panel session - NWILL, September 2021
 
Digital Fluencies: A Story of Trials & Triumph
Digital Fluencies: A Story of Trials & TriumphDigital Fluencies: A Story of Trials & Triumph
Digital Fluencies: A Story of Trials & Triumph
 
Computationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and SpaceComputationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and Space
 

Mehr von donellemckinley

McKinley NDF2013 crowdsourcing
McKinley NDF2013 crowdsourcingMcKinley NDF2013 crowdsourcing
McKinley NDF2013 crowdsourcingdonellemckinley
 
PhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website designPhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website designdonellemckinley
 
Evaluating crowdsourcing websites
Evaluating crowdsourcing websitesEvaluating crowdsourcing websites
Evaluating crowdsourcing websitesdonellemckinley
 
Crowdsourcing or bust: The Indexer, Archives NZ
Crowdsourcing or bust: The Indexer, Archives NZ Crowdsourcing or bust: The Indexer, Archives NZ
Crowdsourcing or bust: The Indexer, Archives NZ donellemckinley
 
This is not a penis: User-generated tags
This is not a penis: User-generated tagsThis is not a penis: User-generated tags
This is not a penis: User-generated tagsdonellemckinley
 
Crowd in the Cloud: Collaborative Frameworks for Virtual DH Projects
Crowd in the Cloud:  Collaborative Frameworks for Virtual DH ProjectsCrowd in the Cloud:  Collaborative Frameworks for Virtual DH Projects
Crowd in the Cloud: Collaborative Frameworks for Virtual DH Projectsdonellemckinley
 
UC CEISMIC: some thoughts on crowd-sourcing earthquake content
UC CEISMIC: some thoughts on crowd-sourcing earthquake contentUC CEISMIC: some thoughts on crowd-sourcing earthquake content
UC CEISMIC: some thoughts on crowd-sourcing earthquake contentdonellemckinley
 
Factors that influence an organization’s decision to adopt crowdsourcing: A r...
Factors that influence an organization’s decision to adopt crowdsourcing: A r...Factors that influence an organization’s decision to adopt crowdsourcing: A r...
Factors that influence an organization’s decision to adopt crowdsourcing: A r...donellemckinley
 
Lessons from Transcribe Bentham
Lessons from Transcribe BenthamLessons from Transcribe Bentham
Lessons from Transcribe Benthamdonellemckinley
 
Crowdsourcing workshop quiz (answers)
Crowdsourcing workshop quiz (answers)Crowdsourcing workshop quiz (answers)
Crowdsourcing workshop quiz (answers)donellemckinley
 
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...donellemckinley
 

Mehr von donellemckinley (12)

McKinley NDF2013 crowdsourcing
McKinley NDF2013 crowdsourcingMcKinley NDF2013 crowdsourcing
McKinley NDF2013 crowdsourcing
 
McLean-letters
McLean-lettersMcLean-letters
McLean-letters
 
PhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website designPhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website design
 
Evaluating crowdsourcing websites
Evaluating crowdsourcing websitesEvaluating crowdsourcing websites
Evaluating crowdsourcing websites
 
Crowdsourcing or bust: The Indexer, Archives NZ
Crowdsourcing or bust: The Indexer, Archives NZ Crowdsourcing or bust: The Indexer, Archives NZ
Crowdsourcing or bust: The Indexer, Archives NZ
 
This is not a penis: User-generated tags
This is not a penis: User-generated tagsThis is not a penis: User-generated tags
This is not a penis: User-generated tags
 
Crowd in the Cloud: Collaborative Frameworks for Virtual DH Projects
Crowd in the Cloud:  Collaborative Frameworks for Virtual DH ProjectsCrowd in the Cloud:  Collaborative Frameworks for Virtual DH Projects
Crowd in the Cloud: Collaborative Frameworks for Virtual DH Projects
 
UC CEISMIC: some thoughts on crowd-sourcing earthquake content
UC CEISMIC: some thoughts on crowd-sourcing earthquake contentUC CEISMIC: some thoughts on crowd-sourcing earthquake content
UC CEISMIC: some thoughts on crowd-sourcing earthquake content
 
Factors that influence an organization’s decision to adopt crowdsourcing: A r...
Factors that influence an organization’s decision to adopt crowdsourcing: A r...Factors that influence an organization’s decision to adopt crowdsourcing: A r...
Factors that influence an organization’s decision to adopt crowdsourcing: A r...
 
Lessons from Transcribe Bentham
Lessons from Transcribe BenthamLessons from Transcribe Bentham
Lessons from Transcribe Bentham
 
Crowdsourcing workshop quiz (answers)
Crowdsourcing workshop quiz (answers)Crowdsourcing workshop quiz (answers)
Crowdsourcing workshop quiz (answers)
 
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
 

Kürzlich hochgeladen

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 

Kürzlich hochgeladen (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 

Choir attempted beautiful anthem “Oh, Radiant Morn” – made hash

  • 1. “Choir attempted that beautiful anthem “Oh, Radiant Morn” – made a hash of it” Making a hash of the Adkin Diary transcriptions Adrian Kingston Collections Information Manager, Digital Assets and Development Museum of New Zealand Te Papa Tongarewa @adriankingston Crowdsourcing for the Digital Humanities and Cultural Heritage Sector Victoria University of Wellington, 23 April 2013
  • 2.
  • 3. Wed. Apr. 23. Worked at Swamp–Cow p[addock] fence. Bulliman took 48 heavy fat ewes at 15/-. In evening Father + I drove down to Levin No L[icense] Democratic Vote Campaign committee meeting. Father voted to chair + self appointed secretary. Discussed campaign.
  • 4. Background  George Leslie Adkin; Farmer, photographer, geologist, explorer, archaeologist, ethnologist.  1 man, 41 diaries, 59 years, Over 21000 days  Thousands of negatives and prints, some albums  Initial deadline, launch of @life100yearsago ,a project of WW100  Did everything ourselves. We resourced most of this project with a curator (Kirstie Ross) and a monkey with a keyboard  Figure out process (imaging, cropping, loading, transcription guidelines),  Figure out content (data structure, quirks of Adkin, glossaries etc.)  Project? What project?  Very early days.
  • 5. Process  Assess album condition  Photograph album pages  Crop pages to days  Create narrative for day  Load “day” images to EMu “day” narrative  Transcribe  Add associated subjects, people, places (from authority files and controlled vocabularies)  Add context to narrative entries for month  Some parts semi-automated, some completely manual; some need no special skills, others do
  • 6. Received a letter + referee’s report from Dr Chilton, Editor “Trans[actions of the] NZ Inst[itute], on my paper on Tararuas = “my theories based on too slender evidence and debatable evidence + also in part erroneous (? GLA). I decided to withdraw the paper as it is evidently unsuitable for publication in “Transactions” http://collections.tepapa.govt.nz/theme.aspx?irn=4294
  • 7. Framework  Using existing framework; EMu, Collections Online  CIDOC CRM for building and expressing relationships  Days are conceptual entities, not physical. Framework allows for this  Links to physical entities, diaries, photographs, albums  Links to people, places, topics  However, scale of content of really starting to highlight issues of display in Collections Online.
  • 8.
  • 9.
  • 10. What we’ve learnt  So much content, so much data  More than just one man’s story, a huge data source on NZ life  So much potential for a number of fields of research  Our existing data structure works really well  Transcription only one part  To get most out of the content, need the links, need the rich conceptual model  Context needed, or at least useful, for the reader  Existing display not so hot  Enlivens the collection, a step beyond just digitisation and transcription
  • 11.
  • 12. Issues  Size of the project is daunting, but the transcription seems manageable to do through crowdsourcing  There are a number of existing platforms that look great, but how to deal with matching to our structure, vocabularies, authorities?  Could use automated in text authority mining, but would need to then match back to authorities and structure  Beyond scope of crowdsourcing? But does that diminish the value of the “data”?  Could come later though, are we getting too hung up on quality?
  • 13.
  • 14. Our potential crowd  By starting it ourselves, we have some content available to promote the crowdsourcing.  Already had unsolicited volunteers  The content is interesting: NZ history, early 20th Century courtship, farming, geology, religion, war, politics, weather…  Horowhenua locals interested in local history, and one of their famous sons  History students and educators  Bring students closer to primary material, work with cursive handwriting, highlight the importance of accuracy in relation to data, personal biography  Learning history through a first hand account  Plan B is do war years with interns
  • 15. We decided to go into town to lunch so I piloted the party to Kirkcaldie + Stains where we had a good dinner… Will wanted to know if one could have all the courses for 2/-. I told him it was not customary to indulge in more than six but that if he wanted to tackle the lot we would have to leave him at it. Olive ordered dishes she did not want + Alice also got a bit mixed up. http://collections.tepapa.govt.nz/theme.aspx?irn=4095
  • 16. Where to  Can’t do with existing (human) resource  Transcription only one part of the project  Need to figure what parts need to be crowdsourced, what can’t  Transcription will enable the adding the contextual and semantic relationships and links to other sources  Options for automating the above  Or, with a focussed crowd and a finite project, maybe we don’t need a new platform, could provide training and use existing tools  Can’t crowdsource the display platform. Or can we? Crowdfund it?  Make data available for analysis, visualisation, research, fun  Need to formalise the project  Lots to figure out
  • 17. In evening rode down to see Maud – showed her some books but there seemed to be a lack of sympathy between us + the evening was a failure. http://collections.tepapa.govt.nz/theme.aspx?irn=4080
  • 18. See  Adkin diaries of Collections Online  @adkin_diary on Twitter  @life100yearsago on Twitter Questions?  Kirstie Ross, Curator Modern New Zealand  Adrian Kingston, Collections Information Manager  Philip Edgar, Manager Digital Collections and Access