SlideShare ist ein Scribd-Unternehmen logo
1 von 29
Downloaden Sie, um offline zu lesen
Crowdsourcing Cultural Heritage
UCL's Transcribe Bentham Project




Dr Melissa Terras
Senior Lecturer in Electronic Communication, UCL Dept of Information Studies
Deputy Director, UCL Centre for Digital Humanities
m.terras@ucl.ac.uk
Crowdsourcing Cultural Heritage



• Bentham and UCL
• Crowdsourcing
  – History and Ideas
  – Heritage and Culture
  – Features and Issues
• Transcribe Bentham
• Potentials and Problems
Jeremy Bentham (1748-1832)
                 •Jurist, philosopher, and legal and
                 social reformer
                 •Leading theorist in Anglo-American
                 philosophy of law
                 •Influenced the development of
                 welfarism
                 •Advocated utilitarianism
                 •Animal rights,
                 •Work on the “panopticon”

                 •Not founder of UCL, but...
                 •60,000 folios in UCL Sp. Collections
                 •Auto-icon
The Bentham Project


             • http://www.ucl.ac.uk/Bentham-Project/
             • Since 1959
             • “aims to produce a new scholarly
               edition of the works and
               correspondence of Jeremy Bentham”
             • twenty six volumes of the new
               Collected Works have been published
             • Previous AHRC grant catalogued the
               manuscripts
                – http://www.benthampapers.ucl.ac.uk/
First 80 hours: 20,000 volunteers, 170,000 pages read.
Currently: 26, 717 volunteers, 220,965 pages read. 237,867 to go
Crowdsourcing



• neologistic portmanteau of “crowd” and
  “outsourcing”
• coined by Jeff Howe in a June 2006 Wired
  magazine article “The Rise of Crowdsourcing”
  – Group intelligence
  – Cheap computers + large crowds = useful
  – “It’s not outsourcing; it’s crowdsourcing.”
Technology and crowd-based research
• Often those outside established institutions that
  have taken the lead in exploiting new technologies
   – Science in the 19th century
   – Classics, maths, black studies, astrophysics,
     oral history, women’s studies, contemporary
     history… all started outside established
     curricula
• Prizes for technological innovation
• Metal detectors/archaeology
• Binoculars/ ornithological fieldwork
• Cassette Recorders/ life history, oral history,
  language
• Telescopes/ astronomical research
Crowdsourcing tasks



•The harnessing of online activity to aid in large
scale projects that require human cognition
•Basic to complex tasks
   • Is this round or square? (yes/no)
   • Is this tag correct for this image?
   • Can you correct the OCR on this page?
Crowdsourcing: Potentials for heritage institutions

•   Achieving goals even with limited resources
•   Achieving goals faster
•   Build new virtual communities and user groups
•   Involve and engage the user community with collections
•   Utilising the knowledge, expertise and interest of the community
•   Improving the quality of data/resource (e.g. corrections), more accurate
    searching
•   Adding value to data (e.g. by addition of comments, tags, ratings, reviews).
•   Making data discoverable in different ways f (e.g. by tagging).
•   Gain insight on user desires by asking and then listening to the crowd.
•   Demonstrating the value and relevance of the institution in the community
•   Strengthen and builditrust and loyalty of collection users
•   Encourage a sense of public ownership and responsibility
•   Holley, R. (2010) “Crowdsourcing: How and Why Should Libraries Do It?” D-
    Lib Magazine http://www.dlib.org/dlib/march10/holley/03holley.html
Galaxy Zoo http://www.galaxyzoo.org/



• Online collaborative astronomy project
• Public assist in classifying millions of galaxies
  from digital photos taken by robots
• Released July 2007
• By August 2007 80,000 volunteers had classified
  10 million galaxies
• To date, more than 60 million galaxies classified
Australian Newspapers Digitisation Program
http://www.nla.gov.au/ndp/


• In 2007 The National Library of Australia began to
  digitise out of copyright newspapers
• However the OCR quality of newsprint is poor
• Opened up the text to allow users to correct
  mistakes in the OCR
• 9000+ members of the public have so far
  corrected 12.5 million lines of newspaper text
Victoria and Albert Museum Crowdsourcing
http://collections.vam.ac.uk/crowdsourcing/


• Search the collections contains 140,000 images,
  selected automatically from the database
• Many images not the best view of an object
• Asking users to help find best crops of images
• 28375 images done in a year
Crowd sourced projects
• Picture Australia, National Library of Australia
   – http://www.pictureaustralia.org/
• Family Search Indexing
   – http://www.familysearch.org/eng/indexing/frameset_indexing.asp
• Free BMD
   – http://www.freebmd.org.uk/
• Distributed Proofreaders (Project Gutenberg)
   – http://www.pgdp.net/c/
• Papyri
   – Project at Oxford to use Galaxy Zoo software to help in classification of
     documentary fragments
• Wikipedia
  – http://www.wikipedia.org/
What do we know of Volunteers?
• Majority of work done by 10% of users
• Clay Shirky describes activity as 'cognitive surplus' time for
  social endeavours, rather than watching TV
• Personal interest
• Personal reward
• Community aspect
• Lot of interest from retirement community, and disabled
  and terminally ill individuals
• Many build up IT expertise as they volunteer
• “addictive”
• Help achieve group goal
• Like to be rewarded
Successful Crowdsourcing




Rose Holley's checklist for crowdsourcing:
http://www.dlib.org/dlib/march10/holley/03holley.html
Enter Transcribe Bentham

• 10,000 images of Bentham’s manuscripts
• Ask user community to transcribe these
  – Provide plain text
  – Or “Markup” in rudimentary TEI
     • Underline, deletions, insertions
• Generate a “Knowledge Bank” of ideas from the
  transcripts
• Link with existing catalogue and transcripts
• Make material more accessible to scholars
Plan



•   Soft launch end of June
•   Full launch early July
•   In process of user testing and creation of system
•   Two full time RAs working on this
    – One for user testing and promotion
    – One for user testing and technical aspects
• http://www.ucl.ac.uk/transcribe-bentham/
User Interaction



• Involving users in the design process is key
• Currently recruiting for testers
• Will be working one to one with users
  – Established textual scholars from DH community
  – Members of the public
• Will open to Beta testing to find bugs
• Then onto full launch
Issues and Outcomes



• Worst Case Scenario?
• Best Case Scenario?
• Is this task suitable to crowd sourcing?
  – Complex
• How can we gauge success?
  – Monitor and log user interaction
  – Report back on initiatives
• How can we reach a user community?
Conclude



• Latest fad?
• Should provide input into cultural and heritage
  institutions, research, and projects
• Longer term outcomes
  – Sustainability
• Good to try these things!
• http://www.ucl.ac.uk/transcribe-bentham/

Weitere ähnliche Inhalte

Andere mochten auch

Curso Técnico Especializado: SIGA
Curso Técnico Especializado: SIGACurso Técnico Especializado: SIGA
Curso Técnico Especializado: SIGARC Consulting
 
The "New" Citizen Scientist, Crowdsourcing at the Smithsonian
The "New" Citizen Scientist, Crowdsourcing at the SmithsonianThe "New" Citizen Scientist, Crowdsourcing at the Smithsonian
The "New" Citizen Scientist, Crowdsourcing at the SmithsonianDan Davis
 
Crowdsourcing as productive engagement with cultural heritage
Crowdsourcing as productive engagement with cultural heritageCrowdsourcing as productive engagement with cultural heritage
Crowdsourcing as productive engagement with cultural heritageMia
 
Crowdsourcing lecture pres
Crowdsourcing lecture presCrowdsourcing lecture pres
Crowdsourcing lecture presOonagh Murphy
 
Linked Data for Digital History presentation for VU symposium "Connecting Dat...
Linked Data for Digital History presentation for VU symposium "Connecting Dat...Linked Data for Digital History presentation for VU symposium "Connecting Dat...
Linked Data for Digital History presentation for VU symposium "Connecting Dat...Victor de Boer
 
Transcribing between the lines: crowd-sourcing historic data collection
Transcribing between the lines: crowd-sourcing historic data collectionTranscribing between the lines: crowd-sourcing historic data collection
Transcribing between the lines: crowd-sourcing historic data collectionNicole Kearney
 
Design for Crowdsourcing
Design for CrowdsourcingDesign for Crowdsourcing
Design for Crowdsourcingmlascarides
 
Everyone wins: crowdsourcing games and museums
Everyone wins: crowdsourcing games and museumsEveryone wins: crowdsourcing games and museums
Everyone wins: crowdsourcing games and museumsMia
 
Crowdsourcing as Public Engagement
Crowdsourcing as Public  EngagementCrowdsourcing as Public  Engagement
Crowdsourcing as Public EngagementAlastair Dunning
 
Changing contexts: museums, audiences and technology
Changing contexts: museums, audiences and technologyChanging contexts: museums, audiences and technology
Changing contexts: museums, audiences and technologyMia
 
The crowd and the library
The crowd and the libraryThe crowd and the library
The crowd and the libraryTrevor Owens
 
دليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأول
دليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأولدليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأول
دليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأولwedad111
 
Digital History Presentation
Digital History PresentationDigital History Presentation
Digital History PresentationEdward Iglesias
 
Crowdsourcing Strategies for Archives, Nov 2010
Crowdsourcing Strategies for Archives, Nov 2010Crowdsourcing Strategies for Archives, Nov 2010
Crowdsourcing Strategies for Archives, Nov 2010Rose Holley
 
The IMLS National Digital Platform & Your Library: Tools You Can Use
The IMLS National Digital Platform & Your Library: Tools You Can UseThe IMLS National Digital Platform & Your Library: Tools You Can Use
The IMLS National Digital Platform & Your Library: Tools You Can UseTrevor Owens
 
Reaching out: museums, crowdsourcing and participatory heritage
Reaching out: museums, crowdsourcing and participatory heritageReaching out: museums, crowdsourcing and participatory heritage
Reaching out: museums, crowdsourcing and participatory heritageMia
 

Andere mochten auch (17)

Curso Técnico Especializado: SIGA
Curso Técnico Especializado: SIGACurso Técnico Especializado: SIGA
Curso Técnico Especializado: SIGA
 
The "New" Citizen Scientist, Crowdsourcing at the Smithsonian
The "New" Citizen Scientist, Crowdsourcing at the SmithsonianThe "New" Citizen Scientist, Crowdsourcing at the Smithsonian
The "New" Citizen Scientist, Crowdsourcing at the Smithsonian
 
Crowdsourcing as productive engagement with cultural heritage
Crowdsourcing as productive engagement with cultural heritageCrowdsourcing as productive engagement with cultural heritage
Crowdsourcing as productive engagement with cultural heritage
 
Crowdsourcing lecture pres
Crowdsourcing lecture presCrowdsourcing lecture pres
Crowdsourcing lecture pres
 
Linked Data for Digital History presentation for VU symposium "Connecting Dat...
Linked Data for Digital History presentation for VU symposium "Connecting Dat...Linked Data for Digital History presentation for VU symposium "Connecting Dat...
Linked Data for Digital History presentation for VU symposium "Connecting Dat...
 
Transcribing between the lines: crowd-sourcing historic data collection
Transcribing between the lines: crowd-sourcing historic data collectionTranscribing between the lines: crowd-sourcing historic data collection
Transcribing between the lines: crowd-sourcing historic data collection
 
Design for Crowdsourcing
Design for CrowdsourcingDesign for Crowdsourcing
Design for Crowdsourcing
 
Everyone wins: crowdsourcing games and museums
Everyone wins: crowdsourcing games and museumsEveryone wins: crowdsourcing games and museums
Everyone wins: crowdsourcing games and museums
 
Crowdsourcing digital humanities
Crowdsourcing digital humanitiesCrowdsourcing digital humanities
Crowdsourcing digital humanities
 
Crowdsourcing as Public Engagement
Crowdsourcing as Public  EngagementCrowdsourcing as Public  Engagement
Crowdsourcing as Public Engagement
 
Changing contexts: museums, audiences and technology
Changing contexts: museums, audiences and technologyChanging contexts: museums, audiences and technology
Changing contexts: museums, audiences and technology
 
The crowd and the library
The crowd and the libraryThe crowd and the library
The crowd and the library
 
دليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأول
دليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأولدليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأول
دليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأول
 
Digital History Presentation
Digital History PresentationDigital History Presentation
Digital History Presentation
 
Crowdsourcing Strategies for Archives, Nov 2010
Crowdsourcing Strategies for Archives, Nov 2010Crowdsourcing Strategies for Archives, Nov 2010
Crowdsourcing Strategies for Archives, Nov 2010
 
The IMLS National Digital Platform & Your Library: Tools You Can Use
The IMLS National Digital Platform & Your Library: Tools You Can UseThe IMLS National Digital Platform & Your Library: Tools You Can Use
The IMLS National Digital Platform & Your Library: Tools You Can Use
 
Reaching out: museums, crowdsourcing and participatory heritage
Reaching out: museums, crowdsourcing and participatory heritageReaching out: museums, crowdsourcing and participatory heritage
Reaching out: museums, crowdsourcing and participatory heritage
 

Kürzlich hochgeladen

Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 

Kürzlich hochgeladen (20)

Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 

Mterras 09 jun2010

  • 1. Crowdsourcing Cultural Heritage UCL's Transcribe Bentham Project Dr Melissa Terras Senior Lecturer in Electronic Communication, UCL Dept of Information Studies Deputy Director, UCL Centre for Digital Humanities m.terras@ucl.ac.uk
  • 2. Crowdsourcing Cultural Heritage • Bentham and UCL • Crowdsourcing – History and Ideas – Heritage and Culture – Features and Issues • Transcribe Bentham • Potentials and Problems
  • 3. Jeremy Bentham (1748-1832) •Jurist, philosopher, and legal and social reformer •Leading theorist in Anglo-American philosophy of law •Influenced the development of welfarism •Advocated utilitarianism •Animal rights, •Work on the “panopticon” •Not founder of UCL, but... •60,000 folios in UCL Sp. Collections •Auto-icon
  • 4. The Bentham Project • http://www.ucl.ac.uk/Bentham-Project/ • Since 1959 • “aims to produce a new scholarly edition of the works and correspondence of Jeremy Bentham” • twenty six volumes of the new Collected Works have been published • Previous AHRC grant catalogued the manuscripts – http://www.benthampapers.ucl.ac.uk/
  • 5.
  • 6. First 80 hours: 20,000 volunteers, 170,000 pages read. Currently: 26, 717 volunteers, 220,965 pages read. 237,867 to go
  • 7. Crowdsourcing • neologistic portmanteau of “crowd” and “outsourcing” • coined by Jeff Howe in a June 2006 Wired magazine article “The Rise of Crowdsourcing” – Group intelligence – Cheap computers + large crowds = useful – “It’s not outsourcing; it’s crowdsourcing.”
  • 8. Technology and crowd-based research • Often those outside established institutions that have taken the lead in exploiting new technologies – Science in the 19th century – Classics, maths, black studies, astrophysics, oral history, women’s studies, contemporary history… all started outside established curricula • Prizes for technological innovation • Metal detectors/archaeology • Binoculars/ ornithological fieldwork • Cassette Recorders/ life history, oral history, language • Telescopes/ astronomical research
  • 9. Crowdsourcing tasks •The harnessing of online activity to aid in large scale projects that require human cognition •Basic to complex tasks • Is this round or square? (yes/no) • Is this tag correct for this image? • Can you correct the OCR on this page?
  • 10. Crowdsourcing: Potentials for heritage institutions • Achieving goals even with limited resources • Achieving goals faster • Build new virtual communities and user groups • Involve and engage the user community with collections • Utilising the knowledge, expertise and interest of the community • Improving the quality of data/resource (e.g. corrections), more accurate searching • Adding value to data (e.g. by addition of comments, tags, ratings, reviews). • Making data discoverable in different ways f (e.g. by tagging). • Gain insight on user desires by asking and then listening to the crowd. • Demonstrating the value and relevance of the institution in the community • Strengthen and builditrust and loyalty of collection users • Encourage a sense of public ownership and responsibility • Holley, R. (2010) “Crowdsourcing: How and Why Should Libraries Do It?” D- Lib Magazine http://www.dlib.org/dlib/march10/holley/03holley.html
  • 11. Galaxy Zoo http://www.galaxyzoo.org/ • Online collaborative astronomy project • Public assist in classifying millions of galaxies from digital photos taken by robots • Released July 2007 • By August 2007 80,000 volunteers had classified 10 million galaxies • To date, more than 60 million galaxies classified
  • 12.
  • 13. Australian Newspapers Digitisation Program http://www.nla.gov.au/ndp/ • In 2007 The National Library of Australia began to digitise out of copyright newspapers • However the OCR quality of newsprint is poor • Opened up the text to allow users to correct mistakes in the OCR • 9000+ members of the public have so far corrected 12.5 million lines of newspaper text
  • 14.
  • 15. Victoria and Albert Museum Crowdsourcing http://collections.vam.ac.uk/crowdsourcing/ • Search the collections contains 140,000 images, selected automatically from the database • Many images not the best view of an object • Asking users to help find best crops of images • 28375 images done in a year
  • 16.
  • 17. Crowd sourced projects • Picture Australia, National Library of Australia – http://www.pictureaustralia.org/ • Family Search Indexing – http://www.familysearch.org/eng/indexing/frameset_indexing.asp • Free BMD – http://www.freebmd.org.uk/ • Distributed Proofreaders (Project Gutenberg) – http://www.pgdp.net/c/ • Papyri – Project at Oxford to use Galaxy Zoo software to help in classification of documentary fragments • Wikipedia – http://www.wikipedia.org/
  • 18. What do we know of Volunteers? • Majority of work done by 10% of users • Clay Shirky describes activity as 'cognitive surplus' time for social endeavours, rather than watching TV • Personal interest • Personal reward • Community aspect • Lot of interest from retirement community, and disabled and terminally ill individuals • Many build up IT expertise as they volunteer • “addictive” • Help achieve group goal • Like to be rewarded
  • 19. Successful Crowdsourcing Rose Holley's checklist for crowdsourcing: http://www.dlib.org/dlib/march10/holley/03holley.html
  • 20. Enter Transcribe Bentham • 10,000 images of Bentham’s manuscripts • Ask user community to transcribe these – Provide plain text – Or “Markup” in rudimentary TEI • Underline, deletions, insertions • Generate a “Knowledge Bank” of ideas from the transcripts • Link with existing catalogue and transcripts • Make material more accessible to scholars
  • 21.
  • 22. Plan • Soft launch end of June • Full launch early July • In process of user testing and creation of system • Two full time RAs working on this – One for user testing and promotion – One for user testing and technical aspects • http://www.ucl.ac.uk/transcribe-bentham/
  • 23. User Interaction • Involving users in the design process is key • Currently recruiting for testers • Will be working one to one with users – Established textual scholars from DH community – Members of the public • Will open to Beta testing to find bugs • Then onto full launch
  • 24.
  • 25.
  • 26.
  • 27.
  • 28. Issues and Outcomes • Worst Case Scenario? • Best Case Scenario? • Is this task suitable to crowd sourcing? – Complex • How can we gauge success? – Monitor and log user interaction – Report back on initiatives • How can we reach a user community?
  • 29. Conclude • Latest fad? • Should provide input into cultural and heritage institutions, research, and projects • Longer term outcomes – Sustainability • Good to try these things! • http://www.ucl.ac.uk/transcribe-bentham/