SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Crowdsourcing Historical Research Claudine Chionh Drupal Downunder 2012
Founders and Survivors ,[object Object],[object Object],[object Object],[object Object],[object Object]
Goals of the project ,[object Object],[object Object],[object Object],[object Object]
Some research projects ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Project staff ,[object Object],[object Object],[object Object],[object Object]
Who are our users? ,[object Object],[object Object],[object Object],[object Object]
Data sources ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Official/formal sources ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Paper databases ,[object Object],[object Object],[object Object]
Informal sources ,[object Object],[object Object]
Our volunteers ,[object Object],[object Object],[object Object]
How volunteers can contribute ,[object Object],[object Object]
Solutions ,[object Object],[object Object],[object Object]
The Founders and Survivors database ,[object Object],[object Object]
Experimenting with Drupal ,[object Object],[object Object],[object Object]
Getting data into our system ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Viewing data ,[object Object],[object Object],[object Object],[object Object]
Data capture ,[object Object],[object Object],[object Object]
XML entry for an individual convict
Prepopulated Drupal form
Community contributed content
Ships (batches of data) ,[object Object],[object Object],[object Object]
Ship summary data in Google Spreadsheets
Drupal can't do everything ,[object Object],[object Object],[object Object]
Where Drupal is appropriate for our project ,[object Object],[object Object],[object Object]
Summary ,[object Object],[object Object],[object Object]
Questions? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Weitere ähnliche Inhalte

Was ist angesagt?

Knapp Elements of a Modern Monograph
Knapp Elements of a Modern MonographKnapp Elements of a Modern Monograph
Knapp Elements of a Modern Monograph
eMonocot
 
Science On The Web
Science On The WebScience On The Web
Science On The Web
kjurecki
 
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
Susanna-Assunta Sansone
 

Was ist angesagt? (20)

Data Publishing and Institutional Repositories
Data Publishing and Institutional RepositoriesData Publishing and Institutional Repositories
Data Publishing and Institutional Repositories
 
Knapp Elements of a Modern Monograph
Knapp Elements of a Modern MonographKnapp Elements of a Modern Monograph
Knapp Elements of a Modern Monograph
 
Force11 jddcp intro
Force11  jddcp introForce11  jddcp intro
Force11 jddcp intro
 
Research Tools
Research ToolsResearch Tools
Research Tools
 
Science On The Web
Science On The WebScience On The Web
Science On The Web
 
ALA TERN 2011
ALA TERN 2011ALA TERN 2011
ALA TERN 2011
 
Open Access for Early Career Researchers
Open Access for Early Career ResearchersOpen Access for Early Career Researchers
Open Access for Early Career Researchers
 
The Use of Electronic Resources in Forensic Science
The Use of Electronic Resources in Forensic ScienceThe Use of Electronic Resources in Forensic Science
The Use of Electronic Resources in Forensic Science
 
Community Economics Library Presentation
Community Economics Library PresentationCommunity Economics Library Presentation
Community Economics Library Presentation
 
DataCite at APE 2011
DataCite at APE 2011DataCite at APE 2011
DataCite at APE 2011
 
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
 
Kenton County Public Library Databases
Kenton County Public Library DatabasesKenton County Public Library Databases
Kenton County Public Library Databases
 
The DATS model: datasets descriptions for data discovery in DataMed
The DATS model: datasets descriptions for data discovery in DataMedThe DATS model: datasets descriptions for data discovery in DataMed
The DATS model: datasets descriptions for data discovery in DataMed
 
2013 DataCite Summer Meeting - FundRef cooperation with CrossRef (Chuck Koshe...
2013 DataCite Summer Meeting - FundRef cooperation with CrossRef (Chuck Koshe...2013 DataCite Summer Meeting - FundRef cooperation with CrossRef (Chuck Koshe...
2013 DataCite Summer Meeting - FundRef cooperation with CrossRef (Chuck Koshe...
 
Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014
 
Accessing library information thomson delmar learning
Accessing library information   thomson delmar learningAccessing library information   thomson delmar learning
Accessing library information thomson delmar learning
 
Linking Scientific Metadata (presented at DC2010)
Linking Scientific Metadata (presented at DC2010)Linking Scientific Metadata (presented at DC2010)
Linking Scientific Metadata (presented at DC2010)
 
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
 
Using and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataUsing and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute data
 
The Dataverse Commons
The Dataverse CommonsThe Dataverse Commons
The Dataverse Commons
 

Andere mochten auch

Sdlc framework
Sdlc frameworkSdlc framework
Sdlc framework
BILL bill
 

Andere mochten auch (18)

Unlocking the ivory tower: FOSS in collaborative humanities research
Unlocking the ivory tower: FOSS in collaborative humanities researchUnlocking the ivory tower: FOSS in collaborative humanities research
Unlocking the ivory tower: FOSS in collaborative humanities research
 
Humanities computing
Humanities computingHumanities computing
Humanities computing
 
Company Presentation Luxus Derma 2012
Company Presentation Luxus Derma 2012Company Presentation Luxus Derma 2012
Company Presentation Luxus Derma 2012
 
Dcms Twitter Intro
Dcms Twitter IntroDcms Twitter Intro
Dcms Twitter Intro
 
Digitising the paper Panopticon
Digitising the paper PanopticonDigitising the paper Panopticon
Digitising the paper Panopticon
 
LUXUS DERMA apparaten - foto's voor en na
LUXUS DERMA apparaten - foto's voor en na LUXUS DERMA apparaten - foto's voor en na
LUXUS DERMA apparaten - foto's voor en na
 
Peche
PechePeche
Peche
 
Samu
SamuSamu
Samu
 
Public history in the digital age
Public history in the digital agePublic history in the digital age
Public history in the digital age
 
Cryolipolyse lipo cryo voor_na
Cryolipolyse lipo cryo voor_naCryolipolyse lipo cryo voor_na
Cryolipolyse lipo cryo voor_na
 
BLOQUE 3. Actividad 3.3
BLOQUE 3. Actividad 3.3BLOQUE 3. Actividad 3.3
BLOQUE 3. Actividad 3.3
 
PROMOCIÓN DEL ESPAÑOL EE CULHAM UNIS NEW YORK
PROMOCIÓN DEL ESPAÑOL EE CULHAM UNIS NEW YORKPROMOCIÓN DEL ESPAÑOL EE CULHAM UNIS NEW YORK
PROMOCIÓN DEL ESPAÑOL EE CULHAM UNIS NEW YORK
 
Sdlc framework
Sdlc frameworkSdlc framework
Sdlc framework
 
The Digital Humanities
The Digital HumanitiesThe Digital Humanities
The Digital Humanities
 
Nederlandse medische studie lipocryo luxus derma & clinipro
Nederlandse medische studie lipocryo   luxus derma & cliniproNederlandse medische studie lipocryo   luxus derma & clinipro
Nederlandse medische studie lipocryo luxus derma & clinipro
 
Commercial
CommercialCommercial
Commercial
 
Commercial
CommercialCommercial
Commercial
 
Commercial
CommercialCommercial
Commercial
 

Ähnlich wie Crowdsourcing Historical Research

Using Dataverse Virtual Archive Technology for Research Data Management
Using Dataverse Virtual Archive Technology for Research Data ManagementUsing Dataverse Virtual Archive Technology for Research Data Management
Using Dataverse Virtual Archive Technology for Research Data Management
Gary Wilhelm
 
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Natsuko Nicholls
 
Alive and kicking! Keeping data re-usable in the European Values Study
Alive and kicking! Keeping data re-usable in the European Values StudyAlive and kicking! Keeping data re-usable in the European Values Study
Alive and kicking! Keeping data re-usable in the European Values Study
CESSDA Training
 

Ähnlich wie Crowdsourcing Historical Research (20)

Data Publishing in Archaeozoology
Data Publishing in ArchaeozoologyData Publishing in Archaeozoology
Data Publishing in Archaeozoology
 
Data curation issues for repositories
Data curation issues for repositoriesData curation issues for repositories
Data curation issues for repositories
 
Data citation and sharing during article publication
Data citation and sharing during article publicationData citation and sharing during article publication
Data citation and sharing during article publication
 
Introduction of Linked Data for Science
Introduction of Linked Data for ScienceIntroduction of Linked Data for Science
Introduction of Linked Data for Science
 
Texas sla presentation finding sci tech grey literature information
Texas sla presentation  finding sci tech grey literature informationTexas sla presentation  finding sci tech grey literature information
Texas sla presentation finding sci tech grey literature information
 
Scratchpad 2014-introduction
Scratchpad 2014-introductionScratchpad 2014-introduction
Scratchpad 2014-introduction
 
Digital Humanities and Linked Data
Digital Humanities and Linked DataDigital Humanities and Linked Data
Digital Humanities and Linked Data
 
Using Dataverse Virtual Archive Technology for Research Data Management
Using Dataverse Virtual Archive Technology for Research Data ManagementUsing Dataverse Virtual Archive Technology for Research Data Management
Using Dataverse Virtual Archive Technology for Research Data Management
 
The Human Cell Atlas Data Coordination Platform
The Human Cell Atlas Data Coordination PlatformThe Human Cell Atlas Data Coordination Platform
The Human Cell Atlas Data Coordination Platform
 
Christine borgman keynote
Christine borgman keynoteChristine borgman keynote
Christine borgman keynote
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx
 
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
 
Research Integrity Advisor and Data Management
Research Integrity Advisor and Data ManagementResearch Integrity Advisor and Data Management
Research Integrity Advisor and Data Management
 
Managing Social Science Data from the Arctic with ELOKA, ACADIS, NSIDC, and (...
Managing Social Science Data from the Arctic with ELOKA, ACADIS, NSIDC, and (...Managing Social Science Data from the Arctic with ELOKA, ACADIS, NSIDC, and (...
Managing Social Science Data from the Arctic with ELOKA, ACADIS, NSIDC, and (...
 
Brislinger, Recker: Keeping data re-usable in the evs
Brislinger, Recker: Keeping data re-usable in the evsBrislinger, Recker: Keeping data re-usable in the evs
Brislinger, Recker: Keeping data re-usable in the evs
 
Alive and kicking! Keeping data re-usable in the European Values Study
Alive and kicking! Keeping data re-usable in the European Values StudyAlive and kicking! Keeping data re-usable in the European Values Study
Alive and kicking! Keeping data re-usable in the European Values Study
 
Steve Mc Eachern Australian Data Archive
Steve Mc Eachern Australian Data ArchiveSteve Mc Eachern Australian Data Archive
Steve Mc Eachern Australian Data Archive
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation Heidorn
 
GigaScience: data and beta-database launch. Announcing GigaDB
GigaScience: data and beta-database launch. Announcing GigaDBGigaScience: data and beta-database launch. Announcing GigaDB
GigaScience: data and beta-database launch. Announcing GigaDB
 
Data Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumData Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access Symposium
 

Kürzlich hochgeladen

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 

Kürzlich hochgeladen (20)

PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 

Crowdsourcing Historical Research

Hinweis der Redaktion

  1. Abstract: Founders and Survivors is an Australian Research Council-funded research project to build biographies of the approx. 70,000 convicts transported to Tasmania, and their descendants. The project is a collaboration between historians, public health scientists, and a growing number of volunteer genealogists and amateur historians. Our tools include a massive and complex XML database, and social and collaboration tools built with Drupal and Google Docs. This presentation will describe the goals and challenges of the project, the motivations behind the adoption of these tools, and their implementation. I am one of two developers on the Founders and Survivors project. I will introduce the project and how we use 'crowdsourcing' methods to enrich our archival sources.
  2. Introduction to the project. Founders and Survivors: Tasmanian convicts and their descendants -- health and resilience. Collaboration between researchers from Universities of Melbourne and Tasmania and elsewhere.
  3. Digital history: 'problem' or opportunity. Historical, archival resources that have not been compiled or explored.
  4. Some examples of current research.
  5. Shoestring budget. Experimental. Lots of research questions, limited technical resources.
  6. Users of this website. Research team: diverse backgrounds, locations. Amateur historians with interest in Tasmanian history.
  7. Some existing archival material has been digitised. We want to incorporate material from lives of convicts after leaving the convict system, from a range of primary sources and family histories.
  8. The project began with digitised images of archival documents from court trials, ships and prisons, recording the physical and behavioural characteristics of convicts before and during transportation and during their sentence in Van Diemen's Land.
  9. Good starting point for quantitative history. High-quality data.
  10. Less reliable. Less accessible. Collaboration between 'professional' and 'amateur' historians
  11. Our volunteers include family historians, retired historians, librarians and engineers ... Interest in family or local histories, or convicts in general. Varying levels of experience with technology and historical research.
  12. What happened to convicts after they left the convict system?
  13. How to collate different sources of data and incorporate new data (from volunteers and other researchers or archives). Experimentation – solutions not planned from the start.
  14. Our other developer has consolidated the different sources of tabular data into one massive XML database using the BaseX engine and a data format based on the Text Encoding Initiative.
  15. At the same time, I was experimenting with presenting some of the same ata in Drupal, but it would not scale (73,000 convicts, many different source documents for each). Drupal is now used to document the project, collect some data from volunteers, and coordinate volunteer efforts.
  16. Some tabular data has been captured in Excel or CSV form. Most textual/narrative documents are yet to be transcribed and will require more human intervention to incorporate them into the master database. Unfulfilled dream about GEDCOM import.
  17. Public and staff views of consolidated convict biographies using XSLT. Link between basex and ccc: scripts to add links to basex, run as cron jobs.
  18. Convict biographies are captured in Drupal. XSLT template for a convict record includes a url to create a new entry in a Drupal form, using the Prepopulate module to capture enough from the XML record to assist in two-way linkage. (Just the record ID)
  19. Automated process to incorporate community-contributed content into the master database (Perl).
  20. Consolidated source info from the XML entry and prepopulated Drupal form with link to Archive Index ID number. [NB some of our record IDs are obscure. Here: CON31/40...]
  21. What if more than one person submits info on the same convict? These will not be identical because every descendant or research has different (but overlapping) info. All submissions are checked by staff before being added to the master database.
  22. More committed volunteers are assigned to ships and try to trace all convicts on that ship. In addition to convict biographies in Drupal, some summary data (targeted at analysis) is captured in Google Spreadsheets, one for each ship. Prepopulated using the Perl Google Docs API.
  23. Links to XML and Drupal records.
  24. Scale: both developers started on this project around the same time, with our own experiments (Drupal and XML), and XML appeared more suitable to the scale of our dataset. That was when we had much less data than we do now. Complex nature of our data: combination of tabular, textual and image sources; XML was a more natural fit for presenting a whole individual's lifecourse. Expertise: Some staff and volunteers seemed to have difficulty navigating complex forms. For the ship project, which involved volunteers making lots of numerical entries, we decided to use spreadsheets with validation controls instead.
  25. Building a web frontend which is more than the requisite "About the project" site – interface to XML database, data capture, and volunteer forums. BaseX and Drupal live on our own servers – not dependent on Google.
  26. This model has evolved as new data has become available and new analytical questions have been proposed – we did not know exactly what we needed to do when we began 3-4 years ago.