SlideShare ist ein Scribd-Unternehmen logo
1 von 31
@mattlease
ml@utexas.edu
Beyond Mechanical Turk: An Analysis of
Paid Crowd Work Platforms
Donna Vakharia & Matthew Lease
University of Texas at Austin
Talk slides: www.slideshare.net/mattlease
Roadmap
• A Brief History of Amazon’s Mechanical Turk
– Its rise to fame, enduring popularity, & strengths & limitations
• A capability analysis of 7 alternative crowd work platforms
– Ascertain distinguishing capabilities not found in MTurk
Matt Lease <ml@utexas.edu>
Take-away: Predominant use of Mechanical Turk in research
on paid crowd work has let its unique vagaries & limitations
unduly shape the broader crowdsourcing research agenda.
Greater awareness of alternative crowd work platforms can
accelerate practice, conceptual understanding, & progress.
2/20
Amazon Mechanical Turk (MTurk)
• Online marketplace for paid crowd work (micro-tasks)
• Human labor on-demand as a programmatic service
• Launched in November 2005
Matt Lease <ml@utexas.edu> 3/20
The Rise of Mechanical Turk
J. Pontin. Artificial Intelligence, With Help From the
Humans. New York Times, March 2007
Su et al., WWW 2007: “a web-based human data
collection system that we [call] ‘System M’ ”
Matt Lease <ml@utexas.edu> 4/20
2008: Gold Rush begins in Comp. Sci.
Snow et al, EMNLP
• Annotating human language
• 22,000 labels for only US $26 !!!
• Crowd’s consensus labels can
replace traditional expert labels
“Discovery” sparks rush for “gold” data across areas
• Alonso et al., SIGIR Forum (Information Retrieval)
• Kittur et al., CHI (Human-Computer Interaction)
• Sorokin and Forsythe, CVPR (Computer Vision)
Matt Lease <ml@utexas.edu> 5/20
MTurk for Social & Behavioral Sciences
• A Guide to Behavioral Experiments
on Mechanical Turk
– W. Mason and S. Suri (2010). SSRN online.
• Crowdsourcing for Human Subjects Research
– L. Schmidt (CrowdConf 2010)
• Crowdsourcing Content Analysis for Behavioral Research:
Insights from Mechanical Turk
– Conley & Tosti-Kharas (2010). Academy of Management
• Amazon's Mechanical Turk : A New Source of
Inexpensive, Yet High-Quality, Data?
– M. Buhrmester et al. (2011). Perspectives… 6(1):3-5.
– see also: Amazon Mechanical Turk Guide for Social Scientists
Matt Lease <ml@utexas.edu> 6/20
MTurk Strengths
• On-demand, 24-7, vast U.S. & Indian workforce
• Traditional economic friction and geographic
work barriers dramatically streamlined
– Cheap & fast recruitment & job execution
– Hiring/firing, communication, transfer, payment
• Programmatic control via intermediary API
• Worker pseudonyms and diverse participant
pool amenable to human subjects research
Matt Lease <ml@utexas.edu> 7/20
But also challenges (& much research)
• Reliability & data quality
– Risk & uncertainty may preclude adoption;
management overhead may offset savings
– Limited reputation system (no ratings/ reviews)
– Real & perceived fraud in marketplace
• Basic usage difficulties (eg. posting & search)
• How to accomplish non-trivial tasks?
– Lack of worker profiles (skills & interests)
– interaction, collaboration, & workflow design
• How to process sensitive data?
– Private records, trade secrets, military use
• How to assess and navigate ethics?
Matt Lease <ml@utexas.edu> 8/20
Amazon Mechanical Turk (MTurk)
• Online marketplace for paid crowd work (micro-tasks)
• Human labor on-demand as a programmatic service
• Launched in Nov. 2005, remains in “beta” in 2015
Matt Lease <ml@utexas.edu> 9/20
Matt Lease <ml@utexas.edu>
www.behind-the-enemy-lines.com/2010/10/plea-to-amazon-fix-mechanical-turk.html
10/20
Why Eytan Adar hates MTurk Research
(CHI 2011 CHC Workshop)
• Overly-narrow research focus on MTurk
– Distinguish general vs. platform-specific problems
– Distinguish research vs. industry concerns
• Should researchers really focus on…
– “...writing the user’s manual for MTurk ...”?
– “…struggl[ing] against the limits of the platform...”?
“…by rewarding quick demonstrations of the tool’s
use, we fail to attain a deeper understanding of the
problems to which it is applied…”
Matt Lease <ml@utexas.edu> 11/20
Crowdsourcing ≠ Mechanical Turk!
Matt Lease <ml@utexas.edu> 12/20
What about volunteer crowdsourcing?
Matt Lease <ml@utexas.edu> 13/20
Many Crowd Work Platforms Today
And More!
JobBoy, microWorkers, MiniFreelance,
MiniJobz, MinuteWorkers, MyEasyTask,
OpTask, ShortTask, SimpleWorkers
Paid Crowd Work ≠ Mechanical Turk!
Matt Lease <ml@utexas.edu> 14/20
The Future of Crowd Work
Paper @ ACM CSCW 2013 by
Kittur, Nickerson, Bernstein,
Gerber, Shaw, Zimmerman
Lease, & Horton
Matt Lease <ml@utexas.edu>
With diminished visibility and communication…
workers may be treated as exchangeable and
untrustworthy, having …. strong motivations to
shirk. Workers may become equally cynical…”
“…we surveyed a number of … popular crowd work
platforms... Mechanical Turk, oDesk, Freelancer,
Crowdflower, MobileWorks, ManPower…”
But our platform coverage was pretty superficial…
15/20
Here: Qualitative Study of 7 Platforms
Matt Lease <ml@utexas.edu>
We shared preliminary findings with personnel from
all platforms, requesting & incorporating feedback
16/20
Criteria for Assessing
Platform Capabilities
Matt Lease <ml@utexas.edu> 17/20
Capabilities Beyond MTurk (1 of 2)
• Basic platform operations mature beyond “beta”
– e.g. effective task search, preventing task starvation
• Payment models beyond per-task completed
– e.g., hourly pay or stronger models of contract work
• Rich worker profiles & reputation systems
– Some rigorously curate and assess workforce
– Work quality, desired demographics, task routing
• Rich interaction & collaboration support
– Support for a variety of specific task verticals
Matt Lease <ml@utexas.edu> 18/20
Capabilities Beyond MTurk (2 of 2)
• Truly global workforce & deeper geographic foci
– Still accepting new international workers…
• Supporting confidential & bring-your-own crowds
– Ability to crowdsource sensitive data
• “Fair trade” models for work with career ladders
– Explicit missions & designs for ethical work practices
• Legal terms-of-use beyond marketplace broker
– e.g., avoid risk of independent contractors vs.
employees debate (Wolfson & Lease, ASIS&T’11)
Matt Lease <ml@utexas.edu> 19/20
Summary
• Qualitative study of 7 platforms for crowd work
– Detailed analysis & comparison of current state-of-the-art
– Highlighted key capabilities available beyond MTurk
• Further inform current practice & future research
Matt Lease <ml@utexas.edu>
Take-away: Predominant use of Mechanical Turk in research
on paid crowd work has let its unique vagaries & limitations
unduly shape the broader crowdsourcing research agenda.
Greater awareness of alternative crowd work platforms can
accelerate practice, conceptual understanding, & progress.
20/20
Thank You!
ir.ischool.utexas.eduSlides: www.slideshare.net/mattlease
Matt Lease <ml@utexas.edu> 22/20
AAAI HCOMP 2013 Industry Panel
Anand Kulkarni: “How do we
dramatically reduce the complexity of
getting work done with the crowd?”
Greg Little: How can we post a task and
with 98% confidence know we’ll get a
quality result?
Matt Lease <ml@utexas.edu> 23/20
Why Paid Crowd Work?
• Why can’t we just…
– …ask people to volunteer?
– …create fun games which produce work as a by-product?
– …design services that produce work invisibly (e.g. re-Captcha)?
• Established models of paying work exist for a reason
– Even volunteers need to work to survive
– Demand will likely always outstrip volunteer supply
– There will always be forms of uninteresting but needed work
• Volunteer approaches bring their own set of challenges…
• But what about ethics?
– Are volunteering approaches truly more ethical?
– Since paying work isn’t going away, promote awareness & study
Matt Lease <ml@utexas.edu> 24/20
CACM August, 2013
Paul Hyman. Communications of the ACM, Vol. 56 No. 8, Pages 19-21, August 2013.
Matt Lease <ml@utexas.edu> 25/20
What about ethics?
• Silberman, Irani, and Ross (2010)
– “How should we… conceptualize the role of these
people who we ask to power our computing?”
– Power dynamics between parties
• What are the consequences for a worker
when your actions harm their reputation?
– “Abstraction hides detail”
• Fort, Adda, and Cohen (2011)
– “…opportunities for our community to deliberately
value ethics above cost savings.”
Matt Lease <ml@utexas.edu> 26/20
What about Human Subjects Research?
•
“What are the characteristics of MTurk workers?... the MTurk
system is set up to strictly protect workers’ anonymity….”
Matt Lease <ml@utexas.edu> 27/20
`
A MTurk worker’s ID is
also their customer
ID on Amazon. Public
profile pages can link
worker ID to name.
Lease et al., SSRN’13
Matt Lease <ml@utexas.edu> 28/20
Crowdsourcing
• Take a task traditionally
performed by a known agent
(often an employee)
• Advertise it to an undefined,
generally large group of
people via an open call
Matt Lease <ml@utexas.edu> 29/20
The Unreasonable Effectiveness of Data
Banko and Brill (2001)
An AI system’s effectiveness in practice is often
limited by the amount of available training data
Matt Lease <ml@utexas.edu> 30/20
Matt Lease <ml@utexas.edu> 31/20

Weitere ähnliche Inhalte

Was ist angesagt?

Service thinking cases 20130702 v1
Service thinking cases 20130702 v1Service thinking cases 20130702 v1
Service thinking cases 20130702 v1ISSIP
 
Future of learning 20180425 v1
Future of learning 20180425 v1Future of learning 20180425 v1
Future of learning 20180425 v1ISSIP
 
20220203 jim spohrer uidp v11
20220203 jim spohrer uidp v1120220203 jim spohrer uidp v11
20220203 jim spohrer uidp v11ISSIP
 
20220203 jim spohrer purdue v12
20220203 jim spohrer purdue v1220220203 jim spohrer purdue v12
20220203 jim spohrer purdue v12ISSIP
 
Colombia 20140326 v1
Colombia 20140326 v1Colombia 20140326 v1
Colombia 20140326 v1ISSIP
 
WWWW 2010 Conference - ICT4D 2.0
WWWW 2010 Conference - ICT4D 2.0 WWWW 2010 Conference - ICT4D 2.0
WWWW 2010 Conference - ICT4D 2.0 Craig Blewett
 
Data urban service science 20130617 v2
Data urban service science 20130617 v2Data urban service science 20130617 v2
Data urban service science 20130617 v2ISSIP
 
The Search for Truth in Objective & Subject Crowdsourcing
The Search for Truth in Objective & Subject CrowdsourcingThe Search for Truth in Objective & Subject Crowdsourcing
The Search for Truth in Objective & Subject CrowdsourcingMatthew Lease
 
Toward Effective and Sustainable Online Crowd Work
Toward Effective and Sustainable Online Crowd WorkToward Effective and Sustainable Online Crowd Work
Toward Effective and Sustainable Online Crowd WorkMatthew Lease
 
Cases for chesbrough 201304122 v2
Cases for chesbrough 201304122 v2Cases for chesbrough 201304122 v2
Cases for chesbrough 201304122 v2ISSIP
 
Issip 2014028 v3
Issip 2014028 v3Issip 2014028 v3
Issip 2014028 v3ISSIP
 
Aspen forum 20140619 v5
Aspen forum 20140619 v5Aspen forum 20140619 v5
Aspen forum 20140619 v5ISSIP
 
China ibm 20110522 v1
China ibm 20110522 v1China ibm 20110522 v1
China ibm 20110522 v1ISSIP
 
Weaving technology into the fabric of the classroom: Future jobs
Weaving technology into the fabric of the classroom: Future jobsWeaving technology into the fabric of the classroom: Future jobs
Weaving technology into the fabric of the classroom: Future jobsBond University
 
Ijcai nyc ai summit 20140224 v1
Ijcai nyc ai summit 20140224 v1Ijcai nyc ai summit 20140224 v1
Ijcai nyc ai summit 20140224 v1ISSIP
 
Ibm welcome and cognitive 20170322 v7
Ibm welcome and cognitive 20170322 v7Ibm welcome and cognitive 20170322 v7
Ibm welcome and cognitive 20170322 v7ISSIP
 
Big Data and Social Machines
Big Data and Social MachinesBig Data and Social Machines
Big Data and Social MachinesDavid De Roure
 
Artificial Intelligence in E-learning (AI-Ed): Current and future applications
Artificial Intelligence in E-learning (AI-Ed): Current and future applicationsArtificial Intelligence in E-learning (AI-Ed): Current and future applications
Artificial Intelligence in E-learning (AI-Ed): Current and future applicationsRoy Clariana
 

Was ist angesagt? (20)

Service thinking cases 20130702 v1
Service thinking cases 20130702 v1Service thinking cases 20130702 v1
Service thinking cases 20130702 v1
 
Future of learning 20180425 v1
Future of learning 20180425 v1Future of learning 20180425 v1
Future of learning 20180425 v1
 
20220203 jim spohrer uidp v11
20220203 jim spohrer uidp v1120220203 jim spohrer uidp v11
20220203 jim spohrer uidp v11
 
20220203 jim spohrer purdue v12
20220203 jim spohrer purdue v1220220203 jim spohrer purdue v12
20220203 jim spohrer purdue v12
 
Colombia 20140326 v1
Colombia 20140326 v1Colombia 20140326 v1
Colombia 20140326 v1
 
WWWW 2010 Conference - ICT4D 2.0
WWWW 2010 Conference - ICT4D 2.0 WWWW 2010 Conference - ICT4D 2.0
WWWW 2010 Conference - ICT4D 2.0
 
Data urban service science 20130617 v2
Data urban service science 20130617 v2Data urban service science 20130617 v2
Data urban service science 20130617 v2
 
The Search for Truth in Objective & Subject Crowdsourcing
The Search for Truth in Objective & Subject CrowdsourcingThe Search for Truth in Objective & Subject Crowdsourcing
The Search for Truth in Objective & Subject Crowdsourcing
 
Toward Effective and Sustainable Online Crowd Work
Toward Effective and Sustainable Online Crowd WorkToward Effective and Sustainable Online Crowd Work
Toward Effective and Sustainable Online Crowd Work
 
Cases for chesbrough 201304122 v2
Cases for chesbrough 201304122 v2Cases for chesbrough 201304122 v2
Cases for chesbrough 201304122 v2
 
Issip 2014028 v3
Issip 2014028 v3Issip 2014028 v3
Issip 2014028 v3
 
CSE509 Lecture 1
CSE509 Lecture 1CSE509 Lecture 1
CSE509 Lecture 1
 
Aspen forum 20140619 v5
Aspen forum 20140619 v5Aspen forum 20140619 v5
Aspen forum 20140619 v5
 
China ibm 20110522 v1
China ibm 20110522 v1China ibm 20110522 v1
China ibm 20110522 v1
 
Machine Learning & Law
Machine Learning & LawMachine Learning & Law
Machine Learning & Law
 
Weaving technology into the fabric of the classroom: Future jobs
Weaving technology into the fabric of the classroom: Future jobsWeaving technology into the fabric of the classroom: Future jobs
Weaving technology into the fabric of the classroom: Future jobs
 
Ijcai nyc ai summit 20140224 v1
Ijcai nyc ai summit 20140224 v1Ijcai nyc ai summit 20140224 v1
Ijcai nyc ai summit 20140224 v1
 
Ibm welcome and cognitive 20170322 v7
Ibm welcome and cognitive 20170322 v7Ibm welcome and cognitive 20170322 v7
Ibm welcome and cognitive 20170322 v7
 
Big Data and Social Machines
Big Data and Social MachinesBig Data and Social Machines
Big Data and Social Machines
 
Artificial Intelligence in E-learning (AI-Ed): Current and future applications
Artificial Intelligence in E-learning (AI-Ed): Current and future applicationsArtificial Intelligence in E-learning (AI-Ed): Current and future applications
Artificial Intelligence in E-learning (AI-Ed): Current and future applications
 

Ähnlich wie Beyond Mechanical Turk: An Analysis of Paid Crowd Work Platforms

AI & Work, with Transparency & the Crowd
AI & Work, with Transparency & the Crowd AI & Work, with Transparency & the Crowd
AI & Work, with Transparency & the Crowd Matthew Lease
 
Crowdsourcing: From Aggregation to Search Engine Evaluation
Crowdsourcing: From Aggregation to Search Engine EvaluationCrowdsourcing: From Aggregation to Search Engine Evaluation
Crowdsourcing: From Aggregation to Search Engine EvaluationMatthew Lease
 
UT Dallas CS - Rise of Crowd Computing
UT Dallas CS - Rise of Crowd ComputingUT Dallas CS - Rise of Crowd Computing
UT Dallas CS - Rise of Crowd ComputingMatthew Lease
 
Metrocon-Rise-Of-Crowd-Computing
Metrocon-Rise-Of-Crowd-ComputingMetrocon-Rise-Of-Crowd-Computing
Metrocon-Rise-Of-Crowd-ComputingMatthew Lease
 
Explainable Fact Checking with Humans in-the-loop
Explainable Fact Checking with Humans in-the-loopExplainable Fact Checking with Humans in-the-loop
Explainable Fact Checking with Humans in-the-loopMatthew Lease
 
Crowdsourcing for Information Retrieval: From Statistics to Ethics
Crowdsourcing for Information Retrieval: From Statistics to EthicsCrowdsourcing for Information Retrieval: From Statistics to Ethics
Crowdsourcing for Information Retrieval: From Statistics to EthicsMatthew Lease
 
Rise of Crowd Computing (December 2012)
Rise of Crowd Computing (December 2012)Rise of Crowd Computing (December 2012)
Rise of Crowd Computing (December 2012)Matthew Lease
 
Picmet 20130801 v2
Picmet 20130801 v2Picmet 20130801 v2
Picmet 20130801 v2ISSIP
 
Spohrer PHD_ICT_KES 20230316 v10.pptx
Spohrer PHD_ICT_KES 20230316 v10.pptxSpohrer PHD_ICT_KES 20230316 v10.pptx
Spohrer PHD_ICT_KES 20230316 v10.pptxISSIP
 
Seminar 20221027 v4.pptx
Seminar 20221027 v4.pptxSeminar 20221027 v4.pptx
Seminar 20221027 v4.pptxISSIP
 
Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...
Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...
Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...Matthew Lease
 
Towards a classification framework for social machines
Towards a classification  framework for social machinesTowards a classification  framework for social machines
Towards a classification framework for social machinesSOCIAM Project
 
Towards a classification framework for social machines
Towards a classification framework for social machinesTowards a classification framework for social machines
Towards a classification framework for social machinesElena Simperl
 
Ten reasons 20130621 v3
Ten reasons 20130621 v3Ten reasons 20130621 v3
Ten reasons 20130621 v3ISSIP
 
Keynote 20221028 v5.pptx
Keynote 20221028 v5.pptxKeynote 20221028 v5.pptx
Keynote 20221028 v5.pptxISSIP
 
NHH 20221023 v3.pptx
NHH 20221023 v3.pptxNHH 20221023 v3.pptx
NHH 20221023 v3.pptxISSIP
 
UCSC-SV 20220825 v1.pptx
UCSC-SV 20220825 v1.pptxUCSC-SV 20220825 v1.pptx
UCSC-SV 20220825 v1.pptxISSIP
 
2021004 jim spohrer alan hartman_retirement v3
2021004 jim spohrer alan hartman_retirement v32021004 jim spohrer alan hartman_retirement v3
2021004 jim spohrer alan hartman_retirement v3ISSIP
 
Data Center Computing for Data Science: an evolution of machines, middleware,...
Data Center Computing for Data Science: an evolution of machines, middleware,...Data Center Computing for Data Science: an evolution of machines, middleware,...
Data Center Computing for Data Science: an evolution of machines, middleware,...Paco Nathan
 
Historical Perspective Business Use Of The Internet
Historical Perspective Business Use Of The InternetHistorical Perspective Business Use Of The Internet
Historical Perspective Business Use Of The InternetAnn Treacy
 

Ähnlich wie Beyond Mechanical Turk: An Analysis of Paid Crowd Work Platforms (20)

AI & Work, with Transparency & the Crowd
AI & Work, with Transparency & the Crowd AI & Work, with Transparency & the Crowd
AI & Work, with Transparency & the Crowd
 
Crowdsourcing: From Aggregation to Search Engine Evaluation
Crowdsourcing: From Aggregation to Search Engine EvaluationCrowdsourcing: From Aggregation to Search Engine Evaluation
Crowdsourcing: From Aggregation to Search Engine Evaluation
 
UT Dallas CS - Rise of Crowd Computing
UT Dallas CS - Rise of Crowd ComputingUT Dallas CS - Rise of Crowd Computing
UT Dallas CS - Rise of Crowd Computing
 
Metrocon-Rise-Of-Crowd-Computing
Metrocon-Rise-Of-Crowd-ComputingMetrocon-Rise-Of-Crowd-Computing
Metrocon-Rise-Of-Crowd-Computing
 
Explainable Fact Checking with Humans in-the-loop
Explainable Fact Checking with Humans in-the-loopExplainable Fact Checking with Humans in-the-loop
Explainable Fact Checking with Humans in-the-loop
 
Crowdsourcing for Information Retrieval: From Statistics to Ethics
Crowdsourcing for Information Retrieval: From Statistics to EthicsCrowdsourcing for Information Retrieval: From Statistics to Ethics
Crowdsourcing for Information Retrieval: From Statistics to Ethics
 
Rise of Crowd Computing (December 2012)
Rise of Crowd Computing (December 2012)Rise of Crowd Computing (December 2012)
Rise of Crowd Computing (December 2012)
 
Picmet 20130801 v2
Picmet 20130801 v2Picmet 20130801 v2
Picmet 20130801 v2
 
Spohrer PHD_ICT_KES 20230316 v10.pptx
Spohrer PHD_ICT_KES 20230316 v10.pptxSpohrer PHD_ICT_KES 20230316 v10.pptx
Spohrer PHD_ICT_KES 20230316 v10.pptx
 
Seminar 20221027 v4.pptx
Seminar 20221027 v4.pptxSeminar 20221027 v4.pptx
Seminar 20221027 v4.pptx
 
Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...
Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...
Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...
 
Towards a classification framework for social machines
Towards a classification  framework for social machinesTowards a classification  framework for social machines
Towards a classification framework for social machines
 
Towards a classification framework for social machines
Towards a classification framework for social machinesTowards a classification framework for social machines
Towards a classification framework for social machines
 
Ten reasons 20130621 v3
Ten reasons 20130621 v3Ten reasons 20130621 v3
Ten reasons 20130621 v3
 
Keynote 20221028 v5.pptx
Keynote 20221028 v5.pptxKeynote 20221028 v5.pptx
Keynote 20221028 v5.pptx
 
NHH 20221023 v3.pptx
NHH 20221023 v3.pptxNHH 20221023 v3.pptx
NHH 20221023 v3.pptx
 
UCSC-SV 20220825 v1.pptx
UCSC-SV 20220825 v1.pptxUCSC-SV 20220825 v1.pptx
UCSC-SV 20220825 v1.pptx
 
2021004 jim spohrer alan hartman_retirement v3
2021004 jim spohrer alan hartman_retirement v32021004 jim spohrer alan hartman_retirement v3
2021004 jim spohrer alan hartman_retirement v3
 
Data Center Computing for Data Science: an evolution of machines, middleware,...
Data Center Computing for Data Science: an evolution of machines, middleware,...Data Center Computing for Data Science: an evolution of machines, middleware,...
Data Center Computing for Data Science: an evolution of machines, middleware,...
 
Historical Perspective Business Use Of The Internet
Historical Perspective Business Use Of The InternetHistorical Perspective Business Use Of The Internet
Historical Perspective Business Use Of The Internet
 

Mehr von Matthew Lease

Automated Models for Quantifying Centrality of Survey Responses
Automated Models for Quantifying Centrality of Survey ResponsesAutomated Models for Quantifying Centrality of Survey Responses
Automated Models for Quantifying Centrality of Survey ResponsesMatthew Lease
 
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...Matthew Lease
 
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...Matthew Lease
 
Designing Human-AI Partnerships to Combat Misinfomation
Designing Human-AI Partnerships to Combat Misinfomation Designing Human-AI Partnerships to Combat Misinfomation
Designing Human-AI Partnerships to Combat Misinfomation Matthew Lease
 
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Matthew Lease
 
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...Matthew Lease
 
Fact Checking & Information Retrieval
Fact Checking & Information RetrievalFact Checking & Information Retrieval
Fact Checking & Information RetrievalMatthew Lease
 
Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...
Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...
Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...Matthew Lease
 
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesDeep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesMatthew Lease
 
Systematic Review is e-Discovery in Doctor’s Clothing
Systematic Review is e-Discovery in Doctor’s ClothingSystematic Review is e-Discovery in Doctor’s Clothing
Systematic Review is e-Discovery in Doctor’s ClothingMatthew Lease
 
Multidimensional Relevance Modeling via Psychometrics & Crowdsourcing: ACM SI...
Multidimensional Relevance Modeling via Psychometrics & Crowdsourcing: ACM SI...Multidimensional Relevance Modeling via Psychometrics & Crowdsourcing: ACM SI...
Multidimensional Relevance Modeling via Psychometrics & Crowdsourcing: ACM SI...Matthew Lease
 
Crowdsourcing Transcription Beyond Mechanical Turk
Crowdsourcing Transcription Beyond Mechanical TurkCrowdsourcing Transcription Beyond Mechanical Turk
Crowdsourcing Transcription Beyond Mechanical TurkMatthew Lease
 
Crowdsourcing & ethics: a few thoughts and refences.
Crowdsourcing & ethics: a few thoughts and refences. Crowdsourcing & ethics: a few thoughts and refences.
Crowdsourcing & ethics: a few thoughts and refences. Matthew Lease
 
Crowdsourcing & Human Computation Labeling Data & Building Hybrid Systems
Crowdsourcing & Human Computation Labeling Data & Building Hybrid SystemsCrowdsourcing & Human Computation Labeling Data & Building Hybrid Systems
Crowdsourcing & Human Computation Labeling Data & Building Hybrid SystemsMatthew Lease
 
Mechanical Turk is Not Anonymous
Mechanical Turk is Not AnonymousMechanical Turk is Not Anonymous
Mechanical Turk is Not AnonymousMatthew Lease
 
UT Austin @ TREC 2012 Crowdsourcing Track: Image Relevance Assessment Task (I...
UT Austin @ TREC 2012 Crowdsourcing Track: Image Relevance Assessment Task (I...UT Austin @ TREC 2012 Crowdsourcing Track: Image Relevance Assessment Task (I...
UT Austin @ TREC 2012 Crowdsourcing Track: Image Relevance Assessment Task (I...Matthew Lease
 

Mehr von Matthew Lease (16)

Automated Models for Quantifying Centrality of Survey Responses
Automated Models for Quantifying Centrality of Survey ResponsesAutomated Models for Quantifying Centrality of Survey Responses
Automated Models for Quantifying Centrality of Survey Responses
 
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
 
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
 
Designing Human-AI Partnerships to Combat Misinfomation
Designing Human-AI Partnerships to Combat Misinfomation Designing Human-AI Partnerships to Combat Misinfomation
Designing Human-AI Partnerships to Combat Misinfomation
 
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
 
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
 
Fact Checking & Information Retrieval
Fact Checking & Information RetrievalFact Checking & Information Retrieval
Fact Checking & Information Retrieval
 
Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...
Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...
Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...
 
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesDeep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
 
Systematic Review is e-Discovery in Doctor’s Clothing
Systematic Review is e-Discovery in Doctor’s ClothingSystematic Review is e-Discovery in Doctor’s Clothing
Systematic Review is e-Discovery in Doctor’s Clothing
 
Multidimensional Relevance Modeling via Psychometrics & Crowdsourcing: ACM SI...
Multidimensional Relevance Modeling via Psychometrics & Crowdsourcing: ACM SI...Multidimensional Relevance Modeling via Psychometrics & Crowdsourcing: ACM SI...
Multidimensional Relevance Modeling via Psychometrics & Crowdsourcing: ACM SI...
 
Crowdsourcing Transcription Beyond Mechanical Turk
Crowdsourcing Transcription Beyond Mechanical TurkCrowdsourcing Transcription Beyond Mechanical Turk
Crowdsourcing Transcription Beyond Mechanical Turk
 
Crowdsourcing & ethics: a few thoughts and refences.
Crowdsourcing & ethics: a few thoughts and refences. Crowdsourcing & ethics: a few thoughts and refences.
Crowdsourcing & ethics: a few thoughts and refences.
 
Crowdsourcing & Human Computation Labeling Data & Building Hybrid Systems
Crowdsourcing & Human Computation Labeling Data & Building Hybrid SystemsCrowdsourcing & Human Computation Labeling Data & Building Hybrid Systems
Crowdsourcing & Human Computation Labeling Data & Building Hybrid Systems
 
Mechanical Turk is Not Anonymous
Mechanical Turk is Not AnonymousMechanical Turk is Not Anonymous
Mechanical Turk is Not Anonymous
 
UT Austin @ TREC 2012 Crowdsourcing Track: Image Relevance Assessment Task (I...
UT Austin @ TREC 2012 Crowdsourcing Track: Image Relevance Assessment Task (I...UT Austin @ TREC 2012 Crowdsourcing Track: Image Relevance Assessment Task (I...
UT Austin @ TREC 2012 Crowdsourcing Track: Image Relevance Assessment Task (I...
 

Kürzlich hochgeladen

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 

Kürzlich hochgeladen (20)

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

Beyond Mechanical Turk: An Analysis of Paid Crowd Work Platforms

  • 1. @mattlease ml@utexas.edu Beyond Mechanical Turk: An Analysis of Paid Crowd Work Platforms Donna Vakharia & Matthew Lease University of Texas at Austin Talk slides: www.slideshare.net/mattlease
  • 2. Roadmap • A Brief History of Amazon’s Mechanical Turk – Its rise to fame, enduring popularity, & strengths & limitations • A capability analysis of 7 alternative crowd work platforms – Ascertain distinguishing capabilities not found in MTurk Matt Lease <ml@utexas.edu> Take-away: Predominant use of Mechanical Turk in research on paid crowd work has let its unique vagaries & limitations unduly shape the broader crowdsourcing research agenda. Greater awareness of alternative crowd work platforms can accelerate practice, conceptual understanding, & progress. 2/20
  • 3. Amazon Mechanical Turk (MTurk) • Online marketplace for paid crowd work (micro-tasks) • Human labor on-demand as a programmatic service • Launched in November 2005 Matt Lease <ml@utexas.edu> 3/20
  • 4. The Rise of Mechanical Turk J. Pontin. Artificial Intelligence, With Help From the Humans. New York Times, March 2007 Su et al., WWW 2007: “a web-based human data collection system that we [call] ‘System M’ ” Matt Lease <ml@utexas.edu> 4/20
  • 5. 2008: Gold Rush begins in Comp. Sci. Snow et al, EMNLP • Annotating human language • 22,000 labels for only US $26 !!! • Crowd’s consensus labels can replace traditional expert labels “Discovery” sparks rush for “gold” data across areas • Alonso et al., SIGIR Forum (Information Retrieval) • Kittur et al., CHI (Human-Computer Interaction) • Sorokin and Forsythe, CVPR (Computer Vision) Matt Lease <ml@utexas.edu> 5/20
  • 6. MTurk for Social & Behavioral Sciences • A Guide to Behavioral Experiments on Mechanical Turk – W. Mason and S. Suri (2010). SSRN online. • Crowdsourcing for Human Subjects Research – L. Schmidt (CrowdConf 2010) • Crowdsourcing Content Analysis for Behavioral Research: Insights from Mechanical Turk – Conley & Tosti-Kharas (2010). Academy of Management • Amazon's Mechanical Turk : A New Source of Inexpensive, Yet High-Quality, Data? – M. Buhrmester et al. (2011). Perspectives… 6(1):3-5. – see also: Amazon Mechanical Turk Guide for Social Scientists Matt Lease <ml@utexas.edu> 6/20
  • 7. MTurk Strengths • On-demand, 24-7, vast U.S. & Indian workforce • Traditional economic friction and geographic work barriers dramatically streamlined – Cheap & fast recruitment & job execution – Hiring/firing, communication, transfer, payment • Programmatic control via intermediary API • Worker pseudonyms and diverse participant pool amenable to human subjects research Matt Lease <ml@utexas.edu> 7/20
  • 8. But also challenges (& much research) • Reliability & data quality – Risk & uncertainty may preclude adoption; management overhead may offset savings – Limited reputation system (no ratings/ reviews) – Real & perceived fraud in marketplace • Basic usage difficulties (eg. posting & search) • How to accomplish non-trivial tasks? – Lack of worker profiles (skills & interests) – interaction, collaboration, & workflow design • How to process sensitive data? – Private records, trade secrets, military use • How to assess and navigate ethics? Matt Lease <ml@utexas.edu> 8/20
  • 9. Amazon Mechanical Turk (MTurk) • Online marketplace for paid crowd work (micro-tasks) • Human labor on-demand as a programmatic service • Launched in Nov. 2005, remains in “beta” in 2015 Matt Lease <ml@utexas.edu> 9/20
  • 11. Why Eytan Adar hates MTurk Research (CHI 2011 CHC Workshop) • Overly-narrow research focus on MTurk – Distinguish general vs. platform-specific problems – Distinguish research vs. industry concerns • Should researchers really focus on… – “...writing the user’s manual for MTurk ...”? – “…struggl[ing] against the limits of the platform...”? “…by rewarding quick demonstrations of the tool’s use, we fail to attain a deeper understanding of the problems to which it is applied…” Matt Lease <ml@utexas.edu> 11/20
  • 12. Crowdsourcing ≠ Mechanical Turk! Matt Lease <ml@utexas.edu> 12/20
  • 13. What about volunteer crowdsourcing? Matt Lease <ml@utexas.edu> 13/20
  • 14. Many Crowd Work Platforms Today And More! JobBoy, microWorkers, MiniFreelance, MiniJobz, MinuteWorkers, MyEasyTask, OpTask, ShortTask, SimpleWorkers Paid Crowd Work ≠ Mechanical Turk! Matt Lease <ml@utexas.edu> 14/20
  • 15. The Future of Crowd Work Paper @ ACM CSCW 2013 by Kittur, Nickerson, Bernstein, Gerber, Shaw, Zimmerman Lease, & Horton Matt Lease <ml@utexas.edu> With diminished visibility and communication… workers may be treated as exchangeable and untrustworthy, having …. strong motivations to shirk. Workers may become equally cynical…” “…we surveyed a number of … popular crowd work platforms... Mechanical Turk, oDesk, Freelancer, Crowdflower, MobileWorks, ManPower…” But our platform coverage was pretty superficial… 15/20
  • 16. Here: Qualitative Study of 7 Platforms Matt Lease <ml@utexas.edu> We shared preliminary findings with personnel from all platforms, requesting & incorporating feedback 16/20
  • 17. Criteria for Assessing Platform Capabilities Matt Lease <ml@utexas.edu> 17/20
  • 18. Capabilities Beyond MTurk (1 of 2) • Basic platform operations mature beyond “beta” – e.g. effective task search, preventing task starvation • Payment models beyond per-task completed – e.g., hourly pay or stronger models of contract work • Rich worker profiles & reputation systems – Some rigorously curate and assess workforce – Work quality, desired demographics, task routing • Rich interaction & collaboration support – Support for a variety of specific task verticals Matt Lease <ml@utexas.edu> 18/20
  • 19. Capabilities Beyond MTurk (2 of 2) • Truly global workforce & deeper geographic foci – Still accepting new international workers… • Supporting confidential & bring-your-own crowds – Ability to crowdsource sensitive data • “Fair trade” models for work with career ladders – Explicit missions & designs for ethical work practices • Legal terms-of-use beyond marketplace broker – e.g., avoid risk of independent contractors vs. employees debate (Wolfson & Lease, ASIS&T’11) Matt Lease <ml@utexas.edu> 19/20
  • 20. Summary • Qualitative study of 7 platforms for crowd work – Detailed analysis & comparison of current state-of-the-art – Highlighted key capabilities available beyond MTurk • Further inform current practice & future research Matt Lease <ml@utexas.edu> Take-away: Predominant use of Mechanical Turk in research on paid crowd work has let its unique vagaries & limitations unduly shape the broader crowdsourcing research agenda. Greater awareness of alternative crowd work platforms can accelerate practice, conceptual understanding, & progress. 20/20
  • 23. AAAI HCOMP 2013 Industry Panel Anand Kulkarni: “How do we dramatically reduce the complexity of getting work done with the crowd?” Greg Little: How can we post a task and with 98% confidence know we’ll get a quality result? Matt Lease <ml@utexas.edu> 23/20
  • 24. Why Paid Crowd Work? • Why can’t we just… – …ask people to volunteer? – …create fun games which produce work as a by-product? – …design services that produce work invisibly (e.g. re-Captcha)? • Established models of paying work exist for a reason – Even volunteers need to work to survive – Demand will likely always outstrip volunteer supply – There will always be forms of uninteresting but needed work • Volunteer approaches bring their own set of challenges… • But what about ethics? – Are volunteering approaches truly more ethical? – Since paying work isn’t going away, promote awareness & study Matt Lease <ml@utexas.edu> 24/20
  • 25. CACM August, 2013 Paul Hyman. Communications of the ACM, Vol. 56 No. 8, Pages 19-21, August 2013. Matt Lease <ml@utexas.edu> 25/20
  • 26. What about ethics? • Silberman, Irani, and Ross (2010) – “How should we… conceptualize the role of these people who we ask to power our computing?” – Power dynamics between parties • What are the consequences for a worker when your actions harm their reputation? – “Abstraction hides detail” • Fort, Adda, and Cohen (2011) – “…opportunities for our community to deliberately value ethics above cost savings.” Matt Lease <ml@utexas.edu> 26/20
  • 27. What about Human Subjects Research? • “What are the characteristics of MTurk workers?... the MTurk system is set up to strictly protect workers’ anonymity….” Matt Lease <ml@utexas.edu> 27/20
  • 28. ` A MTurk worker’s ID is also their customer ID on Amazon. Public profile pages can link worker ID to name. Lease et al., SSRN’13 Matt Lease <ml@utexas.edu> 28/20
  • 29. Crowdsourcing • Take a task traditionally performed by a known agent (often an employee) • Advertise it to an undefined, generally large group of people via an open call Matt Lease <ml@utexas.edu> 29/20
  • 30. The Unreasonable Effectiveness of Data Banko and Brill (2001) An AI system’s effectiveness in practice is often limited by the amount of available training data Matt Lease <ml@utexas.edu> 30/20

Hinweis der Redaktion

  1. Show number of citations in google Scholar
  2. Ipeirotis Plea to amazon – fix mechanical turk ------------ Recent chronicle article on ethics
  3. Show number of citations in google Scholar
  4. Mindwire, new IMLS initiative
  5. Innocentive