SlideShare a Scribd company logo
1 of 17
AN EXPLORATIVE APPROACH FOR
CROWDSOURCING TASKS DESIGN
Marco Brambilla
Stefano Ceri
Andrea Mauri
Riccardo Volonterio
Introduction
• OBJECTIVE: selecting the best execution strategy for the
specific human computation task
• ISSUE 1: Dealing with crowds introduces many
concurring objectives and constraints
• ISSUE 2: Very large datasets, high costs of selecting the
wrong strategy
• Performers
• Selection
• Rewarding
• Cost
• Object specific or global
• Time
• Quality
• Convergence criteria
SOCM'15, Monday, May 18 2An explorative approach for Crowdsourcing tasks design
Current approaches
• Tool to simplify the configuration
• Do not provide support on PROs and
CONs of alternatives in settings definition
• Define a mathematical formulation of the problem
• small set of decisions
• NP-hard classes
SOCM'15, Monday, May 18 3An explorative approach for Crowdsourcing tasks design
Our Approach to strategy selection
• We propose a domain-independent, explorative design
method
• Rapid prototyping and execution in the small in order to
select the design parameters to be used for big datasets
SOCM'15, Monday, May 18 4An explorative approach for Crowdsourcing tasks design
Define a
representative
set of
execution
strategies
Execute them
on a small
dataset
Collect quality
measures
Decide the
strategy to be
used with the
complete
dataset
Conceptual Model
SOCM'15, Monday, May 18 5An explorative approach for Crowdsourcing tasks design
Conceptual Model (2)
• Platform: where the task will be executed
• Cardinality: the number of object shown to the performer
• Reward: e.g., the cost of a HIT on Amazon Mechanical
Turk, or game rewards
• Agreement: e.g., majority based decision for each object
This list can be extended in order to satisfy specific user
needs
SOCM'15, Monday, May 18 6An explorative approach for Crowdsourcing tasks design
Candidate Strategy
• Each candidate strategies is thus represented by a set of
parameters describing the model instance considered
S = {s1, s2, . . . , sn} where n is the number of
considered parameters
• Example:
• an execution on Amazon Mechanical Turk
• 3 objects per HIT,
• “2 workers over 3” agreement
• 0.01$ per answer
Sexample = [“AMT”, 3, 2/3,0.01]
SOCM'15, Monday, May 18 7An explorative approach for Crowdsourcing tasks design
Quality measures
Strategies need to be evaluated by using a set of quality
measures
• Cohen’s kappa coefficient: a statistical measure of inter-
annotator agreement for categorical annotation tasks
• Precision of responses: percent of correct responses
• Execution time: the elapsed time needed to complete the
whole task.
• Cost: the total amount of money spent or impact on the
social network cause by our activity.
SOCM'15, Monday, May 18 8An explorative approach for Crowdsourcing tasks design
Evaluation of the strategies
Split the
dataset in 2
(small and
large)
Run all the
strategies on
the small
dataset
Collect the
quality
measure(s)
Select the
“best”
strategy
SOCM'15, Monday, May 18 9An explorative approach for Crowdsourcing tasks design
With |small| << |large|
Experiment
Two main assumptions
1. The execution of a strategy on the small and large
datasets are correlated
2. The cost of performing all experiments in the small
followed by one (the best) experiment in the large is
affordable
SOCM'15, Monday, May 18 10An explorative approach for Crowdsourcing tasks design
Experiment (2)
• We designed an image labeling crowdsourcing task in
which we ask the crowd to classify pictures related to
actor.
• Design dimensions
• Number of images shown in
each microtask
• Agreement level for each picture
• Cost of each AMT HIT
• Dataset
• 900 images related to actors retrieved from Google Images
• Subselection of 90 random images as small dataset
SOCM'15, Monday, May 18 11An explorative approach for Crowdsourcing tasks design
Experiment (3)
• Then we selected 8 different strategies and we ran them
on both the small and large dataset
(to validate correlation hyp.)
SOCM'15, Monday, May 18 12An explorative approach for Crowdsourcing tasks design
Experiment (4)
• We calculated all quality measures of the strategies
• Selection of best strategy depends on weight given to the
measures
• E.g., in the example we compared the strategies wrt the trade-off
between precision and cost
SOCM'15, Monday, May 18 13An explorative approach for Crowdsourcing tasks design
Results
• First assumption:
• we calculated the Pearson correlation coefficient, for each design
dimension
SOCM'15, Monday, May 18 14An explorative approach for Crowdsourcing tasks design
Cost Precision Agreement Duration
Pearson 0.999 0.619 0.707 0.915
Results (2)
• Second assumption:
• Cost for executing all the 8 strategies on the small dataset: $22.49
• Cost for executing the selected strategy: $16.86
• Total: 39.95$
• The difference between the cost of experiments in the
small and in the large increases a lot with big input data
• Hint: in real scenarios (tens of K of objects), numerosity of small vs.
big >= 2 orders of magnitude
• If you selected a random strategy, you may have found
worst quality and higher cost
SOCM'15, Monday, May 18 15An explorative approach for Crowdsourcing tasks design
Conclusion
• Our method is applicable and can lead to quantifiable
advantages of cost and quality
• Trade-off between the additional cost and the added value
is affordable
Future Works
• Formalizing the process for selecting candidate strategies
and the “best” one (currently empirical selection)
• Iterative tuning: multi-level or separate dimensions
• Testing on bigger datasets and with more design
dimensions
SOCM'15, Monday, May 18 16An explorative approach for Crowdsourcing tasks design
Thanks for your attention
Any Questions?
Stefano Ceri stefano.ceri@polimi.it
Marco Brambilla marco.brambilla@polimi.it
Andrea Mauri andrea.mauri@polimi.it
Riccardo Volonterio riccardo.volonterio@polimi.it
SOCM'15, Monday, May 18 17An explorative approach for Crowdsourcing tasks design

More Related Content

Similar to An explorative approach for Crowdsourcing tasks design

Overcoming the 5 Most Common PCM Challenges
Overcoming the 5 Most Common PCM Challenges Overcoming the 5 Most Common PCM Challenges
Overcoming the 5 Most Common PCM Challenges Michelle Scifers, MBA
 
Doing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating AnalyticsDoing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating AnalyticsTasktop
 
chapter 1.pptx
chapter 1.pptxchapter 1.pptx
chapter 1.pptxDejeneDay
 
Agile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgnirudra Sikdar
 
Day 1 1620 - 1705 - maple - pranabendu bhattacharyya
Day 1   1620 - 1705 - maple - pranabendu bhattacharyyaDay 1   1620 - 1705 - maple - pranabendu bhattacharyya
Day 1 1620 - 1705 - maple - pranabendu bhattacharyyaPMI2011
 
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02PMI_IREP_TP
 
Pragmatic Approaches to Project Costs Estimation
Pragmatic Approaches to Project Costs EstimationPragmatic Approaches to Project Costs Estimation
Pragmatic Approaches to Project Costs EstimationChristopher Akinlade
 
Procurement strategy in major infrastructure: The AS-IS and STEPS - D. Makovš...
Procurement strategy in major infrastructure: The AS-IS and STEPS - D. Makovš...Procurement strategy in major infrastructure: The AS-IS and STEPS - D. Makovš...
Procurement strategy in major infrastructure: The AS-IS and STEPS - D. Makovš...OECD Governance
 
IRJET-An Entropy-Weight Based TOPSIS Approach for Supplier Selection
IRJET-An Entropy-Weight Based TOPSIS Approach for Supplier SelectionIRJET-An Entropy-Weight Based TOPSIS Approach for Supplier Selection
IRJET-An Entropy-Weight Based TOPSIS Approach for Supplier SelectionIRJET Journal
 
Capital Expenditure Estimating Cost of Project
Capital Expenditure Estimating Cost of ProjectCapital Expenditure Estimating Cost of Project
Capital Expenditure Estimating Cost of ProjectArvinThamsir1
 
Capital Cost Expenditure Estimating .ppt
Capital Cost Expenditure Estimating .pptCapital Cost Expenditure Estimating .ppt
Capital Cost Expenditure Estimating .pptArvinThamsir1
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Roger Barga
 
Parametric Estimation in a nutshell
Parametric Estimation in a nutshellParametric Estimation in a nutshell
Parametric Estimation in a nutshellPlanisware
 
Biomedical engineering design
Biomedical engineering designBiomedical engineering design
Biomedical engineering designBrian Matovu
 
A Case Study: Measuring Scrum Implementation
A Case Study: Measuring Scrum ImplementationA Case Study: Measuring Scrum Implementation
A Case Study: Measuring Scrum ImplementationBurakcan Ekici
 

Similar to An explorative approach for Crowdsourcing tasks design (20)

Overcoming the 5 Most Common PCM Challenges
Overcoming the 5 Most Common PCM Challenges Overcoming the 5 Most Common PCM Challenges
Overcoming the 5 Most Common PCM Challenges
 
Doing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating AnalyticsDoing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating Analytics
 
chapter 1.pptx
chapter 1.pptxchapter 1.pptx
chapter 1.pptx
 
Agile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity management
 
Day 1 1620 - 1705 - maple - pranabendu bhattacharyya
Day 1   1620 - 1705 - maple - pranabendu bhattacharyyaDay 1   1620 - 1705 - maple - pranabendu bhattacharyya
Day 1 1620 - 1705 - maple - pranabendu bhattacharyya
 
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
 
Ch09
Ch09Ch09
Ch09
 
Pragmatic Approaches to Project Costs Estimation
Pragmatic Approaches to Project Costs EstimationPragmatic Approaches to Project Costs Estimation
Pragmatic Approaches to Project Costs Estimation
 
Procurement strategy in major infrastructure: The AS-IS and STEPS - D. Makovš...
Procurement strategy in major infrastructure: The AS-IS and STEPS - D. Makovš...Procurement strategy in major infrastructure: The AS-IS and STEPS - D. Makovš...
Procurement strategy in major infrastructure: The AS-IS and STEPS - D. Makovš...
 
IRJET-An Entropy-Weight Based TOPSIS Approach for Supplier Selection
IRJET-An Entropy-Weight Based TOPSIS Approach for Supplier SelectionIRJET-An Entropy-Weight Based TOPSIS Approach for Supplier Selection
IRJET-An Entropy-Weight Based TOPSIS Approach for Supplier Selection
 
Lesson1.2.pptx.pdf
Lesson1.2.pptx.pdfLesson1.2.pptx.pdf
Lesson1.2.pptx.pdf
 
Capital Expenditure Estimating Cost of Project
Capital Expenditure Estimating Cost of ProjectCapital Expenditure Estimating Cost of Project
Capital Expenditure Estimating Cost of Project
 
Capital Cost Expenditure Estimating .ppt
Capital Cost Expenditure Estimating .pptCapital Cost Expenditure Estimating .ppt
Capital Cost Expenditure Estimating .ppt
 
Ds for finance day 3
Ds for finance day 3Ds for finance day 3
Ds for finance day 3
 
Chapt 12
Chapt 12Chapt 12
Chapt 12
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
 
Parametric Estimation in a nutshell
Parametric Estimation in a nutshellParametric Estimation in a nutshell
Parametric Estimation in a nutshell
 
Unit 1.pptx
Unit 1.pptxUnit 1.pptx
Unit 1.pptx
 
Biomedical engineering design
Biomedical engineering designBiomedical engineering design
Biomedical engineering design
 
A Case Study: Measuring Scrum Implementation
A Case Study: Measuring Scrum ImplementationA Case Study: Measuring Scrum Implementation
A Case Study: Measuring Scrum Implementation
 

More from Andrea Mauri

A Big Data Analysis Framework for Model-Based Web User Behavior Analytics
A Big Data Analysis Framework for Model-Based Web User Behavior AnalyticsA Big Data Analysis Framework for Model-Based Web User Behavior Analytics
A Big Data Analysis Framework for Model-Based Web User Behavior AnalyticsAndrea Mauri
 
TripleWave: Spreading RDF Streams on the Web
TripleWave: Spreading RDF Streams on the WebTripleWave: Spreading RDF Streams on the Web
TripleWave: Spreading RDF Streams on the WebAndrea Mauri
 
Model Driven Development of Social Media Environmental Monitoring Applications
Model Driven Development of Social Media Environmental Monitoring ApplicationsModel Driven Development of Social Media Environmental Monitoring Applications
Model Driven Development of Social Media Environmental Monitoring ApplicationsAndrea Mauri
 
Methodologies for the Development of Crowd and Social-based applications
Methodologies for the Development of Crowd and Social-based applicationsMethodologies for the Development of Crowd and Social-based applications
Methodologies for the Development of Crowd and Social-based applicationsAndrea Mauri
 
Community Profiling for Crowdsourcing Queries
Community Profiling for Crowdsourcing QueriesCommunity Profiling for Crowdsourcing Queries
Community Profiling for Crowdsourcing QueriesAndrea Mauri
 
Community-based Crowdsourcing
Community-based CrowdsourcingCommunity-based Crowdsourcing
Community-based CrowdsourcingAndrea Mauri
 

More from Andrea Mauri (6)

A Big Data Analysis Framework for Model-Based Web User Behavior Analytics
A Big Data Analysis Framework for Model-Based Web User Behavior AnalyticsA Big Data Analysis Framework for Model-Based Web User Behavior Analytics
A Big Data Analysis Framework for Model-Based Web User Behavior Analytics
 
TripleWave: Spreading RDF Streams on the Web
TripleWave: Spreading RDF Streams on the WebTripleWave: Spreading RDF Streams on the Web
TripleWave: Spreading RDF Streams on the Web
 
Model Driven Development of Social Media Environmental Monitoring Applications
Model Driven Development of Social Media Environmental Monitoring ApplicationsModel Driven Development of Social Media Environmental Monitoring Applications
Model Driven Development of Social Media Environmental Monitoring Applications
 
Methodologies for the Development of Crowd and Social-based applications
Methodologies for the Development of Crowd and Social-based applicationsMethodologies for the Development of Crowd and Social-based applications
Methodologies for the Development of Crowd and Social-based applications
 
Community Profiling for Crowdsourcing Queries
Community Profiling for Crowdsourcing QueriesCommunity Profiling for Crowdsourcing Queries
Community Profiling for Crowdsourcing Queries
 
Community-based Crowdsourcing
Community-based CrowdsourcingCommunity-based Crowdsourcing
Community-based Crowdsourcing
 

Recently uploaded

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 

Recently uploaded (20)

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 

An explorative approach for Crowdsourcing tasks design

  • 1. AN EXPLORATIVE APPROACH FOR CROWDSOURCING TASKS DESIGN Marco Brambilla Stefano Ceri Andrea Mauri Riccardo Volonterio
  • 2. Introduction • OBJECTIVE: selecting the best execution strategy for the specific human computation task • ISSUE 1: Dealing with crowds introduces many concurring objectives and constraints • ISSUE 2: Very large datasets, high costs of selecting the wrong strategy • Performers • Selection • Rewarding • Cost • Object specific or global • Time • Quality • Convergence criteria SOCM'15, Monday, May 18 2An explorative approach for Crowdsourcing tasks design
  • 3. Current approaches • Tool to simplify the configuration • Do not provide support on PROs and CONs of alternatives in settings definition • Define a mathematical formulation of the problem • small set of decisions • NP-hard classes SOCM'15, Monday, May 18 3An explorative approach for Crowdsourcing tasks design
  • 4. Our Approach to strategy selection • We propose a domain-independent, explorative design method • Rapid prototyping and execution in the small in order to select the design parameters to be used for big datasets SOCM'15, Monday, May 18 4An explorative approach for Crowdsourcing tasks design Define a representative set of execution strategies Execute them on a small dataset Collect quality measures Decide the strategy to be used with the complete dataset
  • 5. Conceptual Model SOCM'15, Monday, May 18 5An explorative approach for Crowdsourcing tasks design
  • 6. Conceptual Model (2) • Platform: where the task will be executed • Cardinality: the number of object shown to the performer • Reward: e.g., the cost of a HIT on Amazon Mechanical Turk, or game rewards • Agreement: e.g., majority based decision for each object This list can be extended in order to satisfy specific user needs SOCM'15, Monday, May 18 6An explorative approach for Crowdsourcing tasks design
  • 7. Candidate Strategy • Each candidate strategies is thus represented by a set of parameters describing the model instance considered S = {s1, s2, . . . , sn} where n is the number of considered parameters • Example: • an execution on Amazon Mechanical Turk • 3 objects per HIT, • “2 workers over 3” agreement • 0.01$ per answer Sexample = [“AMT”, 3, 2/3,0.01] SOCM'15, Monday, May 18 7An explorative approach for Crowdsourcing tasks design
  • 8. Quality measures Strategies need to be evaluated by using a set of quality measures • Cohen’s kappa coefficient: a statistical measure of inter- annotator agreement for categorical annotation tasks • Precision of responses: percent of correct responses • Execution time: the elapsed time needed to complete the whole task. • Cost: the total amount of money spent or impact on the social network cause by our activity. SOCM'15, Monday, May 18 8An explorative approach for Crowdsourcing tasks design
  • 9. Evaluation of the strategies Split the dataset in 2 (small and large) Run all the strategies on the small dataset Collect the quality measure(s) Select the “best” strategy SOCM'15, Monday, May 18 9An explorative approach for Crowdsourcing tasks design With |small| << |large|
  • 10. Experiment Two main assumptions 1. The execution of a strategy on the small and large datasets are correlated 2. The cost of performing all experiments in the small followed by one (the best) experiment in the large is affordable SOCM'15, Monday, May 18 10An explorative approach for Crowdsourcing tasks design
  • 11. Experiment (2) • We designed an image labeling crowdsourcing task in which we ask the crowd to classify pictures related to actor. • Design dimensions • Number of images shown in each microtask • Agreement level for each picture • Cost of each AMT HIT • Dataset • 900 images related to actors retrieved from Google Images • Subselection of 90 random images as small dataset SOCM'15, Monday, May 18 11An explorative approach for Crowdsourcing tasks design
  • 12. Experiment (3) • Then we selected 8 different strategies and we ran them on both the small and large dataset (to validate correlation hyp.) SOCM'15, Monday, May 18 12An explorative approach for Crowdsourcing tasks design
  • 13. Experiment (4) • We calculated all quality measures of the strategies • Selection of best strategy depends on weight given to the measures • E.g., in the example we compared the strategies wrt the trade-off between precision and cost SOCM'15, Monday, May 18 13An explorative approach for Crowdsourcing tasks design
  • 14. Results • First assumption: • we calculated the Pearson correlation coefficient, for each design dimension SOCM'15, Monday, May 18 14An explorative approach for Crowdsourcing tasks design Cost Precision Agreement Duration Pearson 0.999 0.619 0.707 0.915
  • 15. Results (2) • Second assumption: • Cost for executing all the 8 strategies on the small dataset: $22.49 • Cost for executing the selected strategy: $16.86 • Total: 39.95$ • The difference between the cost of experiments in the small and in the large increases a lot with big input data • Hint: in real scenarios (tens of K of objects), numerosity of small vs. big >= 2 orders of magnitude • If you selected a random strategy, you may have found worst quality and higher cost SOCM'15, Monday, May 18 15An explorative approach for Crowdsourcing tasks design
  • 16. Conclusion • Our method is applicable and can lead to quantifiable advantages of cost and quality • Trade-off between the additional cost and the added value is affordable Future Works • Formalizing the process for selecting candidate strategies and the “best” one (currently empirical selection) • Iterative tuning: multi-level or separate dimensions • Testing on bigger datasets and with more design dimensions SOCM'15, Monday, May 18 16An explorative approach for Crowdsourcing tasks design
  • 17. Thanks for your attention Any Questions? Stefano Ceri stefano.ceri@polimi.it Marco Brambilla marco.brambilla@polimi.it Andrea Mauri andrea.mauri@polimi.it Riccardo Volonterio riccardo.volonterio@polimi.it SOCM'15, Monday, May 18 17An explorative approach for Crowdsourcing tasks design

Editor's Notes

  1. We propose a domain independent, explorative method that is based on rapid prototyping and execution on small set of data in order to select the design parameter. The overall process is summarized in this flow…
  2. Our approach refers to a simple concept model shown in, which describes how each elementary execution of a crowdsourcing step, called Execution, is referred to an underlying operation (e.g. classifying, tagging, labeling, liking, commenting) called Task, to a specific Platform (that can be either a crowdsourcing marketplace or a social network), to a specific Object of a given collection, and to a specific Performer who executes
  3. These concepts, in turn, are characterized by a set of properties that will define the design space.
  4. Then quality measures are needed for evaluating the strategies Precision: can be aggregated at the level of object, performer, platform, or whole task
  5. The we can perform the actual evaluation….
  6. Our approach has two main assumptions that need to be verified…
  7. We designed an experiment composed by a classification task..
  8. We built the matrix by composing different combination of each parameter Selected a subset, bcause is not feasble to test all of them (for some case it does not even make any sense) (for example avoiding to pay a lot of money for classifying a few objects)
  9. In particular, based upon the small-scale experiment, we selected Strategy 6, which ap- pears to have enough precision (0.864) associated with a low cost (1.92) The designer’s choice is anyway driven by cost-benefit analy- sis, that however is performed in the small, e.g. the designer will be able to decide if a difference in precision from .811 of case 3 to .856 of case 5 is justified by an increase in costs from 1.40 to 4.77.
  10. As one can see, correlation is almost one for the cost, that can be obtained just by considering the scale factor between small and large; but correlation is quite good also for du- ration, performer agreement and precision. Note that du- rations are longer for the small experiments than for the long ones. This reflects a known behaviour of the crowd, which tends to select tasks with higher number of executions to perform (also due to the bias introduced by crowd platforms, which show the biggest tasks first).