SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Downloaden Sie, um offline zu lesen
Estimating the Completion Time
    of Crowdsourced Tasks using Survival Analysis


                   Jing Wang, New York University
           Siamak Faridani, University of California, Berkeley
                 Panos Ipeirotis, New York Univesity




1
Crowdsourcing: Pricing and Time to completion?


     Many firms use crowdsourcing for a variety of tasks
         y                       g             y


     Still unclear how to price
                           p
        Prior results indicate that price does not affect quality
         (Mason and Watts, 2009)
        …but it does affect completion time

    U l
     Unclear how long it will take for a task to finish
             h   l    it ill t k f       t k t fi i h




2
Data Set:      Mechanical Turk Tracker 
                   (http://www.mturk‐tracker.com)


     Crawled Amazon Mechanical Turk hourly (now every min)
                                          y(         y    )
     Captured full market state (content, position, and 
      characteristics of all available HITs).


     15 months of data (now >24 months)
     165,368 HIT groups
     6,701,406 HIT assignments from 9,436 requesters
     Value of the HITs: $529,259 [guesstimate ~10% of actual value]
     Missing very short tasks (posted and disappeared in <1hr)
     Do not observe HIT redundancy

3
Completion Times: Power‐laws




      HIT completion time: Time_last_seen – Time_first_posted

4
Completion Times: Power‐laws and Censoring
                         Censoring Effects

                      Jumps/Outliers: Expiration

                      Different slope: Requesters 
                           taking down HITs
                           taking down HITs




      HIT completion time: Time_last_seen – Time_first_posted

5
Parameter estimation

 Maximum Likelihood Estimation, controlling for censored data
     Power‐law parameter α~1.5
     Power‐laws with α<2 do not have well‐defined mean value
     Sample average increases as sample size increases
      Sample average increases as sample size increases




6
Why Power‐laws?

     Queuing theory model by (Cobham, 1954):
        If workers pick tasks from two priority queues, 
         completion time follows power‐law with α=1.5
        Chilton et al, HCOMP 2010: workers rank either by 
         “most recently posted” or by “most HITs available”
        Result Inherent unpredictability of completion time
         Result: Inherent unpredictability of completion time
        Real solution: Amazon should change the interface


        But let’s see how other factors affect completion time



7
Survival Analysis

 Examine and model the time it takes for events to occur
 In our case: Event = HIT gets completed


 Survival function S(t):  
     Probability that tasks will last longer than t


 Used stratified Cox Proportional Hazards Model




8
Covariates Examined

      HIT Characteristics
         Monetary reward
          Monetary reward
         Number of HITs
         Length in characters
         HIT topic (based on Latent Dirichlet Allocation analysis)

      Market Characteristics
         Day of the week (when HIT was first posted)
         Time of the day (when HIT was first posted)

      Requester Characteristics
       Requester Characteristics
         Activities of requester until time of submission
         Existing lifetime of requester



9
Effect of Price: Mostly monotonic

                     h(t) = 1.035^price
                     40% speedup for 10x price
                     40%     d f 10        i




      Half‐life for $0.025 reward ~ 2 days
      H lf lif f $1
       Half‐life for $1 reward ~ 12 hours
                             d 12 h

10
Covariates Examined

      HIT Characteristics
        Monetary reward
         Monetary reward
        Number of HITs
        Length in characters
        HIT topic (based on Latent Dirichlet Allocation analysis)

      Market Characteristics
        Day of the week (when HIT was first posted)
        Time of the day (when HIT was first posted)

      Requester Characteristics
       Requester Characteristics
        Activities of requester until time of submission
        Existing lifetime of requester



11
Effect of #HITs: Monotonic, but sublinear

                   h(t) = 0.998^#HITs




 10 HITs  2% slower than 1 HIT
 100 HITs  19% slower than 1 HIT 
 1000 HITs  87% slower than 1 HIT
  1000 HITs  87% slower than 1 HIT 
  or, 1 group of 1000  7 times faster than 1000 sequential groups of 1
 12
Covariates Examined

      HIT Characteristics
        Monetary reward
         Monetary reward 
        Number of HITs
        Length in characters (increases lifetime)
        HIT topic (based on Latent Dirichlet Allocation analysis)

      Market Characteristics
        Day of the week (when HIT was first posted)
        Time of the day (when HIT was first posted)

      Requester Characteristics
       Requester Characteristics
        Activities of requester until time of submission
        Existing lifetime of requester



13
HIT Topics
topic 1 : cw castingwords  podcast  transcribe  english  mp3  edit  confirm  snippet  grade

topic 2:  d
   i 2 data  collection  search  image  entry  listings  website  review  survey  opinion
               ll i           h i              li i        bi        i              i i

topic 3:  categorization  product  video  page  smartsheet web  comment  website  opinion

topic 4:  easy  quick  survey  money  research  fast  simple  form  answers  link

topic 5:  question  answer  nanonano dinkle article  write  writing  review  blog  articles

topic 6:  writing  answer  article  question  opinion  short  advice  editing  rewriting  paul

topic 7:  transcribe  transcription  improve  retranscribe edit  answerly voicemail  answer




14
Effect of Topic: The CastingWords Effect




      topic 1 : cw castingwords  podcast  transcribe  english  mp3  edit  confirm  snippet  grade
      topic 2:  data  collection  search  image  entry  listings  website  review  survey  opinion
      topic 3:  categorization  product  video  page  smartsheet web  comment  website  opinion
      topic 4:  easy  quick  survey  money  research  fast  simple  form  answers  link
      topic 5:  question  answer  nanonano dinkle article  write  writing  review  blog  articles
        p       q                                                            g             g
      topic 6:  writing  answer  article  question  opinion  short  advice  editing  rewriting  paul
      topic 7:  transcribe  transcription  improve  retranscribe edit  answerly voicemail  query  question  answer


15
Effect of Topic: Surveys=fast (even with redundancy!)




       topic 1 : cw castingwords  podcast  transcribe  english  mp3  edit  confirm  snippet  grade
       topic 2:  data  collection  search  image  entry  listings  website  review  survey  opinion
       topic 3:  categorization  product  video  page  smartsheet web  comment  website  opinion
       topic 4:  easy  quick  survey  money  research  fast  simple  form  answers  link
       topic 5:  question  answer  nanonano dinkle article  write  writing  review  blog  articles
         p       q                                                            g             g
       topic 6:  writing  answer  article  question  opinion  short  advice  editing  rewriting  paul
       topic 7:  transcribe  transcription  improve  retranscribe edit  answerly voicemail  query  question  answer


16
Effect of Topic: Writing takes time




       topic 1 : cw castingwords  podcast  transcribe  english  mp3  edit  confirm  snippet  grade
       topic 2:  data  collection  search  image  entry  listings  website  review  survey  opinion
       topic 3:  categorization  product  video  page  smartsheet web  comment  website  opinion
       topic 4:  easy  quick  survey  money  research  fast  simple  form  answers  link
       topic 5:  question  answer  nanonano dinkle article  write  writing  review  blog  articles
         p       q                                                            g             g
       topic 6:  writing  answer  article  question  opinion  short  advice  editing  rewriting  paul
       topic 7:  transcribe  transcription  improve  retranscribe edit  answerly voicemail  query  question  answer


17
Covariates Examined

  HIT Characteristics
      Monetary reward
       Monetary reward 
      Number of HITs
      Length in characters (increases lifetime)
      HIT topic (based on Latent Dirichlet Allocation analysis)

  Market Characteristics: Not affecting
      Day of the week (when HIT was first posted)
      Time of the day (when HIT was first posted)

  Requester Characteristics
   Requester Characteristics
      Activities of requester until time of submission
      Existing lifetime of requester (1yr ~ 50% speedup)



18
Covariates Examined

  HIT Characteristics
      Monetary reward
       Monetary reward 
      Number of HITs
      Length in characters (increases lifetime)
      HIT topic (based on Latent Dirichlet Allocation analysis)
       Why? We look at long‐running 
  Market Characteristics: Not affecting
                  HITs until completion…
                  HIT    til     l ti
      Day of the week (when HIT was first posted)
      Time of the day (when HIT was first posted)

  Requester Characteristics
   Requester Characteristics
      Activities of requester until time of submission
      Existing lifetime of requester



19
Covariates Examined

  HIT Characteristics
      Monetary reward
       Monetary reward 
      Number of HITs
      Length in characters (increases lifetime)
      HIT topic (based on Latent Dirichlet Allocation analysis)

  Market Characteristics: Not affecting
      Day of the week (when HIT was first posted)
      Time of the day (when HIT was first posted)

  Requester Characteristics
   Requester Characteristics
      Activities of requester until time of submission
      Existing lifetime of requester (1yr ~ 50% speedup)



20
Conclusions

      Completion times for tasks in Amazon Mechanical Turk follow a 
       heavy tail distribution. (Paper studying MicroTasks.com has similar conclusions.)


      Sample averages cannot be used to predict the expected completion 
       Sample averages cannot be used to predict the expected completion
       time of a task.


      B fi i
       By fitting a Cox proportional hazards regression model to the data 
                    C          i   lh     d         i     d l     h d
       collected from AMT, we showed the effect of various HIT parameters 
       in the completion time of the task


      “Base survival function” still a power‐law  Still difficult to predict



23
Lessons Learned and Future Work

      Current survival analysis too naive:
          Ignores many interactions across variables
           Ignores many interactions across variables
          Need time‐dependent covariates (market changes over time)
          More frequent crawling does not change the results
      Important: Analysis ignores “refilling” of HITs


     TODO:
      Better to model directly the HIT assignment disappearance rate 
       (
       (how many #HITs done per minute)
                  y            p          )
      Use queuing model theories 
      Use hierarchical version of LDA and dynamic models (#topics and 
       shifts in topics over time)
        hift i t i           ti )

24
Any Questions?
Any Questions?

Weitere ähnliche Inhalte

Kürzlich hochgeladen

Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 

Kürzlich hochgeladen (20)

Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 

Empfohlen

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

Empfohlen (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Estimating Completion Time for Crowdsourced Tasks Using Survival Analysis Models

  • 1. Estimating the Completion Time of Crowdsourced Tasks using Survival Analysis Jing Wang, New York University Siamak Faridani, University of California, Berkeley Panos Ipeirotis, New York Univesity 1
  • 2. Crowdsourcing: Pricing and Time to completion?  Many firms use crowdsourcing for a variety of tasks y g y  Still unclear how to price p  Prior results indicate that price does not affect quality (Mason and Watts, 2009)  …but it does affect completion time U l Unclear how long it will take for a task to finish h l it ill t k f t k t fi i h 2
  • 3. Data Set:  Mechanical Turk Tracker  (http://www.mturk‐tracker.com)  Crawled Amazon Mechanical Turk hourly (now every min) y( y )  Captured full market state (content, position, and  characteristics of all available HITs).  15 months of data (now >24 months)  165,368 HIT groups  6,701,406 HIT assignments from 9,436 requesters  Value of the HITs: $529,259 [guesstimate ~10% of actual value]  Missing very short tasks (posted and disappeared in <1hr)  Do not observe HIT redundancy 3
  • 4. Completion Times: Power‐laws HIT completion time: Time_last_seen – Time_first_posted 4
  • 5. Completion Times: Power‐laws and Censoring Censoring Effects Jumps/Outliers: Expiration Different slope: Requesters  taking down HITs taking down HITs HIT completion time: Time_last_seen – Time_first_posted 5
  • 6. Parameter estimation  Maximum Likelihood Estimation, controlling for censored data  Power‐law parameter α~1.5  Power‐laws with α<2 do not have well‐defined mean value  Sample average increases as sample size increases Sample average increases as sample size increases 6
  • 7. Why Power‐laws?  Queuing theory model by (Cobham, 1954):  If workers pick tasks from two priority queues,  completion time follows power‐law with α=1.5  Chilton et al, HCOMP 2010: workers rank either by  “most recently posted” or by “most HITs available”  Result Inherent unpredictability of completion time Result: Inherent unpredictability of completion time  Real solution: Amazon should change the interface  But let’s see how other factors affect completion time 7
  • 8. Survival Analysis  Examine and model the time it takes for events to occur  In our case: Event = HIT gets completed  Survival function S(t):    Probability that tasks will last longer than t  Used stratified Cox Proportional Hazards Model 8
  • 9. Covariates Examined  HIT Characteristics  Monetary reward Monetary reward  Number of HITs  Length in characters  HIT topic (based on Latent Dirichlet Allocation analysis)  Market Characteristics  Day of the week (when HIT was first posted)  Time of the day (when HIT was first posted)  Requester Characteristics Requester Characteristics  Activities of requester until time of submission  Existing lifetime of requester 9
  • 10. Effect of Price: Mostly monotonic h(t) = 1.035^price 40% speedup for 10x price 40% d f 10 i  Half‐life for $0.025 reward ~ 2 days  H lf lif f $1 Half‐life for $1 reward ~ 12 hours d 12 h 10
  • 11. Covariates Examined  HIT Characteristics  Monetary reward Monetary reward  Number of HITs  Length in characters  HIT topic (based on Latent Dirichlet Allocation analysis)  Market Characteristics  Day of the week (when HIT was first posted)  Time of the day (when HIT was first posted)  Requester Characteristics Requester Characteristics  Activities of requester until time of submission  Existing lifetime of requester 11
  • 12. Effect of #HITs: Monotonic, but sublinear h(t) = 0.998^#HITs  10 HITs  2% slower than 1 HIT  100 HITs  19% slower than 1 HIT   1000 HITs  87% slower than 1 HIT 1000 HITs  87% slower than 1 HIT  or, 1 group of 1000  7 times faster than 1000 sequential groups of 1 12
  • 13. Covariates Examined  HIT Characteristics  Monetary reward Monetary reward   Number of HITs  Length in characters (increases lifetime)  HIT topic (based on Latent Dirichlet Allocation analysis)  Market Characteristics  Day of the week (when HIT was first posted)  Time of the day (when HIT was first posted)  Requester Characteristics Requester Characteristics  Activities of requester until time of submission  Existing lifetime of requester 13
  • 14. HIT Topics topic 1 : cw castingwords  podcast  transcribe  english  mp3  edit  confirm  snippet  grade topic 2:  d i 2 data  collection  search  image  entry  listings  website  review  survey  opinion ll i h i li i bi i i i topic 3:  categorization  product  video  page  smartsheet web  comment  website  opinion topic 4:  easy  quick  survey  money  research  fast  simple  form  answers  link topic 5:  question  answer  nanonano dinkle article  write  writing  review  blog  articles topic 6:  writing  answer  article  question  opinion  short  advice  editing  rewriting  paul topic 7:  transcribe  transcription  improve  retranscribe edit  answerly voicemail  answer 14
  • 15. Effect of Topic: The CastingWords Effect topic 1 : cw castingwords  podcast  transcribe  english  mp3  edit  confirm  snippet  grade topic 2:  data  collection  search  image  entry  listings  website  review  survey  opinion topic 3:  categorization  product  video  page  smartsheet web  comment  website  opinion topic 4:  easy  quick  survey  money  research  fast  simple  form  answers  link topic 5:  question  answer  nanonano dinkle article  write  writing  review  blog  articles p q g g topic 6:  writing  answer  article  question  opinion  short  advice  editing  rewriting  paul topic 7:  transcribe  transcription  improve  retranscribe edit  answerly voicemail  query  question  answer 15
  • 16. Effect of Topic: Surveys=fast (even with redundancy!) topic 1 : cw castingwords  podcast  transcribe  english  mp3  edit  confirm  snippet  grade topic 2:  data  collection  search  image  entry  listings  website  review  survey  opinion topic 3:  categorization  product  video  page  smartsheet web  comment  website  opinion topic 4:  easy  quick  survey  money  research  fast  simple  form  answers  link topic 5:  question  answer  nanonano dinkle article  write  writing  review  blog  articles p q g g topic 6:  writing  answer  article  question  opinion  short  advice  editing  rewriting  paul topic 7:  transcribe  transcription  improve  retranscribe edit  answerly voicemail  query  question  answer 16
  • 17. Effect of Topic: Writing takes time topic 1 : cw castingwords  podcast  transcribe  english  mp3  edit  confirm  snippet  grade topic 2:  data  collection  search  image  entry  listings  website  review  survey  opinion topic 3:  categorization  product  video  page  smartsheet web  comment  website  opinion topic 4:  easy  quick  survey  money  research  fast  simple  form  answers  link topic 5:  question  answer  nanonano dinkle article  write  writing  review  blog  articles p q g g topic 6:  writing  answer  article  question  opinion  short  advice  editing  rewriting  paul topic 7:  transcribe  transcription  improve  retranscribe edit  answerly voicemail  query  question  answer 17
  • 18. Covariates Examined  HIT Characteristics  Monetary reward Monetary reward   Number of HITs  Length in characters (increases lifetime)  HIT topic (based on Latent Dirichlet Allocation analysis)  Market Characteristics: Not affecting  Day of the week (when HIT was first posted)  Time of the day (when HIT was first posted)  Requester Characteristics Requester Characteristics  Activities of requester until time of submission  Existing lifetime of requester (1yr ~ 50% speedup) 18
  • 19. Covariates Examined  HIT Characteristics  Monetary reward Monetary reward   Number of HITs  Length in characters (increases lifetime)  HIT topic (based on Latent Dirichlet Allocation analysis) Why? We look at long‐running   Market Characteristics: Not affecting HITs until completion… HIT til l ti  Day of the week (when HIT was first posted)  Time of the day (when HIT was first posted)  Requester Characteristics Requester Characteristics  Activities of requester until time of submission  Existing lifetime of requester 19
  • 20. Covariates Examined  HIT Characteristics  Monetary reward Monetary reward   Number of HITs  Length in characters (increases lifetime)  HIT topic (based on Latent Dirichlet Allocation analysis)  Market Characteristics: Not affecting  Day of the week (when HIT was first posted)  Time of the day (when HIT was first posted)  Requester Characteristics Requester Characteristics  Activities of requester until time of submission  Existing lifetime of requester (1yr ~ 50% speedup) 20
  • 21. Conclusions  Completion times for tasks in Amazon Mechanical Turk follow a  heavy tail distribution. (Paper studying MicroTasks.com has similar conclusions.)  Sample averages cannot be used to predict the expected completion  Sample averages cannot be used to predict the expected completion time of a task.  B fi i By fitting a Cox proportional hazards regression model to the data  C i lh d i d l h d collected from AMT, we showed the effect of various HIT parameters  in the completion time of the task  “Base survival function” still a power‐law  Still difficult to predict 23
  • 22. Lessons Learned and Future Work  Current survival analysis too naive:  Ignores many interactions across variables Ignores many interactions across variables  Need time‐dependent covariates (market changes over time)  More frequent crawling does not change the results  Important: Analysis ignores “refilling” of HITs TODO:  Better to model directly the HIT assignment disappearance rate  ( (how many #HITs done per minute) y p )  Use queuing model theories   Use hierarchical version of LDA and dynamic models (#topics and  shifts in topics over time) hift i t i ti ) 24