[Karger+ NIPS11] Iterative Learning for Reliable Crowdsourcing Systems

•

2 gefällt mir•28,106 views

Shuyo Nakatani

Technologie News & Politik

[Karger+] Iterative Learning for
Reliable Crowdsourcing Systems

2012/04/08 #NIPSreading
Nakatani Shuyo

Crowdsourcing
• Outsource to undefined public
– Almost workers are not experts
– Some workers may be SPAMMERs
• Amazon Mechanical Turk
– Separate a large task into microtasks
– Workers gain a few cents per a microtask

2

Spammer and Hammer
• Spam/Spammer
– submitting arbitrary answers for fee
• Ham/Hammer
– answering question correctly
• It is difficult to distinguish spam/spammers
– Requester doesn’t have a gold standard
– Workers are neither persistent nor unidentifiable
3

Questions
• How to ensure reliability of workers
– Is this worker is a spammer or hammer?
• How to minimize total price
– ∝ number of task assignments
• How to predict answers
– majority voting? EMA?
• How to estimate upper bound of error rate
– estimate upper bound

4

Setting
• 𝑡 𝑖 : tasks, 𝑖 = 1, ⋯ , 𝑚 t1 t2 t3 … tm

• 𝑤 𝑗 : workers, 𝑗 = 1, ⋯ , 𝑛
• (l, r)-regular bipartite graph w1 w2 w3 … wn

– Each task assigns to l workers.
– Each worker assigns to r tasks.
• Given m and r, how to select l?
𝑚𝑙
– 𝑚𝑙 = 𝑛𝑟, then 𝑛 = is decided.
𝑟

5

Model
• 𝑠 𝑖 = ±1: correct answers of ti (unobserved)
• 𝐴 𝑖𝑗 : answers to ti of wj (observed)
∀
• 𝑝 𝑗 = 𝑝 𝐴 𝑖𝑗 = 𝑠 𝑖 for 𝑖 : reliability of workers
– It assumes independent on task
2
• 𝐄 2𝑝 𝑗 − 1 = 𝑞 : average quality parameter
– 𝑞 ∈ 0, 1 close to 1 indicates that almost workers are
diligent
– q is set to 0.3 on the later experiment

6

Example: spammer-hammer model
• For 𝑞 ∈ 0, 1 given,
• 𝑝 𝑗 = 1 with probability 𝑞
– wj is a perfect hammer (all correct).
• 𝑝 𝑗 = 1/2 with probability 1 − 𝑞
– wj is a spammer (random answers)
2
• Then 𝐄 2𝑝 𝑗 − 1 = 𝑞×1+ 1− 𝑞 ×0= 𝑞

7

Iterative Inference
• 𝑥 𝑖→𝑗 : real-valued task messages from ti to wj
• 𝑦 𝑗→𝑖 : worker messages from wj to ti

8
from [Karger+ NIPS11]

Prediction
• predicted answer:

𝑠𝑖 𝐴 𝑖𝑗 = sign 𝐴 𝑖𝑗 𝑦 𝑗→𝑖
𝑖,𝑗 ∈𝐸 𝑗∈𝜕 𝑖
– where 𝜕 𝑖 : neighborhood of ti
• error rate:
𝑚
1
lim sup 𝑝 𝑠𝑖 ≠ 𝑠𝑖 𝐴 𝑖𝑗
𝑚→∞ 𝑚 𝑖,𝑗 ∈𝐸
𝑖=1

9

Theorem 2.1
• For l >1, r >1, 𝑞 ∈ 0, 1 given, let 𝑙 = 𝑙 − 1, 𝑟 = 𝑟 − 1.
• Assume m tasks assign to 𝑛 = 𝑚𝑙/𝑟 workers according
to (l, r)-regular bipartite graph
• Estimate from k iterations of the iterative algorithm
• If 𝜇 ≡ 𝐄 2𝑝 𝑗 − 1 > 0 and 𝑞2 > 1/𝑙 𝑟, then
𝑚 𝑙𝑞
1 − 2
lim sup 𝑝 𝑠𝑖 ≠ 𝑠𝑖 𝐴 𝑖𝑗 ≤ 𝑒 2𝜌 𝑘
𝑚→∞ 𝑚 𝑖,𝑗 ∈𝐸
𝑖=1
– where

11

Corollary 2.2
• Under the hypotheses of Theorem 2.1,
𝑚 𝑙𝑞
1 − 2
2𝜌∞
lim sup lim sup 𝑝 𝑠𝑖 ≠ 𝑠𝑖 𝐴 𝑖𝑗 ≤ 𝑒
𝑘→∞ 𝑚→∞ 𝑚 𝑖,𝑗 ∈𝐸
𝑖=1
• where

– For 𝑞 = 0.3, 𝑙 = 𝑟 = 25 then r.h.s. = 0.31
– For 𝑞 = 0.5, 𝑙 = 25, 𝑟 = 10 then r.h.s. = 0.15

12

Experiments
• m = n = 1000, l = r
• left: q=0.3, 𝑙 ∈ [1,30]
• right: l = 25, 𝑞 ∈ [0, 0.4]

from [Karger+ NIPS11] 13

Weitere ähnliche Inhalte

Was ist angesagt?

MT102 Лекц-1ssuser1b40bc

Central TendencyKaori Kubo Germano, PhD

regressionKaori Kubo Germano, PhD

MT102 Лекц 13ssuser184df1

MT102 Лекц 14ssuser184df1

MT102 Лекц 12ssuser184df1

PROBABILITY DISTRIBUTION OF SUM OF TWO CONTINUOUS VARIABLES AND CONVOLUTIONJournal For Research

Standard normal distributionNadeem Uddin

MT102 Лекц 16ssuser184df1

MT102 Лекц 8ssuser184df1

Normal probability distributionNadeem Uddin

Basic calculus (ii) recapFarzad Javidanrad

MT102 Лекц 6ssuser184df1

VariabilityKaori Kubo Germano, PhD

On The Distribution of Non - Zero Zeros of Generalized Mittag – Leffler Funct...IJERA Editor

効率的反実仮想学習Masa Kato

Central Tendency & DispersionBirinder Singh Gulati

Chpt8 how to do an experimentLexume1

A Mathematical Model for the Enhancement of Stress Induced Hypoglycaemia by A...IJRES Journal

MITx_14310_CLTRyosuke Ishii

Was ist angesagt? (20)

MT102 Лекц-1

Central Tendency

regression

MT102 Лекц 13

MT102 Лекц 14

MT102 Лекц 12

PROBABILITY DISTRIBUTION OF SUM OF TWO CONTINUOUS VARIABLES AND CONVOLUTION

Standard normal distribution

MT102 Лекц 16

MT102 Лекц 8

Normal probability distribution

Basic calculus (ii) recap

MT102 Лекц 6

Variability

On The Distribution of Non - Zero Zeros of Generalized Mittag – Leffler Funct...

効率的反実仮想学習

Central Tendency & Dispersion

Chpt8 how to do an experiment

A Mathematical Model for the Enhancement of Stress Induced Hypoglycaemia by A...

MITx_14310_CLT

Ähnlich wie [Karger+ NIPS11] Iterative Learning for Reliable Crowdsourcing Systems

2Multi_armed_bandits.pptxZhiwuGuo1

13Kernel_Machines.pptxKarasuLee

Lecture Notes: EEEC4340318 Instrumentation and Control Systems - System ModelsAIMST University

Calculus Review Session Brian Prest Duke University Nicholas School of the En...rofiho9697

Analysis of Algorithms - 2AtakanAral

ERF Training WorkshopPanel Data 5Economic Research Forum

Queues internet src2Ammulu Amma

STLtalk about statistical analysis and its applicationJulieDash5

Linear regression, costs & gradient descentRevanth Kumar

Deep neural networks & computational graphsRevanth Kumar

Daa notes 2smruti sarangi

Neural NetworksMakerere Unversity School of Public Health, Victoria University

Quadratic form and functional optimizationJunpei Tsuji

Playing Go with Clojureztellman

Support vector machinesJinho Lee

Digital control systems (dcs) lecture 18-19-20Ali Rind

variBAD, A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.pdftaeseon ryu

Reinforcement Learning basics part1Euijin Jeong

equivalence and countabilityROHAN GAIKWAD

Av 738- Adaptive Filtering - Wiener Filters[wk 3]Dr. Bilal Siddiqui, C.Eng., MIMechE, FRAeS

Ähnlich wie [Karger+ NIPS11] Iterative Learning for Reliable Crowdsourcing Systems (20)

2Multi_armed_bandits.pptx

13Kernel_Machines.pptx

Lecture Notes: EEEC4340318 Instrumentation and Control Systems - System Models

Calculus Review Session Brian Prest Duke University Nicholas School of the En...

Analysis of Algorithms - 2

ERF Training WorkshopPanel Data 5

Queues internet src2

STLtalk about statistical analysis and its application

Linear regression, costs & gradient descent

Deep neural networks & computational graphs

Daa notes 2

Neural Networks

Quadratic form and functional optimization

Playing Go with Clojure

Support vector machines

Digital control systems (dcs) lecture 18-19-20

variBAD, A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.pdf

Reinforcement Learning basics part1

equivalence and countability

Av 738- Adaptive Filtering - Wiener Filters[wk 3]

Mehr von Shuyo Nakatani

画像をテキストで検索したい！(OpenAI CLIP) - VRC-LT #15Shuyo Nakatani

Generative adversarial networksShuyo Nakatani

無限関係モデル (続・わかりやすいパターン認識 13章)Shuyo Nakatani

Memory Networks (End-to-End Memory Networks の Chainer 実装)Shuyo Nakatani

人工知能と機械学習の違いって？Shuyo Nakatani

RとStanでクラウドセットアップ時間を分析してみたら #TokyoRShuyo Nakatani

ドラえもんでわかる統計的因果推論 #TokyoRShuyo Nakatani

[Yang, Downey and Boyd-Graber 2015] Efficient Methods for Incorporating Knowl...Shuyo Nakatani

星野「調査観察データの統計科学」第3章Shuyo Nakatani

星野「調査観察データの統計科学」第1＆2章Shuyo Nakatani

言語処理するのに Python でいいの？ #PyDataTokyoShuyo Nakatani

Zipf? (ジップ則のひみつ？) #DSIRNLPShuyo Nakatani

ACL2014 Reading: [Zhang+] "Kneser-Ney Smoothing on Expected Count" and [Pickh...Shuyo Nakatani

ソーシャルメディアの多言語判定 #SoC2014Shuyo Nakatani

猫に教えてもらうルベーグ可測Shuyo Nakatani

アラビア語とペルシャ語の見分け方 #DSIRNLP 5Shuyo Nakatani

どの言語でつぶやかれたのか、機械が知る方法 #WebDBf2013Shuyo Nakatani

Active Learning 入門Shuyo Nakatani

数式を綺麗にプログラミングするコツ #spro2013Shuyo Nakatani

ノンパラベイズ入門の入門Shuyo Nakatani

Mehr von Shuyo Nakatani (20)

画像をテキストで検索したい！(OpenAI CLIP) - VRC-LT #15

Generative adversarial networks

無限関係モデル (続・わかりやすいパターン認識 13章)

Memory Networks (End-to-End Memory Networks の Chainer 実装)

人工知能と機械学習の違いって？

RとStanでクラウドセットアップ時間を分析してみたら #TokyoR

ドラえもんでわかる統計的因果推論 #TokyoR

[Yang, Downey and Boyd-Graber 2015] Efficient Methods for Incorporating Knowl...

星野「調査観察データの統計科学」第3章

星野「調査観察データの統計科学」第1＆2章

言語処理するのに Python でいいの？ #PyDataTokyo

Zipf? (ジップ則のひみつ？) #DSIRNLP

ACL2014 Reading: [Zhang+] "Kneser-Ney Smoothing on Expected Count" and [Pickh...

ソーシャルメディアの多言語判定 #SoC2014

猫に教えてもらうルベーグ可測

アラビア語とペルシャ語の見分け方 #DSIRNLP 5

どの言語でつぶやかれたのか、機械が知る方法 #WebDBf2013

Active Learning 入門

数式を綺麗にプログラミングするコツ #spro2013

ノンパラベイズ入門の入門

Kürzlich hochgeladen

Artificial intelligence in cctv survelliance.pptxhariprasad279825

TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

CloudStudio User manual (basic edition):comworks

Powerpoint exploring the locations used in television show Time Clashcharlottematthew16

Take control of your SAP testing with UiPath Test SuiteDianaGray10

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati

TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe

Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity

SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

Gen AI in Business - Global Trends Report 2024.pdfAddepto

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays

The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech

Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

Kürzlich hochgeladen (20)

Artificial intelligence in cctv survelliance.pptx

TeamStation AI System Report LATAM IT Salaries 2024

Advanced Test Driven-Development @ php[tek] 2024

Designing IA for AI - Information Architecture Conference 2024

CloudStudio User manual (basic edition):

Powerpoint exploring the locations used in television show Time Clash

Take control of your SAP testing with UiPath Test Suite

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy

Connect Wave/ connectwave Pitch Deck Presentation

How AI, OpenAI, and ChatGPT impact business and software.

Dev Dives: Streamline document processing with UiPath Studio Web

SAP Build Work Zone - Overview L2-L3.pptx

Nell’iperspazio con Rocket: il Framework Web di Rust!

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

Gen AI in Business - Global Trends Report 2024.pdf

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...

The Ultimate Guide to Choosing WordPress Pros and Cons

Ensuring Technical Readiness For Copilot in Microsoft 365

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

[Karger+ NIPS11] Iterative Learning for Reliable Crowdsourcing Systems

1. [Karger+] Iterative Learning for Reliable Crowdsourcing Systems 2012/04/08 #NIPSreading Nakatani Shuyo

2. Crowdsourcing • Outsource to undefined public – Almost workers are not experts – Some workers may be SPAMMERs • Amazon Mechanical Turk – Separate a large task into microtasks – Workers gain a few cents per a microtask 2

3. Spammer and Hammer • Spam/Spammer – submitting arbitrary answers for fee • Ham/Hammer – answering question correctly • It is difficult to distinguish spam/spammers – Requester doesn’t have a gold standard – Workers are neither persistent nor unidentifiable 3

4. Questions • How to ensure reliability of workers – Is this worker is a spammer or hammer? • How to minimize total price – ∝ number of task assignments • How to predict answers – majority voting? EMA? • How to estimate upper bound of error rate – estimate upper bound 4

5. Setting • 𝑡 𝑖 : tasks, 𝑖 = 1, ⋯ , 𝑚 t1 t2 t3 … tm • 𝑤 𝑗 : workers, 𝑗 = 1, ⋯ , 𝑛 • (l, r)-regular bipartite graph w1 w2 w3 … wn – Each task assigns to l workers. – Each worker assigns to r tasks. • Given m and r, how to select l? 𝑚𝑙 – 𝑚𝑙 = 𝑛𝑟, then 𝑛 = is decided. 𝑟 5

6. Model • 𝑠 𝑖 = ±1: correct answers of ti (unobserved) • 𝐴 𝑖𝑗 : answers to ti of wj (observed) ∀ • 𝑝 𝑗 = 𝑝 𝐴 𝑖𝑗 = 𝑠 𝑖 for 𝑖 : reliability of workers – It assumes independent on task 2 • 𝐄 2𝑝 𝑗 − 1 = 𝑞 : average quality parameter – 𝑞 ∈ 0, 1 close to 1 indicates that almost workers are diligent – q is set to 0.3 on the later experiment 6

7. Example: spammer-hammer model • For 𝑞 ∈ 0, 1 given, • 𝑝 𝑗 = 1 with probability 𝑞 – wj is a perfect hammer (all correct). • 𝑝 𝑗 = 1/2 with probability 1 − 𝑞 – wj is a spammer (random answers) 2 • Then 𝐄 2𝑝 𝑗 − 1 = 𝑞×1+ 1− 𝑞 ×0= 𝑞 7

8. Iterative Inference • 𝑥 𝑖→𝑗 : real-valued task messages from ti to wj • 𝑦 𝑗→𝑖 : worker messages from wj to ti 8 from [Karger+ NIPS11]

9. Prediction • predicted answer: 𝑠𝑖 𝐴 𝑖𝑗 = sign 𝐴 𝑖𝑗 𝑦 𝑗→𝑖 𝑖,𝑗 ∈𝐸 𝑗∈𝜕 𝑖 – where 𝜕 𝑖 : neighborhood of ti • error rate: 𝑚 1 lim sup 𝑝 𝑠𝑖 ≠ 𝑠𝑖 𝐴 𝑖𝑗 𝑚→∞ 𝑚 𝑖,𝑗 ∈𝐸 𝑖=1 9

10. Performance Guarantee 10

11. Theorem 2.1 • For l >1, r >1, 𝑞 ∈ 0, 1 given, let 𝑙 = 𝑙 − 1, 𝑟 = 𝑟 − 1. • Assume m tasks assign to 𝑛 = 𝑚𝑙/𝑟 workers according to (l, r)-regular bipartite graph • Estimate from k iterations of the iterative algorithm • If 𝜇 ≡ 𝐄 2𝑝 𝑗 − 1 > 0 and 𝑞2 > 1/𝑙 𝑟, then 𝑚 𝑙𝑞 1 − 2 lim sup 𝑝 𝑠𝑖 ≠ 𝑠𝑖 𝐴 𝑖𝑗 ≤ 𝑒 2𝜌 𝑘 𝑚→∞ 𝑚 𝑖,𝑗 ∈𝐸 𝑖=1 – where 11

12. Corollary 2.2 • Under the hypotheses of Theorem 2.1, 𝑚 𝑙𝑞 1 − 2 2𝜌∞ lim sup lim sup 𝑝 𝑠𝑖 ≠ 𝑠𝑖 𝐴 𝑖𝑗 ≤ 𝑒 𝑘→∞ 𝑚→∞ 𝑚 𝑖,𝑗 ∈𝐸 𝑖=1 • where – For 𝑞 = 0.3, 𝑙 = 𝑟 = 25 then r.h.s. = 0.31 – For 𝑞 = 0.5, 𝑙 = 25, 𝑟 = 10 then r.h.s. = 0.15 12

13. Experiments • m = n = 1000, l = r • left: q=0.3, 𝑙 ∈ [1,30] • right: l = 25, 𝑞 ∈ [0, 0.4] from [Karger+ NIPS11] 13

14. Lower Bound 14

[Karger+ NIPS11] Iterative Learning for Reliable Crowdsourcing Systems

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie [Karger+ NIPS11] Iterative Learning for Reliable Crowdsourcing Systems

Ähnlich wie [Karger+ NIPS11] Iterative Learning for Reliable Crowdsourcing Systems (20)

Mehr von Shuyo Nakatani

Mehr von Shuyo Nakatani (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

[Karger+ NIPS11] Iterative Learning for Reliable Crowdsourcing Systems