Crowd Agents: Interactive Crowd-Powered Systems in the Real World

Crowd Agents
Interactive Crowd-Powered Systems in the Real World

Jeffrey P. Bigham
University of Rochester
University of Rochester Human-Computer Interaction Jeffrey P. Bigham

Introduction VizWiz Crowd Agents Scribe



Human Assistance in History

What the Disability Community
Can Teach Us About Interactive
Crowdsourcing. Jeffrey P.
Bigham and Richard Ladner.
Iteractions magazine. July 2011.



Connectivity



University of Rochester Human-Computer Interaction Courtesy Jeffrey P. Brabyn
of John Bigham


Remote Assistance

Video Relay Services

Real-time
Captioning



Connectivity -> Crowd
Mechanical Turk

Friends and Family
on Social Networks



VizWiz

Bigham et al. Nearly Real-Time Answers to Visual Questions. UIST 2010.


Access Technology
• Optical Character Recognition
• Color Recognizers
• Talking GPS
• …

Problems
1. Limited Scope
2. Unacceptable Error Rate
3. $$$
4. Not Exactly What Users Want



Releasing VizWiz
• Released on May 31, 2011
– 5000 users asked more than 50,000 questions
– answers in less than a minute



Recruiting Crowd Quickly
How many workers do we need?
- number of current workers
- likelihood of needing more workers

Post jobs or remove jobs Turkers answer multiple questions

Turkit
For $4/hr goes down to under 30s from start to finish.
quikturkit.googlecode.com
Bigham et al. Nearly Real-Time Answers to Visual Questions. UIST 2010.


Characterization of the Crowd

- Workers Come and Go
- Some May Do the Wrong Thing



Supporting a Continuous Interaction?

Where’s the coffee?
Walk to end of this hall, turn right.
Turn right into the kitchen.
Where’s the Soda on left, coffee on the right
How do I use this machine?
coffee?



Model for Crowd Agents




Input Mediation

Learning




• What interface is being controlled?
• How is input mediation done?
• Role of automated agents?



Chorus



Legion: Control of Any Interface

Input Media on
Legion Server - video stream
Flash Media Server - - task description
Input Mediators - - crowd agreement/payment info
- video stream quikTurkit -
- task description
- worker input
(key presses, mouse clicks) Worker Interface
- mediated input
Legion Client

250 8/10
8/10
200
10/10 Explanation of
Time (sec)

controls, and feedback
150 regarding current
bonus level (tied to
crowd agreement).
10/10
100
4/10
50

0 Feedback reflecting worker’s
Solo Mob Vote Active Leader last key press, and whether
the interface last followed
multiple workers the crowd or the worker.

W. Lasecki, S. White, K. Murray, R. Miller, and J.P. Bigham “Real-Time Control of
Existing Interfaces.” UIST 2011.


Crowd Memory

W.S. Lasecki, S.C. White, K.I. Murray and J.P. Bigham. “Crowd Memory: Learning in
the Collective.” Collective Intelligence 2012.


Crowd Memory



Deployable Activity Recognition

W.S. Lasecki, Y. Song, H. Kautz, and J.P. Bigham. “Real-Time Activity Labeling for
Deployable Activity Recognition.” Submitted to CSCW 2012. Pervasive 2012 (poster)

Legion:Scribe
Real-Time Captions by Groups of Non-Experts



Real-Time Captioning
Problem: produce text transcript of speech with less than 5-second latency

Stenographers ASR
expensive cheap

difficult to schedule available on demand

lack domain expertise Can I
can be trained for new vocab
help?
pretty accurate does not work*
NO,
you are worse than ASR.
* in real settings from an unknown mic with speaker who hasn’t trained the ASR


Real-Time Captioning

W. Lasecki, C. Miller, A. Sadilek, A. Abumoussa, D. Borrello, R. Kushalnagar, J.P.
Bigham. “Real-Time Captioning by Groups of Non-Experts.” UIST 2012.


Input Mediator
Multiple Sequence Alignment

Online Version
Stage 1 the Stage 2 the Stage 3 the now and
Graph open open open file
Time
java java java up
Worker 1 open the file now
Worker 2 the java fiel
Worker 3 open java file up and
Baseline open the java file now and

W.S. Lasecki, C.D. Miller, D. Borrello and J.P. Bigham. “Online Sequence Alignment
for Real-Time Audio Transcription by Non-Experts.” AAAI 2012 (poster).


Scribe Interface

Encourages:
- real-time input
- global coverage
- short sequences

Co-evolution of
Interface and
Algorithm



Coverage Graph



Tradeoff
Failures:
“n-factorial” 
“in pectoral”



Interesting Qualities
• Captionists can be experts
– not at captioning but in the subject

• Low cost
– $30/hour on Mturk (did not optimize)

– or free (impossible before)

• Recruited on demand
– for only as long as needed



Scribe ASR
Web prefetching is 1 technique that A lactate fencing is one thinking that
ressearchers rely on history based to and etc. rely on to improve network.
the non history based technique the Phillipe pitching. Anything survived
downloaded pages will be scanned and all incident techniques…
hyperlinks will be…



Incorporating ASR

Coverage Increase: 28% to 55%
(single worker case)


Conclusions
General Lessons, Science, and the Future



“What would it take for me
to be proud of my daughter
being a crowd worker?”
- Niki Kittur @ CrowdCamp

Currence Bigham after her first running race.



Do Good
Connect to help and support.

Do Better
Do better work than anyone could alone.



hci.cs.rochester.edu
@jeffbigham

Thanks!

Funded by: National Science Foundation Grants (#IIS-1149709, #IIS-
1116051, #IIS-1049080 ), and Google.

Crowd Agents: Interactive Crowd-Powered Systems in the Real World

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Andere mochten auch

Andere mochten auch (20)

Mehr von Jeffrey Bigham

Mehr von Jeffrey Bigham (9)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Crowd Agents: Interactive Crowd-Powered Systems in the Real World

Hinweis der Redaktion